Release Notes
The CHANGELOG for the current development version is available at https://github.com/rasbt/biopandas/blob/main/docs/sources/CHANGELOG.md.
0.5.0dev (UNRELEASED)
Downloads
New Features
- Added a new
PandasMmcif.convert_to_pandas_pdb()
class that converts the mmCIF file into a PDB structure. (Via Arian Jamasb, PR #107) - Added ability to write PDBs to a filestream (Via Arian Jamasb, PR #107)
- Adds a method
PandasPdb.gyradius
to calculate the radius of gyration of a molecule (via goniochromatic, PR #117) - Adds MMTF export & improves MMTF parsing robustness (via Arian Jamasb, PR #119)
- Adds support for parsing MMTF files. (via Arian Jamasb, PR #111)
- Adds primitive functions for parsing PDB, mmCIF, and MMTF into dataframes. (via Arian Jamasb, PR #111)
- Added support for AlphaFolds 200M+ structures via
PandasMmcif().fetch_mmcif(uniprot_id='Q5VSL9', source='alphafold2-v3')
andPandasPdb().fetch_pdb(uniprot_id='Q5VSL9', source='alphafold2-v3')
. (Via Arian Jamasb, PR #102)
Bug Fixes
- Fix the
return
statement inPandasPdb.to_pdb_stream()
to returnoutput
instead ofoutput.seek(0)
. (via goniochromatic, PR #116) - Change the
records
default argument inPandasPdb.to_pdb_stream()
to be immutable. (via goniochromatic, PR #116) - Fix some typos and general style issues. (via goniochromatic, PR #116)
- Fix link for "How to contribute" in
README.md
. (via goniochromatic, PR #116)
0.4.1 (05-13-2022)
Downloads
Changes
- Remove walrus operator for Python 3.7 compatibility.
0.4.0 (05-11-2022)
Downloads
New Features
-
Adds support for extracting structures from PDB files containing multiple models. See the documentation for details. (via Arian Jamasb, PR #101).
-
Adds support for fetching mmCIF (
PandasMmcif().fetch_mmcif(uniprot_id='Q5VSL9', source='alphafold2-v2')
) and PDB structures (e.g.,PandasPdb().fetch_pdb(uniprot_id='Q5VSL9', source="alphafold2-v2")
) (via Arian Jamasb, PR #102).
Changes
- Instead of raising a warning when no ATOM entries are loaded, raise the warning only when neither ATOM nor HETAM entries are loaded.
Bug Fixes
- None
0.3.0 (04-06-2022)
Downloads
New Features
- Adds support for parsing mmCIF protein structure files (via Arian Jamasb, PR #94)
Changes
- -
Bug Fixes
- Fixes a bug where coordinates with more than 4 digits before the decimal point caused a column shift when saving a PDB file. (via PR #90)
- Fixes a bug where the invert parameter in get_carbon was selecting the wrong case. (via Arian Jamasb PR #96)
0.2.9 (08-30-2021)
Downloads
New Features
- -
Changes
- Now also allow
.ent
and.ent.gz
file endings for PDB files. (via PR #82 - Added Python 3.8 and 3.9 to setup.py in order to support these versions via conda-forge. (via PR #87
Bug Fixes
- -
0.2.8 (03-30-2021)
Downloads
New Features
- A
PandasPdb.read_pdb_from_list
method was added analogous to the existingPandasMol2.read_mol2_from_list
(via PR #72 by dominiquesydow)
Changes
ValueError
raising and improved file format error messages forread_pdb
andread_mol2
functionality. (via PR #73 by dominiquesydow)
Bug Fixes
- -
0.2.7 (08-04-2020)
Downloads
New Features
- -
Changes
- -
Bug Fixes
- Fix Manifest file to include license file in the PyPI tar.gz file so that BioPandas can be packaged by conda-forge.
0.2.6 (08-03-2020)
Downloads
New Features
- -
Changes
- Uses more modern
https
queries for the RCSB server via thefetch_pdb
function. - Updates the documentation (incl. a code of conduct)
Bug Fixes
- -
0.2.5 (07-09-2019)
Downloads
New Features
- -
Changes
- -
Bug Fixes
- The
PandasPdb.amino3to1
method now also considers insertion codes when converting the amino acid codes; before, inserted amino acides were skipped.
0.2.4 (02-05-2019)
Downloads
New Features
- -
Changes
- Minor adjustments to support to address deprecation warnings in pandas >= 23.0
Bug Fixes
- -
0.2.3 (03-29-2018)
Downloads
New Features
- -
Changes
PandasMol2.distance_df
was added as a static method that allows distance computations based for external data frames with its behavior otherwise similar toPandasMol2.distance
.PandasPdb.distance_df
was added as a static method that allows distance computations based for external data frames with its behavior otherwise similar toPandasPdb.distance
.PandasPdb.distance
now supports multiple record sections to be considered (e.g.,records=('ATOM', 'HETATM')
to include both protein and ligand in a query. Now also defaults torecords=('ATOM', 'HETATM')
for concistency with the impute method.PandasPdb.get(...)
now supports external data frames and lets the user specify the record section to be considered (e.g.,records=('ATOM', 'HETATM')
to include both protein and ligand in a query. Now also defaults torecords=('ATOM', 'HETATM')
for concistency with the impute method.- The
section
parameter ofPandasPdb.impute_element(...)
was renamed torecords
for API consistency.
Bug Fixes
-
0.2.2 (06-07-2017)
Downloads
New Features
- -
Changes
- Raises a meaningful error message if attempting to overwrite the
df
attributes ofPandasMol2
andPandasPdb
directly. - Added
PandasPdb.pdb_path
andPandasMol2.mol2_path
attributes that store the location of the data file last read.
Bug Fixes
- The
rmsd
methods ofPandasMol2
andPandasPdb
don't return a NaN anymore if the array indices of to structures are different.
0.2.1 (2017-05-11)
Downloads
New Features
- -
Changes
- The
amino3to1
method ofbiopandas.pdb.PandasPDB
objects now returns a pandasDataFrame
instead of a pandasSeries
object. The returned data frame has two columns,'chain_id'
and'residue_name'
, where the former contains the chain ID of the amino acid and the latter contains the 1-letter amino acid code, respectively. - Significant speed improvements of the
distance
method of bothPandasPdb
andPandasMol2
(now about 300 percent faster than previously).
Bug Fixes
- The
amino3to1
method ofbiopandas.pdb.PandasPDB
objects now handles multi-chain proteins correctly. - The
amino3to1
method ofbiopandas.pdb.PandasPDB
objects now also works as expected if the'ATOM'
entry DataFrame contains disordered DataFrame indices or duplicate DataFrame index values.
0.2.0 (2017-04-02)
Downloads
New Features
- Added an
amino3to1
method toPandasPdb
data frames to convert 3-amino acid letter codes to 1-letter codes. - Added a
distance
method toPandasPdb
data frames to compute the Euclidean distance between atoms and a reference point. - Added the
PandasMol2
class for working with Tripos MOL2 files in pandas DataFrames.
Changes
PandasPDB
was renamed toPandasPdb
.- Raises a warning if
PandasPdb
is written to PDB and ATOM and HETAM section contains unexpected columns; these columns will now be skipped.
Bug Fixes
- -
0.1.5 (2016-11-19)
Downloads
New Features
- Added an
impute_element
method toPandasPDB
objects to infer the Element Symbol from the Atom Name column. - Added two new selection types for
PandasPDB
ATOM and HETATM coordinate sections:'heavy'
and'carbon'
.
Changes
- Include test data in the PyPI package; add install_requires for pandas.
- The
'hydrogen'
atom selection inPandasPDB
methods is now based on the element type instead of the atom name. - By default, the RMSD is now computed on all atoms unless a specific selection is defined.
Bug Fixes
- -
0.1.4 (2015-11-24)
Downloads
New Features
-
Changes
- Needed to bump the version number due to a bug in the PyPI setup.py script.
- Support for the old pandas sorting syntax (
DataFrame.sort
vsDataFrame.sort_values
) incl. DeprecationWarning.
Bug Fixes
-
0.1.3 (2015-11-23)
New Features
-
Changes
-
Bug Fixes
- Exception handling in tests if PDB goes down (which just happened).
- Added a separate ANISOU engine to handle those records correctly.
0.1.2 (2015-11-23)
- First Release.