Hi Illimar,
When reading a PDB file that has multiple models, the RDKit PDB Parser
creates a separate conformer for each model, it shouldn't be creating
additional atoms.
Here's a little demo:
>>> from rdkit import Chem
>>> import rdkit
>>> print(rdkit.__version__)
2019.03.2
>>> !grep -c '^ATOM ' ./5zsw.pdb
27740
>>> mol = Chem.MolFromPDBFile('./5zsw.pdb',removeHs=False)
>>> mol.GetNumAtoms(),mol.GetNumConformers()
(1387, 20)
Are you seeing something different when you read in a multi-model PDB file?
-greg
On Wed, May 29, 2019 at 8:27 PM Illimar Hugo Rekand <[email protected]>
wrote:
> Hey, RDKitters!
>
>
> I am currently trying to figure out how to only read in the first model of
> a pdb-file. I've designed a script that performs calculations on a per-atom
> basis, and this is very slow when it tries to account for multiple models,
> for example with a NMR-structure.
>
>
> my code is structured something like this:
>
>
> prot = Chem.MolFromPDBFile(prot_cofac, sanitize = False)
>
>
> for atom in prot.GetAtoms():
>
> property = atom.GetProp("property)
>
> etc.
>
>
> Hope to hear from you soon
>
>
> Illimar Rekand
> Ph.D. candidate,
> Brenk-lab, Haug-lab
> Department of Biomedicine
> Department of Chemistry
> University of Bergen
>
>
>
> _______________________________________________
> Rdkit-discuss mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>
_______________________________________________
Rdkit-discuss mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss