Re: [Rdkit-discuss] Atom coordinates from PDB-file
Hi all, Le 25/02/2019 à 12:38, Lukas Pravda a écrit : Hi Illimar, If you need to access coordinates without creating conformer object do you really need to use rdkit I the first place? PDB file is column based format, so extracting coordinates for atoms for example with python is very straightforward. Lukas To pick certains atoms on a PDB file, I'd go for biopython's Bio.PDB -> https://biopython.org/wiki/The_Biopython_Structural_Bioinformatics_FAQ HTH, Stéphane -- Lecturer, UFIP, UMR 6286 CNRS, Team Protein Design In Silico UFR Sciences et Techniques, 2, rue de la Houssinière, Bât. 25, 44322 Nantes cedex 03, France Tél : +33 251 125 636 / Fax : +33 251 125 632 http://www.ufip.univ-nantes.fr/ - http://www.steletch.org ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
Re: [Rdkit-discuss] Evaluating ETKDG method with the Platinum Dataset
To give some context, it's not that we're trying to sample a diverse set of conformers (and find something close to the experimental). In this case, we're generating initial geometries - to assess Naruki's fragment-based builder in Open Babel. But you raise an excellent point - by picking only one random conformer (as we're doing), we'll absolutely going to have a higher RMSD than sampling 50 conformers per compound and picking the best. We'll try a quick test to be safe, but thanks for the suggestion. -Geoff On Mon, Feb 25, 2019 at 3:55 PM Greg Landrum wrote: > > Hi Naruki, > > You're only generating a single conformer per molecule; I wouldn't expect > that to do particularly well. It's generally better to call > EmbedMultipleConfs(). > > As an aside: I've looked at the platinum set too, it might be worth checking > out this RDKit blog post: > http://rdkit.blogspot.com/2017/05/looking-at-platinum-dataset.html > > -greg > > > On Mon, Feb 25, 2019 at 11:53 AM Naruki Yoshikawa > wrote: >> >> Dear all, >> >> I'm evaluating ETKDG method implemented in RDKit using the Platinum >> Dataset introduced in a benchmark paper >> https://pubs.acs.org/doi/abs/10.1021/acs.jcim.7b00505/ >> SMILES generated from the dataset is served as input and a 3D >> conformer is generated. >> We evaluate RMSD between generated structure and experimental structure. >> >> Although the author of the benchmark paper reported the mean RMSD to >> be below 1.0 angstrom, my evaluation code reports around 1.5 angstrom. >> I can't figure out why such a big difference occurs. >> >> My evaluation code is here: >> https://gist.github.com/n-yoshikawa/0ba04a1b0c718c4cc8d83702f3759afa >> There is a link to data in this gist. >> >> Thanks, >> Naruki >> >> >> ___ >> Rdkit-discuss mailing list >> Rdkit-discuss@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
Re: [Rdkit-discuss] Evaluating ETKDG method with the Platinum Dataset
Hi Naruki, You're only generating a single conformer per molecule; I wouldn't expect that to do particularly well. It's generally better to call EmbedMultipleConfs(). As an aside: I've looked at the platinum set too, it might be worth checking out this RDKit blog post: http://rdkit.blogspot.com/2017/05/looking-at-platinum-dataset.html -greg On Mon, Feb 25, 2019 at 11:53 AM Naruki Yoshikawa < naruki.yoshik...@gmail.com> wrote: > Dear all, > > I'm evaluating ETKDG method implemented in RDKit using the Platinum > Dataset introduced in a benchmark paper > https://pubs.acs.org/doi/abs/10.1021/acs.jcim.7b00505/ > SMILES generated from the dataset is served as input and a 3D > conformer is generated. > We evaluate RMSD between generated structure and experimental structure. > > Although the author of the benchmark paper reported the mean RMSD to > be below 1.0 angstrom, my evaluation code reports around 1.5 angstrom. > I can't figure out why such a big difference occurs. > > My evaluation code is here: > https://gist.github.com/n-yoshikawa/0ba04a1b0c718c4cc8d83702f3759afa > There is a link to data in this gist. > > Thanks, > Naruki > > > ___ > Rdkit-discuss mailing list > Rdkit-discuss@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/rdkit-discuss > ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
[Rdkit-discuss] Evaluating ETKDG method with the Platinum Dataset
Dear all, I'm evaluating ETKDG method implemented in RDKit using the Platinum Dataset introduced in a benchmark paper https://pubs.acs.org/doi/abs/10.1021/acs.jcim.7b00505/ SMILES generated from the dataset is served as input and a 3D conformer is generated. We evaluate RMSD between generated structure and experimental structure. Although the author of the benchmark paper reported the mean RMSD to be below 1.0 angstrom, my evaluation code reports around 1.5 angstrom. I can't figure out why such a big difference occurs. My evaluation code is here: https://gist.github.com/n-yoshikawa/0ba04a1b0c718c4cc8d83702f3759afa There is a link to data in this gist. Thanks, Naruki ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
Re: [Rdkit-discuss] Chem.GetMolFrags and 3D coordinates
Hi Greg, Thank you, I can reproduce your example, and my own case works fine now... Best, Michal On Mon, 25 Feb 2019 at 14:26, Greg Landrum wrote: > Hi Michal, > > Which version of the RDKit are you using? This should already be working. > Here's an example demonstrating that: > > In [16]: m = Chem.AddHs(Chem.MolFromSmiles('c1c1.N')) > > In [17]: AllChem.EmbedMolecule(m) > Out[17]: 0 > > In [18]: fs = Chem.GetMolFrags(m,asMols=True) > > In [19]: print(Chem.MolToMolBlock(fs[0])) > > RDKit 3D > > 12 12 0 0 0 0 0 0 0 0999 V2000 > 0.7943 -1.14900.0150 C 0 0 0 0 0 0 0 0 0 0 0 0 > 1.36070.1298 -0.0022 C 0 0 0 0 0 0 0 0 0 0 0 0 > 0.58981.2681 -0.0170 C 0 0 0 0 0 0 0 0 0 0 0 0 >-0.78661.1450 -0.0149 C 0 0 0 0 0 0 0 0 0 0 0 0 >-1.3413 -0.11240.0019 C 0 0 0 0 0 0 0 0 0 0 0 0 >-0.5806 -1.26560.0170 C 0 0 0 0 0 0 0 0 0 0 0 0 > 1.4026 -2.04280.0267 H 0 0 0 0 0 0 0 0 0 0 0 0 > 2.41830.2113 -0.0036 H 0 0 0 0 0 0 0 0 0 0 0 0 > 1.00752.2686 -0.0304 H 0 0 0 0 0 0 0 0 0 0 0 0 >-1.41822.0293 -0.0265 H 0 0 0 0 0 0 0 0 0 0 0 0 >-2.4222 -0.21720.0037 H 0 0 0 0 0 0 0 0 0 0 0 0 >-1.0241 -2.26520.0304 H 0 0 0 0 0 0 0 0 0 0 0 0 > 1 2 2 0 > 2 3 1 0 > 3 4 2 0 > 4 5 1 0 > 5 6 2 0 > 6 1 1 0 > 1 7 1 0 > 2 8 1 0 > 3 9 1 0 > 4 10 1 0 > 5 11 1 0 > 6 12 1 0 > M END > > > In [20]: print(Chem.MolToMolBlock(fs[1])) > > RDKit 3D > > 4 3 0 0 0 0 0 0 0 0999 V2000 >-0.0066 -0.00990.2620 N 0 0 0 0 0 0 0 0 0 0 0 0 >-0.41360.8845 -0.0859 H 0 0 0 0 0 0 0 0 0 0 0 0 >-0.5574 -0.8197 -0.0901 H 0 0 0 0 0 0 0 0 0 0 0 0 > 0.9775 -0.0549 -0.0860 H 0 0 0 0 0 0 0 0 0 0 0 0 > 1 2 1 0 > 1 3 1 0 > 1 4 1 0 > M END > > > In [21]: print(Chem.MolToMolBlock(m)) > > RDKit 3D > > 16 15 0 0 0 0 0 0 0 0999 V2000 > 0.7943 -1.14900.0150 C 0 0 0 0 0 0 0 0 0 0 0 0 > 1.36070.1298 -0.0022 C 0 0 0 0 0 0 0 0 0 0 0 0 > 0.58981.2681 -0.0170 C 0 0 0 0 0 0 0 0 0 0 0 0 >-0.78661.1450 -0.0149 C 0 0 0 0 0 0 0 0 0 0 0 0 >-1.3413 -0.11240.0019 C 0 0 0 0 0 0 0 0 0 0 0 0 >-0.5806 -1.26560.0170 C 0 0 0 0 0 0 0 0 0 0 0 0 >-0.0066 -0.00990.2620 N 0 0 0 0 0 0 0 0 0 0 0 0 > 1.4026 -2.04280.0267 H 0 0 0 0 0 0 0 0 0 0 0 0 > 2.41830.2113 -0.0036 H 0 0 0 0 0 0 0 0 0 0 0 0 > 1.00752.2686 -0.0304 H 0 0 0 0 0 0 0 0 0 0 0 0 >-1.41822.0293 -0.0265 H 0 0 0 0 0 0 0 0 0 0 0 0 >-2.4222 -0.21720.0037 H 0 0 0 0 0 0 0 0 0 0 0 0 >-1.0241 -2.26520.0304 H 0 0 0 0 0 0 0 0 0 0 0 0 >-0.41360.8845 -0.0859 H 0 0 0 0 0 0 0 0 0 0 0 0 >-0.5574 -0.8197 -0.0901 H 0 0 0 0 0 0 0 0 0 0 0 0 > 0.9775 -0.0549 -0.0860 H 0 0 0 0 0 0 0 0 0 0 0 0 > 1 2 2 0 > 2 3 1 0 > 3 4 2 0 > 4 5 1 0 > 5 6 2 0 > 6 1 1 0 > 1 8 1 0 > 2 9 1 0 > 3 10 1 0 > 4 11 1 0 > 5 12 1 0 > 6 13 1 0 > 7 14 1 0 > 7 15 1 0 > 7 16 1 0 > M END > > > > Best, > -greg > > > On Mon, Feb 25, 2019 at 5:54 AM Michal Krompiec > wrote: > >> Hello, >> Let mol be a molecule with a conformer with 3D coordinates, consisting of >> 2 fragments. Chem.GetMolFrags(mol, asMols=true) returns these fragments as >> Molecule objects, but the 3D coordinates are lost. Is there any way to >> preserve them? >> Best, >> Michal >> ___ >> Rdkit-discuss mailing list >> Rdkit-discuss@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss >> > ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
Re: [Rdkit-discuss] Chem.GetMolFrags and 3D coordinates
Hi Michal, Which version of the RDKit are you using? This should already be working. Here's an example demonstrating that: In [16]: m = Chem.AddHs(Chem.MolFromSmiles('c1c1.N')) In [17]: AllChem.EmbedMolecule(m) Out[17]: 0 In [18]: fs = Chem.GetMolFrags(m,asMols=True) In [19]: print(Chem.MolToMolBlock(fs[0])) RDKit 3D 12 12 0 0 0 0 0 0 0 0999 V2000 0.7943 -1.14900.0150 C 0 0 0 0 0 0 0 0 0 0 0 0 1.36070.1298 -0.0022 C 0 0 0 0 0 0 0 0 0 0 0 0 0.58981.2681 -0.0170 C 0 0 0 0 0 0 0 0 0 0 0 0 -0.78661.1450 -0.0149 C 0 0 0 0 0 0 0 0 0 0 0 0 -1.3413 -0.11240.0019 C 0 0 0 0 0 0 0 0 0 0 0 0 -0.5806 -1.26560.0170 C 0 0 0 0 0 0 0 0 0 0 0 0 1.4026 -2.04280.0267 H 0 0 0 0 0 0 0 0 0 0 0 0 2.41830.2113 -0.0036 H 0 0 0 0 0 0 0 0 0 0 0 0 1.00752.2686 -0.0304 H 0 0 0 0 0 0 0 0 0 0 0 0 -1.41822.0293 -0.0265 H 0 0 0 0 0 0 0 0 0 0 0 0 -2.4222 -0.21720.0037 H 0 0 0 0 0 0 0 0 0 0 0 0 -1.0241 -2.26520.0304 H 0 0 0 0 0 0 0 0 0 0 0 0 1 2 2 0 2 3 1 0 3 4 2 0 4 5 1 0 5 6 2 0 6 1 1 0 1 7 1 0 2 8 1 0 3 9 1 0 4 10 1 0 5 11 1 0 6 12 1 0 M END In [20]: print(Chem.MolToMolBlock(fs[1])) RDKit 3D 4 3 0 0 0 0 0 0 0 0999 V2000 -0.0066 -0.00990.2620 N 0 0 0 0 0 0 0 0 0 0 0 0 -0.41360.8845 -0.0859 H 0 0 0 0 0 0 0 0 0 0 0 0 -0.5574 -0.8197 -0.0901 H 0 0 0 0 0 0 0 0 0 0 0 0 0.9775 -0.0549 -0.0860 H 0 0 0 0 0 0 0 0 0 0 0 0 1 2 1 0 1 3 1 0 1 4 1 0 M END In [21]: print(Chem.MolToMolBlock(m)) RDKit 3D 16 15 0 0 0 0 0 0 0 0999 V2000 0.7943 -1.14900.0150 C 0 0 0 0 0 0 0 0 0 0 0 0 1.36070.1298 -0.0022 C 0 0 0 0 0 0 0 0 0 0 0 0 0.58981.2681 -0.0170 C 0 0 0 0 0 0 0 0 0 0 0 0 -0.78661.1450 -0.0149 C 0 0 0 0 0 0 0 0 0 0 0 0 -1.3413 -0.11240.0019 C 0 0 0 0 0 0 0 0 0 0 0 0 -0.5806 -1.26560.0170 C 0 0 0 0 0 0 0 0 0 0 0 0 -0.0066 -0.00990.2620 N 0 0 0 0 0 0 0 0 0 0 0 0 1.4026 -2.04280.0267 H 0 0 0 0 0 0 0 0 0 0 0 0 2.41830.2113 -0.0036 H 0 0 0 0 0 0 0 0 0 0 0 0 1.00752.2686 -0.0304 H 0 0 0 0 0 0 0 0 0 0 0 0 -1.41822.0293 -0.0265 H 0 0 0 0 0 0 0 0 0 0 0 0 -2.4222 -0.21720.0037 H 0 0 0 0 0 0 0 0 0 0 0 0 -1.0241 -2.26520.0304 H 0 0 0 0 0 0 0 0 0 0 0 0 -0.41360.8845 -0.0859 H 0 0 0 0 0 0 0 0 0 0 0 0 -0.5574 -0.8197 -0.0901 H 0 0 0 0 0 0 0 0 0 0 0 0 0.9775 -0.0549 -0.0860 H 0 0 0 0 0 0 0 0 0 0 0 0 1 2 2 0 2 3 1 0 3 4 2 0 4 5 1 0 5 6 2 0 6 1 1 0 1 8 1 0 2 9 1 0 3 10 1 0 4 11 1 0 5 12 1 0 6 13 1 0 7 14 1 0 7 15 1 0 7 16 1 0 M END Best, -greg On Mon, Feb 25, 2019 at 5:54 AM Michal Krompiec wrote: > Hello, > Let mol be a molecule with a conformer with 3D coordinates, consisting of > 2 fragments. Chem.GetMolFrags(mol, asMols=true) returns these fragments as > Molecule objects, but the 3D coordinates are lost. Is there any way to > preserve them? > Best, > Michal > ___ > Rdkit-discuss mailing list > Rdkit-discuss@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/rdkit-discuss > ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
Re: [Rdkit-discuss] Atom coordinates from PDB-file
Hi Illimar, If you need to access coordinates without creating conformer object do you really need to use rdkit I the first place? PDB file is column based format, so extracting coordinates for atoms for example with python is very straightforward. Lukas -- Lukas Pravda, Ph.D. Bioinformatician/Scientific Programmer Protein Data Bank in Europe (PDBe) European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD United Kingdom On 25/02/2019, 10:11, "Illimar Hugo Rekand" wrote: Hello, I am currently trying to access the xyz-coordinates for specific atoms (in a loop) from a .PDB-file. Is there an easy way to do this without creating a conformer of the molecule? all the best, Illimar Rekand Ph.D. Candidate Department of Biomedicine University of Bergen ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
[Rdkit-discuss] Chem.GetMolFrags and 3D coordinates
Hello, Let mol be a molecule with a conformer with 3D coordinates, consisting of 2 fragments. Chem.GetMolFrags(mol, asMols=true) returns these fragments as Molecule objects, but the 3D coordinates are lost. Is there any way to preserve them? Best, Michal ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
[Rdkit-discuss] Atom coordinates from PDB-file
Hello, I am currently trying to access the xyz-coordinates for specific atoms (in a loop) from a .PDB-file. Is there an easy way to do this without creating a conformer of the molecule? all the best, Illimar Rekand Ph.D. Candidate Department of Biomedicine University of Bergen ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
Re: [Rdkit-discuss] Smiles to Mol file
Hi, The "Getting Started" guide is a great place to look for things like this; http://rdkit.org/docs/GettingStartedInPython.html#writing-molecules -greg On Mon, Feb 25, 2019 at 3:06 AM tech tech wrote: > Hi, all, > > Could Rdkit convert smiles file to mol file? > If so, could anyone point out the code to do so? > > Thanks, > > Tom > > > ___ > Rdkit-discuss mailing list > Rdkit-discuss@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/rdkit-discuss > ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
[Rdkit-discuss] Smiles to Mol file
Hi, all, Could Rdkit convert smiles file to mol file? If so, could anyone point out the code to do so? Thanks, Tom ___ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss