Dear all,

I could successfully use the code from Lukas (thanks again!) for scoring the 2D depictions of a molecule and retrieve the "best" one.

However, I noticed that the depictions seem to be dependent on the input format? (i.e. SDF or Smiles).

Since I remove any input conformer before I compute a depiction anyway,  I thought this could have something to do with the atom index order.

However, it does not look this is the case, as this gist shows:

https://gist.github.com/jose-manuel/bc130351c4979d2c0b165f84a8469d2a

I even get a new depiction for CoordGen when converting sdf -> smiles -> sdf, so I get three different results that are reproducible.

Would anyone have an idea why this is happening? Am I missing something obvious?

Cheers,
Jose Manuel

On 10.04.19 17:20, Lukas Pravda wrote:

Sorry guys for the late reply,

The code is available from here: https://gitlab.ebi.ac.uk/pdbe/ccdutils/blob/master/pdbeccdutils/core/depictions.py Hope you will find any use of it. The class you are after is DepictionValidator. Presently, I have a few changes on the development branch, which are soon to be merged to master (basically I started to calculate angle between two bonds if they share common atom because of this http://www.ebi.ac.uk/pdbe-srv/pdbechem/chemicalCompound/show/CPT).

Let me know if you would have any questions, or suggestions.

Lukas

*From: *Thomas Evangelidis <teva...@gmail.com>
*Date: *Tuesday, 9 April 2019 at 15:43
*To: *Lukas Pravda <lpra...@ebi.ac.uk>
*Cc: *Jose Manuel Gally <jose.manuel.ga...@gmail.com>, RDKIT mailing list <rdkit-discuss@lists.sourceforge.net> *Subject: *Re: [Rdkit-discuss] Which method to prefer for computing 2D coordinates

Hello Lukas,

I am also struggling with 2D coordinate generation quite a long time as well as what criteria to use for choosing the most appropriate. Therefore, I would be very interest to use your code for 2D coordinate selection.

With best regards,

Thomas

PS: very nice notebook Jose. I also wanted to write something similar but never really found enough time to finish it.

On Tue, 9 Apr 2019 at 16:31, Lukas Pravda <lpra...@ebi.ac.uk <mailto:lpra...@ebi.ac.uk>> wrote:

    Hi Jose,

    As you have shown there is no single method which would be perfect
    for everything. If you don’t care that much about speed, the
    possible solution could be to compute coordinates with all three
    approaches and then simply select the best conformer based on some
    criteria.

    The solution I use is to generate 2D coordinates using multiple
    approaches and then I have a set of methods which computes number
    of bond collisions and atoms being close to each other using
    KD-tree. Altogether this all is expressed as penalty score, where
    the lower is better.

    Should you need any code, let me know.

    Lukas

    On 09/04/2019, 14:35, "Jose Manuel Gally"
    <jose.manuel.ga...@gmail.com <mailto:jose.manuel.ga...@gmail.com>>
    wrote:

        Dear all,

        This might sound naive, but I want to compute 2D coordinates
    for a set
        of molecules.

        For now I am considering the 3 methods below [1].

        I was wondering if there was any recommendation to use one
    method over
        another in some cases?

        For instance, very large rings are not displayed round for
    CoordGen but
        sometimes this method performs worse than the default (AllChem).

        Computational time is not really an issue here as I generate
    those
        coordinates on the fly for a very small set of compounds.

        Here is a gist with a few examples:
    https://gist.github.com/jose-manuel/0f2a5e8eae8bf2a72c0faad7f2f2a263

        Thanks in advance, any suggestion is welcome!

        Cheers,
        Jose Manuel

        [1] Methods:

        1) rdkit.AllChem.Compute2dCoors (equivqlent to
        rdkit.Chem.rdDepictor.Compute2DCoords)
        2) rdkit.Avalon.pyAvalonTools.Generate2DCoords
        3) rdkit.Chem.rdCoordGen.AddCoords + rescale





        _______________________________________________
        Rdkit-discuss mailing list
    Rdkit-discuss@lists.sourceforge.net
    <mailto:Rdkit-discuss@lists.sourceforge.net>
    https://lists.sourceforge.net/lists/listinfo/rdkit-discuss





    _______________________________________________
    Rdkit-discuss mailing list
    Rdkit-discuss@lists.sourceforge.net
    <mailto:Rdkit-discuss@lists.sourceforge.net>
    https://lists.sourceforge.net/lists/listinfo/rdkit-discuss


--

======================================================================

Dr Thomas Evangelidis

Research Scientist

IOCB - Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences <https://www.uochb.cz/web/structure/31.html?lang=en>

Prague, Czech Republic

&

CEITEC - Central European Institute of Technology <https://www.ceitec.eu/>
Brno, Czech Republic

email: teva...@gmail.com <mailto:teva...@gmail.com>

website: https://sites.google.com/site/thomasevangelidishomepage/

_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to