Sorry my fault. The SmilesGenerator removed the stereochemistry. Just noticed that I have to use .absolute() instead of .unique()

Am 03.09.2015 um 12:36 schrieb Martin Gütlein:
Hi John,

Awesome, thanks. SuppressHydrogens works good, but it suppresses Hs with stereochem-information as well. Can this be avoided?

> although possible you should perhaps store and index (unique check) you structures separately.
Thats what we try to not do, if possible.

Martin


Am 03.09.2015 um 11:32 schrieb John M:
Hi Martin,

By design CDK separates out standardisation (H rep, tautomers, protonation) from canonicalisation (ordering). You've found the method to "sprout" hydrogens but you actually want the opposite - suppressHydrogens(mol) <http://cdk.github.io/cdk/1.5/docs/api/org/openscience/cdk/tools/manipulator/AtomContainerManipulator.html#suppressHydrogens%28org.openscience.cdk.interfaces.IAtomContainer%29>.

I think you might be conflating two things, although possible you should perhaps store and index (unique check) you structures separately. For your example you could assign a unique tautomer but you'll be back at square one with your first example.

O=C[CH]1Cc2[nH]cnc2CC1 -> 1,3 proton shift -> C(=O)[CH]1Cc2c(CC1)[nH]cn2
CC(=O)N -> 1,3 proton shift -> CC(O)=N

Thanks,


Regards,
John W May
john.wilkinson...@gmail.com <mailto:john.wilkinson...@gmail.com>

On 3 September 2015 at 09:27, Martin Gütlein <guetl...@uni-mainz.de <mailto:guetl...@uni-mainz.de>> wrote:

    One more thing:
    I noticed that unique SMILES differentiate explicit and implicit
    Hydrogens, e.g. "[H]Cl" is different form "Cl". This can be
    solved by running
    AtomContainerManipulator.convertImplicitToExplicitHydrogens(mol).
    However, I do not like having my all Hs defined explicitly. Is
    there an option in the CDK to convert explict Hs back to
    implicit, leaving only thoses Hs as explict that are relevant?

    Martin


    Am 03.09.2015 um 09:48 schrieb Martin Gütlein:
    Hi John,

    thanks for your reply, I tried to use unique (kekulized) SMILES
    instead of InChIs.
    Whats good is that the structure for (most) compounds is stored
    correctly (i.e., I can create an IAtomContainer that is
    apparently equal).

    However, I found an example were the unique SMILES of two
    identical structures is different (see below).

    Kind regards,
    Martin


        for (String smi : new String[] { "O=C[CH]1Cc2[nH]cnc2CC1",
    "C(=O)[CH]1Cc2c(CC1)[nH]cn2" })
            {
                IAtomContainer mol = new
    SmilesParser(SilentChemObjectBuilder.getInstance()).parseSmiles(smi);
    System.out.println(SmilesGenerator.unique().create(mol));
            }






    Am 02.09.2015 um 20:58 schrieb John M:
    Just to add on - if you really want to use InChI (don't) then
    you could store the AuxInfo but the CDK doesn't have a
    conversion method that accepts it when turning it back into an
    AtomContainer.

    I also notice you're using unique SMILES (default by old APIs),
    you probably want isomeric that a non-canonical but store
    stereochemistry.

    IAtomContainer mol = SmilesGenerator.isomeric().create(container);

    John

    Regards,
    John W May
    john.wilkinson...@gmail.com <mailto:john.wilkinson...@gmail.com>

    On 2 September 2015 at 19:54, John M
    <john.wilkinson...@gmail.com> wrote:

        Hi Martin,

        The InChI is an identifier and not a structure
        representation it should never be used as such. For maximum
        preservation you should store compounds as Kekulé SMILES or
        Molfile. You can store additional data such as coordinates
        supplementary to the SMILES.

        You might find a recent presentation by Noel (O Babel) and
        Rajarshi (CDK) useful:
        
http://baoilleach.blogspot.co.uk/2015/08/the-whole-of-cheminformatics-best.html

        John




-- Dr. Martin Gütlein
    Phone:
    +49 (0)6131 39 23336 <tel:%2B49%20%280%296131%2039%2023336>  (office)
    +49 (0)177 623 9499 <tel:%2B49%20%280%29177%20623%209499>  (mobile)
    Email:
    guetl...@uni-mainz.de <mailto:guetl...@uni-mainz.de>



--
Dr. Martin Gütlein
Phone:
+49 (0)6131 39 23336 (office)
+49 (0)177 623 9499 (mobile)
Email:
guetl...@uni-mainz.de


--
Dr. Martin Gütlein
Phone:
+49 (0)6131 39 23336 (office)
+49 (0)177 623 9499 (mobile)
Email:
guetl...@uni-mainz.de

------------------------------------------------------------------------------
Monitor Your Dynamic Infrastructure at Any Scale With Datadog!
Get real-time metrics from all of your servers, apps and tools
in one place.
SourceForge users - Click here to start your Free Trial of Datadog now!
http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140
_______________________________________________
Cdk-user mailing list
Cdk-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/cdk-user

Reply via email to