To keep it 'simple' let's set some assumptions:

- I would like to cover 'only' organic compounds
- I won't something that works in 95% of compounds, producing some
clear warning for remaining 5%

Now, how can I classify compound as alkane, acid, ester etc. using RDkit?

On Tue, Jan 14, 2014 at 7:29 PM, David Hall <li...@cowsandmilk.net> wrote:
> Certainly, RDKit can help, any cheminformatics toolkit with
> SMARTS/substructures to quickly classify something as a ketone, ester,
> ether, bond orders, ring finding, path finding, etc. can help.
>
> As for how hard it is, I think for drug discovery, most would start with the
> Nomenclature of Organic Chemistry:
> http://www.iupac.org/home/publications/provisional-recommendations/under-review-by-the-authors/under-review-by-the-authors-container/nomenclature-of-organic-chemistry.html
> That's only 1306 pages to go through.
>
> After that, there's inorganic chemistry, although sections of that may not
> be handled well by RDKit? I'm not sure about how RDKit handles crystals and
> coordination compounds, etc.
> http://www.iupac.org/nc/home/publications/iupac-books/books-db/book-details.html?tx_wfqbe_pi1[bookid]=5
>
> Since RDKit recently added PDB support, people may be interested in naming
> for peptides and amino acids
> http://www.iupac.org/publications/pac/56/5/0595/
> And cyclic peptides
> http://www.iupac.org/home/publications/provisional-recommendations/under-review-by-the-authors/under-review-by-the-authors-container/nomenclature-of-cyclic-peptides.html
>
> So there's a decent amount of work. I think it is possible, but not a single
> weekend project.
>
> I should mention that many commercial vendors have products for this, off
> the top of my head:
> http://www.acdlabs.com/products/draw_nom/nom/name/
> http://www.chemaxon.com/library/iupac-name/
> https://www.cambridgesoft.com/Ensemble_for_Chemistry/ChemDraw/
> http://www.eyesopen.com/docs/lexichem/current/html/mol2nam.html#chapter-mol2nam
>
> So, at least 4 companies have some sort of algorithm. The only one I've used
> is ChemDraw's, and then, I only have done maybe 5 molecules, so I cannot
> claim that any of these products work.
>
> -David
>
>
>
>
> On Tue, Jan 14, 2014 at 11:04 AM, Michał Nowotka <mmm...@gmail.com> wrote:
>>
>> Yes, I know there is no algorithm for that. This is why I'm wondering
>> how hard it to write one and how RDkit can help here.
>>
>> On Tue, Jan 14, 2014 at 4:01 PM, Dimitri Maziuk <dmaz...@bmrb.wisc.edu>
>> wrote:
>> > On 1/14/2014 6:09 AM, Michał Nowotka wrote:
>> >> Hi,
>> >>
>> >> Since there is no open source software converting structure to IUPAC
>> >> name (I'm not talking about web services) I was wandering if one can
>> >> be implemented using RDkit? Which parts of RDKit would help in doing
>> >> that? Any pointers, suggestions?
>> >
>> > I've suspicion there's no algorithm for it -- cactus is the only one I
>> > found that does it and my impression is they're doing a database
>> > search... I'd like to be able to generate systematic names without web
>> > searches, too.
>> >
>> > Dima
>> >
>> >
>> >
>> >
>> > ------------------------------------------------------------------------------
>> > CenturyLink Cloud: The Leader in Enterprise Cloud Services.
>> > Learn Why More Businesses Are Choosing CenturyLink Cloud For
>> > Critical Workloads, Development Environments & Everything In Between.
>> > Get a Quote or Start a Free Trial Today.
>> >
>> > http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk
>> > _______________________________________________
>> > Rdkit-discuss mailing list
>> > Rdkit-discuss@lists.sourceforge.net
>> > https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>>
>>
>> ------------------------------------------------------------------------------
>> CenturyLink Cloud: The Leader in Enterprise Cloud Services.
>> Learn Why More Businesses Are Choosing CenturyLink Cloud For
>> Critical Workloads, Development Environments & Everything In Between.
>> Get a Quote or Start a Free Trial Today.
>>
>> http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk
>> _______________________________________________
>> Rdkit-discuss mailing list
>> Rdkit-discuss@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>
>

------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to