I would definitely expect the SmilesParser to return full-fletched, 
fully connected molecules.

Actually, I don't find it intuitive to need a post-processor for the 
SmilesParser.

So the SMILES parser should:
1. Parse the SMILES into a connectivity table
2. Resolve the connectivity table and deduce bond orders
3. Return a fully connected molecules with correct bond orders

It doesn't need to
1. Detect aromaticity, unless explicitly given in the SMILES
2. Add hydrogens unless explicitly given in the SMILES

I was hoping on fixing the remaining SmilesParser bugs with your new, 
improved atom typing.

Cheers,
Andreas

Egon Willighagen wrote:
> (OK, KMail + Gmail is not a friendly combination... the slighest
> mistype results in an undesired submit :( )
> 
> On 9/25/07, Egon Willighagen <[EMAIL PROTECTED]> wrote:
>> in this email I would like to bring forward a request for comments
>> (RFC) about the SMILES parser.
>>
>> One thing that CDK 1.0 has clarified is the difficulties it has with
>> deducing bond orders, for which it was tried to write a
>> one-size-fits-all algorithm using available atom type lists. This
>> failed. One workarou
> 
> ... One workaround is the DeduceBondOrderTool which takes a rule based
> approach, which will never provide a full coverage, but at least
> succeeds to do the most common things right.
> 
> Getting to the SMILES parser, I would like to request comments on what
> functionality is to be expected from a SMILES parser. To make the
> SMILES parser in the CDK back into a consistent working state again, I
> would like to suggest it will no longer try to resolve bond orders and
> leave that to the user as post-processing. This will fix all standing
> open bugs in the CDK SmilesParser, which are all related to failing
> bond order resolution.
> 
> The SMILES parser should:
> 1. parse the SMILES into a connectivity table
> 2. mark atoms which lower case symbols for steps 2 and 3
> 3. try to resolve bond orders for atom groups which do not include ring atoms
> 4. try to deduce bond orders for atom groups which include ring atoms only for
>   very well defined groups using a rule-based approach
> 5. detect aromaticity for single, isolated ring systems only
> 
> Not really too different from what is happening right now, but cleaner
> and better defined.
> 
> Is the above all we should expect the SMILES parser to do?
> 
> Egon
> 

-- 
Andreas Schüller
PhD student in the research group of Prof. Dr. Gisbert Schneider

Johann Wolfgang Goethe University
Beilstein-endowed Chair for Chemoinformatics
Building B - 3rd floor
Siesmayerstr. 70
60323 Frankfurt am Main
Germany

Tel.: +49 69 798 24879
Fax: +49 69 798 24880
http://www.modlab.de/

-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
Cdk-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/cdk-user

Reply via email to