Greg,
Thanks for this.
I, for one, think it is useful - not in a "parse this particular smiles
string" fashion.
But consider this use case. I have 8,000,000 molecules in a few hundred
smiles files on which I am calculating descriptors on the cloud. I only
have access to log files.
I get some ten thousand "SMILES Parse Error" without any additional info.
Also, I think this error should be just one line (no need to bloat log
files with redundant static data).
These should have a bit of static info which is the same for both (so you
can grep on that) and must have (on the same line) the offending smiles
string, which you could extract easily with regex, so I suggest something
structured like:
In [2]: Chem.MolFromSmiles('Ccc1XXXcCCC')
[06:06:25] SMILES Parse Error: Ccc1XXXcCCC (reason: unknown atoms X)
In [3]: Chem.MolFromSmiles('C1C')
[06:06:28] SMILES Parse Error: C1C (reason: unclosed ring for input)
In the words of the master, as it stands:
Useless, it is. More info, I need.
>From smiles errors, young Padawan must learn.
Jean-Paul Ebejer
Early Stage Researcher
InhibOx Ltd
Pembroke House
36-37 Pembroke Street
Oxford
OX1 1BP
UK
(+44 / 0) 1865 262 034
This email and any files transmitted with it are confidential and intended
solely for the use of the individual or entity to whom they are addressed.
Any unauthorised dissemination or copying of this email or its attachments,
and any use or disclosure of any information contained in them, is strictly
prohibited and may be illegal. If you have received this email in error
please notify the sender and delete all copies from your system.
We and our group companies accept no liability or responsibility for
personal emails or emails unconnected with our business.
Internet communications including emails and access and use of web sites
cannot be guaranteed to be secure or error free as information can be
intercepted, corrupted, lost or arrive late. Furthermore, while we have
taken steps to control the spread of viruses on our systems, we cannot
guarantee that this email and any files transmitted with it are virus free.
No liability is accepted for any errors, omissions, interceptions, corrupted
mail, lost communications or late delivery arising as a result of receiving
this message via the Internet or for any virus that may be contained in it.
On 10 May 2011 05:14, Greg Landrum <[email protected]> wrote:
> Best,
> -greg
>
------------------------------------------------------------------------------
Achieve unprecedented app performance and reliability
What every C/C++ and Fortran developer should know.
Learn how Intel has extended the reach of its next-generation tools
to help boost performance applications - inlcuding clusters.
http://p.sf.net/sfu/intel-dev2devmay
_______________________________________________
Rdkit-discuss mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss