Re: [Rdkit-discuss] Atom Symbol Case in MolFile?

2016-01-19 Thread Paul Emsley

Hi Greg,

Thanks for that.

Why do I ask?  Because the sdf files [1] distributed by the wwPDB, such 
as this one:


http://www.rcsb.org/pdb/files/ligand/CQ8_ideal.sdf

from this page:

http://www.rcsb.org/pdb/ligand/ligandsummary.do?hetId=CQ8

are upper-cased.  I didn't know whether that was right or not (and, as 
you imply, RDKit will not parse it).  I'll get in touch with them and 
see if they can get it changed.


Paul.

[1] I thought that they were molfiles when I wrote the mail - and I 
suppose the same thinking applies.


On 18/01/2016 20:59, Greg Landrum wrote:

Hi Paul,

ctfile.pdf says: "entry in periodic table"
I interpret this to mean "Cl", since that's what one finds in the 
typical periodic table.


The RDKit, curiously, agrees with me. ;-)


:-)



-greg


On Mon, Jan 18, 2016 at 8:45 AM, Paul Emsley 
> wrote:



Dear RDKitters,

Is is clear what the case of the second character of an atom symbol in
the atom block of a MolFile should be? i.e. "CL", "Cl" or either?

Thanks,

Paul.



--
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140
___
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net

https://lists.sourceforge.net/lists/listinfo/rdkit-discuss




--
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140___
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss


Re: [Rdkit-discuss] Atom Symbol Case in MolFile?

2016-01-19 Thread Greg Landrum
Paul,

On Tue, Jan 19, 2016 at 7:59 PM, Paul Emsley 
wrote:

>
> Thanks for that.
>
> Why do I ask?  Because the sdf files [1] distributed by the wwPDB, such as
> this one:
>
> http://www.rcsb.org/pdb/files/ligand/CQ8_ideal.sdf
>
> from this page:
>
> http://www.rcsb.org/pdb/ligand/ligandsummary.do?hetId=CQ8
>
> are upper-cased.  I didn't know whether that was right or not (and, as you
> imply, RDKit will not parse it).  I'll get in touch with them and see if
> they can get it changed.
>

It's an important data source, so it would be great if they were supplying
data that's correctly formatted (assuming, of course, that my reading of
that "spec" is correct). In the meantime, it would be pretty easy to modify
the RDKit to handle these cases correctly when the "strictParsing" option
is set to false. I'll add a github issue for this and get it in there.


> [1] I thought that they were molfiles when I wrote the mail - and I
> suppose the same thinking applies.
>

Yeah, the format of the CTAB piece is identical for mol files and SDFs.

-greg
--
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140___
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss