Dissent is fine, but it's important to remember that there are *always*
going to be edge cases and that we're not trying to model something
physically observable here. The concept of aromaticity is primarily there
to make canonicalization easier. Section 3.4.2 here:
http://www.daylight.com/dayhtml/doc/theory/theory.smiles.html has more info
about this, as does the RDKit documentation:
http://rdkit.org/docs/RDKit_Book.html#aromaticity

I'm willing to change the current behavior, but someone would need to
explain what it should be changed to in a way that is clear, unambiguous,
and that would allow a human being looking at the structure to relatively
easily figure out whether or not a given ring is aromatic.


On Tue, Oct 23, 2018 at 1:17 PM Chris Earnshaw <cgearns...@gmail.com> wrote:

> Sorry about this, but I think that 'perhaps sub-optimal' should be
> replaced by 'definitely wrong'. The 'quasi-aromatic' system in these two
> structures is identical and should behave as such, but in practice one of
> them matches a pyridine SMARTS pattern and the other doesn't. That
> shouldn't be affected by whether the saturated substituents form a ring or
> not. I do appreciate that it gets messy to deal with as fused rings may be
> fully conjugated, but the current behaviour seems to be disturbingly
> inconsistent. It would be suboptimal to say that no exocyclic bond is
> allowed to steal electrons, but that may be better than what's happening
> here.
>
> Apologies for the dissent!
>
> Chris Earnshaw
>
>
>
> On Tue, 23 Oct 2018 at 11:57, Greg Landrum <greg.land...@gmail.com> wrote:
>
>> The current implementation requires "exocyclic" bonds to actually be
>> *non-ring* bonds in order to be recognized as such.
>> This is perhaps sub-optimal, but it's clearly defined and avoids
>> arguments about when exactly an "exocyclic" bond starts stealing electrons.
>>
>> -greg
>>
>> On Tue, Oct 23, 2018 at 12:46 PM Francis Atkinson <fran...@ebi.ac.uk>
>> wrote:
>>
>>> Ian,
>>>
>>>     I make it 6 electrons: two from the N, none from the C double bonded
>>> to the exocyclic N, and one each from four other carbons in the ring. It's
>>> isoelectronic with *e.g.* pyridone, which is aromatic in RDKit...
>>>
>>> In [1]: from rdkit import Chem
>>>
>>> In [2]: Chem.MolToSmiles(Chem.MolFromSmiles('O=c1[nH]cccc1'))
>>> Out[2]: 'O=c1cccc[nH]1'
>>>
>>>     The protonated/tautomerised version are indeed aromatic
>>> (interconverting bewteen these species was actually how I came across this
>>> issue), but I still reckon the unprotonated bicyclic should be aromatic
>>> too...
>>>
>>>         Francis
>>> On 23/10/2018 11:18, Ian Tickle wrote:
>>>
>>>
>>> Hi, it seems to me that neither is aromatic since the N-substituted
>>> hetero ring breaks the Huckel rule by having 7 e- (2 from the N and 1 each
>>> from the 5 Cs).  If you remove 1 e- from the N (so it's [n+]) and also make
>>> the external double bond into a single (picking up a proton on the other N)
>>> it becomes pyridinium which is certainly aromatic.
>>>
>>> [n+]12ccccc1NCCC2
>>>
>>> [n+]12ccccc1NC.CC2
>>>
>>> What does it make of those?
>>>
>>> Cheers
>>>
>>> -- Ian
>>>
>>>
>>> On Tue, 23 Oct 2018 at 10:37, Francis Atkinson <fran...@ebi.ac.uk>
>>> wrote:
>>>
>>>> Hello,
>>>>
>>>>      In the following pair of molecules, the bicyclic is non-aromatic,
>>>> whereas the 'ring-opened' analogue is aromatic...
>>>>
>>>> In [1]: from rdkit import Chem
>>>>
>>>> In [2]: Chem.MolToSmiles(Chem.MolFromSmiles('n12ccccc1=NCCC2'))
>>>> Out[2]: 'C1=CC2=NCCCN2C=C1'
>>>>
>>>> In [3]: Chem.MolToSmiles(Chem.MolFromSmiles('n12ccccc1=NC.CC2'))
>>>> Out[3]: 'CCn1ccccc1=NC'
>>>>
>>>> Notebook version:
>>>>
>>>> https://nbviewer.jupyter.org/gist/flatkinson/b88eb42510a79594a9e37042eeb7e224
>>>>
>>>> This seems counter-intuitive to me: I don't see why the pyridine in the
>>>> first molecule shouldn't be aromatic, just as it is in the second.
>>>>
>>>> Am I missing something here?
>>>>
>>>>      Thanks,
>>>>
>>>>          Francis
>>>>
>>>> --
>>>> Dr Francis L Atkinson
>>>>
>>>> Chemogenomics Group
>>>> European Bioinformatics Institute (EMBL-EBI)
>>>> European Molecular Biology Laboratory
>>>> Wellcome Genome Campus
>>>> Hinxton
>>>> Cambridge CB10 1SD
>>>> United Kingdom
>>>>
>>>> (01223) 494473
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> Rdkit-discuss mailing list
>>>> Rdkit-discuss@lists.sourceforge.net
>>>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>>>>
>>> --
>>> Dr Francis L Atkinson
>>>
>>> Chemogenomics Group
>>> European Bioinformatics Institute (EMBL-EBI)
>>> European Molecular Biology Laboratory
>>> Wellcome Genome Campus
>>> Hinxton
>>> Cambridge CB10 1SD
>>> United Kingdom
>>>
>>> (01223) 494473
>>>
>>> _______________________________________________
>>> Rdkit-discuss mailing list
>>> Rdkit-discuss@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>>>
>> _______________________________________________
>> Rdkit-discuss mailing list
>> Rdkit-discuss@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>>
>
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to