Mea culpa - I hit Reply rather than Reply All and so only sent this to Greg...
On Tue, 23 Oct 2018 at 13:53, Chris Earnshaw <cgearns...@gmail.com> wrote: > Hi Greg > > Apologies again, I'm not trying to stir things up here. As we can see from > some of the the other discussion there's no clear view of what constitutes > aromaticity in these cases. I'm of the school which says that pyridone is > at least somewhat aromatic because, in crude terms, the electronegative > carbonyl oxygen 'steals' the electron from the carbon, the carbon provides > an empty p-orbital to the conjugated ring, and the ring nitrogen provides a > pair of electrons - hence 4n + 2 and aromatic. > > However, the thing that really worries me is that the 'iminopyridine' ring > in n12ccccc1=NCCC2 *should* be perceived in the same way as in > n12ccccc1=NC.CC2 but in practice that doesn't happen - one matches the > pyridine SMARTS c1ccccn1 and the other doesn't. This seems to be > potentially dangerous. The question of 'aromatic or not' is interesting, > but I'm actually more concerned with the consequences for compound > searching and filtering. > > As an approach, rather than simply checking if the exocyclic bond is in > another ring (of any type), would it be possible to check if that other > ring is fully conjugated? If it is, then the Huckel rule could/should be > applied to the fused system to determine aromaticity. If not, then the fact > the substituents form a ring is irrelevant and the potential aromatic > should be treated in the same way as the non-fused analogue. This would > avoid the current inconsistency, but there would no doubt still be some > challenging edge cases... > > Best regards, > Chris > > On Tue, 23 Oct 2018 at 12:43, Greg Landrum <greg.land...@gmail.com> wrote: > >> Dissent is fine, but it's important to remember that there are *always* >> going to be edge cases and that we're not trying to model something >> physically observable here. The concept of aromaticity is primarily there >> to make canonicalization easier. Section 3.4.2 here: >> http://www.daylight.com/dayhtml/doc/theory/theory.smiles.html has more >> info about this, as does the RDKit documentation: >> http://rdkit.org/docs/RDKit_Book.html#aromaticity >> >> I'm willing to change the current behavior, but someone would need to >> explain what it should be changed to in a way that is clear, unambiguous, >> and that would allow a human being looking at the structure to relatively >> easily figure out whether or not a given ring is aromatic. >> >> >> On Tue, Oct 23, 2018 at 1:17 PM Chris Earnshaw <cgearns...@gmail.com> >> wrote: >> >>> Sorry about this, but I think that 'perhaps sub-optimal' should be >>> replaced by 'definitely wrong'. The 'quasi-aromatic' system in these two >>> structures is identical and should behave as such, but in practice one of >>> them matches a pyridine SMARTS pattern and the other doesn't. That >>> shouldn't be affected by whether the saturated substituents form a ring or >>> not. I do appreciate that it gets messy to deal with as fused rings may be >>> fully conjugated, but the current behaviour seems to be disturbingly >>> inconsistent. It would be suboptimal to say that no exocyclic bond is >>> allowed to steal electrons, but that may be better than what's happening >>> here. >>> >>> Apologies for the dissent! >>> >>> Chris Earnshaw >>> >>> >>> >>> On Tue, 23 Oct 2018 at 11:57, Greg Landrum <greg.land...@gmail.com> >>> wrote: >>> >>>> The current implementation requires "exocyclic" bonds to actually be >>>> *non-ring* bonds in order to be recognized as such. >>>> This is perhaps sub-optimal, but it's clearly defined and avoids >>>> arguments about when exactly an "exocyclic" bond starts stealing electrons. >>>> >>>> -greg >>>> >>>> On Tue, Oct 23, 2018 at 12:46 PM Francis Atkinson <fran...@ebi.ac.uk> >>>> wrote: >>>> >>>>> Ian, >>>>> >>>>> I make it 6 electrons: two from the N, none from the C double >>>>> bonded to the exocyclic N, and one each from four other carbons in the >>>>> ring. It's isoelectronic with *e.g.* pyridone, which is aromatic in >>>>> RDKit... >>>>> >>>>> In [1]: from rdkit import Chem >>>>> >>>>> In [2]: Chem.MolToSmiles(Chem.MolFromSmiles('O=c1[nH]cccc1')) >>>>> Out[2]: 'O=c1cccc[nH]1' >>>>> >>>>> The protonated/tautomerised version are indeed aromatic >>>>> (interconverting bewteen these species was actually how I came across this >>>>> issue), but I still reckon the unprotonated bicyclic should be aromatic >>>>> too... >>>>> >>>>> Francis >>>>> On 23/10/2018 11:18, Ian Tickle wrote: >>>>> >>>>> >>>>> Hi, it seems to me that neither is aromatic since the N-substituted >>>>> hetero ring breaks the Huckel rule by having 7 e- (2 from the N and 1 each >>>>> from the 5 Cs). If you remove 1 e- from the N (so it's [n+]) and also >>>>> make >>>>> the external double bond into a single (picking up a proton on the other >>>>> N) >>>>> it becomes pyridinium which is certainly aromatic. >>>>> >>>>> [n+]12ccccc1NCCC2 >>>>> >>>>> [n+]12ccccc1NC.CC2 >>>>> >>>>> What does it make of those? >>>>> >>>>> Cheers >>>>> >>>>> -- Ian >>>>> >>>>> >>>>> On Tue, 23 Oct 2018 at 10:37, Francis Atkinson <fran...@ebi.ac.uk> >>>>> wrote: >>>>> >>>>>> Hello, >>>>>> >>>>>> In the following pair of molecules, the bicyclic is >>>>>> non-aromatic, >>>>>> whereas the 'ring-opened' analogue is aromatic... >>>>>> >>>>>> In [1]: from rdkit import Chem >>>>>> >>>>>> In [2]: Chem.MolToSmiles(Chem.MolFromSmiles('n12ccccc1=NCCC2')) >>>>>> Out[2]: 'C1=CC2=NCCCN2C=C1' >>>>>> >>>>>> In [3]: Chem.MolToSmiles(Chem.MolFromSmiles('n12ccccc1=NC.CC2')) >>>>>> Out[3]: 'CCn1ccccc1=NC' >>>>>> >>>>>> Notebook version: >>>>>> >>>>>> https://nbviewer.jupyter.org/gist/flatkinson/b88eb42510a79594a9e37042eeb7e224 >>>>>> >>>>>> This seems counter-intuitive to me: I don't see why the pyridine in >>>>>> the >>>>>> first molecule shouldn't be aromatic, just as it is in the second. >>>>>> >>>>>> Am I missing something here? >>>>>> >>>>>> Thanks, >>>>>> >>>>>> Francis >>>>>> >>>>>> -- >>>>>> Dr Francis L Atkinson >>>>>> >>>>>> Chemogenomics Group >>>>>> European Bioinformatics Institute (EMBL-EBI) >>>>>> European Molecular Biology Laboratory >>>>>> Wellcome Genome Campus >>>>>> Hinxton >>>>>> Cambridge CB10 1SD >>>>>> United Kingdom >>>>>> >>>>>> (01223) 494473 >>>>>> >>>>>> >>>>>> >>>>>> _______________________________________________ >>>>>> Rdkit-discuss mailing list >>>>>> Rdkit-discuss@lists.sourceforge.net >>>>>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss >>>>>> >>>>> -- >>>>> Dr Francis L Atkinson >>>>> >>>>> Chemogenomics Group >>>>> European Bioinformatics Institute (EMBL-EBI) >>>>> European Molecular Biology Laboratory >>>>> Wellcome Genome Campus >>>>> Hinxton >>>>> Cambridge CB10 1SD >>>>> United Kingdom >>>>> >>>>> (01223) 494473 >>>>> >>>>> _______________________________________________ >>>>> Rdkit-discuss mailing list >>>>> Rdkit-discuss@lists.sourceforge.net >>>>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss >>>>> >>>> _______________________________________________ >>>> Rdkit-discuss mailing list >>>> Rdkit-discuss@lists.sourceforge.net >>>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss >>>> >>>
_______________________________________________ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss