Re: [Rdkit-discuss] single SMARTS for two patterns with Boolean OR

2017-09-19 Thread James T. Metz via Rdkit-discuss
-matches? Cheers, Dante On Tue, Sep 19, 2017 at 9:13 AM, James T. Metz via Rdkit-discuss <rdkit-discuss@lists.sourceforge.net> wrote: Hello, Is it possible to write a single SMARTS for two separate patterns involving a Boolean OR? For example, I want to write a single SMARTS that can mat

[Rdkit-discuss] single SMARTS for two patterns with Boolean OR

2017-09-19 Thread James T. Metz via Rdkit-discuss
Hello, Is it possible to write a single SMARTS for two separate patterns involving a Boolean OR? For example, I want to write a single SMARTS that can match the patterns of [C]-[C] or [N]-[N] I realize that I could write something like [C,N]-[C,N] but that would also match

[Rdkit-discuss] SMARTS for heteroaromatic rings?

2017-09-20 Thread James T. Metz via Rdkit-discuss
Hello, I would like to write a SMARTS that will match all of the individual atoms in all possible heteroaromatic rings. Does anyone know of an elegant, compact way to do this? If one SMARTS will not work, I can concatenate SMARTS using a vertical pipe, "|", as I proposed in an

Re: [Rdkit-discuss] SMARTS for heteroaromatic rings?

2017-09-20 Thread James T. Metz via Rdkit-discuss
just that it's aromatic, you should use [a], so [a]1[a][a][a][a][a]1 would match any 6-membered aromatic ring Jason Biggs On Wed, Sep 20, 2017 at 7:57 PM, James T. Metz via Rdkit-discuss <rdkit-discuss@lists.sourceforge.net> wrote: Hello, I would like to write a SMARTS tha

Re: [Rdkit-discuss] SMARTS for heteroaromatic rings?

2017-09-21 Thread James T. Metz via Rdkit-discuss
]:[a]:1', '[a]:1:[a]:[a]:[a]:[a]:[a]:[a]:1'] The rest is just some calls to MolFromSmarts() and then mol.GetSubstructMatches() for the molecules you want to test. -greg On Thu, Sep 21, 2017 at 3:56 AM, James T. Metz via Rdkit-discuss <rdkit-discuss@lists.sourceforge.net> wrote: Jason, Thanks! I just t

Re: [Rdkit-discuss] single SMARTS for two patterns with Boolean OR

2017-09-19 Thread James T. Metz via Rdkit-discuss
hatic carbon OR an aliphatic nitrogen singly bonded to an aliphatic nitrogen'. Regards, Chris Earnshaw On 19 September 2017 at 15:01, James T. Metz via Rdkit-discuss <rdkit-discuss@lists.sourceforge.net> wrote: Dante, Yes. In principle, if one can figure out all of the

[Rdkit-discuss] 2017.09.1 RDKit release

2017-10-09 Thread James T. Metz via Rdkit-discuss
Many thanks to all for this new RDKit release. I have used commercial software in my past, and I am more and more impressed by the capabilities of RDKit and other open source software. Please keep up the good work! Regards, Jim Metz -Original Message- From:

[Rdkit-discuss] SMARTS pattern matching of canonical forms of aromatic molecules

2017-09-08 Thread James T. Metz via Rdkit-discuss
Hello, Suppose I read in the SMILES of an aromatic molecule e.g., for benzene c1c1 I then want to convert the molecule to a Kekule representation and then perform various SMARTS pattern recognition e.g. [C]=[C]-[C] I have tried various Kekule commands in

[Rdkit-discuss] how to output multiple Kekule structures

2017-09-11 Thread James T. Metz via Rdkit-discuss
Hello, Suppose I read in an aromatic SMILES e.g., for benzene c1c1 I would like to generate the major canonical resonance forms and save the results as two separate molecules. Essentially I am trying to generate m1 = 'C1=CC=CC-C1' m2 = 'C1C=CC=CC1' Can

Re: [Rdkit-discuss] how to output multiple Kekule structures

2017-09-11 Thread James T. Metz via Rdkit-discuss
you trying to do? -greg On Mon, Sep 11, 2017 at 5:04 PM, James T. Metz via Rdkit-discuss <rdkit-discuss@lists.sourceforge.net> wrote: Hello, Suppose I read in an aromatic SMILES e.g., for benzene c1c1 I would like to generate the major canonical resonance for

Re: [Rdkit-discuss] how to output multiple Kekule structures

2017-09-11 Thread James T. Metz via Rdkit-discuss
i Jim, The code currently has no way to enumerate Kekule structures. I don't recall this coming up in the past and, to be honest, it doesn't seem all that generally useful. Perhaps there's an alternate way to solve the problem; what are you trying to do? -greg On Mon,

Re: [Rdkit-discuss] Rdkit-discuss Digest, Vol 119, Issue 6

2017-09-07 Thread James T. Metz via Rdkit-discuss
his. R6 means in 6 rings. r6 means in ring of size 6. http://www.daylight.com/dayhtml/doc/theory/theory.smarts.html TJ O'Donnell On Wed, Sep 6, 2017 at 4:34 PM, James T. Metz via Rdkit-discuss < rdkit-discuss@lists.sourceforge.net> wrote: > Hello, > > Given the following SMILES for a

[Rdkit-discuss] Fwd: Need SMARTS to distinguish 6-ring vs macrocyclic ether oxygens

2017-09-06 Thread James T. Metz via Rdkit-discuss
Hello, Given the following SMILES for a macrocyclic hexaose OCC1OC2OC3C(CO)OC(OC4C(CO)OC(OC5C(CO)OC(OC6C(CO)OC(OC7C(CO)OC(OC1C(O)C2O)C(O)C7O)C(O)C6O)C(O)C5O)C(O)C4O)C(O)C3O can anyone suggest a SMARTS pattern that will distinguish ether oxygens in the smaller 6-membered

Re: [Rdkit-discuss] how to output multiple Kekule structures

2017-09-11 Thread James T. Metz via Rdkit-discuss
C1C=CC=CC=1 C1=CC=CC=C1 Best, Paolo On 09/11/2017 05:22 PM, James T. Metz via Rdkit-discuss wrote: Greg, Thanks! Yes, very helpful

Re: [Rdkit-discuss] need SMARTS query with a specific exclusion

2017-09-24 Thread James T. Metz via Rdkit-discuss
6-ring aromatic pattern a:1:a:a:a:a:a:1, with recursive SMARTS applied to the first atom to ensure that this can't match any of the 6 ring atoms in your undesired system. Regards, Chris Earnshaw On 24 September 2017 at 05:04, James T. Metz via Rdkit-discuss <rdkit-discuss@lists.sourceforge.net

[Rdkit-discuss] suggestions for comprehensive searchable database of natural products

2017-11-27 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group, My apologies in advance if my request is not appropriate for this discussion group. Given a small molecule that might have some resemblance to natural products, can someone suggest a free, comprehensive, PYTHON/RDkit searchable database of natural products

[Rdkit-discuss] ctest

2017-11-05 Thread James T. Metz via Rdkit-discuss
RDkit Experts, What is ctest? Is it fully documented somewhere? I am running PYTHON 3.5.2/RDkit 2017.03.1 on WINDOWS 7 via Pycharm 2017.2.3 Is it possible to run ctest in this environment? ctest seems to be a good way to exercise and test many of the capabilities of RDkit,

[Rdkit-discuss] Python code to merge tuples from a SMARTS match

2017-11-07 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group, I have written a SMARTS to detect vicinal chlorine groups using RDkit. There are 4 atoms involved in a vicinal chlorine group. SMARTS = '[Cl]-[C,c]-,=,:[C,c]-[Cl]' I am trying to count the number of ("unique") occurrences of this pattern. For some

[Rdkit-discuss] transformation of an atom or group of atoms by atom indices

2017-11-09 Thread James T. Metz via Rdkit-discuss
RDKit Discussion Group, Suppose I have a molecule smiles1 = '' The carbon atoms will be assigned indices 0, 1, 2, and 3. Suppose I want to specifically change carbon 3 to a hydrogen. Is this possible using RDkit? I am aware of using SMARTS to match a pattern and

Re: [Rdkit-discuss] transformation of an atom or group of atoms by atom indices

2017-11-09 Thread James T. Metz via Rdkit-discuss
Chem.MolToSmiles(mol) Out[12]: '[H]CCC' In [13]: Chem.MolToSmiles(Chem.RemoveHs(mol)) Out[13]: 'CCC' Jason Biggs On Thu, Nov 9, 2017 at 3:12 PM, James T. Metz via Rdkit-discuss <rdkit-discuss@lists.sourceforge.net> wrote: RDKit Discussion Group, S

Re: [Rdkit-discuss] Python code to merge tuples from a SMARTS match

2017-11-08 Thread James T. Metz via Rdkit-discuss
t = (match[1], match[2]) if match[1] < match[2] ): t = (match[1], match[2]) else: t = (match[2], match[1]) d[t] = match You will wind up with as many dictionary elements as there are matches. -P. On Tue, Nov 7, 2017 at 7:38 PM, James T. Metz via Rdkit-dis

Re: [Rdkit-discuss] Python code to merge tuples from a SMARTS match

2017-11-08 Thread James T. Metz via Rdkit-discuss
(Cl)(Cl)(Cl)' >>> count_unique_substructures(smiles1, SMARTS) 1 >>> count_unique_substructures(smiles2, SMARTS) 1 -Brian On Tue, Nov 7, 2017 at 7:38 PM, James T. Metz via Rdkit-discuss <rdkit-discuss@lists.sourceforge.net> wrote: RDkit Discussion Group, I have wri

[Rdkit-discuss] RDkit ionization state prediction at a given pH

2018-02-02 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group, Can RDkit predict the ionization state of a molecule at a given pH by calculating the pKa, predominant species at that pH, etc? I have already checked the RDkit discussion email archive, and per my searching there was a response from Greg Landrum about

[Rdkit-discuss] automatic generation of reaction SMARTS in RDkit

2018-02-01 Thread James T. Metz via Rdkit-discuss
RDkit discussion group, Is it possible to automatically generate reaction SMARTS in RDkit from reactant(s) and product(s)? If so, are there various "trims" that can be performed to focus on the reaction center going out certain number of atoms which could be used to specify reaction

[Rdkit-discuss] descriptors beyond rotatable bond count and possible correlations with entropy

2018-08-27 Thread James T. Metz via Rdkit-discuss
RDkit users, Is there a RDkit descriptor (or code) to determine the largest number of contiguous rotatable bonds in a small molecule? Hmmm... it seems likely that ligand conformational flexibility might be somehow related to the entropy component of ligand binding. Has anyone made

[Rdkit-discuss] How to get more CPU utilization for RDkit calculations

2018-07-06 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group, I am running the RDkit diverse max min picker code on my WINDOWS 7 64 bit 4 dual core I7 computer. I have tested the code on a small set of compounds and the results are OK. I am now running diverse subset picking on an SDF containing about 40k compounds.

[Rdkit-discuss] suggestions or ideas for SMILES for an electron?

2018-02-28 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group, I am trying to get RDkit to recognize reactants and products in RedOx reactions including reactions which include explicit electrons using SMILES as patterns. Does anyone have any suggestions or ideas for how to deal with an explicit electron? I have tried:

[Rdkit-discuss] Interest in a RDkit UGM in the USA midwest?

2018-04-10 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group, Is there interest in having a RDkit UGM in the USA midwest, perhaps somewhere in the Chicago area? I would recommend a similar format to the European UGM and I would insist that presenters deposit their slides and code on the RDkit GitHub site. Regards,

[Rdkit-discuss] appropriate literature reference for RDkit 2017.03.1

2018-03-27 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group, What is an appropriate literature reference for RDkit 2017.03.1? Thank you. Regards, Jim Metz Northwestern University -- Check out the vibrant tech community on one of

[Rdkit-discuss] How to encode atomic contributions to logP (hydrophobicity) in MOL2 formatted charge slot?

2018-11-13 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group,     Given a set of small molecules as a SDF file, I would like to generate a MOL2 file where the atomic contributions to logP (hydrophobicity) from each atom including hydrogens have been calculated and are now encoded in the partial atomic charge"slot" in a MOL2 file. 

[Rdkit-discuss] Butina clustering with additional output

2018-09-20 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group,     I note that RDkit can perform Butina clustering.  Given an SDF ofsmall molecules I would like to cluster the ligands, but obtain additionalinformation from the clustering algorithm.  In particular, I would like to obtainthe cluster number and Tanimoto distance from

Re: [Rdkit-discuss] atomic contributions to molecular polarizability

2019-01-10 Thread James T. Metz via Rdkit-discuss
Carlos,     Thank you for your suggestion.  However, rather than wrestle with the technical details of obtaining polarizabilities from new quantum mechanics schemes, I am simply interested in extracting the atomic contributions that are apparently already available for adescriptor called apol

[Rdkit-discuss] atomic contributions to molecular polarizability

2019-01-10 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group,     I would like to calculate and be able to visualize the atomic contributionsto the total molecular polarizability of small organic molecules.  Apparentlythere is a molecular descriptor, apol, that is the sum total from the atomiccontributions to polarizability

Re: [Rdkit-discuss] atomic contributions to molecular polarizability

2019-01-11 Thread James T. Metz via Rdkit-discuss
-discuss] atomic contributions to molecular polarizability Hi Jim, On Thu, Jan 10, 2019 at 11:59 PM James T. Metz via Rdkit-discuss wrote:     I would like to calculate and be able to visualize the atomic contributionsto the total molecular polarizability of small organic molecules

[Rdkit-discuss] Structure-Based Drug Design

2019-03-16 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group,     RDkit has quite a number of useful tools and algorithms for ligand-based drug design (LBDD).  However, what about structure-based drug design (SBDD)?   Perhaps a few questions to motivate the discussion.     1) Since RDkit supposedly includes the force field MMFF, does

[Rdkit-discuss] AM1-BCC charges for small molecules

2019-03-11 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group,     I am interested in generating and assigning AM1-BCC charges to small molecules, preferably in batch mode.  I understand this topic has been discussed previously, buthas there been RDkit code written to do this?  Since this relies on the results of AM1 calculations, 

[Rdkit-discuss] molecular dynamics using RDkit only

2019-04-13 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group,     I am aware of RDkit scripts that use the MMFF force field to minimize smallmolecules.  Has anyone written RDkit code to perform molecular dynamics (MD) of small molecules or protein-ligand complexes using only RDkit and existingRDkit force fields.  I am aware of a

[Rdkit-discuss] calculation of spherical harmonics coefficients for 3D molecules?

2020-08-03 Thread James T. Metz via Rdkit-discuss
RDkit Discussion Group,     Has anyone implemented code to calculate spherical harmonicscoefficients for 3D conformations of molecules?  The approach I havein mind is described in Morris, R.J. et al., Real Spherical Harmonic expansion coefficients for protein binding pocket and ligand