[Rdkit-discuss] Calculating MorganFingerprint Counts for large number of Molecules

2018-05-18 Thread Jennifer Hemmerich
Hi all, I am trying to calculate Morgan Fingerprints for approximately 100.000 Molecules. I do not want to use the folded FPs. I want to use the counts for the bits which are on and afterwards drop all the infrequent features. So I would ultimately need a dataframe with the counts for each

Re: [Rdkit-discuss] Calculating MorganFingerprint Counts for large number of Molecules

2018-05-18 Thread Jennifer Hemmerich
May 18, 2018 at 9:41 AM Jennifer Hemmerich <jennyhemmeric...@gmail.com <mailto:jennyhemmeric...@gmail.com>> wrote: Hi all, I am trying to calculate Morgan Fingerprints for approximately 100.000 Molecules. I do not want to use the folded FPs. I want to use the counts for

[Rdkit-discuss] BulkTanimotoSimilarity

2018-06-06 Thread Jennifer Hemmerich
Hi all, I was trying to calculate a Similarity Matrix with Morgan Fingerprints and TanimotoSimilarity. If I use DataStructs.TanimotoSimilarity(fp,fp) I get a Simlarity of 1, which I would expect. If I do the same with the DataStructs.BulkTanimotoSimilarity(fps[i], fps[i]) i get a Similarity

Re: [Rdkit-discuss] Inchi/smiles conversion issue

2019-06-18 Thread Jennifer Hemmerich
Dear Alexis, if you calculate the Standard Inchi it is invariant to tautomers (see here: https://www.inchi-trust.org/technical-faq-2/#6.1). Therefore the information which tautomer was converted is lost due to the Inchi conversion. If you want to keep the tautomer information you need to use

Re: [Rdkit-discuss] rdkit.MolStandardize tautomer

2019-07-24 Thread Jennifer Hemmerich
Hi Ansgar, that might be a trivial solution but have you already considered using the Inchi? It is supposed to be invariant to (most) forms of tautomers (https://www.inchi-trust.org/technical-faq-2/#6). Therefore tautomers should have the same InChI Code/Key. Afterwards you would probably

Re: [Rdkit-discuss] Interactive plots in Jupyter notebooks

2020-01-15 Thread Jennifer Hemmerich
Hi Chris, I remember that a colleague of mine once used images of the molecules for that. However he did it in R and so far I also have not really succeeded to redo this as his code is not available anymore (but did also not try too hard). If images are an option you can look here, it