Hi Hongbin, Thank you for your reply. I will take your suggestions as a good starting point. I too was afraid that I underestimated the difficulty of the task...
That's an interesting point regarding the examples in the recited paper; one of the cited papers (https://pubs.acs.org/doi/10.1021/ci300206e) seemed to suggest that the matrices were made "with aid of RDKit," but I may have read too much into that. I did know that StarDrop could produce such tables, but it has some limitations (with respect to my specific needs) that motivated me to try to devise a more customized solution. I just learned (like an hour ago) that my organization has a license to this software, so I will see if I can live with its limitations in the short term while I work on my own solution. Thanks again! Ken On Sun, Aug 18, 2019, at 11:27 AM, Hongbin Yang wrote: > > Hi Ken, > > I am afraid that you may have underestimated the amount of your demand. At > least it includes R group decomposition (and it seems that only two > substitutes are considered in your example), drawing molecules, and rendering > the “table”. > > You can use RDKit to decompose the compounds. For example, use maximum common > substructure to setup the scaffold, followed by identification of the > connecting points by comparing the the compounds and the scaffold. > > As for drawing molecules, I think RDKit cannot draw the substituent group > (I’m not sure) like what ChemDraw does. > > If you just want to analyse the compounds by R-group decomposition, I believe > some kinds of commercial software such as Schrodinger and StarDrap are pretty > good. > > And as for the example in the paper by Gupta-Ostermann et al. I guess that > they drew the figures via ChemDraw and Powerpoint rather than scripts. > > Best regards, > > > > Hongbin Yang 杨弘宾, Ph.D. > Research: Toxicophore and Chemoinformatics > On 08/18/2019 20:23,ken<k...@postinbox.com> wrote: >> Hello, >> >> I am trying to build a 2-D R-group grid (or table, or spreadsheet), where >> the row headers contain R1 values and the column headers contain R2 values >> (or vice versa). Compounds that have given R1 and R2 groups would be >> represented on the table as a filled cell that intersects those R1 and R2. >> For example, the input could be an SD file containing the following three >> compounds: >> >> The desired output grid from the sd file would look something like this ("Y" >> can be replaced with cell formatting or some other indicator): >> >> >> >> The closest thing to this that I have been able to find is the "SAR Matrix" >> (https://f1000research.com/articles/3-113/v2), but the code that was used to >> generate the matrices does not appear to be available. Does anyone happen to >> have such code or know how I can generate it? I imagine the first step would >> be to perform an R-group decomposition, but I'm not sure what to do from >> there. >> >> I started to see if I could build the program from scratch, but then I >> thought that someone must've done this before and I shouldn't needlessly >> reinvent it. I've been (re)learning Python for the past year or so and I >> *think* I have a pretty good handle on the language, but I wouldn't mind >> putting said learning to the test on a "real" project, so if anyone has a >> solution that outputs something that even vaguely resembles the desired >> grid/matrix, maybe I can modify it to fit my needs. >> >> At some point, I would need the grid to be editable in Word, but I'll cross >> that bridge when I get to it... >> >> Thank you in advance for your help, >> Ken > > *Attachments:* > * r_matrix example01.png > * r_matrix example02.png
_______________________________________________ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss