Also worth considering align-it
<http://silicos-it.be.s3-website-eu-west-1.amazonaws.com/software/align-it/1.0.4/align-it.html>
in this regard (also from silicos-it).
This focusses on the pharmacophoric features rather than the shape which
may be more sensible depending on the use case.
Seems pretty quick *shrug*.

J.



On Wed, 4 Nov 2020 at 19:12, Greg Landrum <greg.land...@gmail.com> wrote:

>
>
> On Wed, 4 Nov 2020 at 18:27, Mark Mackey <m...@cresset-group.com> wrote:
>
>>  I did look at Shape-It way back when Silicos open-sourced it: as far as
>> I can remember the code looked clean enough but it was slow. Unfortunately
>> from the RDKit point of view it’s LGPL so can’t be used as the basis of an
>> RDKit shape algorithm.
>>
> Hans actually gave permission to relicense an RDKit shape-it port. There
> is a very old PR that made some progress on this but we never finished it
> since the performance was just no good. I can’t find that PR/fork at the
> moment, but if someone thinks that may be a starting point (I’m skeptical)
> I can look
>
>
>
>
>>
>> Regards,
>>
>> Mark
>>
>>
>>
>> *From:* Chris Swain <sw...@mac.com>
>> *Sent:* 04 November 2020 15:56
>> *To:* rdkit-discuss@lists.sourceforge.net; Mark Mackey <
>> m...@cresset-group.com>
>> *Subject:* Re: Rdkit-discuss Digest, Vol 157, Issue 2
>>
>>
>>
>> Hi Mark,
>>
>>
>>
>> Have you ever looked at Optipharm for shape comparison?
>>
>>
>>
>> https://www.nature.com/articles/s41598-018-37908-6
>>
>>
>>
>> Or Shape-it
>>
>>
>>
>>
>> http://silicos-it.be.s3-websiteu-west-1.amazonaws.com/software/shape-it/1.0.1/shape-it.html
>>
>>
>>
>>
>>
>> Cheers
>>
>>
>>
>> Chris
>>
>>
>>
>>
>>
>>
>>
>> On 4 Nov 2020, at 14:28, rdkit-discuss-requ...@lists.sourceforge.net
>> wrote:
>>
>>
>>
>> From: Mark Mackey <m...@cresset-group.com>
>> To: Lewis Martin <lewis.marti...@gmail.com>, RDKit Discuss
>>               <rdkit-discuss@lists.sourceforge.net>
>> Subject: Re: [Rdkit-discuss] GPU Implementation of shape-based 3D
>>               overlap on rdkit?
>> Message-ID:
>> <
>> dbbpr08mb4235128b45e0f546acfc5adb97...@dbbpr08mb4235.eurprd08.prod.outlook.com
>> >
>>
>> Content-Type: text/plain; charset="utf-8"
>>
>> Hi Lewis,
>>
>> The standard shape alignment algorithm that everyone uses is from Grant &
>> Pickup 1996 (
>> https://onlinelibrary.wiley.com/doi/abs/10.1002/%28SICI%291096-987X%2819961115%2917%3A14%3C1653%3A%3AAID-JCC7%3E3.0.CO%3B2-K
>> ).
>>
>> It?s a Taylor-series-like expansion using spherical Gaussians as
>> stand-ins for hard spheres - you take the atomic volumes, subtract off the
>> pairwise overlaps, add back in the three-way overlaps, subtract off the
>> four-way overlaps, and so on. I did a fair few tests some years back and
>> you really need to go to 6 terms to get decent accuracy. However, all of
>> the commercial algorithms (ROCS, Phase Shape, etc) seem to truncate at 2,
>> so go figure. OTOH the ?high throughput? versions all seem to be operated
>> with ludicrously low number of conformations so the error in incomplete
>> coverage of conformer space dwarfs the 5% noise that you get from
>> truncating at 2 terms rather than 6.
>>
>> If you want something slightly more accurate at the same computational
>> cost, look at WEGA (
>> https://onlinelibrary.wiley.com/doi/abs/10.1002/jcc.23603 and references
>> therein) which heuristically corrects for some flaws in the truncated
>> Grant&Pickup calculations.
>>
>> If you want a fast GPU-accelerated version, then forget about actually
>> applying the algorithm directly[*]. Instead, to compare a reference
>> molecule A to a database molecule B, precompute a grid over A containing
>> the pairwise overlap value of an atom at each point in the grid with A. You
>> can then compute the shape overlap for a given orientation of B by a simple
>> 3D texture lookup rather than faffing around trying to compute exponential
>> functions.. This is simplified by assuming that all atoms have the same
>> atomic radius and neglecting hydrogens (we?re going for speed over accuracy
>> here, remember?) You can get a similar lookup texture for gradients, I
>> think. One thing GPUs are really good at is texture lookups and
>> interpolation. They?re less good at evaluating exponential functions. Your
>> GPU algorithm is then a massively parallel CG or NR optimiser with the
>> objective function computing shape overlap values for as many molecules as
>> you can cram into GPU memory all in parallel.
>>
>> [*] gWEGA (I believe) is a GPU-accelerated version of the standard WEGA
>> algorithm and based on the published timings is an order of magnitude or
>> more slower than fastROCS
>>
>> Having said all of that, our GPU-accelerated shape similarity function
>> just brute forces through the overlap series to sixth order, as (a) my
>> happy place is on the accuracy side of the speed/accuracy tradeoff, and (b)
>> our electrostatic similarity calculations are sufficiently complex that
>> making the shape function faster wouldn?t be that much of a net win. As a
>> result, take all of the above with a grain of salt ?.
>>
>> Regards,
>> Mark
>>
>> --
>> Mark Mackey
>> Chief Scientific Officer
>> Cresset
>> New Cambridge House, Bassingbourn Road, Litlington, Cambridgeshire, SG8
>> 0SS, UK
>> tel: +44 (0)1223 858890    mobile: +44 (0)7595 099165    fax: +44 (0)1223
>> 853667
>> email: m...@cresset-group.com<mailto:m...@cresset-group.com
>> <m...@cresset-group.com>>    web: www.cresset-group.com<
>> http://www.cresset-group.com/>    skype: mark_cresset
>>
>>
>> _______________________________________________
>> Rdkit-discuss mailing list
>> Rdkit-discuss@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>>
> _______________________________________________
> Rdkit-discuss mailing list
> Rdkit-discuss@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to