Hi Ganesh,

To delve a bit deeper into this, if I recall correctly, SA score is calculated 
based on both:

  1.  The prevalence of your molecule’s chemical motifs in a virtual library of 
synthesizable compounds.
  2.  A set of logarithmic formulas that take as parameters molecular features 
associated with chemical complexity, like the ring complexity or number of 
stereocenters.
Consequently, like Axel and Nils have pointed out before me, the sum of 
fragment SA scores shouldn’t be the same as that of a whole molecule. This is 
because you have more chemical motifs in your larger molecule (think of how 
fingerprints are generated) and because you can’t sum logarithms in the same 
way you sum real numbers.

You can find the details in the original publication:

Ertl, P., & Schuffenhauer, A. (2009). Estimation of synthetic accessibility 
score of drug-like molecules based on molecular complexity and fragment 
contributions. Journal of Cheminformatics, 1(1), 1–11. 
https://doi.org/10.1186/1758-2946-1-8
From: Nils Weskamp<mailto:nils.wesk...@gmail.com>
Sent: 01 April 2020 12:48
To: Ganesh Shahane<mailto:ganesh7shah...@gmail.com>
Cc: 
rdkit-discuss@lists.sourceforge.net<mailto:rdkit-discuss@lists.sourceforge.net>
Subject: Re: [Rdkit-discuss] Synthetic Accessibility (SA) score

Hi Ganesh,

I would like to challenge your premise. Why do you think that synthetic 
accessibility should add up like that?

Theoretically, I would expect that the combination of A,B and C to ABC will 
require some synthetic effort - so should be SA(A) + SA(B) + SA(C) < SA(ABC).

Technically, the combination of the three fragments will change the properties 
and environment of at least some atoms in the molecule, so that should have an 
influence on the result. I suspect it will be difficult to define a score with 
the desired properties without making use of the fragmentation scheme you are 
using.

Best regards,
Nils


On Wed, Apr 1, 2020 at 12:36 PM Ganesh Shahane 
<ganesh7shah...@gmail.com<mailto:ganesh7shah...@gmail.com>> wrote:
Hi Axel,

Thank you for your response.

Yes, I tried to implement the aforementioned script. It works very well on 
whole molecules.

However, I am trying to implement the script on fragments. For example, if I 
have fragments: A, B and C that makes up a whole molecule "ABC", then the sum 
of SA scores of the fragments should be equal to SA score of the whole molecule.

Right now, the summation doesn't add up.

I was wondering if there is a correction that needs to be made to the sum of SA 
scores such that it is equal to the SA score of the whole molecule.

--
Best,
Ganesh


On Mon, Mar 30, 2020 at 4:30 PM Axel Pahl 
<axelp...@gmx.de<mailto:axelp...@gmx.de>> wrote:
Hi Ganesh,

are you aware that the SA Score IS implemented in RDKit:

https://github.com/rdkit/rdkit/tree/master/Contrib/SA_Score

Kind regards,
Axel


On 30.03.20 16:55, Ganesh Shahane wrote:
Dear RDKitters,

I am trying a to find a way to implement the SA score as a sum of fragments 
contributions from this paper:

Ertl, Peter, and Ansgar Schuffenhauer. 2009. “Estimation of Synthetic 
Accessibility Score of Drug-like Molecules Based on Molecular Complexity and 
Fragment Contributions.” Journal of Cheminformatics 1 (1): 8.

For example, if I have three different fragments: A, B and C and their 
respective SA scores, then the SA score of molecule "A+B+C" should be the sum 
of all three fragment SA scores.

In practice though, the fragment SA scores do not add up.

I was wondering if anyone across the cheminformatics community has successfully 
implemented this or a similar strategy?

--
Best,
Ganesh




_______________________________________________

Rdkit-discuss mailing list

Rdkit-discuss@lists.sourceforge.net<mailto:Rdkit-discuss@lists.sourceforge.net>

https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net<mailto:Rdkit-discuss@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net<mailto:Rdkit-discuss@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to