We usually cluster compounds with an appropriate method, and then equally
sample molecules from clusters that contain the molecules from the smaller
reference library.



On Sun, Dec 11, 2022 at 11:25 AM Christopher Mayer-Bacon <cmaye...@umbc.edu>
wrote:

> Hello all,
>
> I’m starting a project that explores the sampling of a large compound
> library.  My question is not so much about how to do something, but rather
> the specific use cases for weighted sampling from a compound library.
>
> Given a large compound library and a smaller, reference library, I want to
> take random samples from the large library such that the samples resemble
> the reference library in some way.  At the moment I’m focused on element
> composition (% of carbon atoms, % of oxygen atoms, etc.), but I’m open to
> using other features in the future.
>
> I have an idea of how to perform this sampling; my question for this
> community concerns a possible use case.  What would be the benefit of
> sampling from a compound library such that the samples resemble another
> library in some way?  I can think of a use case for my specific research
> niche (adaptive properties of the canonical amino acid alphabet), but I
> can’t think of another potential use case.  I know the RDKit community has
> a wide variety of backgrounds and expertise, hence why I wanted to pose
> this question to you all.
>
> -Chris
>
>
> --
> -Christopher Mayer-Bacon (*he/him/his*)
> PhD student
> Department of Biological Sciences
> University of Maryland, Baltimore County
> _______________________________________________
> Rdkit-discuss mailing list
> Rdkit-discuss@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>
-- 
Sent from Gmail Mobile
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to