Hi Giuseppe,

Is there a specific highly-cited reference for these methods. I did a
quick search on Google scholar, and it seemed that I could mostly find
them used in chemistry.

Cheers,

Gaƫl

On Tue, Aug 19, 2014 at 12:05:13PM +0200, Giuseppe Marco Randazzo wrote:
> Hello,

> i'm interested to contribute in scikit learn implementing some 
> algorithms to make an optimal selection of objects in a N-dimensional 
> space. These techniques are used when sampling is needed in large data 
> and when the sampling must be done with a specifi criterion:

> - Most Descriptive Compound: The aim of this algorithm is to select a 
> subset of compounds which most effectively represents the compounds in 
> the original population[Hudson, B; Quantitative Structure-Activity 
> Relationships 1996, 15, 285]

> - Dissimilarity Selection: The aim of this algorithm is to select a 
> subset of compounds which are really different each others [Lajiness, M; 
> Perspectives in Drug Discovery and Design 1997, 7(8), 65].


> - others....

> I can implement the Dissimilarity Selection, the Most Descriptive 
> Compound for the moment. Maybe lather other algorithms.


> Are you intrested?

> Giuseppe Marco Randazzo
-- 
    Gael Varoquaux
    Researcher, INRIA Parietal
    Laboratoire de Neuro-Imagerie Assistee par Ordinateur
    NeuroSpin/CEA Saclay , Bat 145, 91191 Gif-sur-Yvette France
    Phone:  ++ 33-1-69-08-79-68
    http://gael-varoquaux.info            http://twitter.com/GaelVaroquaux

------------------------------------------------------------------------------
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to