Thanks, Takayuki,

For both Repeated Bisection clustering and K-means clustering, they all
need the number of clusters as input, right?


Best,
Jing

On Sun, Aug 23, 2015 at 1:17 AM, Taka Seri <serit...@gmail.com> wrote:

> Dear Jing,
>
> How about your trying using bayon ?
> https://code.google.com/p/bayon/
> It's not function of RDKit, but I think the library can cluster molecules
> using ECFP4.
>
> Unfortunately, input file format of bayon is not distance matrix but easy
> to prepare the format.
>
> Best regards.
>
> Takayuki
>
>
> 2015年8月23日(日) 12:03 Jing Lu <ajin...@gmail.com>:
>
>> Currently, I prefer fingerprint based clustering, because it's hard to
>> set the cutoff for scaffold based clustering. Does RDKit have scaffold
>> based clustering?
>>
>> On Sat, Aug 22, 2015 at 10:56 PM, <abhik1...@gmail.com> wrote:
>>
>>> Hi, how about scaffold based clustering . You extract the scaffolds and
>>> then cluster it and then put the respective scaffold compounds inside the
>>> cluster .
>>>
>>> Sent from my iPhone
>>>
>>> > On Aug 22, 2015, at 8:43 PM, Jing Lu <ajin...@gmail.com> wrote:
>>> >
>>> > Dear RDKit users,
>>> >
>>> > If I want to cluster more than 1M molecules by ECFP4. How could I do
>>> it? If I calculate the distance between every pair of molecules, the size
>>> of distance matrix will be too big. Does RDKit support any heuristic
>>> clustering algorithm without calculating the distance matrix of the whole
>>> library?
>>> >
>>> >
>>> >
>>> > Thanks,
>>> > Jing
>>> >
>>> ------------------------------------------------------------------------------
>>> > _______________________________________________
>>> > Rdkit-discuss mailing list
>>> > Rdkit-discuss@lists.sourceforge.net
>>> > https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>>>
>>
>>
>> ------------------------------------------------------------------------------
>> _______________________________________________
>> Rdkit-discuss mailing list
>> Rdkit-discuss@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>>
>
------------------------------------------------------------------------------
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to