On Monday 11 June 2007 15:34, Rajarshi Guha wrote:
> On Jun 11, 2007, at 9:20 AM, sushil ronghe wrote:
> > Therefore a fast bistring comparision to rule out most
> > structures (e.g. has it got an aromatic ring represented in the
> > bitstring) before the more deep substructure matching would be very
> > beneficial.
> >
> > This is absolutely fantastic idea.
> > But what would be the benefit of that kind of search.
> > If some one really want to use the information of the atoms given
> > in MOL
> > file we can use the atomtype class.
> >
> > Also if we are using the fingerprinting CDK need to Process the
> > molecule.
> > and same for substructure search. SO i don't think there will be
> > any great difference
> > i think!
>
> I think the idea is to have the bit strings precalculated for a set
> of molecules. Then rather than doing a substructure search over all
> the molecules, you can do a fast bit comparison, and get a smaller
> number of molecules to perform the substructure search on.
Indeed. This is the way to go. I can say that a bitsearch on the precalculated 
fingeprints of the 20000 structures in NMRShiftDB is below a second, whereas 
the substructure search takes that on just a few structures. So it really 
helps.
>
> That is the way we do it in our databases here - definitely saves time!
>
> -------------------------------------------------------------------
> Rajarshi Guha  <[EMAIL PROTECTED]>
> GPG Fingerprint: 0CCA 8EE2 2EEB 25E2 AB04  06F7 1BB9 E634 9B87 56EE
> -------------------------------------------------------------------
> There is no truth to the allegation that statisticians are mean.
> They are just your standard normal deviates.
>
>
>
> -------------------------------------------------------------------------
> This SF.net email is sponsored by DB2 Express
> Download DB2 Express C - the FREE version of DB2 express and take
> control of your XML. No limits. Just data. Click to get it now.
> http://sourceforge.net/powerbar/db2/
> _______________________________________________
> Cdk-user mailing list
> Cdk-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/cdk-user

-- 
Stefan Kuhn BSc MA
IPB Halle
AG Bioinformatik & Massenspektrometrie
Weinberg 3
06120 Halle
http://www.ipb-halle.de http://msbi.bic-gh.de
[EMAIL PROTECTED] +49 (0) 345 5582 1474 Fax.+49 (0) 345 5582 1409

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Cdk-user mailing list
Cdk-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/cdk-user

Reply via email to