Hi all,

  I've just released chemfp 3.5.1 with support for "licensed
FPB files". These are fingerprint datasets which can be used
under the terms of chemfp's base license agreement even without
a chemfp license key or source code distribution.

As the first (and so far only) data set, I've converted the
RDKit Morgan fingerprints from the ChEMBL 27 release into FPB
format and made it available from https://chemfp.com/datasets/ .

If you are on a Linux-based OS and RDKit is already installed
then here are the steps to get started:

1) Install a pre-compiled version of chemfp for Linux:

  python -m pip install chemfp -i https://chemfp.com/packages/

2) Download the ChEMBL data set in FPB format using one of
the following:

  wget https://chemfp.com/datasets/chembl_27.fpb.gz
        -or-
  curl -O https://chemfp.com/datasets/chembl_27.fpb.gz

3) (Optional but recommended) Uncompress it:

    gunzip chembl_27.fpb.gz

4) Do a similarity search, for example, with a query SMILES
or query file:

    simsearch --query c1ccccc1O chembl_27.fpb
    simsearch --queries queryfile.sdf chembl_27.fpb

5) View the ChEMBL license agreement and legal notices
included with the dataset.
 
    python -m chemfp fpb_text chembl_27.fpb


Chemfp is a Python package for cheminformatics fingerprints.

It can be used to:
  - generate fingerprints using the  RDKit, Open Babel,
      CDK, and OEChem/OEGraphSim toolkits;
  - extract pre-computed fingerprints from SDF tag data;
  - do high-performance Tanimoto and Tversky similarity search;
  - integrate fingerprints and search results with NumPy/SciPy.
  - ... and much more!

It includes an extensive and well-documented Python API for
working with fingerprints and a set of command-line tools
for fingerprint generation, conversion, and similarity search.

Chemfp natively supports two fingerprint file formats. The
FPS format is a text format which is easy to read and write.
The FPB format is a binary format which is quick to load.

For more information see https://chemfp.com/ .

License keys for academic use are available at no cost.

Best regards,

                                Andrew
                                da...@dalkescientific.com




_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to