I am pleased to announce the availability of Apache Nutch 1.0.
Apache Nutch, a subproject of Apache Lucene, is open source web-search
software. It builds on Lucene Java, adding web-specifics, such as a
crawler, a link-graph database, parsers for HTML and other document formats.
Apache Nutch 1.0 contains a number of bug fixes and improvements such as
Solr Integration, new indexing framework and new scoring framework just
to mention a few. Details can be found in the changes file:
Apache Nutch is available for download from the following download page:
When downloading from a mirror site, please remember to verify the
downloads using signatures found on the Apache site:
For more information on Apache Nutch, visit the project home page:
-- Sami Siren (on behalf of the Apache Nutch community)