This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git
commit 5d7a6e6ddeb7ffef7107ed271e12eb7113eaf7c1 Author: Sebastian Nagel <[email protected]> AuthorDate: Tue Feb 17 16:39:35 2026 +0100 Convert legacy news --- content/news/2000th-jira-issue.md | 11 + content/news/legacy-nutch-news.md | 407 ------------------------- content/news/nutch-0.7-release.md | 12 + content/news/nutch-0.7.1-release.md | 11 + content/news/nutch-0.7.2-release.md | 11 + content/news/nutch-0.8-release.md | 11 + content/news/nutch-0.8.1-release.md | 11 + content/news/nutch-0.9-release.md | 11 + content/news/nutch-1.0-release.md | 11 + content/news/nutch-1.1-release.md | 11 + content/news/nutch-1.10-release.md | 19 ++ content/news/nutch-1.11-release.md | 19 ++ content/news/nutch-1.12-release.md | 19 ++ content/news/nutch-1.13-release.md | 19 ++ content/news/nutch-1.14-release.md | 19 ++ content/news/nutch-1.15-release.md | 19 ++ content/news/nutch-1.16-release.md | 20 ++ content/news/nutch-1.17-release.md | 20 ++ content/news/nutch-1.2-release.md | 11 + content/news/nutch-1.3-release.md | 11 + content/news/nutch-1.4-release.md | 11 + content/news/nutch-1.5-release.md | 11 + content/news/nutch-1.5.1-release.md | 11 + content/news/nutch-1.6-release.md | 11 + content/news/nutch-1.7-release.md | 11 + content/news/nutch-1.8-release.md | 11 + content/news/nutch-1.9-release.md | 11 + content/news/nutch-10th-birthday.md | 11 + content/news/nutch-2.0-release.md | 11 + content/news/nutch-2.1-release.md | 11 + content/news/nutch-2.2-release.md | 11 + content/news/nutch-2.2.1-release.md | 11 + content/news/nutch-2.3-release.md | 34 +++ content/news/nutch-2.3.1-release.md | 32 ++ content/news/nutch-2.4-release.md | 22 ++ content/news/nutch-at-apachecon-eu-2009.md | 20 ++ content/news/nutch-at-apachecon-eu-2014.md | 13 + content/news/nutch-at-apachecon-na-2009.md | 32 ++ content/news/nutch-at-apachecon-na-2014.md | 13 + content/news/nutch-dev-focus-1.x.md | 11 + content/news/nutch-graduates-from-incubator.md | 11 + content/news/nutch-graduates-tlp.md | 11 + content/news/nutch-gsoc-2014.md | 15 + content/news/nutch-joins-incubator.md | 11 + content/news/nutch-search-creative-commons.md | 13 + content/news/nutch-search-osu.md | 13 + content/news/nutch-wiki-migrated.md | 11 + content/news/wicket-webapp-gsoc.md | 26 ++ 48 files changed, 696 insertions(+), 407 deletions(-) diff --git a/content/news/2000th-jira-issue.md b/content/news/2000th-jira-issue.md new file mode 100644 index 00000000..46fb5e33 --- /dev/null +++ b/content/news/2000th-jira-issue.md @@ -0,0 +1,11 @@ ++++ +date = "2015-04-23T00:00:00+00:00" +title = "Apache Nutch Reaches 2000th Jira Issue" +tags = ["Jira","issues"] +categories = ["news"] +draft = false +description = "23 April 2015 - Apache Nutch Reaches 2000th Jira Issue" +weight = 10 ++++ + +[NUTCH-2000](https://issues.apache.org/jira/browse/NUTCH-2000) is the 2000th Jira issues opened. diff --git a/content/news/legacy-nutch-news.md b/content/news/legacy-nutch-news.md deleted file mode 100644 index 1c6b4fb5..00000000 --- a/content/news/legacy-nutch-news.md +++ /dev/null @@ -1,407 +0,0 @@ -+++ -date = "2017-03-02T21:56:55+01:00" -title = "Legacy Nutch News Announcements" -tags = ["news","legacy"] -categories = ["news"] -draft = false -description = "This page covers all Nutch news before the Nutch 1.18 release." -weight = 10 -+++ - -## 02 July 2020 - Nutch 1.17 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.17, we advise all -current users and developers of the 1.X series to upgrade to this release. - -An account of the CHANGES in this release can be seen in the -[release report](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=10680&version=12346090). -Breaking changes are listed in the [changelog](https://apache.org/dist/nutch/1.17/CHANGES.txt). - -As usual in the 1.X series, release artifacts are made available as both source and binary and also available within -[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch%20AND%20v:1.17) as a Maven dependency. -The release is available from our [downloads page](/downloads.html). - -## 11 October 2019 - Nutch 1.16 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.16, we advise all -current users and developers of the 1.X series to upgrade to this release. - -An account of the CHANGES in this release can be seen in the -[release report](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=10680&version=12343430). -Breaking changes are listed in the [changelog](https://apache.org/dist/nutch/1.16/CHANGES.txt). - -As usual in the 1.X series, release artifacts are made available as both source and binary and also available within -[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch%20AND%20v:1.16) as a Maven dependency. -The release is available from our [downloads page](/downloads.html). - -## 11 October 2019 - Nutch 2.4 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v2.4, we advise all -current users and developers of the 2.X series to upgrade to this release. - -This release contains 81 issues addressed. For a complete overview of these -issues please see the [release report](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=10680&version=12324540). - -As usual in the 2.X series, release artifacts are made available as only source and also available within -[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch%20AND%20v:2.4) as a Maven dependency. -The release is available from our [downloads page](/downloads.html). - -We expect that v2.4 is the last release on the 2.X series. We've decided to freeze the development on the 2.X branch for now, as no committer is -actively working on it. - - -## 26 July 2019 - Nutch Wiki Migrated - -The [Apache Nutch wiki](https://cwiki.apache.org/confluence/display/NUTCH/Home) has been migrated from MoinMoin to Confluence. - -## 9 August 2018 - Nutch 1.15 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.15, we advise all -current users and developers of the 1.X series to upgrade to this release. - -An account of the CHANGES in this release can be seen in the -[release report](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=10680&version=12342302). - -As usual in the 1.X series, release artifacts are made available as both source and binary and also available within -[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. -The release is available from our [DOWNLOADS PAGE](/downloads.html). - -## 23 December 2017 - Nutch 1.14 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.14, we advise all -current users and developers of the 1.X series to upgrade to this release. - -An account of the CHANGES in this release can be seen in the -[release report](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=10680&version=12340218). - -As usual in the 1.X series, release artifacts are made available as both source and binary and also available within -[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. -The release is available from our [DOWNLOADS PAGE](/downloads.html). - -## 02 April 2017 - Nutch 1.13 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.13, we advise all -current users and developers of the 1.X series to upgrade to this release. - -An account of the CHANGES in this release can be seen in the -[release report](https://s.apache.org/wq3x). - -As usual in the 1.X series, release artifacts are made available as both source and binary and also available within -[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. -The release is available from our [DOWNLOADS PAGE](/downloads.html). - -## 18 June 2016 - Nutch 1.12 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.12, we advise all -current users and developers of the 1.X series to upgrade to this release. - -This release is the result of many months of work and over 40 issues addressed. For a complete overview of these issues please see the -[release report](https://s.apache.org/nutch1.12). - -As usual in the 1.X series, release artifacts are made available as both source and binary and also available within -[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. -The release is available from our [DOWNLOADS PAGE](/downloads.html). - -## 21 January 2016 - Nutch 2.3.1 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v2.3.1, we advise all -current users and developers of the 2.X series to upgrade to this release. - -This bug fix release contains around 40 issues addressed. For a complete overview of these issues please see the -[release report](https://s.apache.org/nutch_2.3.1). - -As usual in the 2.X series, release artifacts are made available as only source and also available within -[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. -The release is available from our [DOWNLOADS PAGE](/downloads.html). - -The recommended Gora backends for this Nutch release are - - - Apache Avro 1.7.6 - - Apache Hadoop 1.2.1 and 2.5.2 - - Apache HBase 0.98.8-hadoop2 (although also tested with 1.X) - - Apache Cassandra 2.0.2 - - Apache Solr 4.10.3 - - MongoDB 2.6.X - - Apache Accumlo 1.5.1 - - Apache Spark 1.4.1 - -Thank you to everyone that contributed towards this release. - -## 07 December 2015 - Nutch 1.11 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.11, we advise all -current users and developers of the 1.X series to upgrade to this release. - -This release is the result of many months of work and around 100 issues addressed. For a complete overview of these issues please see the -[release report](https://s.apache.org/nutch11). - -As usual in the 1.X series, release artifacts are made available as both source and binary and also available within -[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. -The release is available from our [DOWNLOADS PAGE](/downloads.html). - -## 06 May 2015 - Nutch 1.10 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.10, we advise all -current users and developers of the 1.X series to upgrade to this release. - -This release is the result of many months of work and well over 100 issues addressed. For a complete overview of these issues please see the -[release report](https://s.apache.org/nutch10). - -As usual in the 1.X series, release artifacts are made available as both source and binary and also available within -[Maven Central](https://search.maven.org/) as a Maven dependency. -The release is available from our [DOWNLOADS PAGE](/downloads.html). - -## 23 April 2015 - Apache Nutch Reaches 2000th Jira Issue - -## 22 January 2015 - Nutch 2.3 Release - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v2.3, we advise all -current users and developers of the 2.X series to upgrade to this release. -After successful completion of the first [Nutch Google Summer of Code project](https://issues.apache.org/jira/browse/NUTCH-841) -we are pleased to announce that Nutch 2.3 release now comes packaged with a self -contained [Apache Wicket](https://wicket.apache.org/)-based Web Application. - -This release is the result of many months of work and 143 issues addressed. For a complete overview of these issues please see the -[release report](https://s.apache.org/nutch_2.3). - -As usual in the 2.x series, this release is made available only as source, but is also available within -[Maven Central](https://search.maven.org/) as a Maven dependency. -The release is available from our [DOWNLOADS PAGE](/downloads.html). - -The supported [Apache Gora](https://gora.apache.org/) v0.5 backends are; - - * [Apache Hadoop](https://hadoop.apache.org/) 1.0.1 & 2.4.0 - * [Apache Cassandra](https://cassandra.apache.org/) 2.0.2 - * [Apache HBase](https://hbase.apache.org/) 0.94.14 - * [Apache Accumulo](https://accumulo.apache.org/) 1.5.1 - * [MongoDB](https://mongodb.org/) 2.12.2 - * [Apache Solr](https://lucene.apache.org/solr) 4.8.1 - * [Apache Avro](https://avro.apache.org/) 1.7.6 - -Please note that the SQL backend for Gora has been deprecated. - -## 22 September 2014 - Wicket WebApp now part of Nutch 2.x Codebase - -After successful completion of the first [Nutch Google Summer of Code project](https://issues.apache.org/jira/browse/NUTCH-841) -we are pleased to announce that Nutch 2.X branch now comes packaged with a self -contained [Apache Wicket](https://wicket.apache.org/)-based Web Application. - -This not only greatly lowers the barrier for direct interaction with the Nutch 2.X -REST API but also provides a stepping stone from which we intend to backport this -work to the Nutch 1.X (trunk) series. - -Some of the Web Application features include: - - * Functionality to dynamically load seed URLs in order to bootstrap Nutch crawls - * Browsable and dynamic editing of [Configuration overrides](https://cwiki.apache.org/nutch/NutchPropertiesCompleteList) - * Complete [REST API documentation](https://cwiki.apache.org/nutch/NutchRESTAPI) and UML -model describing REST API calls, Administration and Job and Configuration Management. - -The new Web Application feature will be present within the upcoming Nutch 2.3 Release. - -## 16 August 2014 - Apache Nutch v1.9 Released -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.9, we advise all current users and developers of the 1.X series to upgrade to this release. This release addressed no fewer than 55 issues in total. Please see the [list of changes](https://www.apache.org/dist/nutch/1.9/CHANGES.txt) for a full breakdown, or see the [release report](https://s.apache.org/1.9-release). As usual in the 1.X series, this release is made available both as source and binary. Ad [...] - -31 July 2014 - Nutch tutorial at upcoming [ApacheCon Europe](http://events.linuxfoundation.org/events/apachecon-europe) in Budapest ------------------------------------------------------------------------------------------------------------------------------------ - - [](http://events.linuxfoundation.org/events/apachecon-europe "ApacheCon EU 2014") - -The upcoming [ApacheCon Europe](http://events.linuxfoundation.org/events/apachecon-europe) in Budapest, November 17 - 21, 2014, will offer a one-day [Nutch tutorial](http://sched.co/1pbE15n). Topics will span from Nutch installation and configuration up to plugin development. Both Nutch 1.x and 2.x are covered. The conference is a good opportunity to bring together both users and committers of Nutch and related projects. - -01 May 2014 - Apache Nutch Participates in [Google Summer of Code](https://www.google-melange.com/gsoc/homepage/google/gsoc2014) --------------------------------------------------------------------------------------------------------------------------------- - - [](http://www.us.apachecon.com/c/acus2009/ "ApacheCon US 2009") - -For the first time in Nutch project history, we are participating as part of Apache's mentoring efforts in the ever popular [Google Summer of Code](https://www.google-melange.com/gsoc/homepage/google/gsoc2014) program. This years project involves the [creation of a Apache Wicket-based Web Application](https://issues.apache.org/jira/browse/NUTCH-841) for Nutch 2.X branch. - -Keep your eyes peeled and check here for updates as the project progresses throughout the summer. - -07-09 April 2014 - Nutch at ApacheCon 2014, Denver Colorado ------------------------------------------------------------ - - [](http://events.linuxfoundation.org/events/apachecon-north-america "ApacheCon NA 2014") - -lots of talk and loads of exposure for this at ApacheCon NA 2014 in the beautiful city of Denver, CO. This year one presentation focused on [Building your Big Data Search Stack with Apache Nutch 2.x](http://sched.co/1pav9xl). You can see presentation slides below and follow the audio (sorry no video) [here](https://www.youtube.com/watch?v=rIv3Js-zBpE) - -. - -17 March 2014 - Apache Nutch v1.8 Released ------------------------------------------- - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.8, we advise all current users and developers of the 1.X series to upgrade to this release. Alhough this release includes library upgrades to [Crawler Commons](http://code.google.com/p/crawler-commons/) 0.3 and [Apache Tika](https://tika.apache.org/) 1.5, it also provides over 30 bug fixes as well as 18 improvements. Please see the [list of changes](http://www.apache.org/dist/nutch/1.8/CHANGES.txt) for [...] - -02 July 2013 - Apache Nutch v2.2.1 Released -------------------------------------------- - -The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v2.2.1, we advise all current users and developers of the 2.X series to upgrade to this release ASAP. Although this release includes library upgrades to [Apache Hadoop](https://hadoop.apache.org/) 1.2.0 and [Apache Tika](https://tika.apache.org/) 1.3, it is predominantly a bug fix for [NUTCH-1591 - Incorrect conversion of ByteBuffer to String](https://issues.apache.org/jira/browse/NUTCH-1591). Please see t [...] - -24th June 2013 - Apache Nutch v1.7 Released -------------------------------------------- - -The Apache Nutch PMC are extremely pleased to announce the immediate release of Apache Nutch v1.7. This release includes over 20 bug fixes, as many improvements; most noticeably featuring a new [pluggable indexing architecture](https://issues.apache.org/jira/browse/NUTCH-1047) which currently supports [Apache Solr](http://lucene.apache.org/solr) and [Elastic Search](http://www.elasticsearch.org/). Shadowing the recent Nutch 2.2 release, parsing of Robots.txt is now delegated to [Crawler- [...] - -08 June 2013 - Apache Nutch v2.2 Released ------------------------------------------ - -The Apache Nutch PMC are extremely pleased to announce the immediate release of Apache Nutch v2.2. This release includes over 30 bug fixes and over 25 improvements representing the third release of increasingly popular 2.x Nutch series. This release features inclusion of [Crawler-Commons](http://code.google.com/p/crawler-commons/) which Nutch now utilizes for improved robots.txt parsing, library upgrades to [Apache Hadoop](https://hadoop.apache.org/) 1.1.1, [Apache Gora](https://gora.apa [...] - -06 December 2012 - Apache Nutch v1.6 Released ---------------------------------------------- - -The Apache Nutch PMC are extremely pleased to announce the release of Apache Nutch v1.6. This release includes over 20 bug fixes, the same in improvements, as well as new functionalities including a new HostNormalizer, the ability to dynamically set fetchInterval by MIME-type and functional enhancements to the Indexer API inluding the normalization of URL's and the deletion of robots noIndex documents. Other notable improvements include the upgrade of key dependencies to [Tika 1.2](https [...] - -05 October 2012 - Apache Nutch v2.1 Released --------------------------------------------- - -The Apache Nutch PMC are very pleased to announce the release of Apache Nutch v2.1. This release continues to provide Nutch users with a simplified Nutch distribution building on the 2.x development drive which is growing in popularity amongst the community. As well as addressing ~20 bugs this release also offers improved properties for better [Solr](http://lucene.apache.org/solr/) configuration, upgrades to various [Gora](https://gora.apache.org/) dependencies and the introduction of th [...] - -10 August 2012 - Happy 10th Birthday Apache Nutch!! ---------------------------------------------------- - -It's official, Apache Nutch is now a decade old! The project has come a long long way since inception, through [acceptance into the Apache Incubator](#January+2005%3A+Nutch+Joins+Apache+Incubator) way back in Janurary 2005, to the [Top Level Project](#21+April+2010+-+Apache+Nutch+graduates+to+TLP) it became on 21st April 2010. Happy birthday Nutch and thanks to all contributors past and present! See [Doug Cutting's tweet](https://twitter.com/cutting/status/233415059798372353). - -10 July 2012 - Apache Nutch v1.5.1 Released -------------------------------------------- - -The Apache Nutch PMC are very pleased to announce the release of Apache Nutch v1.5.1. This release is a maintainence release of the popular 1.5.X mainstream version of Nutch which has been widely adopted within the community. Please see the [list of changes](http://www.apache.org/dist/nutch/1.5.1/CHANGES.txt) made in this version for a full breakdown. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -07 July 2012 - Apache Nutch v2.0 Released ------------------------------------------ - -The Apache Nutch PMC are very pleased to announce the release of Apache Nutch v2.0. This release offers users an edition focused on large scale crawling which builds on storage abstraction (via Apache Gora™) for big data stores such as Apache Accumulo™, Apache Avro™, Apache Cassandra™, Apache HBase™, HDFS™, an in memory data store and various high profile SQL stores. After some two years of development Nutch v2.0 also offers all of the mainstream Nutch functionality and it builds on Apac [...] - -07 June 2012 - Apache Nutch 1.5 Released ----------------------------------------- - -The 1.5 release of Nutch is now available. This release includes several improvements including upgrades of several major components including Tika 1.1 and Hadoop 1.0.0, improvements to LinkRank and WebGraph elements as well as a number of new plugins covering blacklisting, filering and parsing to name a few. Please see the [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.5.txt) made in this version for a full breakdown of the 50 odd improvements the release boasts. The relea [...] - -26 November 2011 - Apache Nutch 1.4 Released --------------------------------------------- - -The 1.4 release of Nutch is now available. This release includes several improvements including allowing Parsers to declare support for multiple MIME types, configurable Fetcher Queue depth, Fetcher speed improvements, tigther Tika integration, and support for HTTP auth in Solr indexing. Please see the [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.4.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -23 September 2011 - Apache Nutch focuses on 1.x series for main development ---------------------------------------------------------------------------- - -After some [discussion](http://www.mail-archive.com/[email protected]/msg03581.html) and a [vote](http://www.mail-archive.com/[email protected]/msg04348.html) about the issue, the Nutch development community decided to focus their efforts on maintaining and releasing the 1.x series of Nutch, and to branch the now former Nutch trunk based on Gora, allowing others to try and improve it, while the mainline development goes on. - -7 June 2011 - Apache Nutch 1.3 Released ---------------------------------------- - -The 1.3 release of Nutch is now available. This release includes several improvements (improved RSS parsing support, tighter integration with Apache Tika, external parsing support, improved language identification and an order of magnitude smaller source release tarball -- only about 2MB!). Please see the [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.3.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -24 September 2010 - Apache Nutch 1.2 Released ---------------------------------------------- - -The 1.2 release of Nutch is now available. This release includes several improvements (addition of parse-html as a selectable parser again, configurable per-field indexing), new features (including adding timing information to all Tool classes, and implementation of parser timeouts), and bug fixes (fixing an NPE in distributed search, fixing of XML formatting issues per Document fields). Please see the [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.2.txt) made in this versi [...] - -06 June 2010 - Apache Nutch 1.1 Released ----------------------------------------- - -The 1.1 release of Nutch is now available. This release includes several major upgrades of existing libraries (Hadoop, Solr, Tika, etc.) on which Nutch depends. Various bug fixes, and speedups (e.g., to Fetcher2) have also been included. See [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.1.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -21 April 2010 - Apache Nutch graduates to TLP ---------------------------------------------- - -[Passed by unanimous approval of the Apache Board](http://www.apache.org/foundation/records/minutes/2010/board_minutes_2010_04_21.txt), Nutch graduated to TLP status. We are in the process of updating the website, and moving things around, so if you notice anything out of place, [please let us know.](./mailing_lists.html) - -14 August 2009 - Lucene at US ApacheCon ---------------------------------------- - - [](http://www.us.apachecon.com/c/acus2009/ "ApacheCon US 2009") ApacheCon US is once again in the Bay Area and Lucene is coming along for the ride! The Lucene community has planned two full days of talks, plus a meetup and the usual bevy of training. With a well-balanced mix of first time and veteran ApacheCon speakers, the [Lucene track](http://www.us.apachecon.com/c/acus2009/schedule#lucene) at ApacheCon US prom [...] - -Training: - -* [Lucene Boot Camp](http://www.us.apachecon.com/c/acus2009/sessions/437) - A two day training session, Nov. 2nd & 3rd -* [Solr Day](http://www.us.apachecon.com/c/acus2009/sessions/375) - A one day training session, Nov. 2nd - -Thursday, Nov. 5th - -* [Introduction to the Lucene Ecosystem](http://www.us.apachecon.com/c/acus2009/sessions/428) \- Grant Ingersoll @ 9:00 -* [Lucene Basics and New Features](http://www.us.apachecon.com/c/acus2009/sessions/461) - Michael Busch @ 10:00 -* [Apache Solr: Out of the Box](http://www.us.apachecon.com/c/acus2009/sessions/331) - Chris Hostetter @ 14:00 -* [Introduction to Nutch](http://www.us.apachecon.com/c/acus2009/sessions/427) - Andrzej Bialecki @ 15:00 -* [Lucene and Solr Performance Tuning](http://www.us.apachecon.com/c/acus2009/sessions/430) - Mark Miller @ 16:30 - -Friday, Nov. 6th - -* [Implementing an Information Retrieval Framework for an Organizational Repository](http://www.us.apachecon.com/c/acus2009/sessions/332) - Sithu D Sudarsan @ 9:00 -* [Apache Mahout - Going from raw data to Information](http://www.us.apachecon.com/c/acus2009/sessions/333) - Isabel Drost @ 10:00 -* [MIME Magic with Apache Tika](http://www.us.apachecon.com/c/acus2009/sessions/334) - Jukka Zitting @ 11:30 -* [Building Intelligent Search Applications with the Lucene Ecosystem](http://www.us.apachecon.com/c/acus2009/sessions/335) - Ted Dunning @ 14:00 -* [Realtime Search](http://www.us.apachecon.com/c/acus2009/sessions/462) - Jason Rutherglen @ 15:00 - -23 March 2009 - Apache Nutch 1.0 Released ------------------------------------------ - -The 1.0 release of Nutch is now available. This release includes several major feature improvements such as new indexing framework, new scoring framework, Apache Solr integration just to mention a few. See [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.0.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -09 February 2009 - Lucene at ApacheCon Europe 2009 in Amsterdam ---------------------------------------------------------------- - - [](http://www.eu.apachecon.com/c/aceu2009/ "ApacheCon EU 2009") Lucene will be extremely well represented at [ApacheCon EU 2009](http://www.eu.apachecon.com/c/aceu2009/) in Amsterdam, Netherlands this March 23-27, 2009: - -* [Lucene Boot Camp](http://eu.apachecon.com/c/aceu2009/sessions/197) - A two day training session, March 23 & 24th -* [Solr Boot Camp](http://eu.apachecon.com/c/aceu2009/sessions/201) - A one day training session, March 24th -* [Introducing Apache Mahout](http://eu.apachecon.com/c/aceu2009/sessions/136) - Grant Ingersoll. March 25th @ 10:30 -* [Lucene/Solr Case Studies](http://eu.apachecon.com/c/aceu2009/sessions/137) - Erik Hatcher. March 25th @ 11:30 -* [Advanced Indexing Techniques with Apache Lucene](http://eu.apachecon.com/c/aceu2009/sessions/138) - Michael Busch. March 25th @ 14:00 -* [Apache Solr - A Case Study](http://eu.apachecon.com/c/aceu2009/sessions/251) - Uri Boness. March 26th @ 17:30 -* [Best of breed - httpd, forrest, solr and droids](http://eu.apachecon.com/c/aceu2009/sessions/250) - Thorsten Scherler. March 27th @ 17:30 -* [Apache Droids - an intelligent standalone robot framework](http://eu.apachecon.com/c/aceu2009/sessions/165) - Thorsten Scherler. March 26th @ 15:00 - -2 April 2007: Nutch 0.9 Released --------------------------------- - -The 0.9 release of Nutch is now available. This is the second release of Nutch based entirely on the underlying Hadoop platform. This release includes several critical bug fixes, as well as key speedups described in more detail at [Sami Siren's blog](http://blog.foofactory.fi/2007/03/twice-speed-half-size.html). See [list of changes](http://www.apache.org/dist/nutch/CHANGES-0.9.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -24 September 2006: Nutch 0.8.1 Released ---------------------------------------- - -The 0.8.1 release of Nutch is now available. This is a maintenance release to 0.8 branch fixing many serous bugs found in version 0.8. See [list of changes](http://www.apache.org/dist/nutch/CHANGES-0.8.1.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -25 July 2006: Nutch 0.8 Released --------------------------------- - -The 0.8 release of Nutch is now available. This is the first release of Nutch based on hadoop architecure. See [CHANGES.txt](http://svn.apache.org/viewvc/nutch/tags/release-0.8/CHANGES.txt?view=markup) for list of changes made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -31 March 2006: Nutch 0.7.2 Released ------------------------------------ - -The 0.7.2 release of Nutch is now available. This is a bug fix release for 0.7 branch. See [CHANGES.txt](http://svn.apache.org/viewcvs.cgi/nutch/branches/branch-0.7/CHANGES.txt?rev=390158) for details. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -1 October 2005: Nutch 0.7.1 Released ------------------------------------- - -The 0.7.1 release of Nutch is now available. This is a bug fix release. See [CHANGES.txt](http://svn.apache.org/viewcvs.cgi/nutch/branches/branch-0.7/CHANGES.txt?rev=292986) for details. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -17 August 2005: Nutch 0.7 Released ----------------------------------- - -This is the first Nutch release as an Apache Lucene sub-project. See [CHANGES.txt](http://svn.apache.org/viewcvs.cgi/nutch/trunk/CHANGES.txt?rev=233150) for details. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). - -June 2005: Nutch graduates from Incubator ------------------------------------------ - -Nutch has now graduated from the Apache incubator, and is now a Subproject of Lucene. - -January 2005: Nutch Joins Apache Incubator ------------------------------------------- - -Nutch is a two-year-old open source project, previously hosted at Sourceforge and backed by its own non-profit organization. The non-profit was founded in order to assign copyright, so that we could retain the right to change the license. We have now determined that the Apache license is the appropriate license for Nutch and no longer require the overhead of an independent non-profit organization. Nutch's board of directors and its developers were both polled and supported the move to th [...] - -September 2004: Creative Commons launches Nutch-based Search ------------------------------------------------------------- - -Creative Commons unveiled a beta version of its search engine, which scours the web for text, images, audio, and video free to re-use on certain terms a search refinement offered by no other company or organization. - -See the [Creative Commons Press Release](http://creativecommons.org/press-releases/entry/5064) for more details. - -September 2004: Oregon State University switches to Nutch ---------------------------------------------------------- - -Oregon State University is converting its searching infrastructure from Googletm to the open source project Nutch. The effort to replace the Googletm will realize significant cost savings for Oregon State University, while promoting both the Nutch Search Engine and transparency in search engine use and management. - -For more details see the announcement by OSU's [Open Source Lab](http://osuosl.org/news_folder/nutch). \ No newline at end of file diff --git a/content/news/nutch-0.7-release.md b/content/news/nutch-0.7-release.md new file mode 100644 index 00000000..98e5e729 --- /dev/null +++ b/content/news/nutch-0.7-release.md @@ -0,0 +1,12 @@ ++++ +date = "2005-08-17T00:00:00+00:00" +title = "Nutch 0.7 Released" +tags = ["0.7","release"] +categories = ["releases"] +draft = false +description = "17 August 2005 - Nutch 0.7 Released" +weight = 10 ++++ + +This is the first Nutch release as an Apache Lucene sub-project. See [CHANGES.txt](http://svn.apache.org/viewcvs.cgi/nutch/trunk/CHANGES.txt?rev=233150) for details. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). + diff --git a/content/news/nutch-0.7.1-release.md b/content/news/nutch-0.7.1-release.md new file mode 100644 index 00000000..efe2f226 --- /dev/null +++ b/content/news/nutch-0.7.1-release.md @@ -0,0 +1,11 @@ ++++ +date = "2005-10-01T00:00:00+00:00" +title = "Nutch 0.7.1 Released" +tags = ["0.7.1","release"] +categories = ["releases"] +draft = false +description = "1 October 2005 - Nutch 0.7.1 Released" +weight = 10 ++++ + +The 0.7.1 release of Nutch is now available. This is a bug fix release. See [CHANGES.txt](http://svn.apache.org/viewcvs.cgi/nutch/branches/branch-0.7/CHANGES.txt?rev=292986) for details. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). diff --git a/content/news/nutch-0.7.2-release.md b/content/news/nutch-0.7.2-release.md new file mode 100644 index 00000000..ced651fc --- /dev/null +++ b/content/news/nutch-0.7.2-release.md @@ -0,0 +1,11 @@ ++++ +date = "2006-03-31T00:00:00+00:00" +title = "Nutch 0.7.2 Released" +tags = ["0.7.2","release"] +categories = ["releases"] +draft = false +description = "31 March 2006 - Nutch 0.7.2 Released" +weight = 10 ++++ + +The 0.7.2 release of Nutch is now available. This is a bug fix release for 0.7 branch. See [CHANGES.txt](http://svn.apache.org/viewcvs.cgi/nutch/branches/branch-0.7/CHANGES.txt?rev=390158) for details. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). diff --git a/content/news/nutch-0.8-release.md b/content/news/nutch-0.8-release.md new file mode 100644 index 00000000..84e4216f --- /dev/null +++ b/content/news/nutch-0.8-release.md @@ -0,0 +1,11 @@ ++++ +date = "2006-07-25T00:00:00+00:00" +title = "Nutch 0.8 Released" +tags = ["0.8","release"] +categories = ["releases"] +draft = false +description = "25 July 2006 - Nutch 0.8 Released" +weight = 10 ++++ + +The 0.8 release of Nutch is now available. This is the first release of Nutch based on hadoop architecure. See [CHANGES.txt](http://svn.apache.org/viewvc/nutch/tags/release-0.8/CHANGES.txt?view=markup) for list of changes made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). diff --git a/content/news/nutch-0.8.1-release.md b/content/news/nutch-0.8.1-release.md new file mode 100644 index 00000000..629a8969 --- /dev/null +++ b/content/news/nutch-0.8.1-release.md @@ -0,0 +1,11 @@ ++++ +date = "2006-09-24T00:00:00+00:00" +title = "Nutch 0.8.1 Released" +tags = ["0.8.1","release"] +categories = ["releases"] +draft = false +description = "24 September 2006 - Nutch 0.8.1 Released" +weight = 10 ++++ + +The 0.8.1 release of Nutch is now available. This is a maintenance release to 0.8 branch fixing many serous bugs found in version 0.8. See [list of changes](http://www.apache.org/dist/nutch/CHANGES-0.8.1.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). diff --git a/content/news/nutch-0.9-release.md b/content/news/nutch-0.9-release.md new file mode 100644 index 00000000..8f7203b0 --- /dev/null +++ b/content/news/nutch-0.9-release.md @@ -0,0 +1,11 @@ ++++ +date = "2007-04-02T00:00:00+00:00" +title = "Nutch 0.9 Released" +tags = ["0.9","release"] +categories = ["releases"] +draft = false +description = "2 April 2007 - Nutch 0.9 Released" +weight = 10 ++++ + +The 0.9 release of Nutch is now available. This is the second release of Nutch based entirely on the underlying Hadoop platform. This release includes several critical bug fixes, as well as key speedups described in more detail at [Sami Siren's blog](http://blog.foofactory.fi/2007/03/twice-speed-half-size.html). See [list of changes](http://www.apache.org/dist/nutch/CHANGES-0.9.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). diff --git a/content/news/nutch-1.0-release.md b/content/news/nutch-1.0-release.md new file mode 100644 index 00000000..218b1dbc --- /dev/null +++ b/content/news/nutch-1.0-release.md @@ -0,0 +1,11 @@ ++++ +date = "2009-03-23T00:00:00+00:00" +title = "Apache Nutch 1.0 Released" +tags = ["1.0","release"] +categories = ["releases"] +draft = false +description = "23 March 2009 - Apache Nutch 1.0 Released" +weight = 10 ++++ + +The 1.0 release of Nutch is now available. This release includes several major feature improvements such as new indexing framework, new scoring framework, Apache Solr integration just to mention a few. See [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.0.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). diff --git a/content/news/nutch-1.1-release.md b/content/news/nutch-1.1-release.md new file mode 100644 index 00000000..6171acea --- /dev/null +++ b/content/news/nutch-1.1-release.md @@ -0,0 +1,11 @@ ++++ +date = "2010-06-06T00:00:00+00:00" +title = "Apache Nutch 1.1 Released" +tags = ["1.1","release"] +categories = ["releases"] +draft = false +description = "06 June 2010 - Apache Nutch 1.1 Released" +weight = 10 ++++ + +The 1.1 release of Nutch is now available. This release includes several major upgrades of existing libraries (Hadoop, Solr, Tika, etc.) on which Nutch depends. Various bug fixes, and speedups (e.g., to Fetcher2) have also been included. See [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.1.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). diff --git a/content/news/nutch-1.10-release.md b/content/news/nutch-1.10-release.md new file mode 100644 index 00000000..b75a2519 --- /dev/null +++ b/content/news/nutch-1.10-release.md @@ -0,0 +1,19 @@ ++++ +date = "2015-05-06T00:00:00+00:00" +title = "Nutch 1.10 Release" +tags = ["1.10","release"] +categories = ["releases"] +draft = false +description = "6 May 2015 - Apache Nutch 1.10 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.10, we advise all +current users and developers of the 1.X series to upgrade to this release. + +This release is the result of many months of work and well over 100 issues addressed. For a complete overview of these issues please see the +[release report](https://s.apache.org/nutch10). + +As usual in the 1.X series, release artifacts are made available as both source and binary and also available within +[Maven Central](https://search.maven.org/) as a Maven dependency. +The release is available from our [DOWNLOADS PAGE](/downloads.html). diff --git a/content/news/nutch-1.11-release.md b/content/news/nutch-1.11-release.md new file mode 100644 index 00000000..78920005 --- /dev/null +++ b/content/news/nutch-1.11-release.md @@ -0,0 +1,19 @@ ++++ +date = "2015-12-07T00:00:00+00:00" +title = "Nutch 1.11 Release" +tags = ["1.11","release"] +categories = ["releases"] +draft = false +description = "7 December 2015 - Apache Nutch 1.11 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.11, we advise all +current users and developers of the 1.X series to upgrade to this release. + +This release is the result of many months of work and around 100 issues addressed. For a complete overview of these issues please see the +[release report](https://s.apache.org/nutch11). + +As usual in the 1.X series, release artifacts are made available as both source and binary and also available within +[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. +The release is available from our [DOWNLOADS PAGE](/downloads.html). diff --git a/content/news/nutch-1.12-release.md b/content/news/nutch-1.12-release.md new file mode 100644 index 00000000..54e5b7ee --- /dev/null +++ b/content/news/nutch-1.12-release.md @@ -0,0 +1,19 @@ ++++ +date = "2016-06-18T00:00:00+00:00" +title = "Nutch 1.12 Release" +tags = ["1.12","release"] +categories = ["releases"] +draft = false +description = "18 June 2016 - Apache Nutch 1.12 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.12, we advise all +current users and developers of the 1.X series to upgrade to this release. + +This release is the result of many months of work and over 40 issues addressed. For a complete overview of these issues please see the +[release report](https://s.apache.org/nutch1.12). + +As usual in the 1.X series, release artifacts are made available as both source and binary and also available within +[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. +The release is available from our [DOWNLOADS PAGE](/downloads.html). diff --git a/content/news/nutch-1.13-release.md b/content/news/nutch-1.13-release.md new file mode 100644 index 00000000..3bdc7a8c --- /dev/null +++ b/content/news/nutch-1.13-release.md @@ -0,0 +1,19 @@ ++++ +date = "2017-04-02T00:00:00+00:00" +title = "Nutch 1.13 Release" +tags = ["1.13","release"] +categories = ["releases"] +draft = false +description = "2 April 2017 - Apache Nutch 1.13 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.13, we advise all +current users and developers of the 1.X series to upgrade to this release. + +An account of the CHANGES in this release can be seen in the +[release report](https://s.apache.org/wq3x). + +As usual in the 1.X series, release artifacts are made available as both source and binary and also available within +[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. +The release is available from our [DOWNLOADS PAGE](/downloads.html). diff --git a/content/news/nutch-1.14-release.md b/content/news/nutch-1.14-release.md new file mode 100644 index 00000000..6286cf33 --- /dev/null +++ b/content/news/nutch-1.14-release.md @@ -0,0 +1,19 @@ ++++ +date = "2017-12-23T00:00:00+00:00" +title = "Nutch 1.14 Release" +tags = ["1.14","release"] +categories = ["releases"] +draft = false +description = "23 December 2017 - Apache Nutch 1.14 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.14, we advise all +current users and developers of the 1.X series to upgrade to this release. + +An account of the CHANGES in this release can be seen in the +[release report](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=10680&version=12340218). + +As usual in the 1.X series, release artifacts are made available as both source and binary and also available within +[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. +The release is available from our [DOWNLOADS PAGE](/downloads.html). diff --git a/content/news/nutch-1.15-release.md b/content/news/nutch-1.15-release.md new file mode 100644 index 00000000..52404898 --- /dev/null +++ b/content/news/nutch-1.15-release.md @@ -0,0 +1,19 @@ ++++ +date = "2018-08-09T00:00:00+00:00" +title = "Nutch 1.15 Release" +tags = ["1.15","release"] +categories = ["releases"] +draft = false +description = "9 August 2018 - Apache Nutch 1.15 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.15, we advise all +current users and developers of the 1.X series to upgrade to this release. + +An account of the CHANGES in this release can be seen in the +[release report](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=10680&version=12342302). + +As usual in the 1.X series, release artifacts are made available as both source and binary and also available within +[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. +The release is available from our [DOWNLOADS PAGE](/downloads.html). diff --git a/content/news/nutch-1.16-release.md b/content/news/nutch-1.16-release.md new file mode 100644 index 00000000..85805285 --- /dev/null +++ b/content/news/nutch-1.16-release.md @@ -0,0 +1,20 @@ ++++ +date = "2019-10-11T00:00:00+00:00" +title = "Nutch 1.16 Release" +tags = ["1.16","release"] +categories = ["releases"] +draft = false +description = "11 October 2019 - Apache Nutch 1.16 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.16, we advise all +current users and developers of the 1.X series to upgrade to this release. + +An account of the CHANGES in this release can be seen in the +[release report](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=10680&version=12343430). +Breaking changes are listed in the [changelog](https://apache.org/dist/nutch/1.16/CHANGES.txt). + +As usual in the 1.X series, release artifacts are made available as both source and binary and also available within +[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch%20AND%20v:1.16) as a Maven dependency. +The release is available from our [downloads page](/downloads.html). diff --git a/content/news/nutch-1.17-release.md b/content/news/nutch-1.17-release.md new file mode 100644 index 00000000..eb5c0dbb --- /dev/null +++ b/content/news/nutch-1.17-release.md @@ -0,0 +1,20 @@ ++++ +date = "2020-07-02T00:00:00+00:00" +title = "Nutch 1.17 Release" +tags = ["1.17","release"] +categories = ["releases"] +draft = false +description = "2 July 2020 - Apache Nutch 1.17 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.17, we advise all +current users and developers of the 1.X series to upgrade to this release. + +An account of the CHANGES in this release can be seen in the +[release report](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=10680&version=12346090). +Breaking changes are listed in the [changelog](https://apache.org/dist/nutch/1.17/CHANGES.txt). + +As usual in the 1.X series, release artifacts are made available as both source and binary and also available within +[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch%20AND%20v:1.17) as a Maven dependency. +The release is available from our [downloads page](/downloads.html). diff --git a/content/news/nutch-1.2-release.md b/content/news/nutch-1.2-release.md new file mode 100644 index 00000000..52d9757b --- /dev/null +++ b/content/news/nutch-1.2-release.md @@ -0,0 +1,11 @@ ++++ +date = "2010-09-24T00:00:00+00:00" +title = "Apache Nutch 1.2 Released" +tags = ["1.2","release"] +categories = ["releases"] +draft = false +description = "24 September 2010 - Apache Nutch 1.2 Released" +weight = 10 ++++ + +The 1.2 release of Nutch is now available. This release includes several improvements (addition of parse-html as a selectable parser again, configurable per-field indexing), new features (including adding timing information to all Tool classes, and implementation of parser timeouts), and bug fixes (fixing an NPE in distributed search, fixing of XML formatting issues per Document fields). Please see the [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.2.txt) made in this versi [...] diff --git a/content/news/nutch-1.3-release.md b/content/news/nutch-1.3-release.md new file mode 100644 index 00000000..5ec2b3a8 --- /dev/null +++ b/content/news/nutch-1.3-release.md @@ -0,0 +1,11 @@ ++++ +date = "2011-06-07T00:00:00+00:00" +title = "Apache Nutch 1.3 Released" +tags = ["1.3","release"] +categories = ["releases"] +draft = false +description = "7 June 2011 - Apache Nutch 1.3 Released" +weight = 10 ++++ + +The 1.3 release of Nutch is now available. This release includes several improvements (improved RSS parsing support, tighter integration with Apache Tika, external parsing support, improved language identification and an order of magnitude smaller source release tarball -- only about 2MB!). Please see the [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.3.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). diff --git a/content/news/nutch-1.4-release.md b/content/news/nutch-1.4-release.md new file mode 100644 index 00000000..09a86fa9 --- /dev/null +++ b/content/news/nutch-1.4-release.md @@ -0,0 +1,11 @@ ++++ +date = "2011-11-26T00:00:00+00:00" +title = "Apache Nutch 1.4 Released" +tags = ["1.4","release"] +categories = ["releases"] +draft = false +description = "26 November 2011 - Apache Nutch 1.4 Released" +weight = 10 ++++ + +The 1.4 release of Nutch is now available. This release includes several improvements including allowing Parsers to declare support for multiple MIME types, configurable Fetcher Queue depth, Fetcher speed improvements, tigther Tika integration, and support for HTTP auth in Solr indexing. Please see the [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.4.txt) made in this version. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). diff --git a/content/news/nutch-1.5-release.md b/content/news/nutch-1.5-release.md new file mode 100644 index 00000000..f1818cb5 --- /dev/null +++ b/content/news/nutch-1.5-release.md @@ -0,0 +1,11 @@ ++++ +date = "2012-06-07T00:00:00+00:00" +title = "Apache Nutch 1.5 Released" +tags = ["1.5","release"] +categories = ["releases"] +draft = false +description = "07 June 2012 - Apache Nutch 1.5 Released" +weight = 10 ++++ + +The 1.5 release of Nutch is now available. This release includes several improvements including upgrades of several major components including Tika 1.1 and Hadoop 1.0.0, improvements to LinkRank and WebGraph elements as well as a number of new plugins covering blacklisting, filering and parsing to name a few. Please see the [list of changes](http://www.apache.org/dist/nutch/CHANGES-1.5.txt) made in this version for a full breakdown of the 50 odd improvements the release boasts. The relea [...] diff --git a/content/news/nutch-1.5.1-release.md b/content/news/nutch-1.5.1-release.md new file mode 100644 index 00000000..5bb413d0 --- /dev/null +++ b/content/news/nutch-1.5.1-release.md @@ -0,0 +1,11 @@ ++++ +date = "2012-07-10T00:00:00+00:00" +title = "Apache Nutch v1.5.1 Released" +tags = ["1.5.1","release"] +categories = ["releases"] +draft = false +description = "10 July 2012 - Apache Nutch v1.5.1 Released" +weight = 10 ++++ + +The Apache Nutch PMC are very pleased to announce the release of Apache Nutch v1.5.1. This release is a maintainence release of the popular 1.5.X mainstream version of Nutch which has been widely adopted within the community. Please see the [list of changes](http://www.apache.org/dist/nutch/1.5.1/CHANGES.txt) made in this version for a full breakdown. The release is available [here](http://www.apache.org/dyn/closer.cgi/nutch/). diff --git a/content/news/nutch-1.6-release.md b/content/news/nutch-1.6-release.md new file mode 100644 index 00000000..32fd65d8 --- /dev/null +++ b/content/news/nutch-1.6-release.md @@ -0,0 +1,11 @@ ++++ +date = "2012-12-06T00:00:00+00:00" +title = "Apache Nutch v1.6 Released" +tags = ["1.6","release"] +categories = ["releases"] +draft = false +description = "06 December 2012 - Apache Nutch v1.6 Released" +weight = 10 ++++ + +The Apache Nutch PMC are extremely pleased to announce the release of Apache Nutch v1.6. This release includes over 20 bug fixes, the same in improvements, as well as new functionalities including a new HostNormalizer, the ability to dynamically set fetchInterval by MIME-type and functional enhancements to the Indexer API inluding the normalization of URL's and the deletion of robots noIndex documents. Other notable improvements include the upgrade of key dependencies to [Tika 1.2](https [...] diff --git a/content/news/nutch-1.7-release.md b/content/news/nutch-1.7-release.md new file mode 100644 index 00000000..fc68e7ed --- /dev/null +++ b/content/news/nutch-1.7-release.md @@ -0,0 +1,11 @@ ++++ +date = "2013-06-24T00:00:00+00:00" +title = "Apache Nutch v1.7 Released" +tags = ["1.7","release"] +categories = ["releases"] +draft = false +description = "24th June 2013 - Apache Nutch v1.7 Released" +weight = 10 ++++ + +The Apache Nutch PMC are extremely pleased to announce the immediate release of Apache Nutch v1.7. This release includes over 20 bug fixes, as many improvements; most noticeably featuring a new [pluggable indexing architecture](https://issues.apache.org/jira/browse/NUTCH-1047) which currently supports [Apache Solr](http://lucene.apache.org/solr) and [Elastic Search](http://www.elasticsearch.org/). Shadowing the recent Nutch 2.2 release, parsing of Robots.txt is now delegated to [Crawler- [...] diff --git a/content/news/nutch-1.8-release.md b/content/news/nutch-1.8-release.md new file mode 100644 index 00000000..e3d677b3 --- /dev/null +++ b/content/news/nutch-1.8-release.md @@ -0,0 +1,11 @@ ++++ +date = "2014-03-17T00:00:00+00:00" +title = "Apache Nutch v1.8 Released" +tags = ["1.8","release"] +categories = ["releases"] +draft = false +description = "17 March 2014 - Apache Nutch v1.8 Released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.8, we advise all current users and developers of the 1.X series to upgrade to this release. Alhough this release includes library upgrades to [Crawler Commons](http://code.google.com/p/crawler-commons/) 0.3 and [Apache Tika](https://tika.apache.org/) 1.5, it also provides over 30 bug fixes as well as 18 improvements. Please see the [list of changes](http://www.apache.org/dist/nutch/1.8/CHANGES.txt) for [...] diff --git a/content/news/nutch-1.9-release.md b/content/news/nutch-1.9-release.md new file mode 100644 index 00000000..fd493873 --- /dev/null +++ b/content/news/nutch-1.9-release.md @@ -0,0 +1,11 @@ ++++ +date = "2014-08-16T00:00:00+00:00" +title = "Apache Nutch v1.9 Released" +tags = ["1.9","release"] +categories = ["releases"] +draft = false +description = "16 August 2014 - Apache Nutch 1.9 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.9, we advise all current users and developers of the 1.X series to upgrade to this release. This release addressed no fewer than 55 issues in total. Please see the [list of changes](https://www.apache.org/dist/nutch/1.9/CHANGES.txt) for a full breakdown, or see the [release report](https://s.apache.org/1.9-release). As usual in the 1.X series, this release is made available both as source and binary. Ad [...] diff --git a/content/news/nutch-10th-birthday.md b/content/news/nutch-10th-birthday.md new file mode 100644 index 00000000..854ff0e5 --- /dev/null +++ b/content/news/nutch-10th-birthday.md @@ -0,0 +1,11 @@ ++++ +date = "2012-08-10T00:00:00+00:00" +title = "Happy 10th Birthday Apache Nutch!!" +tags = ["birthday"] +categories = ["news"] +draft = false +description = "10 August 2012 - Happy 10th Birthday Apache Nutch!!" +weight = 10 ++++ + +It's official, Apache Nutch is now a decade old! The project has come a long long way since inception, through [acceptance into the Apache Incubator](#January+2005%3A+Nutch+Joins+Apache+Incubator) way back in Janurary 2005, to the [Top Level Project](#21+April+2010+-+Apache+Nutch+graduates+to+TLP) it became on 21st April 2010. Happy birthday Nutch and thanks to all contributors past and present! See [Doug Cutting's tweet](https://twitter.com/cutting/status/233415059798372353). diff --git a/content/news/nutch-2.0-release.md b/content/news/nutch-2.0-release.md new file mode 100644 index 00000000..375ae5e0 --- /dev/null +++ b/content/news/nutch-2.0-release.md @@ -0,0 +1,11 @@ ++++ +date = "2012-07-07T00:00:00+00:00" +title = "Apache Nutch v2.0 Released" +tags = ["2.0","release"] +categories = ["releases"] +draft = false +description = "07 July 2012 - Apache Nutch v2.0 Released" +weight = 10 ++++ + +The Apache Nutch PMC are very pleased to announce the release of Apache Nutch v2.0. This release offers users an edition focused on large scale crawling which builds on storage abstraction (via Apache Gora™) for big data stores such as Apache Accumulo™, Apache Avro™, Apache Cassandra™, Apache HBase™, HDFS™, an in memory data store and various high profile SQL stores. After some two years of development Nutch v2.0 also offers all of the mainstream Nutch functionality and it builds on Apac [...] diff --git a/content/news/nutch-2.1-release.md b/content/news/nutch-2.1-release.md new file mode 100644 index 00000000..d1529ded --- /dev/null +++ b/content/news/nutch-2.1-release.md @@ -0,0 +1,11 @@ ++++ +date = "2012-10-05T00:00:00+00:00" +title = "Apache Nutch v2.1 Released" +tags = ["2.1","release"] +categories = ["releases"] +draft = false +description = "05 October 2012 - Apache Nutch v2.1 Released" +weight = 10 ++++ + +The Apache Nutch PMC are very pleased to announce the release of Apache Nutch v2.1. This release continues to provide Nutch users with a simplified Nutch distribution building on the 2.x development drive which is growing in popularity amongst the community. As well as addressing ~20 bugs this release also offers improved properties for better [Solr](http://lucene.apache.org/solr/) configuration, upgrades to various [Gora](https://gora.apache.org/) dependencies and the introduction of th [...] diff --git a/content/news/nutch-2.2-release.md b/content/news/nutch-2.2-release.md new file mode 100644 index 00000000..77a626ab --- /dev/null +++ b/content/news/nutch-2.2-release.md @@ -0,0 +1,11 @@ ++++ +date = "2013-06-08T00:00:00+00:00" +title = "Apache Nutch v2.2 Released" +tags = ["2.2","release"] +categories = ["releases"] +draft = false +description = "08 June 2013 - Apache Nutch v2.2 Released" +weight = 10 ++++ + +The Apache Nutch PMC are extremely pleased to announce the immediate release of Apache Nutch v2.2. This release includes over 30 bug fixes and over 25 improvements representing the third release of increasingly popular 2.x Nutch series. This release features inclusion of [Crawler-Commons](http://code.google.com/p/crawler-commons/) which Nutch now utilizes for improved robots.txt parsing, library upgrades to [Apache Hadoop](https://hadoop.apache.org/) 1.1.1, [Apache Gora](https://gora.apa [...] diff --git a/content/news/nutch-2.2.1-release.md b/content/news/nutch-2.2.1-release.md new file mode 100644 index 00000000..ff55d99f --- /dev/null +++ b/content/news/nutch-2.2.1-release.md @@ -0,0 +1,11 @@ ++++ +date = "2013-07-02T00:00:00+00:00" +title = "Apache Nutch v2.2.1 Released" +tags = ["2.2.1","release"] +categories = ["releases"] +draft = false +description = "02 July 2013 - Apache Nutch v2.2.1 Released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v2.2.1, we advise all current users and developers of the 2.X series to upgrade to this release ASAP. Although this release includes library upgrades to [Apache Hadoop](https://hadoop.apache.org/) 1.2.0 and [Apache Tika](https://tika.apache.org/) 1.3, it is predominantly a bug fix for [NUTCH-1591 - Incorrect conversion of ByteBuffer to String](https://issues.apache.org/jira/browse/NUTCH-1591). Please see t [...] diff --git a/content/news/nutch-2.3-release.md b/content/news/nutch-2.3-release.md new file mode 100644 index 00000000..e78aa3c6 --- /dev/null +++ b/content/news/nutch-2.3-release.md @@ -0,0 +1,34 @@ ++++ +date = "2015-01-22T00:00:00+00:00" +title = "Nutch 2.3 Release" +tags = ["2.3","release"] +categories = ["releases"] +draft = false +description = "22 January 2015 - Nutch 2.3 Release" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v2.3, we advise all +current users and developers of the 2.X series to upgrade to this release. +After successful completion of the first [Nutch Google Summer of Code project](https://issues.apache.org/jira/browse/NUTCH-841) +we are pleased to announce that Nutch 2.3 release now comes packaged with a self +contained [Apache Wicket](https://wicket.apache.org/)-based Web Application. + +This release is the result of many months of work and 143 issues addressed. For a complete overview of these issues please see the +[release report](https://s.apache.org/nutch_2.3). + +As usual in the 2.x series, this release is made available only as source, but is also available within +[Maven Central](https://search.maven.org/) as a Maven dependency. +The release is available from our [DOWNLOADS PAGE](/downloads.html). + +The supported [Apache Gora](https://gora.apache.org/) v0.5 backends are; + + * [Apache Hadoop](https://hadoop.apache.org/) 1.0.1 & 2.4.0 + * [Apache Cassandra](https://cassandra.apache.org/) 2.0.2 + * [Apache HBase](https://hbase.apache.org/) 0.94.14 + * [Apache Accumulo](https://accumulo.apache.org/) 1.5.1 + * [MongoDB](https://mongodb.org/) 2.12.2 + * [Apache Solr](https://lucene.apache.org/solr) 4.8.1 + * [Apache Avro](https://avro.apache.org/) 1.7.6 + +Please note that the SQL backend for Gora has been deprecated. diff --git a/content/news/nutch-2.3.1-release.md b/content/news/nutch-2.3.1-release.md new file mode 100644 index 00000000..0c3dfe41 --- /dev/null +++ b/content/news/nutch-2.3.1-release.md @@ -0,0 +1,32 @@ ++++ +date = "2016-01-21T00:00:00+00:00" +title = "Nutch 2.3.1 Release" +tags = ["2.3.1","release"] +categories = ["releases"] +draft = false +description = "21 January 2016 - Apache Nutch 2.3.1 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v2.3.1, we advise all +current users and developers of the 2.X series to upgrade to this release. + +This bug fix release contains around 40 issues addressed. For a complete overview of these issues please see the +[release report](https://s.apache.org/nutch_2.3.1). + +As usual in the 2.X series, release artifacts are made available as only source and also available within +[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch&core=gav) as a Maven dependency. +The release is available from our [DOWNLOADS PAGE](/downloads.html). + +The recommended Gora backends for this Nutch release are + + - Apache Avro 1.7.6 + - Apache Hadoop 1.2.1 and 2.5.2 + - Apache HBase 0.98.8-hadoop2 (although also tested with 1.X) + - Apache Cassandra 2.0.2 + - Apache Solr 4.10.3 + - MongoDB 2.6.X + - Apache Accumlo 1.5.1 + - Apache Spark 1.4.1 + +Thank you to everyone that contributed towards this release. diff --git a/content/news/nutch-2.4-release.md b/content/news/nutch-2.4-release.md new file mode 100644 index 00000000..754a85b8 --- /dev/null +++ b/content/news/nutch-2.4-release.md @@ -0,0 +1,22 @@ ++++ +date = "2019-10-11T00:00:00+00:00" +title = "Nutch 2.4 Release" +tags = ["2.4","release"] +categories = ["releases"] +draft = false +description = "11 October 2019 - Apache Nutch 2.4 released" +weight = 10 ++++ + +The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v2.4, we advise all +current users and developers of the 2.X series to upgrade to this release. + +This release contains 81 issues addressed. For a complete overview of these +issues please see the [release report](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=10680&version=12324540). + +As usual in the 2.X series, release artifacts are made available as only source and also available within +[Maven Central](https://search.maven.org/search?q=g:org.apache.nutch%20AND%20a:nutch%20AND%20v:2.4) as a Maven dependency. +The release is available from our [downloads page](/downloads.html). + +We expect that v2.4 is the last release on the 2.X series. We've decided to freeze the development on the 2.X branch for now, as no committer is +actively working on it. diff --git a/content/news/nutch-at-apachecon-eu-2009.md b/content/news/nutch-at-apachecon-eu-2009.md new file mode 100644 index 00000000..9520f1c6 --- /dev/null +++ b/content/news/nutch-at-apachecon-eu-2009.md @@ -0,0 +1,20 @@ ++++ +date = "2009-02-09T00:00:00+00:00" +title = "Lucene at ApacheCon Europe 2009 in Amsterdam" +tags = ["ApacheCon"] +categories = ["news"] +draft = false +description = "09 February 2009 - Lucene at ApacheCon Europe 2009 in Amsterdam" +weight = 10 ++++ + + [](http://www.eu.apachecon.com/c/aceu2009/ "ApacheCon EU 2009") Lucene will be extremely well represented at [ApacheCon EU 2009](http://www.eu.apachecon.com/c/aceu2009/) in Amsterdam, Netherlands this March 23-27, 2009: + +* [Lucene Boot Camp](http://eu.apachecon.com/c/aceu2009/sessions/197) - A two day training session, March 23 & 24th +* [Solr Boot Camp](http://eu.apachecon.com/c/aceu2009/sessions/201) - A one day training session, March 24th +* [Introducing Apache Mahout](http://eu.apachecon.com/c/aceu2009/sessions/136) - Grant Ingersoll. March 25th @ 10:30 +* [Lucene/Solr Case Studies](http://eu.apachecon.com/c/aceu2009/sessions/137) - Erik Hatcher. March 25th @ 11:30 +* [Advanced Indexing Techniques with Apache Lucene](http://eu.apachecon.com/c/aceu2009/sessions/138) - Michael Busch. March 25th @ 14:00 +* [Apache Solr - A Case Study](http://eu.apachecon.com/c/aceu2009/sessions/251) - Uri Boness. March 26th @ 17:30 +* [Best of breed - httpd, forrest, solr and droids](http://eu.apachecon.com/c/aceu2009/sessions/250) - Thorsten Scherler. March 27th @ 17:30 +* [Apache Droids - an intelligent standalone robot framework](http://eu.apachecon.com/c/aceu2009/sessions/165) - Thorsten Scherler. March 26th @ 15:00 diff --git a/content/news/nutch-at-apachecon-eu-2014.md b/content/news/nutch-at-apachecon-eu-2014.md new file mode 100644 index 00000000..7db87f2c --- /dev/null +++ b/content/news/nutch-at-apachecon-eu-2014.md @@ -0,0 +1,13 @@ ++++ +date = "2014-07-31T00:00:00+00:00" +title = "Nutch tutorial at upcoming ApacheCon Europe in Budapest" +tags = ["ApacheCon", "Nutch tutorial"] +categories = ["news"] +draft = false +description = "31 July 2014 - Nutch tutorial at upcoming ApacheCon Europe in Budapest" +weight = 10 ++++ + + + +The upcoming [ApacheCon Europe](http://events17.linuxfoundation.org/events/archive/2014/apachecon-europe) in Budapest, November 17 - 21, 2014, will offer a one-day [Nutch tutorial](http://sched.co/1pbE15n). Topics will span from Nutch installation and configuration up to plugin development. Both Nutch 1.x and 2.x are covered. The conference is a good opportunity to bring together both users and committers of Nutch and related projects. diff --git a/content/news/nutch-at-apachecon-na-2009.md b/content/news/nutch-at-apachecon-na-2009.md new file mode 100644 index 00000000..47f2d401 --- /dev/null +++ b/content/news/nutch-at-apachecon-na-2009.md @@ -0,0 +1,32 @@ ++++ +date = "2009-08-14T00:00:00+00:00" +title = "Lucene at US ApacheCon" +tags = ["ApacheCon"] +categories = ["news"] +draft = false +description = "14 August 2009 - Lucene at US ApacheCon" +weight = 10 ++++ + + [](http://www.us.apachecon.com/c/acus2009/ "ApacheCon US 2009") ApacheCon US is once again in the Bay Area and Lucene is coming along for the ride! The Lucene community has planned two full days of talks, plus a meetup and the usual bevy of training. With a well-balanced mix of first time and veteran ApacheCon speakers, the [Lucene track](http://www.us.apachecon.com/c/acus2009/schedule#lucene) at ApacheCon US prom [...] + +Training: + +* [Lucene Boot Camp](http://www.us.apachecon.com/c/acus2009/sessions/437) - A two day training session, Nov. 2nd & 3rd +* [Solr Day](http://www.us.apachecon.com/c/acus2009/sessions/375) - A one day training session, Nov. 2nd + +Thursday, Nov. 5th + +* [Introduction to the Lucene Ecosystem](http://www.us.apachecon.com/c/acus2009/sessions/428) \- Grant Ingersoll @ 9:00 +* [Lucene Basics and New Features](http://www.us.apachecon.com/c/acus2009/sessions/461) - Michael Busch @ 10:00 +* [Apache Solr: Out of the Box](http://www.us.apachecon.com/c/acus2009/sessions/331) - Chris Hostetter @ 14:00 +* [Introduction to Nutch](http://www.us.apachecon.com/c/acus2009/sessions/427) - Andrzej Bialecki @ 15:00 +* [Lucene and Solr Performance Tuning](http://www.us.apachecon.com/c/acus2009/sessions/430) - Mark Miller @ 16:30 + +Friday, Nov. 6th + +* [Implementing an Information Retrieval Framework for an Organizational Repository](http://www.us.apachecon.com/c/acus2009/sessions/332) - Sithu D Sudarsan @ 9:00 +* [Apache Mahout - Going from raw data to Information](http://www.us.apachecon.com/c/acus2009/sessions/333) - Isabel Drost @ 10:00 +* [MIME Magic with Apache Tika](http://www.us.apachecon.com/c/acus2009/sessions/334) - Jukka Zitting @ 11:30 +* [Building Intelligent Search Applications with the Lucene Ecosystem](http://www.us.apachecon.com/c/acus2009/sessions/335) - Ted Dunning @ 14:00 +* [Realtime Search](http://www.us.apachecon.com/c/acus2009/sessions/462) - Jason Rutherglen @ 15:00 diff --git a/content/news/nutch-at-apachecon-na-2014.md b/content/news/nutch-at-apachecon-na-2014.md new file mode 100644 index 00000000..250e7022 --- /dev/null +++ b/content/news/nutch-at-apachecon-na-2014.md @@ -0,0 +1,13 @@ ++++ +date = "2014-04-07T00:00:00+00:00" +title = "Nutch at ApacheCon 2014, Denver Colorado" +tags = ["ApacheCon"] +categories = ["news"] +draft = false +description = "07-09 April 2014 - Nutch at ApacheCon 2014, Denver Colorado" +weight = 10 ++++ + + [](http://events.linuxfoundation.org/events/apachecon-north-america "ApacheCon NA 2014") + +Lots of talk and loads of exposure for this at ApacheCon NA 2014 in the beautiful city of Denver, CO. This year one presentation focused on [Building your Big Data Search Stack with Apache Nutch 2.x](http://sched.co/1pav9xl). You can see presentation slides below and follow the audio (sorry no video) [here](https://www.youtube.com/watch?v=rIv3Js-zBpE) diff --git a/content/news/nutch-dev-focus-1.x.md b/content/news/nutch-dev-focus-1.x.md new file mode 100644 index 00000000..3eb6e227 --- /dev/null +++ b/content/news/nutch-dev-focus-1.x.md @@ -0,0 +1,11 @@ ++++ +date = "2011-09-23T00:00:00+00:00" +title = "Apache Nutch focuses on 1.x series for main development" +tags = ["development"] +categories = ["news"] +draft = false +description = "23 September 2011 - Apache Nutch focuses on 1.x series for main development" +weight = 10 ++++ + +After some [discussion](http://www.mail-archive.com/[email protected]/msg03581.html) and a [vote](http://www.mail-archive.com/[email protected]/msg04348.html) about the issue, the Nutch development community decided to focus their efforts on maintaining and releasing the 1.x series of Nutch, and to branch the now former Nutch trunk based on Gora, allowing others to try and improve it, while the mainline development goes on. diff --git a/content/news/nutch-graduates-from-incubator.md b/content/news/nutch-graduates-from-incubator.md new file mode 100644 index 00000000..ac70e134 --- /dev/null +++ b/content/news/nutch-graduates-from-incubator.md @@ -0,0 +1,11 @@ ++++ +date = "2005-06-15T00:00:00+00:00" +title = "Nutch graduates from Incubator" +tags = ["ASF", "incubator"] +categories = ["news"] +draft = false +description = "June 2005 - Nutch graduates from Incubator" +weight = 10 ++++ + +Nutch has now graduated from the Apache incubator, and is now a Subproject of Lucene. diff --git a/content/news/nutch-graduates-tlp.md b/content/news/nutch-graduates-tlp.md new file mode 100644 index 00000000..919080ed --- /dev/null +++ b/content/news/nutch-graduates-tlp.md @@ -0,0 +1,11 @@ ++++ +date = "2010-04-21T00:00:00+00:00" +title = "Apache Nutch graduates to TLP" +tags = ["ASF","top-level project"] +categories = ["news"] +draft = false +description = "21 April 2010 - Apache Nutch graduates to TLP" +weight = 10 ++++ + +[Passed by unanimous approval of the Apache Board](http://www.apache.org/foundation/records/minutes/2010/board_minutes_2010_04_21.txt), Nutch graduated to TLP status. We are in the process of updating the website, and moving things around, so if you notice anything out of place, [please let us know.](./mailing_lists.html) diff --git a/content/news/nutch-gsoc-2014.md b/content/news/nutch-gsoc-2014.md new file mode 100644 index 00000000..fd4c4735 --- /dev/null +++ b/content/news/nutch-gsoc-2014.md @@ -0,0 +1,15 @@ ++++ +date = "2014-05-01T00:00:00+00:00" +title = "Apache Nutch Participates in Google Summer of Code" +tags = ["GSoC"] +categories = ["news"] +draft = false +description = "01 May 2014 - Apache Nutch Participates in Google Summer of Code" +weight = 10 ++++ + + + +For the first time in Nutch project history, we are participating as part of Apache's mentoring efforts in the ever popular [Google Summer of Code](https://www.google-melange.com/gsoc/homepage/google/gsoc2014) program. This years project involves the [creation of a Apache Wicket-based Web Application](https://issues.apache.org/jira/browse/NUTCH-841) for Nutch 2.X branch. + +Keep your eyes peeled and check here for updates as the project progresses throughout the summer. diff --git a/content/news/nutch-joins-incubator.md b/content/news/nutch-joins-incubator.md new file mode 100644 index 00000000..8b522eba --- /dev/null +++ b/content/news/nutch-joins-incubator.md @@ -0,0 +1,11 @@ ++++ +date = "2005-01-15T00:00:00+00:00" +title = "Nutch Joins Apache Incubator" +tags = ["ASF", "incubator"] +categories = ["news"] +draft = false +description = "January 2005 - Nutch Joins Apache Incubator" +weight = 10 ++++ + +Nutch is a two-year-old open source project, previously hosted at Sourceforge and backed by its own non-profit organization. The non-profit was founded in order to assign copyright, so that we could retain the right to change the license. We have now determined that the Apache license is the appropriate license for Nutch and no longer require the overhead of an independent non-profit organization. Nutch's board of directors and its developers were both polled and supported the move to th [...] diff --git a/content/news/nutch-search-creative-commons.md b/content/news/nutch-search-creative-commons.md new file mode 100644 index 00000000..1e685364 --- /dev/null +++ b/content/news/nutch-search-creative-commons.md @@ -0,0 +1,13 @@ ++++ +date = "2004-09-15T00:00:00+00:00" +title = "Creative Commons launches Nutch-based Search" +tags = ["search", "creative commons"] +categories = ["news"] +draft = false +description = "September 2004 - Creative Commons launches Nutch-based Search" +weight = 10 ++++ + +Creative Commons unveiled a beta version of its search engine, which scours the web for text, images, audio, and video free to re-use on certain terms a search refinement offered by no other company or organization. + +See the [Creative Commons Press Release](http://creativecommons.org/press-releases/entry/5064) for more details. diff --git a/content/news/nutch-search-osu.md b/content/news/nutch-search-osu.md new file mode 100644 index 00000000..434241b8 --- /dev/null +++ b/content/news/nutch-search-osu.md @@ -0,0 +1,13 @@ ++++ +date = "2004-09-15T00:00:00+00:00" +title = "Oregon State University switches to Nutch" +tags = ["search", "OSU"] +categories = ["news"] +draft = false +description = "September 2004 - Oregon State University switches to Nutch" +weight = 10 ++++ + +Oregon State University is converting its searching infrastructure from Googletm to the open source project Nutch. The effort to replace the Googletm will realize significant cost savings for Oregon State University, while promoting both the Nutch Search Engine and transparency in search engine use and management. + +For more details see the announcement by OSU's [Open Source Lab](http://osuosl.org/news_folder/nutch). \ No newline at end of file diff --git a/content/news/nutch-wiki-migrated.md b/content/news/nutch-wiki-migrated.md new file mode 100644 index 00000000..9efda5ed --- /dev/null +++ b/content/news/nutch-wiki-migrated.md @@ -0,0 +1,11 @@ ++++ +date = "2019-07-26T00:00:00+00:00" +title = "Nutch Wiki Migrated" +tags = ["wiki"] +categories = ["news"] +draft = false +description = "26 July 2019 - Nutch Wiki Migrated" +weight = 10 ++++ + +The [Apache Nutch wiki](https://cwiki.apache.org/confluence/display/NUTCH/Home) has been migrated from MoinMoin to Confluence. diff --git a/content/news/wicket-webapp-gsoc.md b/content/news/wicket-webapp-gsoc.md new file mode 100644 index 00000000..6acbcf78 --- /dev/null +++ b/content/news/wicket-webapp-gsoc.md @@ -0,0 +1,26 @@ ++++ +date = "2014-09-22T00:00:00+00:00" +title = "Wicket WebApp now part of Nutch 2.x Codebase" +tags = ["webapp","wicket","GSoC"] +categories = ["news"] +draft = false +description = "22 September 2014 - Wicket WebApp now part of Nutch 2.x Codebase" +weight = 10 ++++ + +After successful completion of the first [Nutch Google Summer of Code project](https://issues.apache.org/jira/browse/NUTCH-841) +we are pleased to announce that Nutch 2.X branch now comes packaged with a self +contained [Apache Wicket](https://wicket.apache.org/)-based Web Application. + +This not only greatly lowers the barrier for direct interaction with the Nutch 2.X +REST API but also provides a stepping stone from which we intend to backport this +work to the Nutch 1.X (trunk) series. + +Some of the Web Application features include: + + * Functionality to dynamically load seed URLs in order to bootstrap Nutch crawls + * Browsable and dynamic editing of [Configuration overrides](https://cwiki.apache.org/nutch/NutchPropertiesCompleteList) + * Complete [REST API documentation](https://cwiki.apache.org/nutch/NutchRESTAPI) and UML +model describing REST API calls, Administration and Job and Configuration Management. + +The new Web Application feature will be present within the upcoming Nutch 2.3 Release.
