Great work! ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-283, Mailstop: 171-246 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
-----Original Message----- From: Markus Jelsma <[email protected]> Reply-To: "[email protected]" <[email protected]>, Markus Jelsma <[email protected]> Date: Monday, March 17, 2014 2:36 AM To: "[email protected]" <[email protected]> Subject: Re: [ANNONCEMENT] Apache Nutch 1.8 Release > > > >Thanks lewis! >Lewis John Mcgibbney <[email protected]> schreef: >Good Evening, > >The Apache Nutch PMC are pleased to announce the immediate release of >Apache Nutch v1.8. > > >Apache Nutch is a highly extensible and scalable open source web crawler >software project. Stemming from Apache Lucene, the project has >diversified and now comprises two codebases, namely: Nutch 1.x: A well >matured, production ready crawler. 1.x enables > fine grained configuration, relying on Apache Hadoop data structures, >which are great for batch processing. Nutch 2.x: An emerging alternative >taking direct inspiration from 1.x, but which differs in one key area; >storage is abstracted away from any specific > underlying data store by using Apache Gora for handling object to >persistent mappings. This means we can implement an extremely flexibile >model/stack for storing everything (fetch time, status, content, parsed >text, outlinks, inlinks, etc.) into a number of > NoSQL storage solutions. >We advise all current users and developers of the 1.X series to upgrade >to this release. Although this release includes library upgrades to >Crawler Commons 0.3 and Apache Tika 1.4, it also provides over 30 bug >fixes as well as 18 improvements. Please see the >list of changes <http://www.apache.org/dist/nutch/1.8/CHANGES.txt> for a >full breakdown, or see the >release report <http://s.apache.org/oHY>. As usual in the 1.X series, >this release is made available both as source and binary. Additionally >developers can find Maven artifacts within >Maven Central <http://search.maven.org/>. The release is available >here <http://www.apache.org/dyn/closer.cgi/nutch/>. > > >Thank you > >Lewis > >(On behalf of the Nutch PMC) > >-- >Lewis > > > > >

