[ANNOUNCE] Apache Tika 0.6 released
(...apologies for the cross posting...) The Apache Lucene project is pleased to announce the release of Apache Tika 0.6. The release contents have been pushed out to the main Apache release site and the m2 ibiblio sync, so the releases should be available as soon as the mirrors get the syncs. Apache Tika, a subproject of Apache Lucene, is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. Apache Tika 0.6 contains a number of improvements and bug fixes. Details can be found in the changes file: http://www.apache.org/dist/lucene/tika/CHANGES-0.6.txt Apache Tika is available in source form from the following download page: http://www.apache.org/dyn/closer.cgi/lucene/tika/apache-tika-0.6-src.zip Apache Tika is also available in binary form or for use using Maven 2 from the Central Maven Repositories: http://repo1.maven.org/maven2/org/apache/tika/ http://mirrors.ibiblio.org/pub/mirrors/maven2/org/apache/tika/ In the initial 48 hours, the release may not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: http://www.apache.org/dist/lucene/tika/KEYS-0.6.txt For more information on Apache Tika, visit the project home page: http://lucene.apache.org/tika -- Chris Mattmann (on behalf of the Apache Lucene community) ++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: chris.mattm...@jpl.nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++
[ANNOUNCE] Apache Nutch 1.1 released
(...apologies for the cross posting...) The Apache Nutch project is pleased to announce the release of Apache Nutch 1.1. The release contents have been pushed out to the main Apache release site so the releases should be available as soon as the mirrors get the syncs. Apache Nutch, one of the six new Apache TLPs as a result of the April 2010 Board Meeting, is an extensible framework for building out large-scale web-based search. Layered on top of fellow Apache projects Hadoop, Lucene/Solr, and Tika, Nutch provides an out of the box platform for fetching web pages, pdf files, word documents, and more. Nutch parses the content and its relevant information, indexes its metadata, and makes it available for efficient query and retrieval over modern Internet protocols. Apache Nutch 1.1 contains a number of improvements and bug fixes. Details can be found in the changes file: http://www.apache.org/dist/nutch/CHANGES-1.1.txt Apache Nutch is available in source and binary form from the following download page: http://www.apache.org/dyn/closer.cgi/nutch/ In the initial 48 hours, the release may not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: http://www.apache.org/dist/nutch/KEYS-1.1.txt For more information on Apache Nutch, visit the project home page: http://nutch.apache.org -- Chris Mattmann (on behalf of the Apache Nutch community)
[ANNOUNCE] Apache Nutch 1.2 released
(...apologies for the cross posting...) The Apache Nutch project is pleased to announce the release of Apache Nutch 1.2. The release contents have been pushed out to the main Apache release site so the releases should be available as soon as the mirrors get the syncs. Apache Nutch, one of the six new Apache TLPs as a result of the April 2010 Board Meeting, is an extensible framework for building out large-scale web-based search. Layered on top of fellow Apache projects Hadoop, Lucene/Solr, and Tika, Nutch provides an out of the box platform for fetching web pages, pdf files, word documents, and more. Nutch parses the content and its relevant information, indexes its metadata, and makes it available for efficient query and retrieval over modern Internet protocols. Apache Nutch 1.2 contains a number of improvements and bug fixes. Details can be found in the changes file: http://www.apache.org/dist/nutch/CHANGES-1.2.txt Apache Nutch is available in source and binary form from the following download page: http://www.apache.org/dyn/closer.cgi/nutch/ In the initial 48 hours, the release may not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: http://www.apache.org/dist/nutch/KEYS-1.2.txt For more information on Apache Nutch, visit the project home page: http://nutch.apache.org -- Chris Mattmann (on behalf of the Apache Nutch community) ++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: chris.mattm...@jpl.nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++
[ANNOUNCE] Apache Tika 1.6 released
The Apache Tika project is pleased to announce the release of Apache Tika 1.6. The release contents have been pushed out to the main Apache release site and to the Maven Central sync, so the releases should be available as soon as the mirrors get the syncs. Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. Apache Tika 1.6 contains a number of improvements and bug fixes. Details can be found in the changes file: http://www.apache.org/dist/tika/CHANGES-1.6.txt Apache Tika is available in source form from the following download page: http://www.apache.org/dyn/closer.cgi/tika/apache-tika-1.6-src.zip Apache Tika is also available in binary form or for use using Maven 2 from the Central Repository: http://repo1.maven.org/maven2/org/apache/tika/ In the initial 48 hours, the release may not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: https://people.apache.org/keys/group/tika.asc For more information on Apache Tika, visit the project home page: http://tika.apache.org/ -- Chris Mattmann, on behalf of the Apache Tika community
[ANNOUNCE] Apache OODT 0.7 Released
The Apache OODT community is proud to announce the release of Apache OODT 0.7. Apache OODT is a software framework, and an architectural style for the rapid construction of scientific data systems. It provides components for data capture, curation, metadata extraction, workflow management, resource management and data processing. The 0.7 release addresses 79 issues from our JIRA management system: http://issues.apache.org/jira/browse/OODT This release is a core stability release with numerous bug fixes, and a solid baseline for upgrading. It fixes 79 JIRA issues and provides a backwards compatible CAS-PGE upgrade allowing the wrapper to be used with existing config files. 0.7 fixes some bugs in Ganglia support and allows the Resource Manager to work without Ganglia being present. This version also makes significant fixes to File Manager tests, provides a Vagrant version of OODT RADIX and upgrades RADIX to include CAS-PGE support. This version also includes improved support for a new JAX-RS CAS product service, and this version eliminates several bugs in the workflow manager in its support for dynamic workflow instances. The release is available from: http://www.apache.org/dyn/closer.cgi/oodt/ In the first 48 hours, the release will be propagating to the mirrors, so allow time for it to show up in a mirror near you. The release is also made available via Maven Central: http://repo1.maven.org/maven2/org/apache/oodt/ And from PyPi: https://pypi.python.org/pypi/oodt And from PEAR: http://pear.apache.org/oodt/ The Apache OODT website will be updated shortly to reflect the releases. As always if you find anything that you'd like to report (including praise!) please do so on our u...@oodt.apache.org (use of OODT) or d...@oodt.apache.org (architecture/design of OODT) mailing lists. Also please visit our website: http://oodt.apache.org And our wiki: https://cwiki.apache.org/confluence/display/OODT/Home Thanks! Chris Mattmann (on behalf of the Apache OODT PMC)
[ANNOUNCE] Apache Tika 1.11 release
The Apache Tika project is pleased to announce the release of Apache Tika 1.11. The release contents have been pushed out to the main Apache release site and to the Central sync, so the releases should be available as soon as the mirrors get the syncs. Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. Apache Tika 1.11 contains a number of improvements and bug fixes. Details can be found in the changes file: http://www.apache.org/dist/tika/CHANGES-1.11.txt <http://www.apache.org/dist/tika/CHANGES-1.11.txt> Apache Tika is available in source form from the following download page: http://www.apache.org/dyn/closer.cgi/tika/apache-tika-1.11-src.zip <http://www.apache.org/dyn/closer.cgi/tika/apache-tika-1.11-src.zip> Apache Tika is also available in binary form or for use using Maven 2 from the Central Repository: http://repo1.maven.org/maven2/org/apache/tika/ <http://repo1.maven.org/maven2/org/apache/tika/> In the initial 48 hours, the release may not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: https://people.apache.org/keys/group/tika.asc <https://people.apache.org/keys/group/tika.asc> For more information on Apache Tika, visit the project home page: http://tika.apache.org/ <http://tika.apache.org/> — Chris Mattmann, on behalf of the Apache Tika community
[ANNOUNCE] Apache Tika 1.12 release
The Apache Tika project is pleased to announce the release of Apache Tika 1.12. The release contents have been pushed out to the main Apache release site and to the Central sync, so the releases should be available as soon as the mirrors get the syncs. Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. Apache Tika 1.12 contains a number of improvements and bug fixes. Details can be found in the changes file: http://www.apache.org/dist/tika/CHANGES-1.12.txt <http://www.apache.org/dist/tika/CHANGES-1.12.txt> Apache Tika is available in source form from the following download page: http://www.apache.org/dyn/closer.cgi/tika/apache-tika-1.12-src.zip <http://www.apache.org/dyn/closer.cgi/tika/apache-tika-1.12-src.zip> Apache Tika is also available in binary form or for use using Maven 2 from the Central Repository: http://repo1.maven.org/maven2/org/apache/tika/ <http://repo1.maven.org/maven2/org/apache/tika/> In the initial 48 hours, the release may not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: https://people.apache.org/keys/group/tika.asc <https://people.apache.org/keys/group/tika.asc> For more information on Apache Tika, visit the project home page: http://tika.apache.org/ <http://tika.apache.org/> — Chris Mattmann, on behalf of the Apache Tika community
[ANNOUNCE] Apache OODT 1.1 release
The Apache OODT project is pleased to announce the release of Apache OODT 1.1. The release contents have been pushed out to the main Apache release site and to the Maven Central sync, so the releases should be available as soon as the mirrors get the syncs. Apache OODT is a software framework as well as an architectural style for the rapid construction of scientific data systems. It provides components for data capture, curation, metadata extraction, workflow management, resource management, and data processing. Apache OODT 1.1 contains a number of improvements and bug fixes. Details can be found in the changes file: http://www.apache.org/dist/oodt/CHANGES-1.1.txt Apache OODT is available in source form from the following download page: http://www-us.apache.org/dist/oodt/apache-oodt-1.1-src.zip Apache OODT is also available in binary form or for use using Maven 2 from the Central Repository: http://repo1.maven.org/maven2/org/apache/oodt/ In the initial 48 hours, the release may not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: https://people.apache.org/keys/group/oodt.asc For more information on Apache OODT, visit the project home page: http://oodt.apache.org/ -- Chris Mattmann, on behalf of the Apache OODT community
[ANNOUNCE] Apache OODT 1.2.1 release
The Apache OODT project is pleased to announce the release of Apache OODT 1.2.1. The release contents have been pushed out to the main Apache release site and to the Maven Central sync, so the releases should be available as soon as the mirrors get the syncs. Apache OODT is a software framework as well as an architectural style for the rapid construction of scientific data systems. It provides components for data capture, curation, metadata extraction, workflow management, resource management, and data processing. Apache OODT 1.2.1 contains a number of improvements and bug fixes. Details can be found in the changes file: http://www.apache.org/dist/oodt/CHANGES-1.2.1.txt Apache OODT is available in source form from the following download page: http://www.apache.org/dyn/closer.cgi/oodt Apache OODT is also available in binary form or for use using Maven 2 from the Central Repository: http://repo1.maven.org/maven2/org/apache/oodt/ In the initial 48 hours, the release may not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: https://people.apache.org/keys/group/oodt.asc For more information on Apache OODT, visit the project home page: http://oodt.apache.org/ -- Chris Mattmann, on behalf of the Apache OODT community