For future reference, I have also added a solution for the AR misdetection issue that I have reported in August ( https://issues.apache.org/jira/browse/TIKA-697).
The proposed solution is for the TextDetector.detect() class, which erroneously reports "text/plain" for AR archives. Congratulations and +1 for the release! :-) On Fri, Nov 4, 2011 at 5:42 PM, Mattmann, Chris A (388J) < [email protected]> wrote: > Hi Folks, > > A candidate for the Tika 1.0 release is available at: > > http://people.apache.org/~mattmann/apache-tika-1.0/rc1/ > > The release candidate is a zip archive of the sources in: > > http://svn.apache.org/repos/asf/tika/tags/1.0/ > > The SHA1 checksum of the archive is > 203d84b56c5b8879ce04b496e9b7421387ea386e. > > Please vote on releasing this package as Apache Tika 1.0. > The vote is open for the next 72 hours and passes if a majority of at > least three +1 Tika PMC votes are cast. > > [ ] +1 Release this package as Apache Tika 1.0 > [ ] -1 Do not release this package because... > > Thanks! > > Cheers, > Chris > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Chris Mattmann, Ph.D. > Senior Computer Scientist > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 171-266B, Mailstop: 171-246 > Email: [email protected] > WWW: http://sunset.usc.edu/~mattmann/ > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Adjunct Assistant Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > >
