The release looks good, but I found a few interesting issues:

 * I built from sources ("mvn install") successfully.

 * I tested 0.3 with the the TikaIndexer test in Lucene in Action 2.
   I used "mvn dependency:copy-dependencies" to gather all the
   necessary JARs.  Then TikaIndexer was able to index our small set
   sample docs just fine.  Though, I did see this new warning printed
   when indexing an MS Word doc (97-2003 format):

[java] A property claimed to start before zero, at -512! Resetting it to zero, and hoping for the best

   Not sure if that's a problem with my document or something with
   the POI beta?

 * Then, I had Tika extract all text for the entire manuscript (MS
   Word 2003 docs) for LIA2, and it did great!

 * The tar file doesn't unpack to "apache-tika-0.3" subdir, which is
   sort of unfriendly.  (It puts a bunch of files into my current
   dir).

 * Signature is good for apache-tika-0.3-src.tar.gz.

 * Could you add the KEYS file to the toplevel release dir?

 * The maven path seems to have an extra "tika" subdirectory?  I see
   resository/org/apache/tika/tika/0.3/*

Mike

Mattmann, Chris A wrote:

Hi Folks,

I have posted a candidate for the Apache Tika 0.3 release at

http://people.apache.org/~mattmann/apache-tika-0.3/rc1/

See the included CHANGES.txt file for details on release contents and latest
changes. The release was made from the 0.3 branch at:

http://svn.apache.org/repos/asf/lucene/tika/branches/0.3/

Please vote on releasing these packages as Apache Tika 0.3. The vote is open for the next 72 hours. Only votes from Lucene PMC are binding, but everyone
is welcome to check the release candidate and voice their approval or
disapproval. The vote passes if at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache Tika 0.3.

[ ] -1 Do not release the packages because...

Thanks!

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.mattm...@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++




Reply via email to