The release looks good, but I found a few interesting issues:
* I built from sources ("mvn install") successfully.
* I tested 0.3 with the the TikaIndexer test in Lucene in Action 2.
I used "mvn dependency:copy-dependencies" to gather all the
necessary JARs. Then TikaIndexer was able to index our small set
sample docs just fine. Though, I did see this new warning printed
when indexing an MS Word doc (97-2003 format):
[java] A property claimed to start before zero, at -512!
Resetting it to zero, and hoping for the best
Not sure if that's a problem with my document or something with
the POI beta?
* Then, I had Tika extract all text for the entire manuscript (MS
Word 2003 docs) for LIA2, and it did great!
* The tar file doesn't unpack to "apache-tika-0.3" subdir, which is
sort of unfriendly. (It puts a bunch of files into my current
dir).
* Signature is good for apache-tika-0.3-src.tar.gz.
* Could you add the KEYS file to the toplevel release dir?
* The maven path seems to have an extra "tika" subdirectory? I see
resository/org/apache/tika/tika/0.3/*
Mike
Mattmann, Chris A wrote:
Hi Folks,
I have posted a candidate for the Apache Tika 0.3 release at
http://people.apache.org/~mattmann/apache-tika-0.3/rc1/
See the included CHANGES.txt file for details on release contents
and latest
changes. The release was made from the 0.3 branch at:
http://svn.apache.org/repos/asf/lucene/tika/branches/0.3/
Please vote on releasing these packages as Apache Tika 0.3. The vote
is open
for the next 72 hours. Only votes from Lucene PMC are binding, but
everyone
is welcome to check the release candidate and voice their approval or
disapproval. The vote passes if at least three binding +1 votes are
cast.
[ ] +1 Release the packages as Apache Tika 0.3.
[ ] -1 Do not release the packages because...
Thanks!
Cheers,
Chris
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.mattm...@jpl.nasa.gov
WWW: http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++