Author: jukka
Date: Wed Dec 16 01:21:49 2009
New Revision: 891096
URL: http://svn.apache.org/viewvc?rev=891096&view=rev
Log:
Update change log, minor readme improvement
Modified:
lucene/tika/trunk/CHANGES.txt
lucene/tika/trunk/README.txt
Modified: lucene/tika/trunk/CHANGES.txt
URL:
http://svn.apache.org/viewvc/lucene/tika/trunk/CHANGES.txt?rev=891096&r1=891095&r2=891096&view=diff
==============================================================================
--- lucene/tika/trunk/CHANGES.txt (original)
+++ lucene/tika/trunk/CHANGES.txt Wed Dec 16 01:21:49 2009
@@ -8,13 +8,32 @@
* Tika now has an additional OSGi bundle packaging that includes all the
required parser libraries. This bundle package makes it easy to use all
- Tika features in an OSGi environment. (TIKA-340)
+ Tika features in an OSGi environment. (TIKA-340, TIKA-342)
+
+ * The Apache POI dependency used for parsing Microsoft Office file formats
+ has been upgraded to version 3.6. The most visible improvement in this
+ version is the notably reduced ooxml jar file size. The tika-app jar size
+ is now down to 15MB from the 25MB in Tika 0.5. (TIKA-353)
* Handling of character encoding information in input metadata and HTML
<meta> tags has been improved. When no applicable encoding information is
available, the encoding is detected by looking at the input data.
(TIKA-332, TIKA-334, TIKA-335, TIKA-341)
+ * Some document types like Excel spreadsheets contain content like
+ numbers or dates whose exact text format depends on the current locale.
+ So far Tika has used the platform default locale in such cases, but
+ clients can now explicitly specify the locale by passing a Locale instance
+ in the parse context. (TIKA-125)
+
+ * The default text output encoding of the tika-app jar is now UTF-8
+ when running on Mac OS X. This is because the default encoding used
+ by Java is not compatible with the console application in Mac OS X.
+ On all other platforms the text output from tika-app still uses
+ the platform default encoding. (TIKA-324)
+
+ * A flash video (video/x-flv) parser has been added. (TIKA-328)
+
Release 0.5 - 11/14/2009
------------------------
Modified: lucene/tika/trunk/README.txt
URL:
http://svn.apache.org/viewvc/lucene/tika/trunk/README.txt?rev=891096&r1=891095&r2=891096&view=diff
==============================================================================
--- lucene/tika/trunk/README.txt (original)
+++ lucene/tika/trunk/README.txt Wed Dec 16 01:21:49 2009
@@ -20,7 +20,7 @@
The build consists of a number of components, including a standalone runnable
jar that you can use to try out Tika features. You can run it like this:
- java -jar tika-app/target/tika-app-0.5-SNAPSHOT.jar --help
+ java -jar tika-app/target/tika-app-*.jar --help
License (see also LICENSE.txt)
==============================