Author: jukka
Date: Wed Dec 16 01:21:49 2009
New Revision: 891096

URL: http://svn.apache.org/viewvc?rev=891096&view=rev
Log:
Update change log, minor readme improvement

Modified:
    lucene/tika/trunk/CHANGES.txt
    lucene/tika/trunk/README.txt

Modified: lucene/tika/trunk/CHANGES.txt
URL: 
http://svn.apache.org/viewvc/lucene/tika/trunk/CHANGES.txt?rev=891096&r1=891095&r2=891096&view=diff
==============================================================================
--- lucene/tika/trunk/CHANGES.txt (original)
+++ lucene/tika/trunk/CHANGES.txt Wed Dec 16 01:21:49 2009
@@ -8,13 +8,32 @@
 
  * Tika now has an additional OSGi bundle packaging that includes all the
    required parser libraries. This bundle package makes it easy to use all
-   Tika features in an OSGi environment. (TIKA-340)
+   Tika features in an OSGi environment. (TIKA-340, TIKA-342)
+
+ * The Apache POI dependency used for parsing Microsoft Office file formats
+   has been upgraded to version 3.6. The most visible improvement in this
+   version is the notably reduced ooxml jar file size. The tika-app jar size
+   is now down to 15MB from the 25MB in Tika 0.5. (TIKA-353)
 
  * Handling of character encoding information in input metadata and HTML
    <meta> tags has been improved. When no applicable encoding information is
    available, the encoding is detected by looking at the input data.
    (TIKA-332, TIKA-334, TIKA-335, TIKA-341) 
 
+ * Some document types like Excel spreadsheets contain content like
+   numbers or dates whose exact text format depends on the current locale.
+   So far Tika has used the platform default locale in such cases, but
+   clients can now explicitly specify the locale by passing a Locale instance
+   in the parse context. (TIKA-125)
+
+ * The default text output encoding of the tika-app jar is now UTF-8
+   when running on Mac OS X. This is because the default encoding used
+   by Java is not compatible with the console application in Mac OS X.
+   On all other platforms the text output from tika-app still uses
+   the platform default encoding. (TIKA-324)
+
+ * A flash video (video/x-flv) parser has been added. (TIKA-328)
+
 Release 0.5 - 11/14/2009
 ------------------------
 

Modified: lucene/tika/trunk/README.txt
URL: 
http://svn.apache.org/viewvc/lucene/tika/trunk/README.txt?rev=891096&r1=891095&r2=891096&view=diff
==============================================================================
--- lucene/tika/trunk/README.txt (original)
+++ lucene/tika/trunk/README.txt Wed Dec 16 01:21:49 2009
@@ -20,7 +20,7 @@
 The build consists of a number of components, including a standalone runnable
 jar that you can use to try out Tika features. You can run it like this:
 
-    java -jar tika-app/target/tika-app-0.5-SNAPSHOT.jar --help
+    java -jar tika-app/target/tika-app-*.jar --help
 
 License (see also LICENSE.txt)
 ==============================


Reply via email to