This is an automated email from the ASF dual-hosted git repository. tallison pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git
commit 941d61a3258d92b8a69ac651586ad0490e313532 Author: tballison <talli...@mitre.org> AuthorDate: Wed Apr 19 21:41:18 2017 -0400 update CHANGES.txt in prep for release. reorder changes to most significant first...changes in default behavior then new parsers...Completely subjective, and I'm open to reordering! --- CHANGES.txt | 42 +++++++++++++++++++++++------------------- 1 file changed, 23 insertions(+), 19 deletions(-) diff --git a/CHANGES.txt b/CHANGES.txt index 610c186..50f2b0e 100644 --- a/CHANGES.txt +++ b/CHANGES.txt @@ -5,25 +5,39 @@ Release 1.15 - ?? Users who wish to parse only the container document should set an EmptyParser as the Parser.class in the ParseContext. - * Add support for the XLSB format (TIKA-1195). - * Change default behavior of Office Parsers to _not_ extract Macros. User needs to setExtractMacros to "true" (TIKA-2302). + * Added tika-eval module (TIKA-1332). + * Unified logging across Tika: SLF4J as logging API, Apache Log4j as implementation with JCL and JUL bridges in standalone tools like tika-app, tika-batch and tika-server (TIKA-2245). - * Extract images and thumbnails from ODT via Sam Bayer (TIKA-2295). + * Add parser for XLSB files (TIKA-1195). + + * Add parsers for EMF/WMF files (TIKA-2246/TIKA-2247). + + * Add parsers for WordPerfect and QuattroPro (.qpw) files. + Contributed by Pascal Essiembre (TIKA-1946 and TIKA-2228). + + * Add experimental SAX parser for .pptx files. To select this parser, + set useSAXPptxExtractor(true) on OfficeParserConfig (TIKA-2210). + + * Add experimental SAX parser for .docx files. To select this parser, + set useSAXDocxExtractor(true) on OfficeParserConfig (TIKA-1321, TIKA-2191). + + * Add mime detection and parser for Word 2006ML format (TIKA-2179). * Enabled configuration of the EncodingDetector used by parsers that extend AbstractEncodingDetectorParser (TIKA-2273). - * Added tika-eval module (TIKA-1332). + * Prevent easily preventable OOMs for both detection and parsing + of some compression formats (TIKA-2330). - * Fix potential NPE in FeedParser via Julien Nioche (TIKA-2269). + * Extract images and thumbnails from ODT via Sam Bayer (TIKA-2295). - * Add parsers for EMF/WMF files (TIKA-2246/TIKA-2247). + * Fix potential NPE in FeedParser via Julien Nioche (TIKA-2269). * Official mime types for BMP, EMF and WMF have been registered with IANA, so switch to these (image/bmp image/emf image/wmf) (TIKA-2250) @@ -45,15 +59,9 @@ Release 1.15 - ?? * Mime magic for the OneNote family (.one / .onetoc / .onepkg), no parser (TIKA-2224). - * Add parsers for WordPerfect and QuattroPro (.qpw) files. - Contributed by Pascal Essiembre (TIKA-1946 and TIKA-2228). - * Add configurability of "preserve-interword-spacing" to TesseractOCRParser (TIKA-2190). - * Added experimental SAX parser for .pptx files. To select this parser, - set useSAXPptxExtractor(true) on OfficeParserConfig (TIKA-2210). - * Upgrade to PDFBox 2.0.5 and JempBox 1.8.13 (TIKA-2209/TIKA-2236). * Refactor MockParser to consolidate service loading @@ -63,16 +71,12 @@ Release 1.15 - ?? footnotes, endnotes and comments in legacy .docx parser (TIKA-2192). * Allow extraction of PDActions (including Javascript) from - PDFs (TIKA-2090). + PDFs (TIKA-2090). This is turned off by default. Users + must setExtractActions(true) on the PDFParserConfig. * Change default behavior in experimental .docx parser to ignore deleted text to align with .doc (TIKA-2187). - * Added experimental SAX parser for .docx files. To select this parser, - set useSAXDocxExtractor(true) on OfficeParserConfig (TIKA-1321, TIKA-2191). - - * Add mime detection and parser for Word 2006ML format (TIKA-2179). - * Upgrade to POI 3.16 (TIKA-2116, TIKA-2181, TIKA-2329). * Allow configuration of timeout for ForkParser (TIKA-2170). @@ -82,7 +86,7 @@ Release 1.15 - ?? * Add .jpx, .jp2, .ppm to formats handled by Tesseract (TIKA-2174). - * Upgrade SQLite "provided" dependency to 3.15.1. + * Upgrade SQLite "provided" dependency to 3.16.1 (TIKA-2334). * Update Apache CXF version to 3.0.12 (TIKA-2292). -- To stop receiving notification emails like this one, please contact "commits@tika.apache.org" <commits@tika.apache.org>.