Author: nick Date: Mon May 8 17:46:33 2017 New Revision: 1794427 URL: http://svn.apache.org/viewvc?rev=1794427&view=rev Log: Document support for additional Microsoft Office formats
Modified: tika/site/src/site/apt/1.15/formats.apt Modified: tika/site/src/site/apt/1.15/formats.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.15/formats.apt?rev=1794427&r1=1794426&r2=1794427&view=diff ============================================================================== --- tika/site/src/site/apt/1.15/formats.apt (original) +++ tika/site/src/site/apt/1.15/formats.apt Mon May 8 17:46:33 2017 @@ -67,6 +67,16 @@ Supported Document Formats Old, pre-OLE2 Excel files (Excel 2, 3 and 4) are handled by the {{{./api/org/apache/tika/parser/microsoft/OldExcelParser.html}OldExcelParser}}. + The older, pre-OOXML pure-XML, office file formats are handled by + {{{./api/org/apache/tika/parser/microsoft/xml/SpreadsheetMLParser.html}SpreadsheetMLParser}}, + {{{./api/org/apache/tika/parser/microsoft/xml/WordMLParser.html}WordMLParser}} + and + {{{./api/org/apache/tika/parser/microsoft/ooxml/xwpf/ml2006/Word2006MLParser.html}Word2006MLParser}}. + + Temporary Office lock files (owner files) are supported for basic metadata + extraction by + {{{./api/org/apache/tika/parser/microsoft/MSOwnerFileParser.html}MSOwnerFileParser}}. + * {OpenDocument Format} The OpenDocument format (ODF) is used most notably as the default format