Author: nick
Date: Mon May 8 17:31:09 2017
New Revision: 1794419
URL: http://svn.apache.org/viewvc?rev=1794419&view=rev
Log:
Compress supported formats update, and other new packaging support
Modified:
tika/site/src/site/apt/1.15/formats.apt
Modified: tika/site/src/site/apt/1.15/formats.apt
URL:
http://svn.apache.org/viewvc/tika/site/src/site/apt/1.15/formats.apt?rev=1794419&r1=1794418&r2=1794419&view=diff
==============================================================================
--- tika/site/src/site/apt/1.15/formats.apt (original)
+++ tika/site/src/site/apt/1.15/formats.apt Mon May 8 17:31:09 2017
@@ -113,12 +113,17 @@ Supported Document Formats
class and its subclasses parse the packaging formats and then pass the
unpacked document streams to a second parsing stage using the parser
instance specified in the parse context. Formats supported include Tar,
- AR, CPIO, Zip, 7Zip, Gzip, BZip2, XZ and Pack200.
+ AR, ARJ, CPIO, Dump, Zip, 7Zip, Gzip, BZip2, XZ, LZMA, Z and Pack200.
Additionally, the
- {{{./api/org/apache/tika/parser/pkg/RARParser.html}RARParser}} class
+ {{{./api/org/apache/tika/parser/pkg/RarParser.html}RarParser}} class
supports the RAR archive format, which isn't supported by Commons Compress.
+ The
+
{{{./api/org/apache/tika/parser/apple/AppleSingleFileParser.html}AppleSingleFileParser}}
+ class supports resources packaged within AppleSingle and AppleDouble
+ files.
+
* {Text formats}
Extracting text content from plain text files seems like a simple task