Author: nick Date: Mon May 9 21:03:58 2016 New Revision: 1743051 URL: http://svn.apache.org/viewvc?rev=1743051&view=rev Log: Update for 1.13 formats
Modified: tika/site/src/site/apt/1.13/formats.apt Modified: tika/site/src/site/apt/1.13/formats.apt URL: http://svn.apache.org/viewvc/tika/site/src/site/apt/1.13/formats.apt?rev=1743051&r1=1743050&r2=1743051&view=diff ============================================================================== --- tika/site/src/site/apt/1.13/formats.apt (original) +++ tika/site/src/site/apt/1.13/formats.apt Mon May 9 21:03:58 2016 @@ -177,6 +177,8 @@ Supported Document Formats extracts simple metadata from BPG (Better Portable Graphics) images. The {{{./api/org/apache/tika/parser/image/WebPParser.html}WebPParser}} class extracts simple metadata from WebP image format. + The {{{./api/org/apache/tika/parser/image/ICNSParser.html}ICNSParser}} + class extracts simple metadata from the Apple ICNS icon image format. When extracting from images, it is also possible to chain in Tesseract via the {{{./api/org/apache/tika/parser/ocr/TesseractOCRParser.html}TesseractOCRParser}} @@ -195,9 +197,13 @@ Supported Document Formats For the Ogg family of video formats, a limited amount of metadata is extracted by the - {{{./api/org/gagravarr/tika/OggParser.html}OggParser}} class. + {{{./api/org/gagravarr/tika/OggParser.html}OggParser}} class. There is + also an experimental + {{{./api/org/gagravarr/tika/TheoraParser.html}TheoraParser}} class which + extracts only limited metadata, pending a consensus on the "right" way + to return metadata for audio streams along with the video metadata. - As an alternative, the + As an alternative to the metadata-focused parsers above, the {{{./api/org/apache/tika/parser/pot/PooledTimeSeriesParser}PooledTimeSeriesParser}} can be used (if the required tool is installed) to generate a numeric representation of the video suitable for similarity searches. More details