Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "FrontPage" page has been changed by ChrisMattmann: https://wiki.apache.org/tika/FrontPage?action=diff&rev1=35&rev2=36 Comment: - add Wiki pages for MIME detection * IntegratingTikaWithExtractingRequestHandler - Building the latest Tika and integrating it with the Extracting Request Handler (Tika) in Solr. * [[TesseractOCRStats|Some stats using Tesseract OCR]] - some stats from a contributing team (Hyperion Gray) about using TesseractOCR (will be updated with Tika). + = MIME identification design/implementation = + + * [[https://wiki.apache.org/tika/BaysianMimeTypeSelector|Bayesian MIME selection]] - Tika's new Bayesian MIME selector. + * [[https://wiki.apache.org/tika/ContentMimeDetection|Content-based MIME selection with Byte histograms]] - Tika's new content/byte histogram MIME detector. + = Design = * MetadataDiscussion - discussions on the design of MIME type detection and parsing for recursive metadata formats (and container formats) like Zip, etc.
