Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "FrontPage" page has been changed by ChrisMattmann: https://wiki.apache.org/tika/FrontPage?action=diff&rev1=43&rev2=44 * VirtualMachine - a virtual machine hosted by Rackspace that allows an instance of [[TikaJAXRS|Tika Server]] to run for public testing. Set up by Tim Allison et al. = User Notes = + * [[API Bindings for Tika]] - Using Tika from additional languages and frameworks. * PostingManyFilesToExtractingRequestHandler - How to post many files to the Extracting Request Handler (Tika) in Solr. * IntegratingTikaWithExtractingRequestHandler - Building the latest Tika and integrating it with the Extracting Request Handler (Tika) in Solr. * [[TesseractOCRStats|Some stats using Tesseract OCR]] - some stats from a contributing team (Hyperion Gray) about using TesseractOCR (will be updated with Tika). @@ -38, +39 @@ * Upgrading to [[PDFBOX_2_X_NOTES|PDFBox 2.x]] = MIME identification design/implementation = - * [[BaysianMimeTypeSelector|Bayesian MIME selection]] - Tika's new Bayesian MIME selector. * [[ContentMimeDetection|Content-based MIME selection with Byte histograms]] - Tika's new content/byte histogram MIME detector. + = Advanced Content Extraction with Tika - Integration = - = Configuration and Integration = - * [[API Bindings for Tika]] - Using Tika from additional languages and frameworks * [[cTAKESParser|Getting Tika and Running with Apache cTAKES]] - How to use Tika with Apache cTAKES the clinical text biomedical knowledge extraction framework. * [[EXIFToolParser|Getting Tika up and Running with EXIFTool]] - How to use Tika with EXIFTool. * [[FFMPEGParser|Getting Tika up and Running with FFMPEG]] - How to use Tika with FFMPEG.
