Hi Yuan, You can grab a Snapshot of tika from the repository.apache.org site, and/or you can svn co http://svn.apache.org/repos/asf/tika/trunk tika and build a 1.7-SNAPSHOT version of Tika.
Then, check out this page: https://wiki.apache.org/tika/TikaOCR Hope that helps! Cheers, Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ -----Original Message----- From: "[email protected]" <[email protected]> Date: Wednesday, November 5, 2014 at 10:54 AM To: Chris Mattmann <[email protected]> Subject: Question about Tika >Hello, > >Currently, with my team, we are working on a project using tika, but with >the help of OCR tools (such as Tesseract). >I found your response on a forum mentioning that the tika server 1.7 >version works well with Tesseract, but the problem is, I can¹t find it >anywhere, even on the official web site of tika. >Could you please tell me where I can get and test this version? > >Thank you very much. > >Best regards > >Yuan LIN >
