hi,

as i am new to tika, i want to know following things.

1. how to integrate tika within hadoop, so that tika will use map
reduce to implement the parsing.
2. we wanted tika to parse ocr files too...but as tika is not
supporting ocr parsing and also recommending to use tesseract, i want
to
   know how to call tesseract ( command line operation ) through tika
( which in-turn uses map reduce to parse ocr files ).

thanks and regards
chethan

Reply via email to