Hey awesome Tika folks! The reason I'm writing is that I want to disable the TesseractOCRParser. The reason is that it makes Tika take longer to finish, and I don't need the OCRed results.
I can't simply uninstall tesseract from the system because I use it for other things. I thought about sending Tika a custom PATH that excludes /usr/bin so it can't find tesseract, but that seems ugly and likely to break things. Then I thought I could pass a custom config.xml to disable it, but I can't figure out how to write the config file. I would greatly appreciate any help! Thanks, Sergey
