I've tried without success. There is more than it seems. JavaOCR is not an
option in its current status. Temporal solution can be wrapper of tesseract
however making tesseract to work on multi-platforms is still quite
difficult.

Best regards,
Oleg




On Fri, Jan 4, 2013 at 3:46 PM, Maciej Lizewski (JIRA) <[email protected]>wrote:

>
>     [
> https://issues.apache.org/jira/browse/TIKA-93?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13543882#comment-13543882]
>
> Maciej Lizewski commented on TIKA-93:
> -------------------------------------
>
> anything new in this topic? someone tried that JavaOCR library with
> success? Does anybody has working tika+ocr configuration?
>
> > OCR support
> > -----------
> >
> >                 Key: TIKA-93
> >                 URL: https://issues.apache.org/jira/browse/TIKA-93
> >             Project: Tika
> >          Issue Type: New Feature
> >          Components: parser
> >            Reporter: Jukka Zitting
> >            Priority: Minor
> >
> > I don't know of any decent open source pure Java OCR libraries, but
> there are command line OCR tools like Tesseract (
> http://code.google.com/p/tesseract-ocr/) that could be invoked by Tika to
> extract text content (where available) from image files.
>
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA
> administrators
> For more information on JIRA, see: http://www.atlassian.com/software/jira
>

Reply via email to