[
https://issues.apache.org/jira/browse/TIKA-1540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14303701#comment-14303701
]
Tyler Palsulich commented on TIKA-1540:
---------------------------------------
Will this feature extraction happen externally or within a Java library? If
it's external, you can probably model this parser after the
{{TesseractOCRParser}} -- it saves the given {{InputStream}} as a file, runs an
external command on that file, and extracts the results. If it's a native Java
library, the project needs to have a liberal license and be hosted on Maven
Central. I'd be happy to help you in either direction. :)
What type of features do you want to extract? Do you think they would be a
better fit as part of the {{Metadata}} or content?
> New Tika plugin for image based feature extraction using computer vision
> techniques
> -----------------------------------------------------------------------------------
>
> Key: TIKA-1540
> URL: https://issues.apache.org/jira/browse/TIKA-1540
> Project: Tika
> Issue Type: Bug
> Environment: cross platform
> Reporter: Aashish Chaudhary
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)