[
https://issues.apache.org/jira/browse/TIKA-1540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319525#comment-14319525
]
Lewis John McGibbney edited comment on TIKA-1540 at 2/13/15 3:35 AM:
---------------------------------------------------------------------
Hi [~aashish24]
* regarding the use of two httpclient libraries, this needs to be cleaned up
* I would suggest that we define a [Configuration-type
object|https://github.com/apache/tika/blob/trunk/tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRConfig.java]
for setting all of the external service characteristics such as URL and
implementation specific parameters as expressed within the service call. An
example can be seen on the
[TesseractOCRParser.java|https://github.com/apache/tika/blob/trunk/tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRParser.java#L89].
On another level, I am thinking if this could be generalized?
I mean from looking at the code it is not obvious that it is a ImageParser,
only you and I know this :) Does this make sense?
Thanks for posting the patch [~aashish24] and I apologize for taking ages to
look at it. I will work harder with you from now on to make sure we get this
right.
was (Author: lewismc):
Hi [~aashish24]
* regarding the use of two httpclient libraries, this needs to be cleaned up
* I would suggest that we define a [Configuration-type
object|https://github.com/apache/tika/blob/trunk/tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRConfig.java]
for setting all of the external service characteristics such as URL and
implementation specific parameters as expressed within the service call. An
example can be seen on the
[TesseractOCRParser.java|https://github.com/apache/tika/blob/trunk/tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRParser.java#L89].
On another level, I am thinking if this could be generalized?
I mean from looking at the code it is not obvious that it is a ImageParser,
only you and I know this :) Does this make sense?
> New Tika plugin for image based feature extraction using computer vision
> techniques
> -----------------------------------------------------------------------------------
>
> Key: TIKA-1540
> URL: https://issues.apache.org/jira/browse/TIKA-1540
> Project: Tika
> Issue Type: New Feature
> Environment: cross platform
> Reporter: Aashish Chaudhary
> Assignee: Lewis John McGibbney
> Labels: gsoc2015
> Fix For: 1.8
>
> Attachments: TIKA-vision.achaudhary.150209.patch.txt
>
>
> This will be a web-service client based parser to perform image feature
> extraction using Computer Vision techniques.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)