[ 
https://issues.apache.org/jira/browse/TIKA-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348047#comment-14348047
 ] 

Tim Allison commented on TIKA-1330:
-----------------------------------

Posted patch to review board [31758|https://reviews.apache.org/r/31758/]

Some more work is needed, but this is ready for thumbs up/thumbs down and any 
and all review.

Depending on feedback, I'd like to merge this into trunk over the next week or 
two.

Simplest way to run it is from tika-app:

java -jar tika-app---.jar <input-dir> <output-dir>

> Add robust tika-batch code
> --------------------------
>
>                 Key: TIKA-1330
>                 URL: https://issues.apache.org/jira/browse/TIKA-1330
>             Project: Tika
>          Issue Type: Sub-task
>          Components: cli, general, server
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>         Attachments: TIKA-1330v1-patch.zip
>
>
> In my current design plan, I see creating a separate component "tika-batch" 
> that includes a small bit of configurable code to run Tika against a large 
> batch of documents.  This code should be robust against OOM and hangs, and it 
> should have fairly robust logging.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to