[
https://issues.apache.org/jira/browse/TIKA-3463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3463:
------------------------------
Description:
It'd be nice to be able to pass in a file-based list of fetch keys and have the
pipes-iterator just work. This would be equivalent to the {{-fileList}} option
in the current tika-batch.
This is useful for processing only a specific subset of files within a
directory or s3 bucket.
There's some overlap with the CSV pipes iterator, but this is simpler and can
live in tika-core because of no extra dependencies.
was:
It'd be nice to be able to pass in a file-based list of fetch keys and have the
pipes-iterator just work. This would be equivalent to the {{-fileList}} option
in the current tika-batch.
This is useful for processing only a specific subset of files within a
directory or s3 bucket.
> Add FileListIterator as a pipes-iterator
> ----------------------------------------
>
> Key: TIKA-3463
> URL: https://issues.apache.org/jira/browse/TIKA-3463
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Trivial
>
> It'd be nice to be able to pass in a file-based list of fetch keys and have
> the pipes-iterator just work. This would be equivalent to the {{-fileList}}
> option in the current tika-batch.
> This is useful for processing only a specific subset of files within a
> directory or s3 bucket.
> There's some overlap with the CSV pipes iterator, but this is simpler and can
> live in tika-core because of no extra dependencies.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)