[ 
https://issues.apache.org/jira/browse/TIKA-3463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated TIKA-3463:
------------------------------
    Description: 
It'd be nice to be able to pass in a file-based list of fetch keys and have the 
pipes-iterator just work.  This would be equivalent to the {{-fileList}} option 
in the current tika-batch. 

This is useful for processing only a specific subset of files within a 
directory or s3 bucket.

There's some overlap with the CSV pipes iterator, but this is simpler and can 
live in tika-core because of no extra dependencies.

  was:
It'd be nice to be able to pass in a file-based list of fetch keys and have the 
pipes-iterator just work.  This would be equivalent to the {{-fileList}} option 
in the current tika-batch. 

This is useful for processing only a specific subset of files within a 
directory or s3 bucket.


> Add FileListIterator as a pipes-iterator
> ----------------------------------------
>
>                 Key: TIKA-3463
>                 URL: https://issues.apache.org/jira/browse/TIKA-3463
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Trivial
>
> It'd be nice to be able to pass in a file-based list of fetch keys and have 
> the pipes-iterator just work.  This would be equivalent to the {{-fileList}} 
> option in the current tika-batch. 
> This is useful for processing only a specific subset of files within a 
> directory or s3 bucket.
> There's some overlap with the CSV pipes iterator, but this is simpler and can 
> live in tika-core because of no extra dependencies.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to