[
https://issues.apache.org/jira/browse/TIKA-4543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-4543.
-------------------------------
Fix Version/s: 4.0.0
Resolution: Fixed
> Reorganize pipes implementation modules around resource as opposed to task
> --------------------------------------------------------------------------
>
> Key: TIKA-4543
> URL: https://issues.apache.org/jira/browse/TIKA-4543
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
> Fix For: 4.0.0
>
>
> We currently have pipes implementations by task – fetchers, emitters, etc.
> The actual code we have for each is pretty small, and we have a lot of
> modules.
> It would be more efficient to group the modules by resource: tika-pipes-s3,
> tika-pipes-file-system, and then include the fetchers, emitters etc for that
> resource.
> This way, if we're pulling from s3, iterating in a bucket and writing to s3,
> the application only needs the tika-pipes-s3 module, with the heavy s3
> dependencies.
> If we're pulling from s3 and writing to a local file share, the dependencies
> between where we are now and the proposed reorganization wouldn't change.
> This change would only be in 4.x.
> I'm going to draft a PR off the TIKA-4519 branch unless there are objections.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)