[
https://issues.apache.org/jira/browse/TIKA-3370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342176#comment-17342176
]
Hudson commented on TIKA-3370:
------------------------------
UNSTABLE: Integrated in Jenkins build Tika ยป tika-main-jdk8 #224 (See
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/224/])
TIKA-3370 -- add pipes processor and swap out /emit for /pipes (tallison:
[https://github.com/apache/tika/commit/b370d4f4f09ff45a6ee4b619223c468236c77761])
* (edit) tika-core/src/main/java/org/apache/tika/config/ConfigBase.java
* (edit) tika-core/src/main/java/org/apache/tika/pipes/async/AsyncConfig.java
* (delete) tika-core/src/main/java/org/apache/tika/pipes/async/AsyncServer.java
* (add) tika-core/src/main/java/org/apache/tika/pipes/PipesServer.java
* (delete)
tika-core/src/main/java/org/apache/tika/pipes/async/AsyncEmitterConfig.java
* (delete)
tika-server/tika-server-core/src/main/java/org/apache/tika/server/core/resource/EmitterResource.java
* (edit)
tika-pipes/tika-fetch-iterators/tika-fetch-iterator-jdbc/src/test/java/org/apache/tika/pipes/fetchiterator/jdbc/TestJDBCFetchIterator.java
* (delete)
tika-server/tika-server-core/src/test/java/org/apache/tika/server/core/TikaEmitterTest.java
* (edit) tika-core/src/main/java/org/apache/tika/pipes/async/AsyncEmitter.java
* (add) tika-core/src/main/java/org/apache/tika/pipes/PipesClient.java
* (edit)
tika-pipes/tika-fetch-iterators/tika-fetch-iterator-csv/src/main/java/org/apache/tika/pipes/fetchiterator/csv/CSVFetchIterator.java
* (add) tika-core/src/main/java/org/apache/tika/pipes/PipesParser.java
* (add) tika-core/src/main/java/org/apache/tika/pipes/PipesException.java
* (edit)
tika-server/tika-server-core/src/test/java/org/apache/tika/server/core/TikaServerAsyncIntegrationTest.java
* (edit)
tika-pipes/tika-fetch-iterators/tika-fetch-iterator-jdbc/src/main/java/org/apache/tika/pipes/fetchiterator/jdbc/JDBCFetchIterator.java
* (edit)
tika-core/src/main/java/org/apache/tika/pipes/async/AsyncClientConfig.java
* (edit)
tika-core/src/main/java/org/apache/tika/pipes/emitter/AbstractEmitter.java
* (edit)
tika-pipes/tika-fetch-iterators/tika-fetch-iterator-s3/src/main/java/org/apache/tika/pipes/fetchiterator/s3/S3FetchIterator.java
* (edit) tika-core/src/main/java/org/apache/tika/pipes/FetchEmitTuple.java
* (add)
tika-server/tika-server-core/src/main/java/org/apache/tika/server/core/resource/PipesResource.java
* (add)
tika-server/tika-server-core/src/test/java/org/apache/tika/server/core/TikaServerPipesIntegrationTest.java
* (delete)
tika-core/src/main/java/org/apache/tika/pipes/async/AsyncRuntimeException.java
* (edit)
tika-core/src/test/java/org/apache/tika/pipes/async/AsyncProcessorTest.java
* (edit)
tika-server/tika-server-core/src/main/java/org/apache/tika/server/core/resource/AsyncResource.java
* (delete)
tika-server/tika-server-core/src/test/java/org/apache/tika/server/core/TikaServerEmitterIntegrationTest.java
* (add) tika-core/src/main/java/org/apache/tika/pipes/PipesConfig.java
* (edit) tika-core/src/main/java/org/apache/tika/pipes/async/AsyncProcessor.java
* (edit)
tika-core/src/main/java/org/apache/tika/pipes/fetchiterator/FileSystemFetchIterator.java
* (edit)
tika-server/tika-server-core/src/main/java/org/apache/tika/server/core/TikaServerProcess.java
* (delete) tika-core/src/main/java/org/apache/tika/pipes/async/AsyncClient.java
* (add) tika-core/src/main/java/org/apache/tika/pipes/PipesResult.java
* (edit)
tika-core/src/main/java/org/apache/tika/pipes/fetchiterator/FetchIterator.java
* (edit) tika-core/src/main/java/org/apache/tika/pipes/emitter/EmitData.java
* (add)
tika-server/tika-server-core/src/test/java/org/apache/tika/server/core/TikaPipesTest.java
* (delete) tika-core/src/main/java/org/apache/tika/pipes/async/AsyncResult.java
* (add) tika-core/src/main/java/org/apache/tika/pipes/PipesConfigBase.java
> Refactor the AsyncProcessor in 2.x
> ----------------------------------
>
> Key: TIKA-3370
> URL: https://issues.apache.org/jira/browse/TIKA-3370
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Assignee: Tim Allison
> Priority: Major
>
> Yesterday, I finally got back to trying to wire the AsyncProcessor in
> tika-pipes into the AsyncHandler in tika-server. I've now convinced myself
> that the notorious antipattern of using a db as a queue is in fact a really,
> really bad idea -- there's every chance that I wasn't doing it right or that
> H2 isn't a great choice...my $ is on the former.
> Nevertheless, I think removing H2 from that process and going with a
> modification of our ForkParser or a lightweight purpose-built knock-off to
> handle fetchers and emitters will be as robust, a bunch cleaner, have fewer
> dependencies and hopefully be more performant than what I had in the
> AsyncProcessor.
> Immediate term, I'd like to get this running and wired into tika-server.
> Longer term, we can use this instead of tika-batch in tika-app...more use,
> fewer bugs.
> This is the last item I'd like to finish before 2.0.0-BETA.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)