[
https://issues.apache.org/jira/browse/TIKA-4367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17914074#comment-17914074
]
Tim Allison commented on TIKA-4367:
-----------------------------------
Right...if you had followed my
[comment|https://issues.apache.org/jira/browse/TIKA-4186?focusedCommentId=17913735&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17913735],
I would have helped you configure the endpoints you need.
This "issue" is also better a question for the user list.
> Problem with the: org.apache.tika.server.core.ServerStatusWatcher forked
> process observed TIMEOUT and is shutting down
> ----------------------------------------------------------------------------------------------------------------------
>
> Key: TIKA-4367
> URL: https://issues.apache.org/jira/browse/TIKA-4367
> Project: Tika
> Issue Type: Bug
> Components: tika-server
> Affects Versions: 3.0.0
> Reporter: mbiso
> Priority: Major
> Attachments: tika-config.xml, tika-config_new.xml
>
>
> Hi.
> i have this problem on my tika-server running in a docker container.
> Due to large files, i obtain timeout and the tika process down.
> this is the error:
>
> {code:java}
> 2025-01-16T01:29:19.096206347Z INFO [qtp274100821-133] 02:29:19,096
> org.apache.tika.server.core.resource.MetadataResource /meta (application/pdf)
> 2025-01-16T01:29:19.120130385Z INFO [qtp274100821-270] 02:29:19,120
> org.apache.tika.server.core.resource.TikaResource /tika (application/pdf)
> 2025-01-16T01:29:19.213411527Z INFO [qtp274100821-133] 02:29:19,213
> org.apache.tika.server.core.resource.MetadataResource /meta (application/pdf)
> 2025-01-16T01:29:19.230454549Z INFO [qtp274100821-270] 02:29:19,230
> org.apache.tika.server.core.resource.TikaResource /tika (application/pdf)
> 2025-01-16T01:56:18.370380628Z INFO [qtp274100821-284] 02:56:18,370
> org.apache.tika.server.core.resource.MetadataResource /meta (application/pdf)
> 2025-01-16T02:01:18.430280014Z ERROR [Thread-11] 03:01:18,428
> org.apache.tika.server.core.ServerStatusWatcher Timeout task PARSE, millis
> elapsed 300055; consider increasing the allowable time with the
> <taskTimeoutMillis/> parameter or the X-Tika-Timeout-Millis header
> 2025-01-16T02:01:18.437740057Z WARN [Thread-11] 03:01:18,437
> org.apache.tika.server.core.ServerStatusWatcher forked process observed
> TIMEOUT and is shutting down.
> 2025-01-16T02:01:18.439693546Z INFO [Thread-11] 03:01:18,439
> org.apache.tika.server.core.ServerStatusWatcher Shutting down forked process
> with status: TIMEOUT
> 2025-01-16T02:01:19.851234798Z INFO [pool-2-thread-1] 03:01:19,817
> org.apache.tika.server.core.TikaServerWatchDog forked process exited with
> exit value 3
> 2025-01-16T02:01:20.644728948Z INFO [main] 03:01:20,643
> org.apache.tika.server.core.TikaServerProcess Starting Apache Tika 3.0.0
> server
> 2025-01-16T02:01:20.773526359Z INFO [main] 03:01:20,772
> org.apache.tika.server.core.TikaServerProcess Using custom config:
> /tika-config.xml
> 2025-01-16T02:01:21.358160073Z INFO [main] 03:01:21,357
> org.apache.tika.server.core.TikaServerProcess loading resource from SPI:
> class org.apache.tika.server.standard.resource.XMPMetadataResource
> 2025-01-16T02:01:21.527210481Z Jan 16, 2025 3:01:21 AM
> org.apache.cxf.endpoint.ServerImpl initDestination
> 2025-01-16T02:01:21.527237406Z INFO: Setting the server's publish address to
> be http://0.0.0.0:9998/
> 2025-01-16T02:01:21.627014872Z INFO [main] 03:01:21,626
> org.eclipse.jetty.server.Server jetty-11.0.24; built:
> 2024-08-26T18:11:22.448Z; git: 5dfc59a691b748796f922208956bd1f2794bcd16; jvm
> 21.0.5+11-Ubuntu-1ubuntu124.04
> 2025-01-16T02:01:21.685264827Z INFO [main] 03:01:21,684
> org.eclipse.jetty.server.AbstractConnector Started
> ServerConnector@50b1f030{HTTP/1.1, (http/1.1)}
> {0.0.0.0:9998}
> 2025-01-16T02:01:21.687671013Z INFO [main] 03:01:21,687
> org.eclipse.jetty.server.Server Started
> Server@6034e75d{STARTING}[11.0.24,sto=0] @1755ms
> 2025-01-16T02:01:21.711747262Z INFO [main] 03:01:21,711
> org.eclipse.jetty.server.handler.ContextHandler Started
> o.a.c.t.h.JettyContextHandler@56febdc{/,null,AVAILABLE}
> 2025-01-16T02:01:21.716535893Z INFO [main] 03:01:21,716
> org.apache.tika.server.core.TikaServerProcess Started Apache Tika server
> 5598029c-6de7-4b53-8284-0f18814c049f at http://0.0.0.0:9998/
> {code}
>
> My issue is, because ManifoldCF uses tika to parse the files, the ManifoldCF
> job ends with: "Error: Repeated service interruptions - failure processing
> document: The target server failed to respond"
> Is there a way to avoid the shutdown of tika process for timeout?
> In the attachment, you find my tika-config.xml if it could help.
> Thanks a lot
> Mario
>
>
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)