Hi Folks, Using the tika-server, I am having issues parsing the attachment ENVI hdr file at [0] with the EnviHeaderParser [1]. My experiments are as follows
EXPERIMENT 1 Request ----------- curl -T /usr/local/coal-sds-deploy/data/staging/ang20150420t182050_corr_v1e_img.hdr http://localhost:9998/meta --header "Content-Type: application/envi.hdr" Response -------------- INFO meta (application/envi.hdr) WARN meta: Text extraction failed org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.server.resource.TikaResource$1@46d1d140 at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:279) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) at org.apache.tika.server.resource.TikaResource.parse(TikaResource.java:384) at org.apache.tika.server.resource.MetadataResource.parseMetadata(MetadataResource.java:126) at org.apache.tika.server.resource.MetadataResource.getMetadata(MetadataResource.java:60) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at org.apache.cxf.service.invoker.AbstractInvoker.performInvocation(AbstractInvoker.java:181) at org.apache.cxf.service.invoker.AbstractInvoker.invoke(AbstractInvoker.java:97) at org.apache.cxf.jaxrs.JAXRSInvoker.invoke(JAXRSInvoker.java:202) at org.apache.cxf.jaxrs.JAXRSInvoker.invoke(JAXRSInvoker.java:101) at org.apache.cxf.interceptor.ServiceInvokerInterceptor$1.run(ServiceInvokerInterceptor.java:59) at org.apache.cxf.interceptor.ServiceInvokerInterceptor.handleMessage(ServiceInvokerInterceptor.java:96) at org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:307) at org.apache.cxf.transport.ChainInitiationObserver.onMessage(ChainInitiationObserver.java:121) at org.apache.cxf.transport.http.AbstractHTTPDestination.invoke(AbstractHTTPDestination.java:274) at org.apache.cxf.transport.http_jetty.JettyHTTPDestination.doService(JettyHTTPDestination.java:261) at org.apache.cxf.transport.http_jetty.JettyHTTPHandler.handle(JettyHTTPHandler.java:76) at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1088) at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1024) at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135) at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255) at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116) at org.eclipse.jetty.server.Server.handle(Server.java:370) at org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:494) at org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(AbstractHttpConnection.java:973) at org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.headerComplete(AbstractHttpConnection.java:1035) at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:647) at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:231) at org.eclipse.jetty.server.AsyncHttpConnection.handle(AsyncHttpConnection.java:82) at org.eclipse.jetty.io.nio.SelectChannelEndPoint.handle(SelectChannelEndPoint.java:696) at org.eclipse.jetty.io.nio.SelectChannelEndPoint$1.run(SelectChannelEndPoint.java:53) at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608) at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543) at java.lang.Thread.run(Thread.java:748) Caused by: javax.ws.rs.WebApplicationException: HTTP 415 Unsupported Media Type at org.apache.tika.server.resource.TikaResource$1.parse(TikaResource.java:119) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:277) ... 36 more EXPERIMENT 2 Request ---------- curl -T /usr/local/coal-sds-deploy/data/staging/ang20150420t182050_corr_v1e_img.hdr http://localhost:9998/meta --header "Content-Encoding: application/envi.hdr" Response -------------- "X-Parsed-By","org.apache.tika.parser.DefaultParser","org.apache.tika.parser.txt.TXTParser" "Content-Encoding","ISO-8859-1" "language","en" "Content-Type","text/plain; charset=ISO-8859-1" Is there any way I can explicitly force execution of the EnviHeaderParser? Thanks Lewis [0] https://issues.apache.org/jira/browse/TIKA-2636 [1] https://github.com/apache/tika/blob/9130bbc1fa6d69419b2ad294917260d6b1cced08/tika-parsers/src/main/java/org/apache/tika/parser/envi/EnviHeaderParser.java -- http://home.apache.org/~lewismc/ http://people.apache.org/keys/committer/lewismc
