Hi Folks,
Using the tika-server, I am having issues parsing the attachment ENVI hdr
file at [0] with the EnviHeaderParser [1].
My experiments are as follows

EXPERIMENT 1

Request
-----------
curl -T
/usr/local/coal-sds-deploy/data/staging/ang20150420t182050_corr_v1e_img.hdr
http://localhost:9998/meta --header "Content-Type: application/envi.hdr"

Response
--------------
INFO  meta (application/envi.hdr)
WARN  meta: Text extraction failed
org.apache.tika.exception.TikaException: Unexpected RuntimeException from
org.apache.tika.server.resource.TikaResource$1@46d1d140
    at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:279)
    at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
    at
org.apache.tika.server.resource.TikaResource.parse(TikaResource.java:384)
    at
org.apache.tika.server.resource.MetadataResource.parseMetadata(MetadataResource.java:126)
    at
org.apache.tika.server.resource.MetadataResource.getMetadata(MetadataResource.java:60)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
    at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:498)
    at
org.apache.cxf.service.invoker.AbstractInvoker.performInvocation(AbstractInvoker.java:181)
    at
org.apache.cxf.service.invoker.AbstractInvoker.invoke(AbstractInvoker.java:97)
    at org.apache.cxf.jaxrs.JAXRSInvoker.invoke(JAXRSInvoker.java:202)
    at org.apache.cxf.jaxrs.JAXRSInvoker.invoke(JAXRSInvoker.java:101)
    at
org.apache.cxf.interceptor.ServiceInvokerInterceptor$1.run(ServiceInvokerInterceptor.java:59)
    at
org.apache.cxf.interceptor.ServiceInvokerInterceptor.handleMessage(ServiceInvokerInterceptor.java:96)
    at
org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:307)
    at
org.apache.cxf.transport.ChainInitiationObserver.onMessage(ChainInitiationObserver.java:121)
    at
org.apache.cxf.transport.http.AbstractHTTPDestination.invoke(AbstractHTTPDestination.java:274)
    at
org.apache.cxf.transport.http_jetty.JettyHTTPDestination.doService(JettyHTTPDestination.java:261)
    at
org.apache.cxf.transport.http_jetty.JettyHTTPHandler.handle(JettyHTTPHandler.java:76)
    at
org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1088)
    at
org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1024)
    at
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
    at
org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
    at
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
    at org.eclipse.jetty.server.Server.handle(Server.java:370)
    at
org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:494)
    at
org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(AbstractHttpConnection.java:973)
    at
org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.headerComplete(AbstractHttpConnection.java:1035)
    at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:647)
    at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:231)
    at
org.eclipse.jetty.server.AsyncHttpConnection.handle(AsyncHttpConnection.java:82)
    at
org.eclipse.jetty.io.nio.SelectChannelEndPoint.handle(SelectChannelEndPoint.java:696)
    at
org.eclipse.jetty.io.nio.SelectChannelEndPoint$1.run(SelectChannelEndPoint.java:53)
    at
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
    at
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
    at java.lang.Thread.run(Thread.java:748)
Caused by: javax.ws.rs.WebApplicationException: HTTP 415 Unsupported Media
Type
    at
org.apache.tika.server.resource.TikaResource$1.parse(TikaResource.java:119)
    at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:277)
    ... 36 more

EXPERIMENT 2

Request
----------
curl -T
/usr/local/coal-sds-deploy/data/staging/ang20150420t182050_corr_v1e_img.hdr
http://localhost:9998/meta --header "Content-Encoding: application/envi.hdr"

Response
--------------
"X-Parsed-By","org.apache.tika.parser.DefaultParser","org.apache.tika.parser.txt.TXTParser"
"Content-Encoding","ISO-8859-1"
"language","en"
"Content-Type","text/plain; charset=ISO-8859-1"

Is there any way I can explicitly force execution of the EnviHeaderParser?
Thanks
Lewis

[0] https://issues.apache.org/jira/browse/TIKA-2636
[1]
https://github.com/apache/tika/blob/9130bbc1fa6d69419b2ad294917260d6b1cced08/tika-parsers/src/main/java/org/apache/tika/parser/envi/EnviHeaderParser.java

-- 
http://home.apache.org/~lewismc/
http://people.apache.org/keys/committer/lewismc

Reply via email to