Thanks, Sergey. Sorry to not include the stack trace -- was worried it would be too long; I've added the stack trace to the ticket now :)
Warm regards, Alex -----Original Message----- From: Sergey Vladimirov [mailto:[email protected]] Sent: 25 September 2012 10:21 AM To: POI Users List Subject: Re: Bug 53380 Alex, I will take a look into it a bit later. But I need to note, that it should be different bug, i.e. different reason for ArrayIndexOutOfBounds, because all previous files are "passed" now. So, please include stack trace next time :) Best regards, Sergey On Tue, Sep 25, 2012 at 11:07 AM, Alex Cougarman <[email protected]> wrote: > Hi Sergey, > > The bug persists. We've uploaded a Word DOC (blank_2.doc) to the bug > that generates the ArrayIndexOutOfBounds exception: > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380 > This is using the latest build (#50) from here: > https://builds.apache.org/job/POI/50/ > > Warm regards, > Alex > > -----Original Message----- > From: Sergey Vladimirov [mailto:[email protected]] > Sent: 13 September 2012 11:55 AM > To: POI Users List > Subject: Re: Bug 53380 > > Hi, > > Try #47, build by Yegor: > > https://builds.apache.org/job/POI/47/ > > Best regards, > Sergey > > On Thu, Sep 13, 2012 at 9:03 AM, Alex Cougarman <[email protected]> wrote: > > > Any update on the bug fix for this? There's a Build #46 on this page > > but it says "Failed" when you roll over the red circle: > > https://builds.apache.org/job/POI/46/ > > > > Thank you :) > > > > Warm regards, > > Alex Cougarman > > > > Bahá’í World Centre > > Haifa, Israel > > Office: +972-4-835-8683 > > Cell: +972-54-241-4742 > > [email protected] > > > > > > -----Original Message----- > > From: Alex Cougarman [mailto:[email protected]] > > Sent: 11 September 2012 11:42 AM > > To: 'POI Users List' > > Subject: RE: Bug 53380 > > > > Hi Sergey, > > > > Thank you for looking into this issue. It will make a huge > > difference for us :) > > > > Warm regards, > > Alex > > > > -----Original Message----- > > From: Sergey Vladimirov [mailto:[email protected]] > > Sent: 10 September 2012 2:13 PM > > To: POI Users List > > Subject: Re: Bug 53380 > > > > Hi > > > > I will take a look into it today or tomorrow. > > Sorry for the long waiting > > > > Best regards, > > Sergey > > > > On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <[email protected]> > wrote: > > > > > Dear Yegor, > > > > > > Thank you for your reply. If I knew enough about Java, I'd go in > > > and fix it :) Just happy to have you guys providing such a great tool. > > > Thanks and keep up the great work. > > > > > > Warm regards, > > > Alex > > > > > > -----Original Message----- > > > From: Yegor Kozlov [mailto:[email protected]] > > > Sent: 10 September 2012 10:16 AM > > > To: POI Users List > > > Subject: Re: Bug 53380 > > > > > > We have all pre-requisites for fixing this bug, just need to find > > > a person to do it :) > > > > > > POI is a volunteer project and if this problem is important for > > > you, please do work on it and submit a patch. Otherwise please wait. > > > Unfortuntaly we don't have a active developer working on DOC/DOCX > > > modules, so fixing may take some time. > > > > > > Yegor > > > > > > On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <[email protected]> > > wrote: > > > > Hi. I'm having the same issue from this bug with hundreds of our > > > > DOC files being fed through Solr/Tika: > > > > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380 > > > > > > > > I downloaded the DOC file attached to the ticket and was able to > > > generate the same error we've been getting (please see below for > > > the exception). > > > > > > > > Anyone know of a solution/workaround? Is there a timeline for a fix? > > > > I > > > commented and voted on the ticket but not sure if it's a priority. > > Thanks. > > > > > > > > org.apache.tika.exception.TikaException > > > > : Unexpected RuntimeException from > > > > org.apache.tika.parser.microsoft.OfficeParser@328c62ce > > > > org.apache.solr.common.SolrException: > > > > org.apache.tika.exception.TikaException: Unexpected > > > > RuntimeException > > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce > > > > at > > > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr > > > > actingDocumentLoader.java:230) > > > > at > > > > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co > > > > ntentStreamHandlerBase.java:74) > > > > at > > > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl > > > > erBase.java:129) > > > > at > > > > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle > > > > Request(RequestHandlers.java:240) > > > > at > > org.apache.solr.core.SolrCore.execute(SolrCore.java:1656) > > > > at > > > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter > > > > .java:454) > > > > at > > > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte > > > > r.java:275) > > > > at > > > > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet > > > > Handler.java:1337) > > > > at > > > > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java > > > > :484) > > > > at > > > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j > > > > ava:119) > > > > at > > > > > > > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler. > > > ja > > > va > > > :524) > > > > at > > > > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl > > > > er.java:233) > > > > at > > > > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl > > > > er.java:1065) > > > > at > > > > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java: > > > > 413) > > > > at > > > > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle > > > > r.java:192) > > > > at > > > > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle > > > > r.java:999) > > > > at > > > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j > > > > ava:117) > > > > at > > > > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont > > > > extHandlerCollection.java:250) > > > > at > > > > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl > > > > ection.java:149) > > > > at > > > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper > > > > .java:111) > > > > at > org.eclipse.jetty.server.Server.handle(Server.java:351) > > > > at > > > > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac > > > > tHttpConnection.java:454) > > > > at > > > > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin > > > > gHttpConnection.java:47) > > > > at > > > > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra > > > > ctHttpConnection.java:890) > > > > at > > > > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header > > > > Complete(AbstractHttpConnection.java:944) > > > > at > > > > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642) > > > > at > > > > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java > > > > :2 > > > > 30 > > > > ) > > > > > > > > at > > > > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo > > > > nnection.java:66) > > > > at > > > > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So > > > > cketConnector.java:254) > > > > at > > > > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo > > > > l.java:599) > > > > at > > > > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool > > > > .java:534) > > > > at java.lang.Thread.run(Unknown Source) > > > > Caused by: org.apache.tika.exception.TikaException: > > > > Unexpected > > > RuntimeException > > > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce > > > > at > > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244 > > > > ) > > > > at > > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242 > > > > ) > > > > at > > > > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1 > > > > 20) > > > > at > > > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr > > > > actingDocumentLoader.java:224) > > > > ... 31 more > > > > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7 > > > > at > > > > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163) > > > > at > > > > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81) > > > > at > > > > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac > > > > tType.java:56) > > > > at > > > > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD > > > > escriptor.java:38) > > > > at > > > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera > > > > tion(CharacterSprmUncompressor.java:582) > > > > at > > > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char > > > > acterSprmUncompressor.java:65) > > > > at > > > > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288) > > > > at > > > > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121 > > > > ) > > > > at > > > > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346) > > > > at > > > > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja > > > > va:77) > > > > at > > > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java > > > > :185) > > > > at > > > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java > > > > :160) > > > > at > > > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242 > > > > ) > > > > ... 34 more > > > > > > > > > > > > Warm regards, > > > > Alex > > > > > > > > > > ------------------------------------------------------------------ > > > -- > > > - To unsubscribe, e-mail: [email protected] For > > > additional commands, e-mail: [email protected] > > > > > > > > > ------------------------------------------------------------------ > > > -- > > > - To unsubscribe, e-mail: [email protected] For > > > additional commands, e-mail: [email protected] > > > > > > > > > > > > -- > > Sergey Vladimirov > > > > -------------------------------------------------------------------- > > - To unsubscribe, e-mail: [email protected] For > > additional commands, e-mail: [email protected] > > > > > > -- > Sergey Vladimirov > -- Sergey Vladimirov
