How about using a newer TIKA version?

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: [email protected]


> -----Original Message-----
> From: Yatin Baraiya [mailto:[email protected]]
> Sent: Thursday, September 29, 2011 9:35 AM
> To: [email protected]
> Subject: Re: error parsing .XLS file
> 
> Hy roland
> 
> i get same issue when i parse the Microsoft office doc.
> i have poi-3.6 version jar and tika 0.6 file in my project.
> 
> we get the following exception
> Caused by: java.lang.ArrayIndexOutOfBoundsException: 221433 at
> org.apache.poi.util.LittleEndian.getShort(LittleEndian.java:45)
> at org.apache.poi.hwpf.model.ListLevel.<init>(ListLevel.java:120)
> at org.apache.poi.hwpf.model.ListFormatOverrideLevel.<init>
> (ListFormatOverrideLevel.java:48)
> at org.apache.poi.hwpf.model.ListTables.<init>(ListTables.java:88)
> at org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:267)
> at org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:157)
> at
org.apache.poi.hwpf.extractor.WordExtractor.<init>(WordExtractor.java:62)
> at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:87)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:120)
> 
> i had tried with opening the tika-parsers-0.6.jar in winrar and find the
pom.xml
> from the jar and edit the pom.xml as per ur suggestion edited pom.xml
> snippets of the file is below <dependency>
>       <groupId>org.apache.poi</groupId>
>       <artifactId>poi</artifactId>
>       <version>3.6</version>
>     </dependency>
>     <dependency>
>       <groupId>org.apache.poi</groupId>
>       <artifactId>poi-scratchpad</artifactId>
>       <version>3.6</version>
>     </dependency>
>     <dependency>
>       <groupId>org.apache.poi</groupId>
>       <artifactId>poi-ooxml</artifactId>
>       <version>3.6</version>
>       <exclusions>
>         <exclusion>
>           <groupId>stax</groupId>
>           <artifactId>stax-api</artifactId>
>         </exclusion>
>       </exclusions>
>     </dependency>
> 
> can u tell me exactly  how would u get the solution?
> 
> can u help me to solve the said issue?
> 
> how to modify the POEM in order to use POI 3.7 with TIKA?
> 
> Thanks
> Yatin Baraiya
> 


Reply via email to