[ 
https://issues.apache.org/jira/browse/TIKA-3163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17597766#comment-17597766
 ] 

PJ Fanning commented on TIKA-3163:
----------------------------------

I added a POI test - 
https://github.com/apache/poi/commit/30a0a4362ac38d97f5214a3b558486f881ab6958

This seems to indicate that whatever problems older versions of POI had with 
this file are no longer there. 

[~tallison] can this be closed?

> Java null pointer exception thrown while parsing an xlsx file to string even 
> though the xlsx file is working fine in the wps
> ----------------------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-3163
>                 URL: https://issues.apache.org/jira/browse/TIKA-3163
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.24.1
>            Reporter: Parikshit Phukan
>            Priority: Minor
>         Attachments: CVLKRA-KYC_Download_File_Structure_V3.1.xlsx
>
>
> I am using tika to extract text and feed it to my lucene indexer. Tika is 
> throwing a null pointer exception for a particular xlsx file. It works fine 
> while testing on other xlsx file and only throws an exception on this 
> particular file. I'll be attaching the xlslx file for you to check out. 
> Kindly help me out. 
> Code :-
> String path = "D:\\CVLKRA-KYC_Download_File_Structure_V3.1.xlsx";String path 
> = "D:\\CVLKRA-KYC_Download_File_Structure_V3.1.xlsx";
> File file = new File(path); 
> System.out.print(tika.parseToString(file));
>  
> Error :-
> Exception in thread "main" org.apache.tika.exception.TikaException: 
> Unexpected RuntimeException from 
> org.apache.tika.parser.microsoft.ooxml.OOXMLParser@54a67a45Exception in 
> thread "main" org.apache.tika.exception.TikaException: Unexpected 
> RuntimeException from 
> org.apache.tika.parser.microsoft.ooxml.OOXMLParser@54a67a45 at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:293) at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) at 
> org.apache.tika.Tika.parseToString(Tika.java:527) at 
> org.apache.tika.Tika.parseToString(Tika.java:642) at 
> poc.please.TikaPoc.main(TikaPoc.java:42)Caused by: 
> java.lang.NullPointerException at 
> org.apache.poi.xssf.usermodel.XSSFTableStyle.<init>(XSSFTableStyle.java:64) 
> at org.apache.poi.xssf.model.StylesTable.readFrom(StylesTable.java:245) at 
> org.apache.poi.xssf.model.StylesTable.<init>(StylesTable.java:138) at 
> org.apache.poi.xssf.eventusermodel.XSSFReader.getStylesTable(XSSFReader.java:127)
>  at 
> org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator.buildXHTML(XSSFExcelExtractorDecorator.java:143)
>  at 
> org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.getXHTML(AbstractOOXMLExtractor.java:136)
>  at 
> org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator.getXHTML(XSSFExcelExtractorDecorator.java:126)
>  at 
> org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:210)
>  at 
> org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:113)
>  at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) 
> ... 5 more



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to