Hi,

I use Tika 0.7, when trying to parse a .XLS file I get this error:

Could not parse document:class
org.apache.tika.exception.TikaException:TIKA-198: Illegal IOException
from org.apache.tika.parser.microsoft.officepar...@110003
org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException
from org.apache.tika.parser.microsoft.officepar...@110003
    at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:138)
    at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:99)
    at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:112)
    at metricAv.TikaParser.parse(TikaParser.java:57)
    at metricAv.TikaParser.main(TikaParser.java:39)
Caused by: java.io.IOException: Unable to read entire block; 1 byte
read; expected 512 bytes
    at
org.apache.poi.poifs.storage.RawDataBlock.<init>(RawDataBlock.java:62)
    at
org.apache.poi.poifs.storage.RawDataBlockList.<init>(RawDataBlockList.java:51)
    at
org.apache.poi.poifs.filesystem.POIFSFileSystem.<init>(POIFSFileSystem.java:86)
    at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:74)
    at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:132)
    ... 4 more

When the same document is saved as .ODS file there are no problems.

This is the source used:

    private void parse(String resourceLocation) throws IOException,
    SAXException, TikaException {
        InputStream input = new FileInputStream(new File(resourceLocation));
        ContentHandler textHandler = new BodyContentHandler();
        Metadata metadata = new Metadata();
        AutoDetectParser parser = new AutoDetectParser();
        parser.parse(input, textHandler, metadata);
       
        input.close();
    }

Can anybody clarify my problem?

Roland

Reply via email to