Hi,
I use Tika 0.7, when trying to parse a .XLS file I get this error:
Could not parse document:class
org.apache.tika.exception.TikaException:TIKA-198: Illegal IOException
from org.apache.tika.parser.microsoft.officepar...@110003
org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException
from org.apache.tika.parser.microsoft.officepar...@110003
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:138)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:99)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:112)
at metricAv.TikaParser.parse(TikaParser.java:57)
at metricAv.TikaParser.main(TikaParser.java:39)
Caused by: java.io.IOException: Unable to read entire block; 1 byte
read; expected 512 bytes
at
org.apache.poi.poifs.storage.RawDataBlock.<init>(RawDataBlock.java:62)
at
org.apache.poi.poifs.storage.RawDataBlockList.<init>(RawDataBlockList.java:51)
at
org.apache.poi.poifs.filesystem.POIFSFileSystem.<init>(POIFSFileSystem.java:86)
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:74)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:132)
... 4 more
When the same document is saved as .ODS file there are no problems.
This is the source used:
private void parse(String resourceLocation) throws IOException,
SAXException, TikaException {
InputStream input = new FileInputStream(new File(resourceLocation));
ContentHandler textHandler = new BodyContentHandler();
Metadata metadata = new Metadata();
AutoDetectParser parser = new AutoDetectParser();
parser.parse(input, textHandler, metadata);
input.close();
}
Can anybody clarify my problem?
Roland