I'm using POI to extract text from word and excel documents. For word 
documents, I'm using org.apache.poi.hdf.extractor.WordDocument.

The problem comes if I try to open a password protected word document - it 
breaks POI with an out of memory error

Exception in thread "main" java.lang.OutOfMemoryError:
        at 
org.apache.poi.hdf.extractor.WordDocument.initDocProperties(WordDocument.java:575)
        at 
org.apache.poi.hdf.extractor.WordDocument.processComplexFile(WordDocument.java:329)
        at org.apache.poi.hdf.extractor.WordDocument.readFIB(WordDocument.java:287)
        at org.apache.poi.hdf.extractor.WordDocument.<init>(WordDocument.java(Compiled 
Code))

I don't want to extract text from these, but I would like to be able to 
detect them so I can skip them. Is there a way to discover if a Word 
document is protected?

Thanks
Nick


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to