https://bz.apache.org/bugzilla/show_bug.cgi?id=60284
Bug ID: 60284 Summary: OldExcelExtractor should throw an EncryptedDocumentException Product: POI Version: unspecified Hardware: PC OS: Windows NT Status: NEW Severity: normal Priority: P2 Component: HSSF Assignee: dev@poi.apache.org Reporter: talli...@mitre.org On TIKA-2118, Seva Alekseyev shared a document that causes an UnsupportedCodePage exception. The file is an old xls (BIFF5) that is encrypted. After looking through https://www.openoffice.org/sc/excelfileformat.pdf and experimenting with some files that cause similar exceptions in Tika's regression corpus, it appears that all records after a file pass record are encrypted, even the contents of the code page record. Let's throw an EncryptedDocumentException (Encryption not supported for old excel files). I did find one file that doesn't appear to be encrypted (attached to TIKA-2118). I can open it, but it is write protected...so even though I can open it and copy and paste contents out of it, the inner contents are encrypted. -- You are receiving this mail because: You are the assignee for the bug. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional commands, e-mail: dev-h...@poi.apache.org