On Tue, 14 May 2019, 13:52 Tim Allison, <[email protected]> wrote: > Hi Tucker, > I know only a little about this area, but I think password protected > xlsx files (and ooxml generally) are encrypted inside an OLE package > so you can't even get to the underlying ooxml/zip file until you've > decrypted the file.
That is my understanding as well. And can confirm based on the OfficeParser code paths for x-tika-ooxml-protected. Do you have the passwords to these files? > In most cases they are the default password. So I might need to add a custom mimetype detector to add as a composite detector for handling the case where the default password will work. On Tue, May 14, 2019 at 1:00 PM Tucker B <[email protected]> wrote: > > > > I have a password protected xlsx file. The default mime type detection > > returns a mime type of "application/x-tika-ooxml-protected". Is it > > possible to configure the mime type detection to return the underlying > > content type, e.g. > > "application/vnd.openxmlformats-officedocument.spreadsheetml.sheet". I > > didn't see any configuration options available to override in > > custom-mimetypes.xml. >
