Thanks for creating the JIRA ticket.

On Wed, May 29, 2019 at 6:23 PM Tim Allison <[email protected]> wrote:
>
> Ugh.  Thank you!
>
> https://issues.apache.org/jira/browse/TIKA-2886
>
> On Wed, May 29, 2019 at 6:57 AM Tucker B <[email protected]> wrote:
> >
> > After upgrading to Tika 1.21 I have noticed several known XLSX files
> > are detected by Tika as "application/x-tika-ooxml". I think I've
> > narrowed it down to the new StreamingZipContainerDetector. After
> > inspecting the "[Content_Types].xml" of these XLSX files there is no
> > reference to any of the configured content types for XLSX in the
> > OOXML_CONTENT_TYPES in StreamingZipContainerDetector. Specifically,
> >
> > "application/vnd.openxmlformats-officedocument.spreadsheetml.sheet.main+xml"
> > "application/vnd.ms-excel.sheet.macroEnabled.main+xml"
> > "application/vnd.ms-excel.sheet.binary.macroEnabled.main"
> >
> > I do see a content type of
> >
> > "application/vnd.openxmlformats-officedocument.spreadsheetml.template.main+xml"
> >
> > in "[Content_Types].xml". Is the StreamingZipContainerDetector missing
> > the XSSFRelation TEMPLATE_WORKBOOK in OOXML_CONTENT_TYPES?

Reply via email to