I have raised Issues 1281 & 1282 in the Tika Jira. One for the additional XML type The second for the 4 additional Gzip types
Avi. On Fri, Apr 25, 2014 at 12:04 PM, Nick Burch <[email protected]> wrote: > On Fri, 25 Apr 2014, אברהם חיון wrote: > >> I pitched the team to drop support for unrecognized (by Tika) >> media-types, and if Tika decides to insert them into it's registry then we >> will support them automatically. >> > > If you want additional types supported, please raise a bug in jira, and > list them there. Someone'll hopefully review and commit them fairly quickly > from there! > > The GZIP format is as follows in Wikipedia: >> http://en.wikipedia.org/wiki/Gzip >> >> The MediaType according to Wikipedia is application/gzip, while in the >> TIKA >> DB it is: "*application/x-gzip*" and the "*application/gzip*" is totally >> >> left out (not even an alias) !? >> > > Looks like those were only added quite recently, from the date of the RFC. > I've raised TIKA-1280 to track it > > Nick
