Francisco Tolmasky created TIKA-3132: ----------------------------------------
Summary: Many missing sub-class-of application/xml (or at least text/plain) for +xml types Key: TIKA-3132 URL: https://issues.apache.org/jira/browse/TIKA-3132 Project: Tika Issue Type: Bug Components: tika-batch Affects Versions: 1.24.1 Reporter: Francisco Tolmasky I'm not sure if this is by design or not, but many +xml types in tika-mimetypes.xml seem to be missing the sub-class-of tag. At first I thought that maybe Tika was smart enough to infer that +xml types must be subclasses of application/xml, but then I noticed that not *every* xml type is missing this tag. For example, image/svg+xml *does* have the application/xml sub-class-of tag explicitly set. Others include rdf+xml, dif+xml, etc. Would there be any opposition to me creating a PR that adds sub-class-of to a bunch of these (for example, atom+xml)? Or am I missing something here? -- This message was sent by Atlassian Jira (v8.3.4#803005)