[ https://issues.apache.org/jira/browse/TIKA-309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Chris A. Mattmann resolved TIKA-309. ------------------------------------ Resolution: Fixed - fixed in r836035: * was able to correctly identify RDF/OWL mime types using magic by changing regex pattern for localName in MimeTypes.java (in the case where only the <ns:localName..... is read, but there is no ">" at the end since we only read N first bytes of the magic header) * added unit tests and URLs from this issue for regression * refactored o.a.tika.mime.MimeDetectionTest to support URLs as InputStreams (as well as Files) * took out <match value="<!--" type="string" offset="0"/> for HTML detection since comments can appear in HTML, XML, etc., and aren't specific to HTML > Mime type application/rdf+xml not correctly detected > ---------------------------------------------------- > > Key: TIKA-309 > URL: https://issues.apache.org/jira/browse/TIKA-309 > Project: Tika > Issue Type: Bug > Components: mime > Affects Versions: 0.5 > Reporter: Yuan-Fang Li > Assignee: Chris A. Mattmann > Priority: Minor > Fix For: 0.5 > > > Mime type detector using AutoDetectParser and Metadata returns > "application/xml" for the URL http://www.w3.org/2002/07/owl#, where it should > be "application/rdf+xml". The correct mime type is also suggested here: > http://www.w3.org/TR/owl-ref/#MIMEType. > P.S., Tika was downloaded from svn and built with Maven last week. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.