[ 
https://issues.apache.org/jira/browse/TIKA-309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris A. Mattmann resolved TIKA-309.
------------------------------------

    Resolution: Fixed

- fixed in r836035:

* was able to correctly identify RDF/OWL mime types using magic by changing 
regex pattern for localName in MimeTypes.java (in the case where only the 
<ns:localName..... is read, but there is no ">" at the end since we only read N 
first bytes of the magic header)

* added unit tests and URLs from this issue for regression
* refactored o.a.tika.mime.MimeDetectionTest to support URLs as InputStreams 
(as well as Files)
* took out <match value="&lt;!--" type="string" offset="0"/> for HTML detection 
since comments can appear in HTML, XML, etc., and aren't specific to HTML



> Mime type application/rdf+xml not correctly detected
> ----------------------------------------------------
>
>                 Key: TIKA-309
>                 URL: https://issues.apache.org/jira/browse/TIKA-309
>             Project: Tika
>          Issue Type: Bug
>          Components: mime
>    Affects Versions: 0.5
>            Reporter: Yuan-Fang Li
>            Assignee: Chris A. Mattmann
>            Priority: Minor
>             Fix For: 0.5
>
>
> Mime type detector using AutoDetectParser and Metadata returns 
> "application/xml" for the URL http://www.w3.org/2002/07/owl#, where it should 
> be "application/rdf+xml". The correct mime type is also suggested here: 
> http://www.w3.org/TR/owl-ref/#MIMEType.
> P.S., Tika was downloaded from svn and built with Maven last week.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to