[ 
https://issues.apache.org/jira/browse/TIKA-1208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13866101#comment-13866101
 ] 

Peter Ansell commented on TIKA-1208:
------------------------------------

The patch applies cleanly to the current trunk but it doesn't compile:

[INFO] Compiling 40 source files to 
/home/ans025/gitrepos/tika/tika-core/target/test-classes
[INFO] -------------------------------------------------------------
[ERROR] COMPILATION ERROR : 
[INFO] -------------------------------------------------------------
[ERROR] 
/home/ans025/gitrepos/tika/tika-core/src/test/java/org/apache/tika/detect/Any23DetectorTest.java:[432,66]
 error: cannot find symbol
[ERROR]  class Any23DetectorTest
/home/ans025/gitrepos/tika/tika-core/src/test/java/org/apache/tika/detect/Any23DetectorTest.java:[448,37]
 error: cannot find symbol
[INFO] 2 errors 

I am not sure what the two broken lines should be changed to, as I am not 
familiar with the Tika codebase at this point.

I have put the patch on GitHub to work on it if that is easier for you (you are 
a collaborator on the repository):

https://github.com/ansell/tika/tree/TIKA-1208

> Migrate Any23 mime contributions to Tika
> ----------------------------------------
>
>                 Key: TIKA-1208
>                 URL: https://issues.apache.org/jira/browse/TIKA-1208
>             Project: Tika
>          Issue Type: Sub-task
>          Components: mime
>            Reporter: Lewis John McGibbney
>             Fix For: 1.5
>
>         Attachments: TIKA-1208.patch
>
>
> We begin with one of the most obvious areas in which there
> is overlap.
> In short, the appeal of this package is the addition of detection 
> for the following types:
>  - text/n3
>  - text/rdf+n3
>  - application/n3
>  - text/x-nquads
>  - text/rdf+nq
>  - text/nq
>  - application/nq
>  - text/turtle
>  - application/x-turtle
>  - application/turtle
>  - application/trix
>  
> Therefore although both Tika and Any23 execute the task of Mimetype-related
> tasks, there is a contribution to be made. This involves the trasferral of
> code pertaining to pattern recogition, Mimetype XML defitinions within 
> tika-mimetypes.xml and a Purifier implementation that removes all 
> the eventual blank characters at the header of a file that might 
> prevents its MIME Type detection.  



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to