[
https://issues.apache.org/jira/browse/TIKA-1208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13866101#comment-13866101
]
Peter Ansell commented on TIKA-1208:
------------------------------------
The patch applies cleanly to the current trunk but it doesn't compile:
[INFO] Compiling 40 source files to
/home/ans025/gitrepos/tika/tika-core/target/test-classes
[INFO] -------------------------------------------------------------
[ERROR] COMPILATION ERROR :
[INFO] -------------------------------------------------------------
[ERROR]
/home/ans025/gitrepos/tika/tika-core/src/test/java/org/apache/tika/detect/Any23DetectorTest.java:[432,66]
error: cannot find symbol
[ERROR] class Any23DetectorTest
/home/ans025/gitrepos/tika/tika-core/src/test/java/org/apache/tika/detect/Any23DetectorTest.java:[448,37]
error: cannot find symbol
[INFO] 2 errors
I am not sure what the two broken lines should be changed to, as I am not
familiar with the Tika codebase at this point.
I have put the patch on GitHub to work on it if that is easier for you (you are
a collaborator on the repository):
https://github.com/ansell/tika/tree/TIKA-1208
> Migrate Any23 mime contributions to Tika
> ----------------------------------------
>
> Key: TIKA-1208
> URL: https://issues.apache.org/jira/browse/TIKA-1208
> Project: Tika
> Issue Type: Sub-task
> Components: mime
> Reporter: Lewis John McGibbney
> Fix For: 1.5
>
> Attachments: TIKA-1208.patch
>
>
> We begin with one of the most obvious areas in which there
> is overlap.
> In short, the appeal of this package is the addition of detection
> for the following types:
> - text/n3
> - text/rdf+n3
> - application/n3
> - text/x-nquads
> - text/rdf+nq
> - text/nq
> - application/nq
> - text/turtle
> - application/x-turtle
> - application/turtle
> - application/trix
>
> Therefore although both Tika and Any23 execute the task of Mimetype-related
> tasks, there is a contribution to be made. This involves the trasferral of
> code pertaining to pattern recogition, Mimetype XML defitinions within
> tika-mimetypes.xml and a Purifier implementation that removes all
> the eventual blank characters at the header of a file that might
> prevents its MIME Type detection.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)