[
https://issues.apache.org/jira/browse/TIKA-1484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17572576#comment-17572576
]
Hudson commented on TIKA-1484:
------------------------------
SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk8 #720 (See
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/720/])
TIKA-1484 - isolate boilerpipe dependencies to tika-app, tika-bundle-standard
and tika-server-standard (tallison:
[https://github.com/apache/tika/commit/773bf3bf69751602d0e36cf28f342a27df50fd8f])
* (edit) tika-app/pom.xml
* (edit)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-html-module/pom.xml
* (edit)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-html-module/src/test/java/org/apache/tika/parser/html/HtmlParserTest.java
* (add)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-html-commons/README.md
* (edit) tika-bundles/tika-bundle-standard/pom.xml
* (edit) CHANGES.txt
* (add)
tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/test/java/org/apache/tika/sax/BoilerpipeHandlerTest.java
* (edit) tika-server/tika-server-standard/pom.xml
* (edit)
tika-parsers/tika-parsers-standard/tika-parsers-standard-package/pom.xml
> Boilerpipe dependency is evil
> -----------------------------
>
> Key: TIKA-1484
> URL: https://issues.apache.org/jira/browse/TIKA-1484
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.6
> Reporter: Ben McCann
> Priority: Major
> Attachments: TIKA-1484.patch
>
>
> The Boilerpipe project bundles inside it two classes from org.cyberneko.html.
> We're already using NekoHTML in our project. Depending on which library shows
> up on our classpath certain parts of our project will either work or not. I'd
> really love it if Boilerpipe could be fixed or replaced with some other
> library that is a better citizen.
> I see I'm not the first person to run into this as another Tika user has
> filed a bug on the Boilerpipe project:
> https://code.google.com/p/boilerpipe/issues/detail?id=62
--
This message was sent by Atlassian Jira
(v8.20.10#820010)