[ 
https://issues.apache.org/jira/browse/TIKA-1484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated TIKA-1484:
------------------------------
    Attachment: TIKA-1484.patch

> Boilerpipe dependency is evil
> -----------------------------
>
>                 Key: TIKA-1484
>                 URL: https://issues.apache.org/jira/browse/TIKA-1484
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.6
>            Reporter: Ben McCann
>            Priority: Major
>         Attachments: TIKA-1484.patch
>
>
> The Boilerpipe project bundles inside it two classes from org.cyberneko.html. 
> We're already using NekoHTML in our project. Depending on which library shows 
> up on our classpath certain parts of our project will either work or not. I'd 
> really love it if Boilerpipe could be fixed or replaced with some other 
> library that is a better citizen.
> I see I'm not the first person to run into this as another Tika user has 
> filed a bug on the Boilerpipe project: 
> https://code.google.com/p/boilerpipe/issues/detail?id=62



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to