[
https://issues.apache.org/jira/browse/TIKA-673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Parker closed TIKA-673.
----------------------------
Resolution: Duplicate
I see this is already added.
> BoilerPipe Integration
> ----------------------
>
> Key: TIKA-673
> URL: https://issues.apache.org/jira/browse/TIKA-673
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Reporter: Matt Parker
>
> Found a library that might be worth considering for integration into your
> package. It provides one of the best open source text extraction algorithms
> to find the main text within an HTML page.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira