[
https://issues.apache.org/jira/browse/DROIDS-8?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647367#action_12647367
]
Javier Puerto commented on DROIDS-8:
------------------------------------
Thank for the code formatting, i think i had the eclipse wrong configured.
>Could the LinkExtractor live in core? It does not appear to depend on tika. If
>so, how does it relate to:
>org.apache.droids.parse.html.HtmlParser?
+1
Not depend on tika and could be use in others task with sax.
Please move it to the package you suggest.
> [Patch] Create tied integration with Apache Tika (for parser and handler)
> -------------------------------------------------------------------------
>
> Key: DROIDS-8
> URL: https://issues.apache.org/jira/browse/DROIDS-8
> Project: Droids
> Issue Type: New Feature
> Components: tika
> Reporter: Thorsten Scherler
> Attachments: DROIDS-8-droids-tika.patch, LinkExtractor.java,
> tikaparser.diff, tikaparser.diff
>
>
> http://incubator.apache.org/tika/
> Apache Tika is a toolkit for detecting and extracting metadata and structured
> text content from various documents using existing parser libraries.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.