2010/9/2 Jukka Zitting (JIRA) <[email protected]>

> Add a ContentHandler for collecting links from parser output
> ------------------------------------------------------------
>
>                 Key: TIKA-503
>                 URL: https://issues.apache.org/jira/browse/TIKA-503
>             Project: Tika
>          Issue Type: New Feature
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>             Fix For: 0.8
>
>
> It would be nice to have a LinkContentHandler class that would
> automatically collect any <a href="...">...</a> elements from the parser
> output.
>

Maybe the Droids LinkExtractor class could help you.
https://svn.apache.org/repos/asf/incubator/droids/trunk/droids-core/src/main/java/org/apache/droids/parse/html/LinkExtractor.java

Salu2.


>
> --
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>
>

Reply via email to