2010/9/2 Jukka Zitting (JIRA) <[email protected]> > Add a ContentHandler for collecting links from parser output > ------------------------------------------------------------ > > Key: TIKA-503 > URL: https://issues.apache.org/jira/browse/TIKA-503 > Project: Tika > Issue Type: New Feature > Reporter: Jukka Zitting > Assignee: Jukka Zitting > Fix For: 0.8 > > > It would be nice to have a LinkContentHandler class that would > automatically collect any <a href="...">...</a> elements from the parser > output. >
Maybe the Droids LinkExtractor class could help you. https://svn.apache.org/repos/asf/incubator/droids/trunk/droids-core/src/main/java/org/apache/droids/parse/html/LinkExtractor.java Salu2. > > -- > This message is automatically generated by JIRA. > - > You can reply to this email to add a comment to the issue online. > >
