Extract rel attr with LinkContentHandler
----------------------------------------
Key: TIKA-825
URL: https://issues.apache.org/jira/browse/TIKA-825
Project: Tika
Issue Type: Improvement
Components: parser
Reporter: Markus Jelsma
Priority: Minor
For Nutch we need to extract URL's but need the rel attribute to check for the
nofollow value. I've patched the code to return this information in the Link
object. It's been tested and i can read the rel in Nutch now.
Thoughts?
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira