Steve Blackmon created STREAMS-345:
--------------------------------------

             Summary: LinkCrawler in streams-processor-urls
                 Key: STREAMS-345
                 URL: https://issues.apache.org/jira/browse/STREAMS-345
             Project: Streams
          Issue Type: Improvement
            Reporter: Steve Blackmon
            Assignee: Steve Blackmon


LinkResolverProcessor can follow links through redirects, tracking status codes 
and other metadata, but does not save the content of the page.

Add a processor to the module that retrieves and saves the content of web pages 
referenced in the links field or activity object url fields.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to