Sub pages are not getting crawled
---------------------------------
Key: NUTCH-927
URL: https://issues.apache.org/jira/browse/NUTCH-927
Project: Nutch
Issue Type: Bug
Components: injector
Affects Versions: 2.0
Reporter: Rameez Raja
In my program the objective is to crawl all the pages and fetch the contents
from it. The category wise fetching the information is done perfectly but the
sub pages are not getting crawled. In the sense, the nextpages are in the form
of links at the bottom of the page.
I have included the code as,
<a href="http://reviews.logitech.com/7061/224/reviews.htm?page=2" title="Next
Page >" name="BV_TrackingTag_Review_Display_NextPage">More Reviews for
Z-5500 Digital 5.1 Speaker System</a>.
Can anyone solve this problem.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.