Dear Praveen,

This is very usefully tipp.

Thanks for it:
          Ferenc

Hi,

There is a parameter called db.max.outlinks.per.page in
nutch-default.xml


<property>
  <name>db.max.outlinks.per.page</name>
  <value>100</value>
  <description>The maximum number of outlinks that we'll
process for a page.
  </description>
</property>

Using this you can control the number of out-links processed
from a page.

So one approach to your problem could becrawling the links
in two
steps and keeping this value according to whether or not you
want to
process out going links and then use SegmentMerge tool to
merge the
segments.

HTH,
Praveen. 


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to