Am 17.03.2006 um 00:28 schrieb MagRaj:

Is it possible to create a new segment(contains all the pages of that url)
for each url??


You can use the regex-urlfilter.txt to accept only the urls you want. But for every new segment you have to change the regex-urlfilter.txt. A better way is to use the index field "site". You have to generate a new segment (for all urls), fetch and index this. In your webapp you can limit the results with the "site" field.

e.g.
site:www.foo.com bar

this query search the word bar in the content of the urls with the site www.foo.com.

hope this helps
Marko





-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to