Am 17.03.2006 um 00:28 schrieb MagRaj:
Is it possible to create a new segment(contains all the pages of
that url)
for each url??
You can use the regex-urlfilter.txt to accept only the urls you want.
But for every new segment you have to change the regex-urlfilter.txt.
A better way is to use the index field "site". You have to generate a
new segment (for all urls), fetch and index this. In your webapp you
can limit the results with the "site" field.
e.g.
site:www.foo.com bar
this query search the word bar in the content of the urls with the
site www.foo.com.
hope this helps
Marko
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general