Thanks Marko for your suggestion. But, here is my problem: Find below the config files with sample data i have:
urls.txt has got 5 urls (just as an example) -------------------------------------------------------- http://foo.com/broker/broker_name_1/ http://foo.com/broker/broker_name_2/ http://foo.com/broker/broker_name_3/ http://foo.com/broker/broker_name_4/ http://foo.com/broker/broker_name_5/ crawl-urlfilter.txt contains the following ======================== # accept hosts in MY.DOMAIN.NAME +^http://([a-z0-9]*\.)*foo.com/ I ran the crawl with the above 5 urls and everything went on fine. Now,when the search is done within the broker_name_1 intranet homepage, i wanted to search the search text within all the pages belonging to broker_name_1 homepapge. If the search is done within foo.net, then it should be against all the broker homepages. I tried as you mentioned, but it didn't work. (site:foo.com/broker/broker_name_1 <Search_test>) How can i implement the above requirement?? Is there anything that i need to configure. Any help on this would be appreciated. Thanks. -- View this message in context: http://www.nabble.com/Searching-specific-domains-t1294857.html#a3462577 Sent from the Nutch - User forum at Nabble.com. ------------------------------------------------------- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
