Hi,
I agree, at least in theory current behaviour might expose some unwanted content if the search results were public.
Could you please submit this to jira?
-- Sami Siren
Matthias Jaekle wrote:
For the domains www.tik24.de there is a dns entry 127.0.0.1.
I think nutch should realize that and ignore such domains, if this won't be a problem for intranet crawling.
------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
