Hi Erlend, "Hosts matching seeds" means that if the domain (in this case www.ibsen.uio.no) is mentioned in a seed, a page with the same domain will be included in the crawl if there is nothing else that excludes it. So it sounds like it is working as designed.
Karl On Tue, May 14, 2013 at 7:45 AM, Erlend Garåsen <[email protected]>wrote: > > I just figured out that even though "Include only hosts matching seeds?" > is enabled, the web crawler continues to fetch everything from the host " > www.ibsen.uio.no" if I have placed the following in the seed list: > http://www.ibsen.uio.no/**forside.xhtml<http://www.ibsen.uio.no/forside.xhtml> > > I expected that only this page would be crawled, but that does not seem to > be the case. > > Erlend > > -- > Erlend Garåsen > Center for Information Technology Services > University of Oslo > P.O. Box 1086 Blindern, N-0317 OSLO, Norway > Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: > 31050 >
