On Thu, Sep 18, 2008 at 5:35 PM, Edward Quick <[EMAIL PROTECTED]> wrote: > > Does anyone know what I can do about these warnings? > I have set db.max.outlinks.per.page to -1 in nutch-site.xml. > > WARN regex.RegexURLNormalizer - can't find rules for scope 'outlink', using > default > WARN regex.RegexURLNormalizer - can't find rules for scope 'partition', > using default > WARN regex.RegexURLNormalizer - can't find rules for scope 'crawldb', using > default > WARN regex.RegexURLNormalizer - can't find rules for scope > 'generate_host_count', using default >
They don't mean anything. A URLNormalizer plugin can define different rules for different 'scopes' such as during generation, parsing, etc. So that warning is just reporting that RegexURLNormalizer does not define different rules for different scopes and always uses the "default" rules. So, it is OK :) > Thanks, > > Ed. > > _________________________________________________________________ > Make a mini you and download it into Windows Live Messenger > http://clk.atdmt.com/UKM/go/111354029/direct/01/ -- Doğacan Güney
