On Fri, 2006-12-08 at 12:54 +0100, Andrzej Bialecki wrote: > Prefix filter to cut off anything without "http://". And then a > (non-existent) domain-suffix filter, which considers only domain > suffixes - this is easy to implement based on the suffix filter that > ships with Nutch.
Right.. I don't know Java but I'll give it a shot. Thanks :-) -Rob
