> Dear nutch developers,
> 
> Nutch by default takes seeds from file system (-seedDir). Is it possible to
> change it to take seeds from mysql table?

In theory, yes, but i would not recommend it. It would be quite a job to make 
the mapper nicely play with database queries.


> Is "Nutch URL Normalizer (org.apache.nutch.net.URLNormalizer)" the right
> extension point to implement my plugin for this?

No, this is something else. It normalizes URL's to a format you accept such as 
adding trailing slashes (or not) or removing double occurences of certain 
characters.

http://svn.apache.org/viewvc/nutch/branches/branch-1.3/src/java/org/apache/nutch/crawl/Injector.java?view=markup

> 
> Regards,
> Zhaidarbek Ayazbayev

Reply via email to