> Dear nutch developers, > > Nutch by default takes seeds from file system (-seedDir). Is it possible to > change it to take seeds from mysql table?
In theory, yes, but i would not recommend it. It would be quite a job to make the mapper nicely play with database queries. > Is "Nutch URL Normalizer (org.apache.nutch.net.URLNormalizer)" the right > extension point to implement my plugin for this? No, this is something else. It normalizes URL's to a format you accept such as adding trailing slashes (or not) or removing double occurences of certain characters. http://svn.apache.org/viewvc/nutch/branches/branch-1.3/src/java/org/apache/nutch/crawl/Injector.java?view=markup > > Regards, > Zhaidarbek Ayazbayev

