> > You can write a custom scoringfilter to track the URL of the source, see > > the one in urlmeta for an example. It should be fairly straightforward to > > do > > > I'm not usind nutch index. Crawler is sending data to Solr. >
ScoringFilters are used at pretty much every step of Nutch and are not directly related to the indexing. Re-read my suggestion and look at the code. J. -- * *Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com http://twitter.com/digitalpebble

