hi, i'm doing a project wherein i want to make nutch crawler a "focused crawler". what i mean by a focused crawler is that it's a topic specific crawler wherein i have to decide the topical relevance of every page depending on it's contents. i'm having trouble identifying the files that need modification to implement this kind of feature....could anybody give me some pointers as regards the files that would need modification! any help would be greatly appreciated!
thanks, rajat swarup http://www-scf.usc.edu/~swarup/
