Hello all, I am able to configure Nutch and use it on my PC. I am working a thesis on a local search engine. I hope in the way I understood Nutch, it is automatically indexing the documents it has crawled. I want to do some preprocessing on the documents cralwed before they get indexed. Can you help me on how to go about?
Thank u in advance and hope to hear from you soon.

