I would like to delete certain documents from the crawled documents depending on a certain criteria. Is there a way to achieve this? My guess is, nutch downloads all the files before parsing it.
- Deleting documents Rajesh Munavalli
- Re: Deleting documents Dima Mazmanov
- Re: Deleting documents TDLN
- Re: Deleting documents Honda-Search Administrator
