Once I have done a crawl I have a need to pass all of the raw HTML and javascript that has been fetched through a custom parser. During a fetch does nutch store all of the raw content including HTML tags on disk? Thanks
Kevin
Once I have done a crawl I have a need to pass all of the raw HTML and javascript that has been fetched through a custom parser. During a fetch does nutch store all of the raw content including HTML tags on disk? Thanks
Kevin