Hi everyone, I'm writing a small program which just utilizes Nutch as a crawler only, with no search functionality. The program should be able to return page content given an url input. I would like to ask how can we get the page content given only the URL, since webdb only provides a mechanism to get meta data of a page given URL, while segments can read content but require a record number.
Any help is greatly appreciated. Best regards, Giang
