Just to make this more clear: I want the url of the page the anchor is on, not the url of the anchor (which, of course, is the page I already have).

Thanks.

-lucas

On May 16, 2005, at 7:46 PM, Lucas Rockwell wrote:

Hi all,

I am fairly new to nutch (but I have been wading through the code, docs and mailing lists) and I am wondering if there is a way to get the url of an anchor as well as the text of an anchor? I have a feeling there is, but I have not pulled things apart enough to really know for sure.

Any help would be much appreciated.

Thanks.

-lucas

p.s. nutch is a first-rate piece of software. Thanks to all who have labored over this amazing tool!




------------------------------------------------------- This SF.Net email is sponsored by Oracle Space Sweepstakes Want to be the first software developer in space? Enter now for the Oracle Space Sweepstakes! http://ads.osdn.com/?ad_id=7412&alloc_id=16344&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to