Just to make this more clear: I want the url of the page the anchor is
on, not the url of the anchor (which, of course, is the page I already
have).
Thanks.
-lucas
On May 16, 2005, at 7:46 PM, Lucas Rockwell wrote:
Hi all,
I am fairly new to nutch (but I have been wading through the code,
docs and mailing lists) and I am wondering if there is a way to get
the url of an anchor as well as the text of an anchor? I have a
feeling there is, but I have not pulled things apart enough to really
know for sure.
Any help would be much appreciated.
Thanks.
-lucas
p.s. nutch is a first-rate piece of software. Thanks to all who have
labored over this amazing tool!
-------------------------------------------------------
This SF.Net email is sponsored by Oracle Space Sweepstakes
Want to be the first software developer in space?
Enter now for the Oracle Space Sweepstakes!
http://ads.osdn.com/?ad_id=7412&alloc_id=16344&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general