Lucas Rockwell wrote:
I am fairly new to nutch (but I have been wading through the code, docs and mailing lists) and I am wondering if there is a way to get the url of an anchor as well as the text of an anchor? I have a feeling there is, but I have not pulled things apart enough to really know for sure.

At present this is not supported. It could be easily added, but would substantially slow things. With a little more work it could be made somewhat efficient. I will attempt to include this feature in the MapReduce rewrite that I'm now starting. So, one way or another, this feature will be added.


Doug


------------------------------------------------------- This SF.Net email is sponsored by Oracle Space Sweepstakes Want to be the first software developer in space? Enter now for the Oracle Space Sweepstakes! http://ads.osdn.com/?ad_id=7412&alloc_id=16344&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to