I am trying to modify Nutch to add level to the website parse data.
What I mean by this is suppose you start parsing a website at its
homepage that would be level one. Any links in the same site from the
homepage would be level two, links from those pages would be level three
and so on. I am only counting links in the same site.
How would I go about modifying Nutch to handle this? I was thinking
that I would have to modify Fetcher to do this, adding the level to the
parse metadata. What I am not gettings is how would I get the link
level initially? I was thinking I would have to modify something in the
generator but didn't know what.
Dennis
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general