Zitat von Dennis Kubes <[EMAIL PROTECTED]>:

Please explain a little more what you are trying to do.

I'm trying to set up a focused crawler, i.e. I want to decide if a link should be followed, depending on the contents of the page in which the link appears. This already works, but I also want to store the page's score (not the Nutch score, my own score) as a meta datum. Note this is not the score of the outgoing links, but of the page the links are in.

If you are
trying to set meta-data in crawldatum you can call:

crawldatum.getMetaData().put(key, value)

There are two crawldatums in distributeScoreToOutlink, the target one is not the one I'm interested in, the other one is null by default and I'm trying to use it to attach meta data to the fromUrl page (not the toUrl = outgoing page). So I create a new "adjust" with meta data and return it, but the meta data never shows up in the crawl db.

Regards
 Daniel


Reply via email to