On 8/9/07, djames <[EMAIL PROTECTED]> wrote:
>
> Hello,
>
> I got a question about link analysis in nutch...
>
> Is the link analysis in the default configuration of nutch 0.81 and if not
> how can i set it up?

I am not sure what you mean but if you are talking about PageRank,
nutch uses an algorithm called OPIC (which doesn't need a full link
graph) for calculating a page's score.

However, there is an invertlinks command that can be used to extract
inverted link graph from fetched segments it can be run like:

bin/nutch invertlinks crawl/linkdb -dir crawl/segments

You can use LinkDbReader to read it.

> And what is the minimum depth for a performant link analysis

That depends on what you mean by link analysis and your machine configuration.

> --
> View this message in context: 
> http://www.nabble.com/Link-analysis-tool-tf4242325.html#a12071551
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>


-- 
Doğacan Güney

Reply via email to