You understand correctly but we have both because there is no patch for 
NUTCH-1181 yet, you're more than welcome to provide some!

https://issues.apache.org/jira/browse/NUTCH-1181

On Friday 28 October 2011 15:55:59 Marek Bachmann wrote:
> Hello people,
> 
> can someone explain to me where the differences between the linkdb and
> webgraph's inlink database are?
> 
> As I have understood it, the linkdb holds the sites that are linking to
> a url and the anchor texts.
> 
> But this is nearly the same as on this page is written:
> http://wiki.apache.org/nutch/NewScoring
> "The inlink database is a listing of url and all of its inlinks."
> 
> Why do we have both databases?
> 
> Thanks

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350

Reply via email to