Hi All, I've been learning up Nutch 1.5 from last couple of weeks and so far 
using these links: http://wiki.apache.org/nutch/NutchTutorial and 
http://wiki.apache.org/nutch/NewScoringIndexingExample I'm able to crawl a list 
of sites, with seed list of 1000 urls. I created the webgraphdb using one of 
the segments then dumped the score for link ranking etc. I'm able to see the 
link scores for URLS. I browsed the webgraphdb folders/subfolders which 
contains : inlinks,  loops,  nodes,  outlinks,  routes  etc. I can browse the 
file sitting in these folders but not able to understand anything as they 
contains some URLs and some other related data in some unusual characters. 
Basically, I want to generate a directed graph image or a connectivity graph 
image for the crawled URLs using all the data. I would appreciate any pointers 
in this regard. Is there any third party tool which takes these data as input 
and generates a directed/connectivity graph for the URLs which can be shown to 
give a visual understanding of connectivity between the URLS. Please provide 
inputs in this direction. Thanks in advance. 

Thanks, DW                                        

Reply via email to