Bugs item #982678, was opened at 2004-06-30 11:11
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=491356&aid=982678&group_id=59548

Category: web db
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: MaineC (mainec)
Assigned to: Michael Cafarella (mike_cafarella)
Summary: numLinks() indicates more links than returned by links()

Initial Comment:
The bug was observed on a crawl started with the Intranet 
procedure from the tutorial. Crawling was done starting from a 
weblog directory at depth 3 and 1. The crawler was not restricted to 
only fetch data from the directory but was allowed to follow all links 
found.

After crawling an extended version of the link analysis tool was run to 
get some statistics about the link graph and the communities within. 
In the extended code, at first the method WebDBReader.numLinks() 
was called. Afterwards the method WebDBReader.links() was used to 
get an enumeration of the link objects. I observed that the former 
method returned a number greater than the amount of objects in the 
enumeration returned by the latter method.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=491356&aid=982678&group_id=59548


-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 - 
digital self defense, top technical experts, no vendor pitches, 
unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to