Hi,

I crawled a website which had hundreds or thousands of pages. I asked nutch
to get only 54 pages I wanted, which it did. When I put the database for
searching, I typed the host name and it said there are 54 pages matching the
same - so it do the right operation.

When i type nutch readdb crawled-database/db/ -stats, I get 
Number of pages: 356
Number of links: 355

What does this mean? Does this mean how many pages and links are there in
the website? In this case, what is the difference between pages and links?
Does a page represent a link?

Thanks,
Karthik
-- 
View this message in context: 
http://www.nabble.com/Readdb-question-tf3718517.html#a10403539
Sent from the Nutch - User mailing list archive at Nabble.com.


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to