In the beginning it is approximately 10 to 1. So for every page I crawl
I will get 10 more pages to crawl that are not currently in the index.
As you move towards 50 million pages is becomes more like 6 to 1. If
you seed the entire dmoz, your first crawl will be around 5.5 million
pages. Your second crawl will be around 54 million pages. And a depth
of 3 will give you over 300 million pages. These are the numbers that
we are currently seeing.
Dennis Kubes
bbrown wrote:
This is kind of a generic question. Are there any stats on how many pages
will get crawled based on some initial seed. For example, if you seed the
list from dmoz, how many pages will get indexed? Lets say there are 4
million, will 4 million only get indexed?
Or lets say I have 4000, will I get 30,000 crawled/indexed pages?
--
Berlin Brown
[berlin dot brown at gmail dot com]
http://botspiritcompany.com/botlist/?