No, not familiar with that yet - can you send out any URL's? My question is really whether you're better to try for one or two big boxes or a series of small boxes - also looking for anyone who has 100 million pages in their index and a description of their hardware as a reference point...
Thanks! Paul -----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of VK Sent: Wednesday, November 28, 2007 9:53 PM To: [email protected] Subject: Re: Hardware Planning Have you considered EC2 + S3? Also Rightscale has some interesting solutions, which I am currently evaluating. On Nov 28, 2007 9:38 PM, Paul Stewart <[EMAIL PROTECTED]> wrote: > Hi folks... > > I have read the archives and looking for input specific to my estimated > requirements: > > Want to index about 100 million public webpages. Space and bandwidth > are not a problem - coming up with the right hardware and keeping the > cost down is my goal. > > I would estimate only 1-2 searches per second at least during the first > hardware phase. > > With that in mind I'm trying to figure out whether to use a couple of > larger Dell servers or a bunch of small single CPU, 1 Gig RAM, 160 GB > hard drive type of machines.... > > Anyone share what they are using for hardware for about 100 million > webpages and their search result times etc?? Realworld is important to > me and being able to scale is important.... > > Thanks, > > Paul > > > > > > > > > ------------------------------------------------------------------------ ---- > > "The information transmitted is intended only for the person or entity to > which it is addressed and contains confidential and/or privileged material. > If you received this in error, please contact the sender immediately and > then destroy this transmission, including all attachments, without copying, > distributing or disclosing same. Thank you." > ---------------------------------------------------------------------------- "The information transmitted is intended only for the person or entity to which it is addressed and contains confidential and/or privileged material. If you received this in error, please contact the sender immediately and then destroy this transmission, including all attachments, without copying, distributing or disclosing same. Thank you."
