Andrzej: Thanks for the link. I am waiting for my EC2 activation :-)
Doug, could you please clarify the following: (A ball park figure will do, I am trying to calculate my cost for a X number of document crawl) 1. How long did you run the instance? days, hours etc.. 2. How many pages did you collect using the 19 instance? 3. Did you save the content in S3 or the EC2 provided 160 GB Disk? 4. What kind of page fetch rate did you get? Regards, Zaheed On 11/3/06, Andrzej Bialecki <[EMAIL PROTECTED]> wrote: > kauu wrote: > > that' s very good if it work. > > > > On 11/3/06, Zaheed Haque <[EMAIL PROTECTED]> wrote: > >> > >> Hi: > >> > >> I am just wondering if any of you had tried running Nutch on Amazon > >> EC2 and try to save crawl data on Amazon S3? Could you please tell us > >> about your experience. EC2 is closed beta so I haven't been able to > >> try it. > > Apparently some people tried it - see here: > > http://wiki.apache.org/lucene-hadoop/AmazonEC2 > Best regards, > Andrzej Bialecki <>< > ___. ___ ___ ___ _ _ __________________________________ > [__ || __|__/|__||\/| Information Retrieval, Semantic Web > ___|||__|| \| || | Embedded Unix, System Integration > http://www.sigram.com Contact: info at sigram dot com > > > ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
