Re: [Nutch-general] Amazon S3 and EC2

Zaheed Haque Tue, 07 Nov 2006 04:39:20 -0800

Andrzej:

Thanks for the link. I am waiting for my EC2 activation :-)


Doug, could you please clarify the following: (A ball park figure will
do, I am trying to calculate my cost for a X number of document crawl)

1. How long did you run the instance? days, hours etc..
2. How many pages did you collect using the 19 instance?
3. Did you save the content in S3 or  the EC2 provided 160 GB Disk?
4. What kind of page fetch rate did you get?

Regards,
Zaheed
On 11/3/06, Andrzej Bialecki <[EMAIL PROTECTED]> wrote:
> kauu wrote:
> > that' s very good if it work.
> >
> > On 11/3/06, Zaheed Haque <[EMAIL PROTECTED]> wrote:
> >>
> >> Hi:
> >>
> >> I am just wondering if any of you had tried running Nutch on Amazon
> >> EC2 and try to save crawl data on Amazon S3? Could you please tell us
> >> about your experience. EC2 is closed beta so I haven't been able to
> >> try it.
>
> Apparently some people tried it - see here:
>
>     http://wiki.apache.org/lucene-hadoop/AmazonEC2

> Best regards,
> Andrzej Bialecki     <><
>  ___. ___ ___ ___ _ _   __________________________________
> [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
> ___|||__||  \|  ||  |  Embedded Unix, System Integration
> http://www.sigram.com  Contact: info at sigram dot com
>
>
>

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Re: [Nutch-general] Amazon S3 and EC2

Reply via email to