[Lucene-hadoop Wiki] Update of "AmazonEC2" by TomWhite

Apache Wiki Thu, 18 Jan 2007 01:43:04 -0800

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for 
change notification.


The following page has been changed by TomWhite:
http://wiki.apache.org/lucene-hadoop/AmazonEC2

------------------------------------------------------------------------------
  [http://www.amazon.com/gp/browse.html?node=201590011 Amazon EC2] (Elastic 
Compute Cloud) is a computing service.  One allocates a set of hosts, and runs 
ones's application on them, then, when done, de-allocates the hosts.  Billing 
is hourly per host.  Thus EC2 permits one to deploy Hadoop on a cluster without 
having to own and operate that cluster, but rather renting it on an hourly 
basis.
  
  This document assumes that you have already followed the steps in 
[http://docs.amazonwebservices.com/AmazonEC2/gsg/2006-06-26/ Amazon's Getting 
Started Guide].
+ 
+ There are now some [#AutomatedScripts scripts] available for running Hadoop 
on EC2.
  
  == Concepts ==
  
@@ -222, +224 @@

  
  Please vote for these issues in Jira if you feel this would help your 
project.  (Anyone can create themselves a Jira account in order to vote on 
issues, etc.)
  
+ [[Anchor(AutomatedScripts])]]
+ = Automated Scripts =
+ 
+ == Setting up ==
+  * Make sure you've followed the Amazon EC2 
[http://docs.amazonwebservices.com/AmazonEC2/gsg/2006-10-01/ Getting Started 
Guide], sections "Setting up an Account", "Setting up the Tools" and the 
"Generating a Keypair" section of "Running an Instance".
+  * Unpack the Hadoop EC2 scripts distribution available from 
[http://issues.apache.org/jira/browse/HADOOP-884 HADOOP-884].
+  * Edit all relevant variables in `hadoop-ec2-env.sh`.
+    * You need to get a Java download URL by visiting 
[http://java.sun.com/javase/downloads/index_jdk5.jsp here]. Make sure you get 
the JDK (not JRE) labelled "Linux self-extracting file". (The scripts have not 
been tested with Java 6 yet.)
+ 
+ == Creating an image ==
+ You only need to do this once.
+  * Type {{{
+ ./create-hadoop-image
+ }}}
+  * Accept the Java license terms.
+  * The script will create a new image, then bundle, upload and register it. 
This may take some time. Be patient - don't assume it's crashed.
+  * Terminate you instance using the command given by the script.
+ 
+ == Running a job on a cluster ==
+  * Run cluster. During execution of this script you will be prompted to set 
up DNS (e.g. [http://www.dyndns.com/services/dns/dyndns/ DynDNS]). {{{
+ ./run-hadoop-cluster 
+ }}}
+  * You will then be logged into the master node where you can start your job.
+    * For example, to test your cluster, try {{{
+ cd /usr/local/hadoop-*
+ bin/hadoop jar hadoop-*-examples.jar pi 10 10000000
+ }}}
+  * You can check progress of your job at `http://MASTER_HOST:50030/`.
+  * When you have finished logout of the master node by typing `exit`, then 
shutdown the cluster with {{{
+ ./terminate-hadoop-cluster
+ }}}
+

[Lucene-hadoop Wiki] Update of "AmazonEC2" by TomWhite

Reply via email to