[
https://issues.apache.org/jira/browse/TAJO-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14229484#comment-14229484
]
ASF GitHub Bot commented on TAJO-1199:
--------------------------------------
GitHub user hys9958 opened a pull request:
https://github.com/apache/tajo/pull/275
TAJO-1199: EMR bootstrap script for Tajo
Bootstrap Action Arguments:
==========================
Usage: install-tajo.sh [OPTIONS]
-t [S3_PATH_TO_TAJO_BIN_TARBALL]
Ex: s3://[your_bucket]/[your_path]/tajo-{version}.tar.gz
Default:
http://d3kp3z3ppbkcio.cloudfront.net/tajo-0.9.0/tajo-0.9.0.tar.gz
-c [S3_PATH_TO_TAJO_CONF_DIR]
Ex: s3://[your_bucket]/[your_path]/conf
-l [S3_PATH_TO_THIRD_PARTY_JARS_DIR]
Ex: s3://[your_bucket]/[your_path]/lib
-h
Display help message
-T [LOCAL_PATH_TO_TEST_ROOT] (only used for local test)
Ex: /[LOCAL_PATH_TO_TEST_ROOT]
-H [LOCAL_PATH_TO_HADOOP_HOME_FOR_TEST] (only used for local test)
Ex: /[LOCAL_PATH_TO_HADOOP_HOME_FOR_TEST]
Note that all arguments are optional. ``-T`` and ``-H`` are only for local
test.
Sample Commands:
================
Launching a Tajo cluster with a default configurations
-------------------------------------------------------
* It uses EMR HDFS as ```tajo.root``` which includes the warehouse
directory
* It uses all default heap and concurrency configs.
* It is good for a simple test.
```
$ aws emr create-cluster \
--name="[CLUSTER_NAME]" \
--ami-version=3.3 \
--ec2-attributes KeyName=[KEY_FIAR_NAME] \
--instance-groups
InstanceGroupType=MASTER,InstanceCount=1,InstanceType=m3.xlarge
InstanceGroupType=CORE,InstanceCount=1,InstanceType=c3.xlarge \
--bootstrap-action Name="Install
tajo",Path=s3://[your_bucket]/[your_path]/install-tajo.sh
```
Launching a Tajo cluster with additional configurations
-------------------------------------------------------
* To use your Tajo tarball, you should use ```-t``` to specify S3 URL.
* To change ```tajo.rootdir```, you should make your own
```tajo-site.xml``` and use ```-c``` option to specify S3 URL for config dirs.
* You can find appropriate config templates in tajo-emr/template.
* To use RDS, you needs appropriate JDBC jars like mysql-connector.jar.
```-l``` option allows you to specify S3 directory URL, including third party
Jars.
```
aws emr create-cluster \
--name="[CLUSTER_NAME]" \
--ami-version=3.3 \
--ec2-attributes KeyName=[KEY_FIAR_NAME] \
--instance-groups
InstanceGroupType=MASTER,InstanceCount=1,InstanceType=m3.xlarge
InstanceGroupType=CORE,InstanceCount=1,InstanceType=c3.xlarge \
--bootstrap-action Name="Install
tajo",Path=s3://[your_bucket]/[your_path]/install-tajo.sh,Args=["-t","s3://[your_bucket]/tajo-0.9.0.tar.gz","-c","s3://[your_bucket]/conf","-l","s3://[your_bucket]/lib"]
```
How to test bootstrap in local machine
=======================================
```install-tajo.sh``` allows users to test the bootstrap in local machine
without EMR instances. For it, you need to use ```-T``` and ```-H``` options.
* ```-T``` - Testing root dir which is temporarily used for testing.
* ```-H``` - Hadoop binary directory which is used to pretended to be EMR
Hadoop home
```
$ ./install-EMR-tajo.sh -t /[your_local_binary_path]/tajo-0.9.0.tar.gz -c
/[your_test_conf_dir]/conf -l /[your_test_lib_dir]/lib -T
/[LOCAL_PATH_TO_TEST_ROOT] -H /[LOCAL_PATH_TO_HADOOP_HOME_FOR_TEST]
```
Running with AWS RDS
====================
Tajo can use RDS. For it, you need to make sure you already have a running
RDS instance. Then, you need to make your ```catalog-site.xml```. Please refer
to [Catalog configuration documentation]
(http://tajo.apache.org/docs/current/configuration/catalog_configuration.html)
in Tajo doc.
Also, you should use ```-c``` option in order to use your custom
```catalog-site.xml``` file.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/hys9958/tajo tajo-1199
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/tajo/pull/275.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #275
----
commit 0b4b135c81ca3548e78d622c26027808883b9c9f
Author: hys9958 <[email protected]>
Date: 2014-12-01T07:06:43Z
TAJO-1199: EMR bootstrap script for Tajo
----
> EMR bootstrap script for Tajo
> -----------------------------
>
> Key: TAJO-1199
> URL: https://issues.apache.org/jira/browse/TAJO-1199
> Project: Tajo
> Issue Type: New Feature
> Components: tools
> Affects Versions: 0.9.1
> Reporter: YeonSu Han
> Assignee: YeonSu Han
> Labels: features
> Fix For: 0.9.1
>
>
> Hi guys,
> I'd like to contribute EMR bootstrap script for Tajo. With this script, you
> can easily launch a Tajo cluster on EMR.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)