See Comments Inline
Thanks ------ Original message------ From: Krishna Date: Tue, Sep 9, 2014 5:24 AM To: user@phoenix.apache.org; Subject:Phoenix on Amazon EMR Hi, Does anyone have experience using Amazon EMR with Phoenix? I'm currently evaluating Phoenix for a HBase store on Amazon EMR. EMR provides Phoenix 2.1.2 as the default installation but I prefer to use 3x.---Use 3.x Could someone clarify the following with regards to 2.1.2? 1. Does this version support bulk-loading capabilities? We expect to load more than trillion rows, so, bulk-loader is a necessity. Use-Can Phoenix 2.1.2 run on either Hadoop1 or Hadoop2? -No. Use 3.x for mapreduce uploaded. 2. Did anyone try installing Phoenix 3x using EMR's bootstrap action capabilities?-Yes....it works.You will need to build client jar as per hadoop 2 version supported by AWS. 3. In the following arguments to the bulk loader, is port # required or optional? If I'm using Hadoop2, should Resource Manager node be substituted for Job Tracker? -Yes. You will see the port details when u login to emr cluster. * -hd <arg> HDFS NameNode IP:<port> * -mr <arg> MapReduce Job Tracker IP:<port> * -zk <arg> Zookeeper IP:<port> Thanks for your inputs. Krishna