Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change 
notification.

The following page has been changed by JoydeepSensarma:
http://wiki.apache.org/hadoop/Hive/HiveAws/HivingS3nRemotely

------------------------------------------------------------------------------
  location 's3n://data.s3ndemo.hive/tpch/lineitem';
  }}}
    
- The TPCH DDL statements are slightly modified versions of the original TPCH 
statements (since Hive does not support all the data types used in TPCH).
+ The TPCH DDL statements are slightly modified versions of the original TPCH 
statements (since Hive does not support all the data types used in TPCH). All 
the TPCH DDL statements for Hive can be be found 
[[attachment:TpchDdlForHive.sql here]]
  
  == Executing Queries ==
  Hive can execute some queries without a Hadoop cluster. For example:
@@ -117, +117 @@

  The socket related options allow Hive CLI to communicate with the Hadoop 
cluster using a ssh tunnel (that will be established later). The job.ugi is 
specified to avoid issues with permissions on HDFS. {{{mapred.map.tasks}}} 
specification is a hack that works around 
[[https://issues.apache.org/jira/browse/HADOOP-5861 HADOOP-5861]] and may need 
to be set higher for large clusters. {{{mapred.reduce.tasks}}} is specified to 
let Hive determine the number of reducers (see 
[[https://issues.apache.org/jira/browse/HIVE-490 HIVE-490]]).
  
  === Links ===
-  * [[../../HiveAws Hive and AWS]] presents general landscape and alternative 
on running Hive queries in AWS.
+  * [:../: Hive and AWS] presents general landscape and alternative on running 
Hive queries in AWS.
   * [[http://jsensarma.com/blog/2009/05/hive-hadoop-s3-ec2-it-works On issues 
and lessons learned during this integration effort]]
  

Reply via email to