what does

/tmp/<username>/hive.log contain?

Ashish

________________________________
From: Eva Tse [mailto:[email protected]]
Sent: Friday, July 17, 2009 11:07 AM
To: [email protected]
Subject: Error in running group-by and join hive query...

Hive version: r786648 w/ HIVE-487 2nd patch.

However, it is working on Hive 0.3.

Thanks,
Eva.

Running the script in this email gives the following errors:

Hive history file=/tmp/dataeng/hive_job_log_dataeng_200907171359_1511035858.txt
OK
Time taken: 3.419 seconds
OK
Time taken: 0.211 seconds
OK
Time taken: 0.364 seconds
OK
Time taken: 0.104 seconds
Total MapReduce jobs = 2
Number of reduce tasks not specified. Estimated from input data size: 1
In order to change the average load for a reducer (in bytes):
  set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
  set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
  set mapred.reduce.tasks=<number>
Job Submission failed with exception 'java.io.IOException(cannot find dir = 
hdfs://ip-10-251-49-188.ec2.internal:9000/tmp/hive-dataeng/1 in 
partToPartitionInfo!)'
FAILED: Execution Error, return code 1 from 
org.apache.hadoop.hive.ql.exec.ExecDriver

Script:

drop table facts_details;
drop table facts;

CREATE TABLE facts
(xid string,
devtype_id int)
PARTITIONED by (dateint int, hour int)
ROW FORMAT DELIMITED FIELDS TERMINATED BY '\001' COLLECTION ITEMS TERMINATED BY 
'\004' MAP KEYS TERMINATED BY '\002' stored as SEQUENCEFILE;

CREATE TABLE facts_details
(xid string,
cdn_name string,
utc_ms array<bigint>,
moff array<int>)
PARTITIONED by (dateint int, hour int)
ROW FORMAT DELIMITED FIELDS TERMINATED BY '\001' COLLECTION ITEMS TERMINATED BY 
'\004' MAP KEYS TERMINATED BY '\002' stored as SEQUENCEFILE;


select f.devtype_id from facts f join facts_details c on (f.xid = c.xid)
where c.dateint = 20090710 and f.dateint = 20090710
group by f.devtype_id;


Reply via email to