i have executed  below Hive query

create table table_llv_N_C as select
table_line_n_passed.chromosome_number,table_line_n_passed.position,
table_line_c_passed.id from table_line_n_passed join table_line_c_passed on
(table_line_n_passed.chromosome_number=table_line_c_passed.chromosome_number)

and got following error ......
org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while
processing row (tag=1)
{"key":{"joinkey0":"12"},"value":{"_col2":"."},"alias":1} at
org.apache.hadoop.hive.ql.exec.ExecReducer.reduce(ExecReducer.java:258) ...
7 more Caused by: org.apache.hadoop.hive.ql.metadata.HiveException:
org.apache.hadoop.hive.ql.metadata.HiveException:
org.apache.hadoop.ipc.RemoteException(java.io.IOException): File
/tmp/hive-root/hive_2015-03-09_10-03-59_970_3646456754594156815-1/_task_tmp.-ext-10001/_tmp.000000_0
could only be replicated to 0 nodes instead of minReplication (=1). There
are 2 datanode(s) running and no node(s) are excluded in this operation.
......

The root cause may be lack of disk space in the HDFS cluster. details of
disk space are

hdfs dfs -df -h

Filesystem Size Used Available Use%

hdfs://x.y.ab.com:8020 159.7 G 21.9 G 110.7 G 14%.

table_line_n_passed having 4767409 rows and 1.1 G size.

similarly table_line_c_passed having 4717082 rows and 1.0 G size .

Does Hive really require that much space (more then available free space
110 G ) to process data. how to calculate how much free space require
before running query .any way to run query within available free space.

do i need to set any property or value in hive configuration .

PS: if i used LIMIT 10000 in above query its running fine .


-- 
with regards
krish!!!!!!

Reply via email to