Hi - in my HIVE environment, I ran the following query and expecting to see the rows (the data is present)...however, after 2339 seconds, the o/p I get is as shown below in the email (I've pasted last 5~10 lines of screen output)...
Is there anything that I'm missing? Did the process finish correctly? should there be something that could point me how to debug? ----------------------------- Query ------------------------------ select substr(CB.EXEC_DATE,1,10), count(CB.ID) from callbacks CB JOIN (select * from pages p where p.page like '%google.com/search%<http://google.com/search%>' or p.page like '%google.com/custom%<http://google.com/custom%>' or p.page like '%google.com/#hl%<http://google.com/#hl%>' or p.page like '%google.com/cse%<http://google.com/cse%>' or p.page like '%search.yahoo.com/search%< http://search.yahoo.com/search%>' or p.page like '%bing.com/search%<http://bing.com/search%>' or p.page like '%google.com/product%<http://google.com/product%>' ) s ON s.id = cb.id group by substr(CB.EXEC_DATE,1,10); ================= o/p on screen ================== 10/08/20 11:13:38 INFO mapred.TaskRunner: Task 'attempt_local_0001_r_000000_0' done. 2010-08-20 11:13:39,526 map = 100%, reduce =100% 10/08/20 11:13:39 INFO exec.ExecDriver: 2010-08-20 11:13:39,526 map = 100%, reduce =100% Ended Job = job_local_0001 10/08/20 11:13:39 INFO exec.ExecDriver: Ended Job = job_local_0001 10/08/20 11:13:39 INFO exec.FileSinkOperator: Moving tmp dir: hdfs:// hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001 to: hdfs:// hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate 10/08/20 11:13:39 INFO exec.FileSinkOperator: Moving tmp dir: hdfs:// hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate to: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/10001 OK Time taken: 2339.331 seconds ===================================================== * thanks, ronak* * * *408 504 4847* *My Blog : http://ronakbaps.posterous.com* * * * *