Jerryjung created TAJO-1519:
-------------------------------
Summary: Tajo workers did not working normally.
Key: TAJO-1519
URL: https://issues.apache.org/jira/browse/TAJO-1519
Project: Tajo
Issue Type: Bug
Environment: os : centOS 6.6
hadoop : 2.5.0-cdh5.2.0
tajo : 0.10.0
Reporter: Jerryjung
I tested TPC-H with 1GB. When I executed q7 sql, workers did not response.
The query is below.
{noformat}
Running Queries
QueryId Query Master Started Progress Time Status sql Kill
Query
q_1428023002653_0192 jup-g07-07 2015-04-03 15:41:36 27% 1 hrs,
13 mins, 58 sec QUERY_RUNNING select supp_nation, cust_nation, l_year,
sum(volume) as revenue from ( select supp_nation, cust_nation,
substr(l_shipdate, 1, 4) as l_year, l_extendedprice * (1 - l_discount) as
volume from q7_volume_shipping_tmp t join (select l_shipdate, l_extendedprice,
l_discount, c_nationkey, s_nationkey from supplier s join (select l_shipdate,
l_extendedprice, l_discount, l_suppkey, c_nationkey from customer c join
(select l_shipdate, l_extendedprice, l_discount, l_suppkey, o_custkey from
orders o join lineitem l on o.o_orderkey = l.l_orderkey and l.l_shipdate >=
'1995-01-01' and l.l_shipdate <= '1996-12-31' ) l1 on c.c_custkey =
l1.o_custkey ) l2 on s.s_suppkey = l2.l_suppkey ) l3 on l3.c_nationkey =
t.c_nationkey and l3.s_nationkey = t.s_nationkey ) shipping group by
supp_nation, cust_nation, l_year order by supp_nation, cust_nation, l_year
{noformat}
{noformat}
Status: RUNNING
Started: 2015-04-03 15:46:29 ~ -
# Tasks: 6063 (Local Tasks: 0, Rack Local Tasks: 0)
Progress: 99%
# Shuffles: 0
Input Bytes: 94.4 GiB (101,369,366,283 B)
Actual Processed Bytes: 94.2 GiB (101,176,763,849 B)
Input Rows: 3,288,784,087
Output Bytes: 70.5 GiB (75,663,614,467 B)
Output Rows: 1,801,836,518
{noformat}
The error message is below.
{noformat}
2015-04-03 15:47:40,586 ERROR org.apache.tajo.worker.Task: Fetch failed:
http://jup-g07-07:52758/?qid=q_1428023002653_0192&sid=4&p=1053&type=h
java.io.IOException: org.jboss.netty.channel.ConnectTimeoutException:
connection timed out: jup-g07-07/50.1.101.87:52758
at org.apache.tajo.worker.Fetcher.get(Fetcher.java:142)
at org.apache.tajo.worker.Task$FetchRunner.run(Task.java:609)
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
Caused by: org.jboss.netty.channel.ConnectTimeoutException: connection timed
out: jup-g07-07/50.1.101.87:52758
at
org.jboss.netty.channel.socket.nio.NioClientBoss.processConnectTimeout(NioClientBoss.java:137)
at
org.jboss.netty.channel.socket.nio.NioClientBoss.process(NioClientBoss.java:83)
at
org.jboss.netty.channel.socket.nio.AbstractNioSelector.run(AbstractNioSelector.java:312)
at
org.jboss.netty.channel.socket.nio.NioClientBoss.run(NioClientBoss.java:42)
... 3 more
2015-04-03 15:47:40,594 ERROR org.apache.tajo.worker.Fetcher: Fetch failed :
org.jboss.netty.channel.ConnectTimeoutException: connection timed out:
jup-g07-07/50.1.101.87:52758
at
org.jboss.netty.channel.socket.nio.NioClientBoss.processConnectTimeout(NioClientBoss.java:137)
at
org.jboss.netty.channel.socket.nio.NioClientBoss.process(NioClientBoss.java:83)
at
org.jboss.netty.channel.socket.nio.AbstractNioSelector.run(AbstractNioSelector.java:312)
at
org.jboss.netty.channel.socket.nio.NioClientBoss.run(NioClientBoss.java:42)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)