Hi,
I am trying to create a cube with 3 dimensions (1 normal, 1 derived,
1hierarchy). Out of these, the largest cardinality is about 5M. However
during the #3 Step Name: Build Dimension Dictionary - kylin dies. Web
interface stops responding. I cannot create a new session. What is more
mapreduce does not show any job starting for phase #3. The kylin.log
ends with those entries:
[Thread-9-EventThread]:[2015-04-04
17:57:44,091][INFO][org.apache.curator.framework.state.ConnectionStateManager.postState(ConnectionStateManager.java:228)]
- State change: RECONNECTED
[pool-6-thread-1]:[2015-04-04
17:58:32,755][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)]
- Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others
[pool-6-thread-1]:[2015-04-04
17:59:36,424][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)]
- Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others
[pool-6-thread-1]:[2015-04-04
18:00:40,181][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)]
- Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others
[pool-6-thread-1]:[2015-04-04
18:01:48,590][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)]
- Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others
[pool-6-thread-1]:[2015-04-04
18:02:36,059][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)]
- Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others
[pool-6-thread-1]:[2015-04-04
18:03:38,234][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)]
- Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others
[pool-6-thread-1]:[2015-04-04
18:04:53,317][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)]
- Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others
[pool-6-thread-1]:[2015-04-04
18:06:01,801][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)]
- Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others
[pool-6-thread-1]:[2015-04-04
18:07:14,274][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)]
- Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others
[pool-6-thread-1]:[2015-04-04
18:08:57,283][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)]
- Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others
[Thread-9-SendThread(amb0.mycorp.kom:2181)]:[2015-04-04
18:13:49,068][INFO][org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1096)]
- Client session timed out, have not heard from server in 30519ms for
sessionid 0x14c
7660580a005d, closing socket connection and attempting reconnect
[Thread-9-EventThread]:[2015-04-04
18:14:03,558][INFO][org.apache.curator.framework.state.ConnectionStateManager.postState(ConnectionStateManager.java:228)]
- State change: SUSPENDED
[Thread-9-SendThread(amb0.mycorp.kom:2181)]:[2015-04-04
18:14:38,679][INFO][org.apache.zookeeper.ClientCnxn$SendThread.logStartConnect(ClientCnxn.java:975)]
- Opening socket connection to server amb0.mycorp.kom/172.17.1.94:2181. Wi
ll not attempt to authenticate using SASL (unknown error)
[Thread-9-SendThread(amb0.mycorp.kom:2181)]:[2015-04-04
18:14:55,993][INFO][org.apache.zookeeper.ClientCnxn$SendThread.primeConnection(ClientCnxn.java:852)]
- Socket connection established to amb0.mycorp.kom/172.17.1.94:2181, initi
ating session
[Thread-9-SendThread(amb0.mycorp.kom:2181)]:[2015-04-04
18:16:03,626][INFO][org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1096)]
- Client session timed out, have not heard from server in 67633ms for
sessionid 0x14c
7660580a005d, closing socket connection and attempting reconnect
I am using one of the latest builds (1-2days old). THere is no obvious
indication of error. Can You please help me find a reason why it crashes ?
regards
Kuba Skuratowicz