Hi,
I am trying to create a cube with 3 dimensions (1 normal, 1 derived, 1hierarchy). Out of these, the largest cardinality is about 5M. However during the #3 Step Name: Build Dimension Dictionary - kylin dies. Web interface stops responding. I cannot create a new session. What is more mapreduce does not show any job starting for phase #3. The kylin.log ends with those entries:

[Thread-9-EventThread]:[2015-04-04 17:57:44,091][INFO][org.apache.curator.framework.state.ConnectionStateManager.postState(ConnectionStateManager.java:228)] - State change: RECONNECTED [pool-6-thread-1]:[2015-04-04 17:58:32,755][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)] - Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others [pool-6-thread-1]:[2015-04-04 17:59:36,424][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)] - Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others [pool-6-thread-1]:[2015-04-04 18:00:40,181][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)] - Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others [pool-6-thread-1]:[2015-04-04 18:01:48,590][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)] - Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others [pool-6-thread-1]:[2015-04-04 18:02:36,059][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)] - Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others [pool-6-thread-1]:[2015-04-04 18:03:38,234][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)] - Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others [pool-6-thread-1]:[2015-04-04 18:04:53,317][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)] - Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others [pool-6-thread-1]:[2015-04-04 18:06:01,801][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)] - Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others [pool-6-thread-1]:[2015-04-04 18:07:14,274][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)] - Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others [pool-6-thread-1]:[2015-04-04 18:08:57,283][INFO][org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:117)] - Job Fetcher: 1 running, 1 actual running, 0 ready, 24 others [Thread-9-SendThread(amb0.mycorp.kom:2181)]:[2015-04-04 18:13:49,068][INFO][org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1096)] - Client session timed out, have not heard from server in 30519ms for sessionid 0x14c
7660580a005d, closing socket connection and attempting reconnect
[Thread-9-EventThread]:[2015-04-04 18:14:03,558][INFO][org.apache.curator.framework.state.ConnectionStateManager.postState(ConnectionStateManager.java:228)] - State change: SUSPENDED [Thread-9-SendThread(amb0.mycorp.kom:2181)]:[2015-04-04 18:14:38,679][INFO][org.apache.zookeeper.ClientCnxn$SendThread.logStartConnect(ClientCnxn.java:975)] - Opening socket connection to server amb0.mycorp.kom/172.17.1.94:2181. Wi
ll not attempt to authenticate using SASL (unknown error)
[Thread-9-SendThread(amb0.mycorp.kom:2181)]:[2015-04-04 18:14:55,993][INFO][org.apache.zookeeper.ClientCnxn$SendThread.primeConnection(ClientCnxn.java:852)] - Socket connection established to amb0.mycorp.kom/172.17.1.94:2181, initi
ating session
[Thread-9-SendThread(amb0.mycorp.kom:2181)]:[2015-04-04 18:16:03,626][INFO][org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1096)] - Client session timed out, have not heard from server in 67633ms for sessionid 0x14c
7660580a005d, closing socket connection and attempting reconnect

I am using one of the latest builds (1-2days old). THere is no obvious indication of error. Can You please help me find a reason why it crashes ?

regards
Kuba Skuratowicz

Reply via email to