Re: Kylin and Remote Server

DroopyHoo Wed, 26 Aug 2015 02:41:18 -0700

Hi Diego

We met this error stack when deploying in our Hadoop enviroment(notsandbox). The problem we met is the function for checking MR job statusdo not support kerberos auth (our hadoop cluster use kerberos service).So we made some changes to this part of source code.


I'm not sure whether this case could help you to analyse the problem.

在 15/8/26 上午10:46, Diego Pinheiro 写道:

Hi Bin Mahone,

sorry for the late reply. Thank you for your support. I didn't know
about Kylin instances. It is really interesting.

However, let me ask you, I was setting up my hadoop client machine
with Kylin to communicate to my sandbox. But things are not working
well.

I have installed hadoop 2.4.0, hbase 0.98.0 and hive 0.13.1. All them
are working and I can access my "remote server" from my client machine
(actually, I set kylin as sandbox since all my hadoop cli is pointing
to my sandbox). Then, Kylin was built and everything was ok until I
tried to build the cube.

I got the following errors always in the second step of cube build:

[pool-5-thread-2]:[2015-08-25
19:16:03,679][ERROR][org.apache.kylin.job.tools.HadoopStatusChecker.checkStatus(HadoopStatusChecker.java:91)]
- error check status
java.net.ConnectException: Connection refused
     at java.net.PlainSocketImpl.socketConnect(Native Method)
     at 
java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:339)
     at 
java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:198)
     at 
java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182)
     at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
     at java.net.Socket.connect(Socket.java:579)
     at java.net.Socket.connect(Socket.java:528)
     at java.net.Socket.<init>(Socket.java:425)
     at java.net.Socket.<init>(Socket.java:280)
     at 
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
     at 
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
     at 
org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
     at 
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
     at 
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
     at 
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
     at 
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
     at 
org.apache.kylin.job.tools.HadoopStatusGetter.getHttpResponse(HadoopStatusGetter.java:78)
     at 
org.apache.kylin.job.tools.HadoopStatusGetter.get(HadoopStatusGetter.java:55)
     at 
org.apache.kylin.job.tools.HadoopStatusChecker.checkStatus(HadoopStatusChecker.java:56)
     at 
org.apache.kylin.job.common.MapReduceExecutable.doWork(MapReduceExecutable.java:136)
     at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:106)
     at 
org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:50)
     at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:106)
     at 
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:133)
     at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
     at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
     at java.lang.Thread.run(Thread.java:745)

org.apache.kylin.job.exception.ExecuteException: java.lang.NullPointerException
     at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:110)
     at 
org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:50)
     at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:106)
     at 
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:133)
     at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
     at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
     at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.NullPointerException
     at 
org.apache.kylin.job.common.MapReduceExecutable.onExecuteStart(MapReduceExecutable.java:73)
     at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:105)
     ... 6 more

Do you have any thoughts about these errors? (detailed log is attached)


On Fri, Aug 21, 2015 at 3:21 AM, hongbin ma <[email protected]> wrote:

the document is summarized at
http://kylin.incubator.apache.org/docs/install/kylin_cluster.html

On Fri, Aug 21, 2015 at 1:51 PM, hongbin ma <[email protected]> wrote:

hi Diego

the config "kylin.job.run.as.remote.cmd" is somehow ambiguous , it is
enabled when you cannot run Kylin server on the same machine as your hadoop
CLI, for example, if you're starting Kylin from you local IDE, and you
hadoop CLI is a sandbox in another machine, this is the "remote" case.

In most of the production deployments we suggest using '"non-remote" mode,
that is, kylin instance is started on the hadoop CLI. The picture depicts
the scenario:
https://github.com/apache/incubator-kylin/blob/0.7/website/images/install/on_cli_install_scene.png

Kylin instances are stateless,  the runtime state is saved in its
"Metadata Store" in hbase (kylin.metadata.url config in
conf/kylin.properties). For load balance considerations it is possible to
start multiple Kylin instances sharing the same metadata store (thus
sharing the same state on table schemas, job status, cube status, etc.)

Each of the kylin instances has a kylin.server.mode entry in
conf/kylin.properties specifying the runtime mode, it has three options: 1.
"job" for running job engine only 2. "query" for running query engine only
and 3 "all" for running both. Notice that only one server can run the job
engine("all" mode or "job" mode), the others must all be "query" mode.

A typical scenario is depicted in the attachment chart.





--
Regards,

*Bin Mahone | 马洪宾*
Apache Kylin: http://kylin.io
Github: https://github.com/binmahone



--
Regards,

*Bin Mahone | 马洪宾*
Apache Kylin: http://kylin.io
Github: https://github.com/binmahone


--
-------
Wei Hu

Re: Kylin and Remote Server

Reply via email to