Hi all, until now I have been playing with Kylin in a sandbox. After some tests I would like to run it in my production environment. However, how Kylin works with remote server is not clear to me.
In Kylin properties, there is an option: kylin.job.run.as.remote.cmd, if I set it to true, then Kylin will assume that the job engine will run in another server. Only the jobs run in another machine? How about the data? On http://kylin.incubator.apache.org/docs/install/index.html , it says that the most common case is to run Kylin in a Hadoop client machine. So, I assume that Kylin could use Thrift and JDBC clients to communicate to the server. However, looking into ${KYLIN_HOME}/bin/kylin.sh, I saw that Hive and HBase command lines are used to start Kylin. It also takes their classpaths to load other JAR files. My question is, how are you running Kylin? Are you running it always with your server? If not, how are you doing it? I would like to run it in another machine, but all Kylin information, such as intermediate Hive tables, HBase cube data, should be stored in the server one. Am I missing something in the docs? I believe that this is clear for most of you, but I am quite confused in this case. Regards, Diego
