I am a beginner. I installed Apache Kylin on EMR per instructions on the
kylin website and am at my first step in the tutorials which is building the
sample kylin_sales cube.
When I try to build I keep getting this error and the second step
"Redistribute Flat Hive Table":
Thank you very much for your response Yaqian.
So, regarding the question where tables are not showing, I tested this.
I created a hive external table on table data in S3 location (parquet file).
Then I created another table out of it using CTAS so that the new table is
part of the same hive
I have subscribed but none of my posts are going through. How come your post
of subscribe was accepted whereas none of my posts are getting through.
I am not sure if this reply is going to make it.
--
Sent from: http://apache-kylin.74782.x6.nabble.com/
I have attempted to connect to Kylin using jdbc driver for the following
Kylin versions 2.6, 3.0.2 and 3.1 and have been unable to make connection.
The driver files I have used are from the '$KYLIN_HOME/lib' directory:
for version 3.1: kylin-jdbc-3.1.0.jar, jcl-over-slf4j-1.7.21.jar,
Hi,
I am getting this error during cube build, can anyone please help?
Thanks!!!:
#4 Step Name: Build Dimension Dictionary
Duration: 0.04 mins Waiting: 0 seconds
org.apache.kylin.engine.mr.exception.HadoopShellException:
java.lang.ClassCastException: org.apache.hadoop.io.IntWritable cannot be
I am following instructions to build cube with spark from:
http://kylin.apache.org/docs/tutorial/cube_spark.html
I get the following error when building cube:
Cube build failure: org/apache/spark/api/java/function/Function
Can you please help?
Here is the log:
2020-06-06 14:58:38,289 ERROR
Hi Shaofeng,
Thank you very much for your reply.
Can I please know how I can delete the segments? Are the overlapping
segments causing the error? I get the overlapping segments error when for
example I submit two date ranges as follows:
2020-05-04 to 2020-05-06
and
2020-05-05 to 2020-05-06.
And
We are facing the same issue in tableau when we give a filter on the left
join. It is converting to inner join.
Can you please let us know if there is any solution for this?
Thanks!
--
Sent from: http://apache-kylin.74782.x6.nabble.com/
Hi
I am trying to install Kylin in cluster mode on AWS EMRs using the
documentation: http://kylin.apache.org/docs/install/kylin_cluster.html
Can you please help me with example values of kylin.metadata.url and
kylin.server.cluster-servers so I know the format.
thanks!
--
Sent from:
I have a column with a count distinct of about 12million. This is an integer
field (for e.g. call it account with just numbers in it).
This is account drives a lot of reports where the user enters the account in
a search box on tableau and then tableau sends a query like:
SELECT account ,
Can a cube be queried while it is refreshing?
If yes, how does it handle the switch over to the refreshed data? Can that
involve downtime?
Thanks!
--
Sent from: http://apache-kylin.74782.x6.nabble.com/
Per: http://kylin.apache.org/docs/tutorial/cube_migration.html
It says: "Support to migrate cube data (segments data on HBase) from QA env
to PROD env."
I tried the migration but it only migrates the cube metadata without the
data. The source kylin instance has the cube buiilt and in ready
Thanks Zhichao,
I tried that and am now getting this:
[hadoop@ip-172-31-81-255 apache-kylin-3.1.0-bin-hbase1x]$
[hadoop@ip-172-31-81-255 apache-kylin-3.1.0-bin-hbase1x]$ ./bin/kylin.sh
org.apache.kylin.tool.migration.CubeMigrationCrossClusterCLI -kylinUriSrc
Hi Zhichao,
thank you for your reply.
Thanks for providing the file format as it is not available anywhere.
I am not sure on what hadoop jars are missing. I am doing the cube migration
on Kylin 3.1.1 on AWS EMR 5.29.
And I am following the Install Kylin on AWS EMR instructions.
Here is a new
hi Yaqian,
thanks for the response. I am using this method from the documentation: "II.
Migrate across two Hadoop clusters" using Kylin version 3.1.0 on EMR 5.29 on
both source and destination clusters.
I tried in 2 ways:
(1) Migrate from one kylin instance on emr 5.29 to another kylin instance
Per documentation: http://kylin.apache.org/docs/tutorial/cube_migration.html
In the example for "II. Migrate across two Hadoop clusters"
./bin/kylin.sh org.apache.kylin.tool.migration.CubeMigrationCrossClusterCLI
-kylinUriSrc ADMIN:ky...@qa.env:17070 -kylinUriDst
ADMIN:ky...@prod.env:1 -cube
What are the recommended High availability options.
>From the apache kylin website it recommends the cluster mode installation
and then there is an option to use MySql for metadata storage.
MySQL being an RDBMS is not best suited for an HA architecture.
Any other options that work with Apache
thanks!
What about something like dynamo db?
--
Sent from: http://apache-kylin.74782.x6.nabble.com/
Kylin Cluster Mode. I am using Kylin on EMR.
Is a restart of kylin required on all job and query servers when new node is
added. Then there will be downtime. How do we achieve High Availibility.
from the guide:
"
Configure the Kylin node list kylin.server.cluster-servers, including all
nodes (the
I installed kylin on EMR in standalone mode. Ran a bunch of SQLs and they run
fine.
Then I add a query cluster. So I have kylin.server.mode in one cluster set
to "all" and one cluster in "query" mode.
I set kylin.server.cluster-servers=jobnode1:7070,querynode1:7070
on both query and job
I have obeserved that when the "kylin.job.max-concurrent-jobs" properties is
used for a cluster configuration and set to for e.g. lets say value 1. Then
for a given cube building job cluster, during a cube build if another cube
build request is submitted the currently running job is set to pending
21 matches
Mail list logo