MetaException(message:Could not connect to meta store using any of the URIs provided. Most recent failure: org.apache.thrift.transport.TTransportException: java.net.ConnectException: Connection refuse

2020-03-14 Thread xatax
I am a beginner. I installed Apache Kylin on EMR per instructions on the kylin website and am at my first step in the tutorials which is building the sample kylin_sales cube. When I try to build I keep getting this error and the second step "Redistribute Flat Hive Table":

Re: No tables on KYLIN WEB UI and NoSuchObjectException(message:default.kylin_intermediate_* table not found) when building cube

2020-04-10 Thread xatax
Thank you very much for your response Yaqian. So, regarding the question where tables are not showing, I tested this. I created a hive external table on table data in S3 location (parquet file). Then I created another table out of it using CTAS so that the new table is part of the same hive

Re: subscribe kylin

2020-04-05 Thread xatax
I have subscribed but none of my posts are going through. How come your post of subscribe was accepted whereas none of my posts are getting through. I am not sure if this reply is going to make it. -- Sent from: http://apache-kylin.74782.x6.nabble.com/

jdbc connection not working.

2020-07-12 Thread xatax
I have attempted to connect to Kylin using jdbc driver for the following Kylin versions 2.6, 3.0.2 and 3.1 and have been unable to make connection. The driver files I have used are from the '$KYLIN_HOME/lib' directory: for version 3.1: kylin-jdbc-3.1.0.jar, jcl-over-slf4j-1.7.21.jar,

Build Dimension Dictionary: org.apache.kylin.engine.mr.exception.HadoopShellException: java.lang.ClassCastException: org.apache.hadoop.io.IntWritable cannot be cast to org.apache.hadoop.io.LongWritabl

2020-06-14 Thread xatax
Hi, I am getting this error during cube build, can anyone please help? Thanks!!!: #4 Step Name: Build Dimension Dictionary Duration: 0.04 mins Waiting: 0 seconds org.apache.kylin.engine.mr.exception.HadoopShellException: java.lang.ClassCastException: org.apache.hadoop.io.IntWritable cannot be

Cube build failure: org/apache/spark/api/java/function/Function

2020-06-06 Thread xatax
I am following instructions to build cube with spark from: http://kylin.apache.org/docs/tutorial/cube_spark.html I get the following error when building cube: Cube build failure: org/apache/spark/api/java/function/Function Can you please help? Here is the log: 2020-06-06 14:58:38,289 ERROR

Re: Cube build failure: org/apache/spark/api/java/function/Function

2020-06-06 Thread xatax
Hi Shaofeng, Thank you very much for your reply. Can I please know how I can delete the segments? Are the overlapping segments causing the error? I get the overlapping segments error when for example I submit two date ranges as follows: 2020-05-04 to 2020-05-06 and 2020-05-05 to 2020-05-06. And

Re: error while using left join

2021-01-14 Thread xatax
We are facing the same issue in tableau when we give a filter on the left join. It is converting to inner join. Can you please let us know if there is any solution for this? Thanks! -- Sent from: http://apache-kylin.74782.x6.nabble.com/

Cluster mode kylin.metadata.url and kylin.server.cluster-servers examples for AWS EMR

2021-02-03 Thread xatax
Hi I am trying to install Kylin in cluster mode on AWS EMRs using the documentation: http://kylin.apache.org/docs/install/kylin_cluster.html Can you please help me with example values of kylin.metadata.url and kylin.server.cluster-servers so I know the format. thanks! -- Sent from:

High cardinality column search very slow

2021-03-24 Thread xatax
I have a column with a count distinct of about 12million. This is an integer field (for e.g. call it account with just numbers in it). This is account drives a lot of reports where the user enters the account in a search box on tableau and then tableau sends a query like: SELECT account ,

Can cube be queried while it is refreshing

2021-03-24 Thread xatax
Can a cube be queried while it is refreshing? If yes, how does it handle the switch over to the refreshed data? Can that involve downtime? Thanks! -- Sent from: http://apache-kylin.74782.x6.nabble.com/

Cube migration across hadoop clusters

2021-02-24 Thread xatax
Per: http://kylin.apache.org/docs/tutorial/cube_migration.html It says: "Support to migrate cube data (segments data on HBase) from QA env to PROD env." I tried the migration but it only migrates the cube metadata without the data. The source kylin instance has the cube buiilt and in ready

Re: Cube migration across hadoop clusters

2021-02-26 Thread xatax
Thanks Zhichao, I tried that and am now getting this: [hadoop@ip-172-31-81-255 apache-kylin-3.1.0-bin-hbase1x]$ [hadoop@ip-172-31-81-255 apache-kylin-3.1.0-bin-hbase1x]$ ./bin/kylin.sh org.apache.kylin.tool.migration.CubeMigrationCrossClusterCLI -kylinUriSrc

Re: Re:Cube migration across hadoop clusters: What is -updateMappingPath updateTableMapping.json

2021-02-27 Thread xatax
Hi Zhichao, thank you for your reply. Thanks for providing the file format as it is not available anywhere. I am not sure on what hadoop jars are missing. I am doing the cube migration on Kylin 3.1.1 on AWS EMR 5.29. And I am following the Install Kylin on AWS EMR instructions. Here is a new

Re: Cube migration across hadoop clusters

2021-02-25 Thread xatax
hi Yaqian, thanks for the response. I am using this method from the documentation: "II. Migrate across two Hadoop clusters" using Kylin version 3.1.0 on EMR 5.29 on both source and destination clusters. I tried in 2 ways: (1) Migrate from one kylin instance on emr 5.29 to another kylin instance

Cube migration across hadoop clusters: What is -updateMappingPath updateTableMapping.json

2021-02-26 Thread xatax
Per documentation: http://kylin.apache.org/docs/tutorial/cube_migration.html In the example for "II. Migrate across two Hadoop clusters" ./bin/kylin.sh org.apache.kylin.tool.migration.CubeMigrationCrossClusterCLI -kylinUriSrc ADMIN:ky...@qa.env:17070 -kylinUriDst ADMIN:ky...@prod.env:1 -cube

Cluster mode - High availability. Metadata storage options other than MySQL?

2021-02-14 Thread xatax
What are the recommended High availability options. >From the apache kylin website it recommends the cluster mode installation and then there is an option to use MySql for metadata storage. MySQL being an RDBMS is not best suited for an HA architecture. Any other options that work with Apache

Re: Cluster mode - High availability. Metadata storage options other than MySQL?

2021-02-21 Thread xatax
thanks! What about something like dynamo db? -- Sent from: http://apache-kylin.74782.x6.nabble.com/

Kylin Cluster Mode with EMR. Is a restart of kylin required on all job and query servers when new node is added.

2021-02-21 Thread xatax
Kylin Cluster Mode. I am using Kylin on EMR. Is a restart of kylin required on all job and query servers when new node is added. Then there will be downtime. How do we achieve High Availibility. from the guide: " Configure the Kylin node list kylin.server.cluster-servers, including all nodes (the

Kylin Cluster mode. SQL works in standalone mode. Errors in Cluster mode. No dictionary found.

2021-02-21 Thread xatax
I installed kylin on EMR in standalone mode. Ran a bunch of SQLs and they run fine. Then I add a query cluster. So I have kylin.server.mode in one cluster set to "all" and one cluster in "query" mode. I set kylin.server.cluster-servers=jobnode1:7070,querynode1:7070 on both query and job

Active cube run set to pending and a new cube request is run without finishing

2021-04-22 Thread xatax
I have obeserved that when the "kylin.job.max-concurrent-jobs" properties is used for a cluster configuration and set to for e.g. lets say value 1. Then for a given cube building job cluster, during a cube build if another cube build request is submitted the currently running job is set to pending