Re: Apache Kylin Chinese documents updated / Kylin 中文文档已更新

2018-07-08 Thread Jiatao Tao
 --- Regards! Aron Tao 在 2018/7/2 23:20,“ShaoFeng Shi” 写入: Hi Kylin users, The documents of Chinese version are updated for Kylin v2.4 and v2.3. More will be translated in the future. Latest version: https://kylin.apache.org/cn/docs/ v2.3:

Re: LDAP Sync issue - Empty filter; nested exception is javax.naming.directory.InvalidSearchFilterException

2018-10-13 Thread Jiatao Tao
Hi, Can you try command '"ldapsearch" to get the users/groups you wanted first? --- Regards! Aron Tao On [DATE], "[NAME]" <[ADDRESS]> wrote: Hi - Please help us to solve the below issue to sync with LDAP Below is the kylin ldap configuration .

Re: 有关kylin在Hbase中生成的表的处理疑问

2018-10-31 Thread JiaTao Tao
This in FAQ may also help: http://kylin.apache.org/docs/gettingstarted/faq.html What kind of data be left in ‘kylin.env.hdfs-working-dir’ ? We often > execute kylin cleanup storage command, but now our working dir folder is > about 300 GB size, can we delete old data manually? > >- The

Re: [DISCUSS] Columnar storage engine for Apache Kylin

2018-10-26 Thread JiaTao Tao
tly; Thank you jiatao for the comments! > > JiaTao Tao 于2018年10月25日周四 下午6:12写道: > > > As far as I'm concerned, using Parquet as Kylin's storage format is > pretty > > appropriate. From the aspect of integrating Spark, Spark made a lot of > > optimizations for Parquet, e.g

Re: does Apache Kylin need a Apache Derby or Mysql for run the sample cube

2018-10-26 Thread JiaTao Tao
You may need these: http://kylin.apache.org/docs/install/index.html http://kylin.apache.org/docs/tutorial/kylin_sample.html And it might be more appropriate discussing this in user mailing list :). - Regards! Aron Tao ShaoFeng Shi 于2018年10月26日周五 下午10:24写道: > No, derby/mysql is not

Re: [VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-02 Thread JiaTao Tao
 Here is my vote: +1 (binding) ShaoFeng Shi 于2018年11月2日周五 下午2:10写道: > Hi all, > > I have created a build for Apache Kylin 2.5.1, release candidate 1. > > Changes highlights: > > [KYLIN-3531] - Login failed with case-insensitive username > [KYLIN-3604] - Can't build cube with spark in HBase

Re: Unable to connect to Kylin Web UI

2018-10-25 Thread JiaTao Tao
If you suspect this problem is related to port, you may change your Kylin's default port(7070) to any other available ports temporarily just for clarifying your suspicion first? The way to modify this is changing " 于2018年10月26日周五 上午9:41写道: > Here is log. > > kylin.start

Re: [DISCUSS] Columnar storage engine for Apache Kylin

2018-10-25 Thread JiaTao Tao
As far as I'm concerned, using Parquet as Kylin's storage format is pretty appropriate. From the aspect of integrating Spark, Spark made a lot of optimizations for Parquet, e.g. We can enjoy Spark's vectorized reading and lazy dict decoding, etc. And here are my thoughts about integrating Spark

Re: doubt about measure of processedRowCount

2018-11-06 Thread JiaTao Tao
right when I config > "kylin.query.stream-aggregate-enabled=false". > You are right. Records are pre-aggregated by GTStreamAggregateScanner. > > > ------ 原始邮件 -- > *发件人:* "JiaTao Tao"; > *发送时间:* 2018年11月6日(星期二) 晚上10:50 > *收件人:* "

Re: Kylin Cluster Mode Issue Overwriting conflict /user/ADMIN

2018-11-09 Thread JiaTao Tao
Seems the same with this JIRA: https://issues.apache.org/jira/browse/KYLIN-3562. Shrikant Bang 于2018年11月10日周六 上午2:28写道: > Hi Team, > > I have 3 node Kylin (v2.5.0-hbase1.x) Cluster (1all+2query). I am > seeing HTTP response codes for QUERY REST APIs giving error code 500. I see > these

Re: There was no measure column in the fact table after build cube

2018-11-08 Thread JiaTao Tao
Hi Scott Fan, 1. Kylin only stores aggregated values in cubes, you can try to query sum(PRICE) and see the results. 2. It's as expected, "COUNT aggregation" means count(*), it does not need a column. This link may be helpful: http://kylin.apache.org/docs/tutorial/create_cube.html Scott Fan

Re: Kylin Cluster Mode Issue Overwriting conflict /user/ADMIN

2018-11-10 Thread JiaTao Tao
You are welcome, enjoy Kylin :). Shrikant Bang 于2018年11月11日周日 上午2:35写道: > Thanks JiaTao for response. I have upgraded Kylin to v2.5.1-hbase1.x and > issue resolved. > > Regards, > Shrikant Bang. > > On Sat, Nov 10, 2018 at 7:22 AM JiaTao Tao wrote: > >> Seems the

Re: why does not the job complete?

2018-11-16 Thread JiaTao Tao
Hi "Create Intermediate Flat Hive Table" will submit a job on YARN, and you can check this job to see where it sucks. ebrahim zare 于2018年11月16日周五 下午6:42写道: > hi. > I could install Apache Kylin and built a job but doesnt complete it after > 100 minutes and wait in first step (Create Intermediate

Re: WELCOME to dev@kylin.apache.org

2018-11-06 Thread JiaTao Tao
I have the impression that Kylin may return a wrong answer for the raw query(if you do not have raw measure) but not throw an exception. >From your pic, it seems like a POC, have you ever tried Kylin's tutorial about sample cube ( http://kylin.apache.org/docs/tutorial/kylin_sample.html)? Maybe

Re: 生成的cube部分数据缺失

2019-01-21 Thread JiaTao Tao
Hi Can you try "select count(*)" and compare the result with hive? FYI: http://kylin.apache.org/docs/gettingstarted/faq.html (Why I got an error when running a “select * “ query?) 奥威软件 <3513797...@qq.com> 于2019年1月22日周二 上午5:21写道: > 没有group by 也一样能查到数据的 > 例如把 goodsid 改为1137, > select * from

Re: An incorrect result when I used kylin join

2019-01-18 Thread JiaTao Tao
Hi Kylin' cube only has aggregated data, so try some aggregations in SQL like min/max etc. FYI: http://kylin.apache.org/docs/gettingstarted/faq.html (Why I got an error when running a “select * “ query?) 雒智 于2019年1月18日周五 上午1:43写道: > > hello : > My dimension table has a column named

Re: 生成的cube部分数据缺失

2019-01-22 Thread JiaTao Tao
> > > kylin: > select count(*) from ICSTOCKBILL_1W > result:10366 > > > hive: > select count(*) from ICSTOCKBILL_1W > result:10411 > > > > > -- 原始邮件 -- > 发件人: "JiaTao Tao"; > 发送时间: 2019年1月22日(星期二) 下午

Re: [Discuss] Moving toward Apache Kylin 3.0

2019-01-23 Thread JiaTao Tao
+1 -- Regards! Aron Tao ShaoFeng Shi 于2019年1月23日周三 上午7:57写道: > Hi Kylin developers, > > In last week, Kylin released v2.6.0, with the enhanced & distributed query > cache and JDBC data source SDK. After this release, the next batch > candidate features include real-time streaming,

Re: 答复: show the kylin sql for timeout

2018-12-26 Thread JiaTao Tao
At present, Kylin's min "kylin.query.timeout-seconds" is 60s, and you can not set this smaller. If you want to simulate the scenario of timeout, you can take a look at "ITKylinQueryTest#testTimeoutQuery". It uses a hack way, see:

Re: kylin sql query timeout

2018-12-20 Thread JiaTao Tao
HI, you can take a look at org.apache.kylin.common.exceptions.KylinTimeoutException. 黄云尧 于2018年12月20日周四 上午6:39写道: > I want to know excepion class when a sql query was timeout , someone > knows? > > > > > > -- Regards! Aron Tao

Re: [VOTE] Release apache-kylin-2.5.2 (RC2)

2018-11-30 Thread JiaTao Tao
+1 mvn test passed ShaoFeng Shi 于2018年11月30日周五 下午1:57写道: > Hi all, > > I have created a build for Apache Kylin 2.5.2, release candidate 2. > > Changes: > [KYLIN-3187] - JDK APIs using the default locale, time zone or character > set should be avoided > [KYLIN-3636] - Wrong "storage_type" in

Re: [DISCUSS] Stop inserting git diffs to JIRA ticket

2018-12-02 Thread JiaTao Tao
+1 ShaoFeng Shi 于2018年12月3日周一 上午1:46写道: > Hello Kylin developers, > > After we enable the git box for Kylin code repository, when there is a PR > merged, the "ASF Github Bot" will insert the git diff to the associated > JIRA. We noticed this function will make the JIRA very long when the code >

Re: dont complete a job in apache kylin.

2018-11-19 Thread JiaTao Tao
It seems that you are doing POC with your own PC, here's a link for you: http://kylin.apache.org/docs/install/index.html > We recommend you to try out Kylin or develop it using the integrated > sandbox, such as HDP sandbox, and make sure it has at least 10 GB of > memory. When configuring a

Re: [Announce] Welcome new Apache Kylin committer: ChunEn Ni (倪春恩)

2018-11-27 Thread JiaTao Tao
Congratulations! ShaoFeng Shi 于2018年11月27日周二 上午7:59写道: > The Project Management Committee (PMC) for Apache Kylin > has invited ChunEn Ni(倪春恩) to become a committer and we are pleased > to announce that he has accepted. > > Congratulations and welcome, ChunEn! > > Shaofeng Shi > > On behalf of

Re: [VOTE] Release apache-kylin-2.5.2 (RC1)

2018-11-27 Thread JiaTao Tao
+1 Yichen Zhou 于2018年11月27日周二 上午11:48写道: > mvn test passed > +1 > > -Yichen > > Chao Long 于2018年11月27日周二 下午6:37写道: > > > +1 > > mvn test pass > > > > > > -- > > - > > Chao Long > > > > > > > > > > > > > > > > -- 原始邮件 -- > > 发件人:

Re: [VOTE] Release apache-kylin-2.6.0 (RC1)

2019-01-09 Thread JiaTao Tao
+1 mvn test passed Yanghong Zhong 于2019年1月9日周三 上午2:46写道: > Hi all, > > I have created a build for Apache Kylin 2.6.0, release candidate 1. > > Changes highlights: > [KYLIN-2895] - Refine query cache by changing the query cache expiration > strategy by signature checking and introducing

Re: 答复: 请问可以设置多台机器同时构建cube吗?

2019-01-08 Thread JiaTao Tao
Kylin will submit cubing tasks on Yarn, if your Hadoop cluster has multi-nodes, it can use their abilities NoOne <3513797...@qq.com> 于2019年1月8日周二 上午8:04写道: > sorry,我问的是可以设置多台机器同时构建同一个cube吗? > > -- > Sent from: http://apache-kylin.74782.x6.nabble.com/ > -- Regards! Aron Tao

Re: Increment Upload in Kylin

2019-01-08 Thread JiaTao Tao
Seems like "incremental build"? Cube data consists of segments and every building is a new segment and will not refresh the old segs. somu0...@gmail.com 于2019年1月7日周一 上午2:16写道: > Is there any feature in kylin which will do increment update without > refreshing the complete cube. for example if

Re: help kylin

2019-01-04 Thread JiaTao Tao
Hi I cannot see your pic, can you post the pic again, or describe the problem? 王建圆 于2019年1月4日周五 上午10:04写道: > hello,I import kylin project used IDEA accord to > http://kylin.apache.org/cn/development/dev_env.html,I encountered some > errors when I started. > [image: image.png] > Can you give

Re: Re: use single quote in sql ,how to escape

2018-12-19 Thread JiaTao Tao
You are welcome! 黄云尧 于2018年12月19日周三 上午8:33写道: > thanks 。you are right。 Single quotes are escaped by doubling them up. > 发件人:JiaTao Tao > 发送日期:2018-12-19 16:28:48 > 收件人:dev@kylin.apache.org > 主题:Re: use single quote in sql ,how to escape>Hi > >In SQL, Single quotes are

Re: use single quote in sql ,how to escape

2018-12-19 Thread JiaTao Tao
Hi In SQL, Single quotes are escaped by doubling them up. Try this: select * from buzz_info where title like '%hello i'' am kangkan%' By the way, Kylin is not suitable for answer "select *", see this: Why I got an error when running a “select * “ query? (

Re: Re: Evaluate Kylin on Parquet

2018-12-19 Thread JiaTao Tao
Hi Gang In my opinion, segments/partition pruning is actually in the scope of "Index system", we can have an "Index system" in storage level including File index(for segment/partition pruning), page index(for page pruning) etc. We can put all these stuff in such a system and make the separation

Re: [DISCUSS] Kylin 3.0 alpha and beta release before GA

2019-03-26 Thread JiaTao Tao
Seems awesome, looking forward to Kylin 3.0. -- Regards! Aron Tao ShaoFeng Shi 于2019年3月25日周一 上午1:24写道: > Hello, > > About two months ago, we raised the "[Discuss] Moving toward Apache Kylin > 3.0" in the developer group, all agree to use 3.0 as the next major release > version when the

Re: Debug kylin2.6X with CDH5.15.It didn't build cube.

2019-03-28 Thread JiaTao Tao
Hi You can check the maven profile and will find it uses "hdp" profile, and there exists "cdh" profile, but it still has some work to do as I used to try. As Chao Long said, we recommend use HDP sandbox to debug. -- Regards! Aron Tao Lio_Messi 于2019年3月28日周四 下午12:11写道: > I want to debug

Re: [VOTE] Release apache-kylin-2.6.1 (RC1)

2019-03-04 Thread JiaTao Tao
+1 -- Regards! Aron Tao ShaoFeng Shi 于2019年3月4日周一 上午10:35写道: > Hi all, > > I have created a build for Apache Kylin 2.6.1, release candidate 1. > > Changes highlights: > [KYLIN-3494] - Build cube with spark reports ArrayIndexOutOfBoundsException > [KYLIN-3537] - Use Spark to build Cube on

Re: development environment

2019-02-21 Thread JiaTao Tao
Hi Please check Ambari and see whether the HIVE is healthy or not. If not, you can check the HIVE's logs(/var/log/hive). -- Regards! Aron Tao XiaoHui Zhang <18125...@bjtu.edu.cn> 于2019年2月22日周五 上午2:56写道: > Hi, dear team,I am a beginner of Kylin and I am building kylin > development

Re: can not open kylin web ui

2019-02-22 Thread JiaTao Tao
Hi You can check files in "${KYLIN_HOME)/logs" and see if there's something unexpected occurs. -- Regards! Aron Tao hetadesai56 于2019年2月22日周五 下午2:34写道: > Hi, > > I am working on HDP 2.6.5 on virtual box. I have installed > apache-kylin-2.6.0-bin. kylin started successfully. But Web UI is

Re: [Discussion] Enable shrunken dictionary by default

2019-03-17 Thread JiaTao Tao
+1, seems improved a lot. -- Regards! Aron Tao Xiaoxiang Yu 于2019年3月18日周一 上午2:27写道: > Dear all, > I suggest enable "kylin.dictionary.shrunken-from-global-enabled" by > default(it is disabled by default), because I found enable it will speed up > cube build process when cube have count

Re: Cube build failure at Step 2

2019-03-13 Thread JiaTao Tao
You are welcome. nithya.mb4...@gmail.com 于2019年3月13日周三 上午9:19写道: > Thank you. > For now I have changed tez-site.xml in tez-client config and it is working > fine. I will consider this option if it fails again. > > Thanks, > Nithya > > -- > Sent from: http://apache-kylin.74782.x6.nabble.com/ >

Re: [Discuss] Won't ship Spark binary in Kylin binary anymore

2019-03-09 Thread JiaTao Tao
+1 -- Regards! Aron Tao ShaoFeng Shi 于2019年3月8日周五 上午2:43写道: > Hello, > > As we know Kylin ships a Spark in its binary package; The total package > becomes bigger and bigger as the version grows; the latest version (v2.6.1) > is bigger than 350MB, which was rejected by Apache SVN server

Re: kylin top-n query

2019-03-18 Thread JiaTao Tao
And this may also help: http://kylin.apache.org/docs/tutorial/create_cube.html (go to the "TOP_N" Section) -- Regards! Aron Tao 黄云尧 于2019年3月18日周一 下午12:06写道: > someone has documents for top-n query in kylin ? > > > >

Re: Cube build failure at Step 2

2019-03-13 Thread JiaTao Tao
Hi I recommend you use MR instead of Tez, you can add this to your kylin_hive_conf.xml in KYLIN_HOME/conf or direct modify you hive-site.xml > > > hive.execution.engine > > mr > > > > -- Regards! Aron Tao nithya.mb4...@gmail.com 于2019年3月12日周二 下午1:02写道: > Hello, >

Re: Build cube exception: java.io.FileNotFoundException

2019-02-20 Thread JiaTao Tao
As a beginner, I recommend you using the integrated sandbox to deploy Kylin, such as HDP sandbox ( http://hortonworks.com/products/hortonworks-sandbox) or CDH. -- Regards! Aron Tao jiangxiaoma111 <369806...@qq.com> 于2019年2月19日周二 上午10:02写道: > hi, all. > I am a beginner of kylin. My

Re: 查询无结果

2019-01-29 Thread JiaTao Tao
查询结果为0,这是为啥呢 > select > EXCHANGE_NAME > from HUOBI_GLOBAL.HUOBI_YUNYING_DW_KYLIN_USER_INDEX_DAILY > where HUOBI_YUNYING_DW_KYLIN_USER_INDEX_DAILY.EXCHANGE_NAME = 'b11' > > 在 2019/1/30 上午9:45,“JiaTao Tao” 写入: > > Hi, > Can not see your pic, c

Re: 查询无结果

2019-01-29 Thread JiaTao Tao
Hi, Can not see your pic, can you describe your problem? 廉立伟 于2019年1月29日周二 下午2:26写道: > > > 你好 加where查询没有结果 为啥 > > M *lianli...@huobi.com * > > > -- Regards! Aron Tao

Re: Cleaning up hdfs working directory

2019-01-24 Thread JiaTao Tao
Hi Take a look at this: http://kylin.apache.org/docs/howto/howto_cleanup_storage.html kdcool6932 于2019年1月24日周四 上午8:04写道: > Hi Kylin,We are having a a lot of data in our hdfs working directory, > around 10tb , for last one year or so, this is acutally more than the hbase > usage of kylin(around

Re: Kylin+Spark = NoClassDefFoundError

2019-01-30 Thread JiaTao Tao
If you want to use your own Spark, try a binary package with Hadoop. Kamil 于2019年1月30日周三 下午11:40写道: > Hello All, > > I'm new Kylin user. I successfully managed to get everything work with > "Sample Cube" (http://kylin.apache.org/docs/tutorial/kylin_sample.html) > > Now I wanted to make it

Re: [New Document] Kylin SQL reference

2019-01-30 Thread JiaTao Tao
Very helpful! Thanks to Na Zhai. ShaoFeng Shi 于2019年1月30日周三 下午3:13写道: > Hello Kylin users, > > A new document is added to Apache Kylin website for introducing the SQL > grammar, functions and data types that Kylin supports; We believe it will > help new users. Many thanks to Na Zhai, who

Re: [ANNOUNCE] Kaisen Kang joins the Apache Kylin PMC

2019-04-16 Thread JiaTao Tao
Congratulations! -- Regards! Aron Tao Luke Han 于2019年4月16日周二 上午5:09写道: > On behalf of the Apache Kylin PMC I am pleased to announce that Kaisen Kang > has accepted our invitation to become a PMC member on the Apache Kylin > project. We appreciate Kaisen stepping up to take more

Re: [DISCUSSION] Don't need to purge existing segment of cube to add new measures in Kylin

2019-04-19 Thread JiaTao Tao
Hi The idea that supports Kylin adding measures dynamically is impressive. But in my opinion, once you add a measure, the existing segments should also calculate the new measure(just add a new measure column). Users can have many cubes, a cube can have many segments, if measure's view is

Re: 回复: 安装问题

2019-04-19 Thread JiaTao Tao
Hi Maybe you can change the execution engine from TEZ to MR and give it a try. Add this hive.execution.engine mr to kylin_hive_conf.xml or direct change it in hive-site.xml. -- Regards! Aron Tao mingwen@analyticservice.net 于2019年4月19日周五 下午12:43写道: > HI

Re: [VOTE] Release apache-kylin-2.6.2 (RC1)

2019-05-13 Thread JiaTao Tao
+1 -- Regards! Aron Tao ShaoFeng Shi 于2019年5月14日周二 上午1:10写道: > Hi all, > > I have created a build for Apache Kylin 2.6.2, release candidate 1. > > Changes highlights: > [KYLIN-3892] - Set cubing job priority > [KYLIN-3839] - Storage clean up after refreshing or deleting a segment >

Re: 使用left join查询时报错

2019-05-26 Thread JiaTao Tao
Hi What's your join type in your model? -- Regards! Aron Tao Gods_Dusk <197795...@qq.com> 于2019年5月26日周日 下午12:18写道: > 使用下面的语句查询时报错 > select bingrenxingming, >jiaofeibiaoji, >yaowumingcheng, >danjia, >shuliang, >jiuzhenriqi, >kaidankeshi, >

Re: [ANNOUNCE] Gang Ma joins the Apache Kylin PMC

2019-06-05 Thread JiaTao Tao
Congrats ! -- Regards! Aron Tao ShaoFeng Shi 于2019年6月3日周一 上午5:32写道: > On behalf of the Apache Kylin PMC, I am pleased to announce that Gang Ma > (马刚) has accepted our invitation to become a PMC member on the Apache Kylin > project. We appreciate Gang stepping up to take more responsibility

Re: Plan to host the first "Kylin Data Summit" event

2019-05-30 Thread JiaTao Tao
Looking forward to it! -- Regards! Aron Tao ShaoFeng Shi 于2019年5月30日周四 上午6:29写道: > Hello Kylin developers and users, > > > > We (Kyligence Inc) planned to host the first "Kylin Data Summit" event at > Shanghai, China. This event is going to provide a place to share, discuss > the

Re: cube-rowkey排序咨询

2019-06-10 Thread JiaTao Tao
And this link( https://www.slideshare.net/YangLi43/design-cube-in-apache-kylin) that Shaofeng previous shared is also very helpful, see this chapter: "The Order of Dimensions" -- Regards! Aron Tao Xiaoxiang Yu 于2019年6月11日周二 上午2:45写道: > Hi, wangfx > > Kylin converts sql query to two

Re: [ANNOUNCE] New Committer: Jiatao Tao

2019-06-12 Thread JiaTao Tao
est Regards > > > PENG Zhengshuai > > > > > > > On Jun 13, 2019, at 10:46 AM, ShaoFeng Shi > > wrote: > > > > > > > > The Project Management Committee (PMC) for Apache Kylin > > > > has invited Jiatao Tao to become a committe

Re: [VOTE] Release apache-kylin-2.6.3 (RC1)

2019-07-01 Thread JiaTao Tao
+1 -- Regards! Aron Tao ShaoFeng Shi 于2019年7月1日周一 上午1:27写道: > Hi all, > > I have created a build for Apache Kylin 2.6.3, release candidate 1. > > Changes highlights: > - [KYLIN-4024] - Support pushdown to Presto > - [KYLIN-3977] - Avoid mistaken deleting dicts by storage cleanup while >

[jira] [Created] (KYLIN-3669) Add logs to GTStreamAggregateScanner

2018-11-06 Thread Jiatao Tao (JIRA)
Jiatao Tao created KYLIN-3669: - Summary: Add logs to GTStreamAggregateScanner Key: KYLIN-3669 URL: https://issues.apache.org/jira/browse/KYLIN-3669 Project: Kylin Issue Type: Improvement

[jira] [Created] (KYLIN-3834) Add monitor for curator-based scheduler

2019-02-26 Thread Jiatao Tao (JIRA)
Jiatao Tao created KYLIN-3834: - Summary: Add monitor for curator-based scheduler Key: KYLIN-3834 URL: https://issues.apache.org/jira/browse/KYLIN-3834 Project: Kylin Issue Type: Improvement

[jira] [Created] (KYLIN-3960) Only update user when login in LDAP environment

2019-04-17 Thread Jiatao Tao (JIRA)
Jiatao Tao created KYLIN-3960: - Summary: Only update user when login in LDAP environment Key: KYLIN-3960 URL: https://issues.apache.org/jira/browse/KYLIN-3960 Project: Kylin Issue Type