[jira] [Resolved] (KYLIN-3277) Kylin should override hiveconf settings when connecting to hive using jdbc
[ https://issues.apache.org/jira/browse/KYLIN-3277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Billy Liu resolved KYLIN-3277. -- Resolution: Fixed > Kylin should override hiveconf settings when connecting to hive using jdbc > -- > > Key: KYLIN-3277 > URL: https://issues.apache.org/jira/browse/KYLIN-3277 > Project: Kylin > Issue Type: Bug >Reporter: Chuqian Yu >Assignee: Chuqian Yu >Priority: Major > Labels: patch > Fix For: v2.4.0 > > Attachments: 0001-KYLIN-3277.patch > > > Hi, kylin developers. My cube building procedure failing at Step 2 > "Redistribute Flat Hive Table" because Kylin always trying to submit a mr job > to the default yarn queue. > > I have overrided the mapred.job.queue.name property in both > kylin_hive_conf.xml and kylin.properties but it doesn't work. > > kylin.properties > ``` > kylin.source.hive.beeline-params=-n hive -p hive --hiveconf > mapred.job.queue.name=myQueue -u > "jdbc:hive2://myZk:2181/;serviceDiscoveryMode=zooKeeper;" > ``` > > kylin_hive_conf.xml > ``` > > mapred.job.queue.name > myQueue > > ``` > > After digging into the source code ,I found that kylin try to get the row > count of hive table before redistributing it. But it dose not override the > hive configuration when using jdbc to connect to hive server. > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (KYLIN-3311) Segments overlap error (refactor write conflict exception)
[ https://issues.apache.org/jira/browse/KYLIN-3311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Billy Liu resolved KYLIN-3311. -- Resolution: Fixed Assignee: liyang Fix Version/s: v2.4.0 > Segments overlap error (refactor write conflict exception) > -- > > Key: KYLIN-3311 > URL: https://issues.apache.org/jira/browse/KYLIN-3311 > Project: Kylin > Issue Type: Bug > Components: Metadata >Affects Versions: v2.3.0 >Reporter: xujing >Assignee: liyang >Priority: Major > Labels: build > Fix For: v2.4.0 > > Attachments: Segments_Overlap_ErrorLog.txt > > > when "updateCubeWithRetry" method be called at first time , > line newSegs.validate();was passed . > then > cube = crud.save(cube);seem with error throw exception > write conflict to update cube at try 0 ,will retry... > while retry "updateCubeWithRetry" start > line newSegs.validate();was not passed . > throw exception > Segments overlap: [2018031800_2018031900] and > sales_order_channel[2018031800_201803 > 1900] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (KYLIN-3315) allow each project to set its own source in project level override configuration
[ https://issues.apache.org/jira/browse/KYLIN-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Billy Liu resolved KYLIN-3315. -- Resolution: Fixed Assignee: Dong Li Fix Version/s: v2.4.0 > allow each project to set its own source in project level override > configuration > > > Key: KYLIN-3315 > URL: https://issues.apache.org/jira/browse/KYLIN-3315 > Project: Kylin > Issue Type: Improvement > Components: Metadata >Reporter: Dong Li >Assignee: Dong Li >Priority: Major > Fix For: v2.4.0 > > > Currently, all projects connect to the same source which is set in > kylin.properties with kylin.source.default property. > It's better to allow each project to set its own source in project level > override configuration. > As the result, we can have project A connects to JDBC, and project B connects > to Hive. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (KYLIN-3324) NegativeArraySizeException in CreateDictionaryJob$2.getDictionary()
[ https://issues.apache.org/jira/browse/KYLIN-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Billy Liu resolved KYLIN-3324. -- Resolution: Fixed Fix Version/s: v2.4.0 > NegativeArraySizeException in CreateDictionaryJob$2.getDictionary() > --- > > Key: KYLIN-3324 > URL: https://issues.apache.org/jira/browse/KYLIN-3324 > Project: Kylin > Issue Type: Bug >Reporter: liyang >Assignee: liyang >Priority: Major > Fix For: v2.4.0 > > > During cube build, got following exception: > {noformat} > 2018-01-31 09:58:14,982 ERROR [Scheduler 311988476 Job > b931b081-3b44-462b-9bce-e15bf245f1b9-862] common.HadoopShellExecutable : > error execute > HadoopShellExecutable{id=b931b081-3b44-462b-9bce-e15bf245f1b9-03, name=Build > Dimension Dictionary, state=RUNNING} > java.lang.NegativeArraySizeException > at > org.apache.hadoop.io.BytesWritable.setCapacity(BytesWritable.java:144) > at org.apache.hadoop.io.BytesWritable.setSize(BytesWritable.java:123) > at org.apache.hadoop.io.BytesWritable.readFields(BytesWritable.java:179) > at > org.apache.hadoop.io.SequenceFile$Reader.getCurrentValue(SequenceFile.java:2259) > at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:2387) > at > org.apache.kylin.engine.mr.steps.CreateDictionaryJob$2.getDictionary(CreateDictionaryJob.java:87) > at > org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:62) > at > org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:49) > at > org.apache.kylin.engine.mr.steps.CreateDictionaryJob.run(CreateDictionaryJob.java:66) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) > at > org.apache.kylin.engine.mr.common.HadoopShellExecutable.doWork(HadoopShellExecutable.java:62) > at > org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:125) > at > org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:64) > at > org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:125) > at > org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:156) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745){noformat} > Root cause seems to be > https://stackoverflow.com/questions/24127304/negativearraysizeexception-when-creating-a-sequencefile-with-large-1gb-bytesw -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3277) Kylin should override hiveconf settings when connecting to hive using jdbc
[ https://issues.apache.org/jira/browse/KYLIN-3277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16424971#comment-16424971 ] ASF subversion and git services commented on KYLIN-3277: Commit 417c4b646eb28c3f5dd51378e5cac1546e65c7ff in kylin's branch refs/heads/master from [~nichunen] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=417c4b6 ] KYLIN-3277 Code review > Kylin should override hiveconf settings when connecting to hive using jdbc > -- > > Key: KYLIN-3277 > URL: https://issues.apache.org/jira/browse/KYLIN-3277 > Project: Kylin > Issue Type: Bug >Reporter: Chuqian Yu >Assignee: Chuqian Yu >Priority: Major > Labels: patch > Fix For: v2.4.0 > > Attachments: 0001-KYLIN-3277.patch > > > Hi, kylin developers. My cube building procedure failing at Step 2 > "Redistribute Flat Hive Table" because Kylin always trying to submit a mr job > to the default yarn queue. > > I have overrided the mapred.job.queue.name property in both > kylin_hive_conf.xml and kylin.properties but it doesn't work. > > kylin.properties > ``` > kylin.source.hive.beeline-params=-n hive -p hive --hiveconf > mapred.job.queue.name=myQueue -u > "jdbc:hive2://myZk:2181/;serviceDiscoveryMode=zooKeeper;" > ``` > > kylin_hive_conf.xml > ``` > > mapred.job.queue.name > myQueue > > ``` > > After digging into the source code ,I found that kylin try to get the row > count of hive table before redistributing it. But it dose not override the > hive configuration when using jdbc to connect to hive server. > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3315) allow each project to set its own source in project level override configuration
[ https://issues.apache.org/jira/browse/KYLIN-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16424974#comment-16424974 ] ASF subversion and git services commented on KYLIN-3315: Commit 5552164ba09eba989b9ddccdf3f1e4f83ed0b799 in kylin's branch refs/heads/master from [~lidong_sjtu] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=5552164 ] KYLIN-3315 allow each project to set its own source > allow each project to set its own source in project level override > configuration > > > Key: KYLIN-3315 > URL: https://issues.apache.org/jira/browse/KYLIN-3315 > Project: Kylin > Issue Type: Improvement > Components: Metadata >Reporter: Dong Li >Priority: Major > > Currently, all projects connect to the same source which is set in > kylin.properties with kylin.source.default property. > It's better to allow each project to set its own source in project level > override configuration. > As the result, we can have project A connects to JDBC, and project B connects > to Hive. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3277) Kylin should override hiveconf settings when connecting to hive using jdbc
[ https://issues.apache.org/jira/browse/KYLIN-3277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16424970#comment-16424970 ] ASF subversion and git services commented on KYLIN-3277: Commit cfe84a18f3f54d405ceb394ed3b40bd4b09a692b in kylin's branch refs/heads/master from [~yuchuqian] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=cfe84a1 ] KYLIN-3277 Kylin should override hiveconf settings when connecting to hive using jdbc Signed-off-by: nichunen> Kylin should override hiveconf settings when connecting to hive using jdbc > -- > > Key: KYLIN-3277 > URL: https://issues.apache.org/jira/browse/KYLIN-3277 > Project: Kylin > Issue Type: Bug >Reporter: Chuqian Yu >Assignee: Chuqian Yu >Priority: Major > Labels: patch > Fix For: v2.4.0 > > Attachments: 0001-KYLIN-3277.patch > > > Hi, kylin developers. My cube building procedure failing at Step 2 > "Redistribute Flat Hive Table" because Kylin always trying to submit a mr job > to the default yarn queue. > > I have overrided the mapred.job.queue.name property in both > kylin_hive_conf.xml and kylin.properties but it doesn't work. > > kylin.properties > ``` > kylin.source.hive.beeline-params=-n hive -p hive --hiveconf > mapred.job.queue.name=myQueue -u > "jdbc:hive2://myZk:2181/;serviceDiscoveryMode=zooKeeper;" > ``` > > kylin_hive_conf.xml > ``` > > mapred.job.queue.name > myQueue > > ``` > > After digging into the source code ,I found that kylin try to get the row > count of hive table before redistributing it. But it dose not override the > hive configuration when using jdbc to connect to hive server. > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3324) NegativeArraySizeException in CreateDictionaryJob$2.getDictionary()
[ https://issues.apache.org/jira/browse/KYLIN-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16424972#comment-16424972 ] ASF subversion and git services commented on KYLIN-3324: Commit 343d63d19d281905c2900b250c11d5375d68503d in kylin's branch refs/heads/master from Li Yang [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=343d63d ] KYLIN-3324 fix NegativeArraySizeException in CreateDictionaryJob$2.getDictionary() > NegativeArraySizeException in CreateDictionaryJob$2.getDictionary() > --- > > Key: KYLIN-3324 > URL: https://issues.apache.org/jira/browse/KYLIN-3324 > Project: Kylin > Issue Type: Bug >Reporter: liyang >Assignee: liyang >Priority: Major > > During cube build, got following exception: > {noformat} > 2018-01-31 09:58:14,982 ERROR [Scheduler 311988476 Job > b931b081-3b44-462b-9bce-e15bf245f1b9-862] common.HadoopShellExecutable : > error execute > HadoopShellExecutable{id=b931b081-3b44-462b-9bce-e15bf245f1b9-03, name=Build > Dimension Dictionary, state=RUNNING} > java.lang.NegativeArraySizeException > at > org.apache.hadoop.io.BytesWritable.setCapacity(BytesWritable.java:144) > at org.apache.hadoop.io.BytesWritable.setSize(BytesWritable.java:123) > at org.apache.hadoop.io.BytesWritable.readFields(BytesWritable.java:179) > at > org.apache.hadoop.io.SequenceFile$Reader.getCurrentValue(SequenceFile.java:2259) > at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:2387) > at > org.apache.kylin.engine.mr.steps.CreateDictionaryJob$2.getDictionary(CreateDictionaryJob.java:87) > at > org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:62) > at > org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:49) > at > org.apache.kylin.engine.mr.steps.CreateDictionaryJob.run(CreateDictionaryJob.java:66) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) > at > org.apache.kylin.engine.mr.common.HadoopShellExecutable.doWork(HadoopShellExecutable.java:62) > at > org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:125) > at > org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:64) > at > org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:125) > at > org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:156) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745){noformat} > Root cause seems to be > https://stackoverflow.com/questions/24127304/negativearraysizeexception-when-creating-a-sequencefile-with-large-1gb-bytesw -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3311) Segments overlap error (refactor write conflict exception)
[ https://issues.apache.org/jira/browse/KYLIN-3311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16424969#comment-16424969 ] ASF subversion and git services commented on KYLIN-3311: Commit a6bfc9d019b7df4d96d46db2b8e010323fe4fddf in kylin's branch refs/heads/master from Li Yang [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=a6bfc9d ] KYLIN-3311 refactor Resource.checkAndPutResourceImpl() to throw WriteConflictException > Segments overlap error (refactor write conflict exception) > -- > > Key: KYLIN-3311 > URL: https://issues.apache.org/jira/browse/KYLIN-3311 > Project: Kylin > Issue Type: Bug > Components: Metadata >Affects Versions: v2.3.0 >Reporter: xujing >Priority: Major > Labels: build > Attachments: Segments_Overlap_ErrorLog.txt > > > when "updateCubeWithRetry" method be called at first time , > line newSegs.validate();was passed . > then > cube = crud.save(cube);seem with error throw exception > write conflict to update cube at try 0 ,will retry... > while retry "updateCubeWithRetry" start > line newSegs.validate();was not passed . > throw exception > Segments overlap: [2018031800_2018031900] and > sales_order_channel[2018031800_201803 > 1900] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3315) allow each project to set its own source in project level override configuration
[ https://issues.apache.org/jira/browse/KYLIN-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16424973#comment-16424973 ] ASF subversion and git services commented on KYLIN-3315: Commit 709e5640bc1f59bbb7753951fde5f22ac941aee4 in kylin's branch refs/heads/master from [~lidong_sjtu] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=709e564 ] KYLIN-3315 allow each project to set its own source > allow each project to set its own source in project level override > configuration > > > Key: KYLIN-3315 > URL: https://issues.apache.org/jira/browse/KYLIN-3315 > Project: Kylin > Issue Type: Improvement > Components: Metadata >Reporter: Dong Li >Priority: Major > > Currently, all projects connect to the same source which is set in > kylin.properties with kylin.source.default property. > It's better to allow each project to set its own source in project level > override configuration. > As the result, we can have project A connects to JDBC, and project B connects > to Hive. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Comment Edited] (KYLIN-3308) Improvement exception in REST
[ https://issues.apache.org/jira/browse/KYLIN-3308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16407559#comment-16407559 ] Pan, Julian edited comment on KYLIN-3308 at 4/3/18 11:08 AM: - I'm not sure whether the controller2 will replace controller and I should enhance this issue in controller. was (Author: julianpan): I can lead the exception defined if needed. BTW, I'm not sure whether the controller2 will cover it or not. > Improvement exception in REST > - > > Key: KYLIN-3308 > URL: https://issues.apache.org/jira/browse/KYLIN-3308 > Project: Kylin > Issue Type: Improvement > Components: REST Service >Reporter: Pan, Julian >Assignee: Pan, Julian >Priority: Major > > There are kinds of exception throw by rest call, and we need better defined > the exception. > For example: > 1> There are three types of exception for cube equals null in CubeController: > NotFoundException (deleteCube) > InternalErrorException(getCube) > BadRequestException (cloneCube) > 2> And sometimes InternalErrorException will warp the BadRequestException in > CubeController.saveCubeDesc > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3001) Fix the wrong Cache key issue
[ https://issues.apache.org/jira/browse/KYLIN-3001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423810#comment-16423810 ] Pan, Julian commented on KYLIN-3001: It is replace for multiple space to one space, which will reduce the sql length. > Fix the wrong Cache key issue > -- > > Key: KYLIN-3001 > URL: https://issues.apache.org/jira/browse/KYLIN-3001 > Project: Kylin > Issue Type: Bug > Components: Query Engine >Affects Versions: v2.1.0 >Reporter: Pan, Julian >Assignee: Pan, Julian >Priority: Major > Fix For: v2.3.0 > > Attachments: KYLIN_3001_Cache_key_issue.patch > > > Currently the cacheKey generate by SQLRequest getCacheKey: > cacheKey = Lists.newArrayList(sql.replaceAll("\\s+", ""), project, offset, > limit, acceptPartial, > backdoorToggles); > Which will remove all the space and break line. > If I have a sql: > select a, sum(c) from table > --where a > 0 group by a; > and another sql: > select a, sum(c) from table > --where a > 0 > group by a; > The two sql will share same cache key. > But the first sql should be break. If you query second sql, and then query > the first again. It will return result. > And another issue is if remove all space, which will mix a as b and aasb. > I think the better way is just replace multiple space to one space > Should I create a patch? Or just provide the fix: > cacheKey = Lists.newArrayList(sql.replaceAll("[ ]", " "), project, offset, > limit, acceptPartial, > backdoorToggles); -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3311) Segments overlap error (refactor write conflict exception)
[ https://issues.apache.org/jira/browse/KYLIN-3311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423568#comment-16423568 ] ASF subversion and git services commented on KYLIN-3311: Commit 3222de27c508ed3e7576330dc8c40206fe6cd84d in kylin's branch refs/heads/sync from Li Yang [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=3222de2 ] KYLIN-3311 refactor Resource.checkAndPutResourceImpl() to throw WriteConflictException > Segments overlap error (refactor write conflict exception) > -- > > Key: KYLIN-3311 > URL: https://issues.apache.org/jira/browse/KYLIN-3311 > Project: Kylin > Issue Type: Bug > Components: Metadata >Affects Versions: v2.3.0 >Reporter: xujing >Priority: Major > Labels: build > Attachments: Segments_Overlap_ErrorLog.txt > > > when "updateCubeWithRetry" method be called at first time , > line newSegs.validate();was passed . > then > cube = crud.save(cube);seem with error throw exception > write conflict to update cube at try 0 ,will retry... > while retry "updateCubeWithRetry" start > line newSegs.validate();was not passed . > throw exception > Segments overlap: [2018031800_2018031900] and > sales_order_channel[2018031800_201803 > 1900] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3324) NegativeArraySizeException in CreateDictionaryJob$2.getDictionary()
[ https://issues.apache.org/jira/browse/KYLIN-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423571#comment-16423571 ] ASF subversion and git services commented on KYLIN-3324: Commit 35009d465b9c628e2be8ab9bceff7852e20a4d42 in kylin's branch refs/heads/sync from Li Yang [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=35009d4 ] KYLIN-3324 fix NegativeArraySizeException in CreateDictionaryJob$2.getDictionary() > NegativeArraySizeException in CreateDictionaryJob$2.getDictionary() > --- > > Key: KYLIN-3324 > URL: https://issues.apache.org/jira/browse/KYLIN-3324 > Project: Kylin > Issue Type: Bug >Reporter: liyang >Assignee: liyang >Priority: Major > > During cube build, got following exception: > {noformat} > 2018-01-31 09:58:14,982 ERROR [Scheduler 311988476 Job > b931b081-3b44-462b-9bce-e15bf245f1b9-862] common.HadoopShellExecutable : > error execute > HadoopShellExecutable{id=b931b081-3b44-462b-9bce-e15bf245f1b9-03, name=Build > Dimension Dictionary, state=RUNNING} > java.lang.NegativeArraySizeException > at > org.apache.hadoop.io.BytesWritable.setCapacity(BytesWritable.java:144) > at org.apache.hadoop.io.BytesWritable.setSize(BytesWritable.java:123) > at org.apache.hadoop.io.BytesWritable.readFields(BytesWritable.java:179) > at > org.apache.hadoop.io.SequenceFile$Reader.getCurrentValue(SequenceFile.java:2259) > at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:2387) > at > org.apache.kylin.engine.mr.steps.CreateDictionaryJob$2.getDictionary(CreateDictionaryJob.java:87) > at > org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:62) > at > org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:49) > at > org.apache.kylin.engine.mr.steps.CreateDictionaryJob.run(CreateDictionaryJob.java:66) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) > at > org.apache.kylin.engine.mr.common.HadoopShellExecutable.doWork(HadoopShellExecutable.java:62) > at > org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:125) > at > org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:64) > at > org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:125) > at > org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:156) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745){noformat} > Root cause seems to be > https://stackoverflow.com/questions/24127304/negativearraysizeexception-when-creating-a-sequencefile-with-large-1gb-bytesw -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3315) allow each project to set its own source in project level override configuration
[ https://issues.apache.org/jira/browse/KYLIN-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423572#comment-16423572 ] ASF subversion and git services commented on KYLIN-3315: Commit 54bdae63cfca137c02cdf8749f93d152a9ac2010 in kylin's branch refs/heads/sync from [~lidong_sjtu] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=54bdae6 ] KYLIN-3315 allow each project to set its own source > allow each project to set its own source in project level override > configuration > > > Key: KYLIN-3315 > URL: https://issues.apache.org/jira/browse/KYLIN-3315 > Project: Kylin > Issue Type: Improvement > Components: Metadata >Reporter: Dong Li >Priority: Major > > Currently, all projects connect to the same source which is set in > kylin.properties with kylin.source.default property. > It's better to allow each project to set its own source in project level > override configuration. > As the result, we can have project A connects to JDBC, and project B connects > to Hive. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3277) Kylin should override hiveconf settings when connecting to hive using jdbc
[ https://issues.apache.org/jira/browse/KYLIN-3277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423570#comment-16423570 ] ASF subversion and git services commented on KYLIN-3277: Commit 2e58857d8d6903e5a0e5dc92efdaf93e7fe1f479 in kylin's branch refs/heads/sync from [~nichunen] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=2e58857 ] KYLIN-3277 Code review > Kylin should override hiveconf settings when connecting to hive using jdbc > -- > > Key: KYLIN-3277 > URL: https://issues.apache.org/jira/browse/KYLIN-3277 > Project: Kylin > Issue Type: Bug >Reporter: Chuqian Yu >Assignee: Chuqian Yu >Priority: Major > Labels: patch > Fix For: v2.4.0 > > Attachments: 0001-KYLIN-3277.patch > > > Hi, kylin developers. My cube building procedure failing at Step 2 > "Redistribute Flat Hive Table" because Kylin always trying to submit a mr job > to the default yarn queue. > > I have overrided the mapred.job.queue.name property in both > kylin_hive_conf.xml and kylin.properties but it doesn't work. > > kylin.properties > ``` > kylin.source.hive.beeline-params=-n hive -p hive --hiveconf > mapred.job.queue.name=myQueue -u > "jdbc:hive2://myZk:2181/;serviceDiscoveryMode=zooKeeper;" > ``` > > kylin_hive_conf.xml > ``` > > mapred.job.queue.name > myQueue > > ``` > > After digging into the source code ,I found that kylin try to get the row > count of hive table before redistributing it. But it dose not override the > hive configuration when using jdbc to connect to hive server. > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3315) allow each project to set its own source in project level override configuration
[ https://issues.apache.org/jira/browse/KYLIN-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423573#comment-16423573 ] ASF subversion and git services commented on KYLIN-3315: Commit d3ca7ef4a69cb5c957754606140df0566e41b202 in kylin's branch refs/heads/sync from [~lidong_sjtu] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=d3ca7ef ] KYLIN-3315 allow each project to set its own source > allow each project to set its own source in project level override > configuration > > > Key: KYLIN-3315 > URL: https://issues.apache.org/jira/browse/KYLIN-3315 > Project: Kylin > Issue Type: Improvement > Components: Metadata >Reporter: Dong Li >Priority: Major > > Currently, all projects connect to the same source which is set in > kylin.properties with kylin.source.default property. > It's better to allow each project to set its own source in project level > override configuration. > As the result, we can have project A connects to JDBC, and project B connects > to Hive. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3277) Kylin should override hiveconf settings when connecting to hive using jdbc
[ https://issues.apache.org/jira/browse/KYLIN-3277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423569#comment-16423569 ] ASF subversion and git services commented on KYLIN-3277: Commit 25b4b3bc06b30a4030f4949d97a05566bcbddca4 in kylin's branch refs/heads/sync from [~yuchuqian] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=25b4b3b ] KYLIN-3277 Kylin should override hiveconf settings when connecting to hive using jdbc Signed-off-by: nichunen> Kylin should override hiveconf settings when connecting to hive using jdbc > -- > > Key: KYLIN-3277 > URL: https://issues.apache.org/jira/browse/KYLIN-3277 > Project: Kylin > Issue Type: Bug >Reporter: Chuqian Yu >Assignee: Chuqian Yu >Priority: Major > Labels: patch > Fix For: v2.4.0 > > Attachments: 0001-KYLIN-3277.patch > > > Hi, kylin developers. My cube building procedure failing at Step 2 > "Redistribute Flat Hive Table" because Kylin always trying to submit a mr job > to the default yarn queue. > > I have overrided the mapred.job.queue.name property in both > kylin_hive_conf.xml and kylin.properties but it doesn't work. > > kylin.properties > ``` > kylin.source.hive.beeline-params=-n hive -p hive --hiveconf > mapred.job.queue.name=myQueue -u > "jdbc:hive2://myZk:2181/;serviceDiscoveryMode=zooKeeper;" > ``` > > kylin_hive_conf.xml > ``` > > mapred.job.queue.name > myQueue > > ``` > > After digging into the source code ,I found that kylin try to get the row > count of hive table before redistributing it. But it dose not override the > hive configuration when using jdbc to connect to hive server. > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3277) Kylin should override hiveconf settings when connecting to hive using jdbc
[ https://issues.apache.org/jira/browse/KYLIN-3277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423559#comment-16423559 ] ASF subversion and git services commented on KYLIN-3277: Commit a2d3b5f50b4c4c49dfe7b4ead488713f1b1654b8 in kylin's branch refs/heads/sync from [~nichunen] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=a2d3b5f ] KYLIN-3277 Code review > Kylin should override hiveconf settings when connecting to hive using jdbc > -- > > Key: KYLIN-3277 > URL: https://issues.apache.org/jira/browse/KYLIN-3277 > Project: Kylin > Issue Type: Bug >Reporter: Chuqian Yu >Assignee: Chuqian Yu >Priority: Major > Labels: patch > Fix For: v2.4.0 > > Attachments: 0001-KYLIN-3277.patch > > > Hi, kylin developers. My cube building procedure failing at Step 2 > "Redistribute Flat Hive Table" because Kylin always trying to submit a mr job > to the default yarn queue. > > I have overrided the mapred.job.queue.name property in both > kylin_hive_conf.xml and kylin.properties but it doesn't work. > > kylin.properties > ``` > kylin.source.hive.beeline-params=-n hive -p hive --hiveconf > mapred.job.queue.name=myQueue -u > "jdbc:hive2://myZk:2181/;serviceDiscoveryMode=zooKeeper;" > ``` > > kylin_hive_conf.xml > ``` > > mapred.job.queue.name > myQueue > > ``` > > After digging into the source code ,I found that kylin try to get the row > count of hive table before redistributing it. But it dose not override the > hive configuration when using jdbc to connect to hive server. > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3277) Kylin should override hiveconf settings when connecting to hive using jdbc
[ https://issues.apache.org/jira/browse/KYLIN-3277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423558#comment-16423558 ] ASF subversion and git services commented on KYLIN-3277: Commit df597a7f625ea2bb3ea103706f56934608015936 in kylin's branch refs/heads/sync from [~yuchuqian] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=df597a7 ] KYLIN-3277 Kylin should override hiveconf settings when connecting to hive using jdbc Signed-off-by: nichunen> Kylin should override hiveconf settings when connecting to hive using jdbc > -- > > Key: KYLIN-3277 > URL: https://issues.apache.org/jira/browse/KYLIN-3277 > Project: Kylin > Issue Type: Bug >Reporter: Chuqian Yu >Assignee: Chuqian Yu >Priority: Major > Labels: patch > Fix For: v2.4.0 > > Attachments: 0001-KYLIN-3277.patch > > > Hi, kylin developers. My cube building procedure failing at Step 2 > "Redistribute Flat Hive Table" because Kylin always trying to submit a mr job > to the default yarn queue. > > I have overrided the mapred.job.queue.name property in both > kylin_hive_conf.xml and kylin.properties but it doesn't work. > > kylin.properties > ``` > kylin.source.hive.beeline-params=-n hive -p hive --hiveconf > mapred.job.queue.name=myQueue -u > "jdbc:hive2://myZk:2181/;serviceDiscoveryMode=zooKeeper;" > ``` > > kylin_hive_conf.xml > ``` > > mapred.job.queue.name > myQueue > > ``` > > After digging into the source code ,I found that kylin try to get the row > count of hive table before redistributing it. But it dose not override the > hive configuration when using jdbc to connect to hive server. > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3315) allow each project to set its own source in project level override configuration
[ https://issues.apache.org/jira/browse/KYLIN-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423561#comment-16423561 ] ASF subversion and git services commented on KYLIN-3315: Commit 69420aad854a62c137b742f353e35511d5484df7 in kylin's branch refs/heads/sync from [~lidong_sjtu] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=69420aa ] KYLIN-3315 allow each project to set its own source > allow each project to set its own source in project level override > configuration > > > Key: KYLIN-3315 > URL: https://issues.apache.org/jira/browse/KYLIN-3315 > Project: Kylin > Issue Type: Improvement > Components: Metadata >Reporter: Dong Li >Priority: Major > > Currently, all projects connect to the same source which is set in > kylin.properties with kylin.source.default property. > It's better to allow each project to set its own source in project level > override configuration. > As the result, we can have project A connects to JDBC, and project B connects > to Hive. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3324) NegativeArraySizeException in CreateDictionaryJob$2.getDictionary()
[ https://issues.apache.org/jira/browse/KYLIN-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423560#comment-16423560 ] ASF subversion and git services commented on KYLIN-3324: Commit 08a0b9295088a91ac6b34ed3951ed8fb8404ee04 in kylin's branch refs/heads/sync from Li Yang [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=08a0b92 ] KYLIN-3324 fix NegativeArraySizeException in CreateDictionaryJob$2.getDictionary() > NegativeArraySizeException in CreateDictionaryJob$2.getDictionary() > --- > > Key: KYLIN-3324 > URL: https://issues.apache.org/jira/browse/KYLIN-3324 > Project: Kylin > Issue Type: Bug >Reporter: liyang >Assignee: liyang >Priority: Major > > During cube build, got following exception: > {noformat} > 2018-01-31 09:58:14,982 ERROR [Scheduler 311988476 Job > b931b081-3b44-462b-9bce-e15bf245f1b9-862] common.HadoopShellExecutable : > error execute > HadoopShellExecutable{id=b931b081-3b44-462b-9bce-e15bf245f1b9-03, name=Build > Dimension Dictionary, state=RUNNING} > java.lang.NegativeArraySizeException > at > org.apache.hadoop.io.BytesWritable.setCapacity(BytesWritable.java:144) > at org.apache.hadoop.io.BytesWritable.setSize(BytesWritable.java:123) > at org.apache.hadoop.io.BytesWritable.readFields(BytesWritable.java:179) > at > org.apache.hadoop.io.SequenceFile$Reader.getCurrentValue(SequenceFile.java:2259) > at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:2387) > at > org.apache.kylin.engine.mr.steps.CreateDictionaryJob$2.getDictionary(CreateDictionaryJob.java:87) > at > org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:62) > at > org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:49) > at > org.apache.kylin.engine.mr.steps.CreateDictionaryJob.run(CreateDictionaryJob.java:66) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) > at > org.apache.kylin.engine.mr.common.HadoopShellExecutable.doWork(HadoopShellExecutable.java:62) > at > org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:125) > at > org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:64) > at > org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:125) > at > org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:156) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745){noformat} > Root cause seems to be > https://stackoverflow.com/questions/24127304/negativearraysizeexception-when-creating-a-sequencefile-with-large-1gb-bytesw -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3311) Segments overlap error (refactor write conflict exception)
[ https://issues.apache.org/jira/browse/KYLIN-3311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423557#comment-16423557 ] ASF subversion and git services commented on KYLIN-3311: Commit e465a51fab70da301571511adf4ebfb358026aea in kylin's branch refs/heads/sync from Li Yang [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=e465a51 ] KYLIN-3311 refactor Resource.checkAndPutResourceImpl() to throw WriteConflictException > Segments overlap error (refactor write conflict exception) > -- > > Key: KYLIN-3311 > URL: https://issues.apache.org/jira/browse/KYLIN-3311 > Project: Kylin > Issue Type: Bug > Components: Metadata >Affects Versions: v2.3.0 >Reporter: xujing >Priority: Major > Labels: build > Attachments: Segments_Overlap_ErrorLog.txt > > > when "updateCubeWithRetry" method be called at first time , > line newSegs.validate();was passed . > then > cube = crud.save(cube);seem with error throw exception > write conflict to update cube at try 0 ,will retry... > while retry "updateCubeWithRetry" start > line newSegs.validate();was not passed . > throw exception > Segments overlap: [2018031800_2018031900] and > sales_order_channel[2018031800_201803 > 1900] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (KYLIN-3315) allow each project to set its own source in project level override configuration
[ https://issues.apache.org/jira/browse/KYLIN-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423562#comment-16423562 ] ASF subversion and git services commented on KYLIN-3315: Commit a5841b963fa313a73db510f07ebb489fa999e1fe in kylin's branch refs/heads/sync from [~lidong_sjtu] [ https://gitbox.apache.org/repos/asf?p=kylin.git;h=a5841b9 ] KYLIN-3315 allow each project to set its own source > allow each project to set its own source in project level override > configuration > > > Key: KYLIN-3315 > URL: https://issues.apache.org/jira/browse/KYLIN-3315 > Project: Kylin > Issue Type: Improvement > Components: Metadata >Reporter: Dong Li >Priority: Major > > Currently, all projects connect to the same source which is set in > kylin.properties with kylin.source.default property. > It's better to allow each project to set its own source in project level > override configuration. > As the result, we can have project A connects to JDBC, and project B connects > to Hive. -- This message was sent by Atlassian JIRA (v7.6.3#76005)