[jira] [Updated] (HIVE-9343) Fix windowing.q for Spark on trunk

2015-01-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9343: - Assignee: Rui Li Status: Patch Available (was: Open) > Fix windowing.q for Spark on trunk >

[jira] [Commented] (HIVE-9343) Fix windowing.q for Spark on trunk

2015-01-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14273261#comment-14273261 ] Rui Li commented on HIVE-9343: -- OK I'll take a look > Fix windowing.q for Spark on trunk > --

[jira] [Commented] (HIVE-9339) Optimize split grouping for CombineHiveInputFormat [Spark Branch]

2015-01-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14273191#comment-14273191 ] Rui Li commented on HIVE-9339: -- Using listener is fine. We currently use listeners to collect

[jira] [Updated] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9251: - Attachment: HIVE-9251.6-spark.patch Rebase the patch and include more update. > SetSparkReducerParallelism is like

[jira] [Updated] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9251: - Attachment: HIVE-9251.5-spark.patch I missed some update to optimize_nullscan.q Update patch. > SetSparkReducerPar

[jira] [Commented] (HIVE-9290) Make some test results deterministic

2015-01-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14270742#comment-14270742 ] Rui Li commented on HIVE-9290: -- Hi [~szehon], sorry I didn't provide a patch for spark branch

[jira] [Updated] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9251: - Attachment: HIVE-9251.4-spark.patch Update more golden files. > SetSparkReducerParallelism is likely to set too sm

[jira] [Commented] (HIVE-9290) Make some test results deterministic

2015-01-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14270456#comment-14270456 ] Rui Li commented on HIVE-9290: -- The failed test is not related to the patch here. > Make some

[jira] [Updated] (HIVE-9290) Make some test results deterministic

2015-01-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9290: - Attachment: HIVE-9290.1.patch Reload patch to trigger test > Make some test results deterministic > --

[jira] [Commented] (HIVE-9290) Make some test results deterministic

2015-01-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14268843#comment-14268843 ] Rui Li commented on HIVE-9290: -- Thanks [~xuefuz] for the explanation! > Make some test result

[jira] [Updated] (HIVE-9290) Make some test results deterministic

2015-01-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9290: - Status: Patch Available (was: Open) > Make some test results deterministic >

[jira] [Updated] (HIVE-9290) Make some test results deterministic

2015-01-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9290: - Attachment: HIVE-9290.1.patch Not sure if it's correct to make limit_pushdown.q deterministic. cc [~xuefuz] > Make

[jira] [Commented] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14268728#comment-14268728 ] Rui Li commented on HIVE-9251: -- [~xuefuz] - you're right. I think we should fix HIVE-9290 firs

[jira] [Updated] (HIVE-9290) Make some test results deterministic

2015-01-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9290: - Description: {noformat} limit_pushdown.q optimize_nullscan.q ppd_gby_join.q vector_string_concat.q {noformat} wa

[jira] [Updated] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9251: - Attachment: HIVE-9251.3-spark.patch Addressed RB comments and updated golden files. Some notes about the reducer c

[jira] [Created] (HIVE-9290) Make some test results deterministic

2015-01-07 Thread Rui Li (JIRA)
Rui Li created HIVE-9290: Summary: Make some test results deterministic Key: HIVE-9290 URL: https://issues.apache.org/jira/browse/HIVE-9290 Project: Hive Issue Type: Test Reporter: Rui Li

[jira] [Commented] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14267276#comment-14267276 ] Rui Li commented on HIVE-9251: -- That basically means cluster info is not available. So hive wi

[jira] [Commented] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14267229#comment-14267229 ] Rui Li commented on HIVE-9251: -- Hi [~xuefuz], yeah I'll update the golden files if you think t

[jira] [Commented] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14267213#comment-14267213 ] Rui Li commented on HIVE-9251: -- I quickly checked the failed tests. Most of them are in query

[jira] [Updated] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9251: - Status: Patch Available (was: Open) > SetSparkReducerParallelism is likely to set too small number of reducers >

[jira] [Updated] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9251: - Attachment: HIVE-9251.2-spark.patch Thanks [~jxiang] and [~xuefuz]. Upload another patch. I didn't remove the memo

[jira] [Commented] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-04 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14264135#comment-14264135 ] Rui Li commented on HIVE-9251: -- Hi [~xuefuz], I think {{hive.exec.reducers.bytes.per.reducer}}

[jira] [Updated] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-04 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9251: - Attachment: HIVE-9251.1-spark.patch Submit a patch for review. BTW, maybe we don't need memory per task to calculat

[jira] [Assigned] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-04 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-9251: Assignee: Rui Li > SetSparkReducerParallelism is likely to set too small number of reducers > [Spark Branch

[jira] [Updated] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-04 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9251: - Description: This may hurt performance or even lead to task failures. For example, spark's netty-based shuffle limi

[jira] [Commented] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-04 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14263821#comment-14263821 ] Rui Li commented on HIVE-9251: -- Basically the following problems will lead to small number of

[jira] [Created] (HIVE-9251) SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch]

2015-01-04 Thread Rui Li (JIRA)
Rui Li created HIVE-9251: Summary: SetSparkReducerParallelism is likely to set too small number of reducers [Spark Branch] Key: HIVE-9251 URL: https://issues.apache.org/jira/browse/HIVE-9251 Project: Hive

[jira] [Created] (HIVE-9227) Make HiveInputSplit support InputSplitWithLocationInfo

2014-12-30 Thread Rui Li (JIRA)
Rui Li created HIVE-9227: Summary: Make HiveInputSplit support InputSplitWithLocationInfo Key: HIVE-9227 URL: https://issues.apache.org/jira/browse/HIVE-9227 Project: Hive Issue Type: Improvement

[jira] [Commented] (HIVE-8956) Hive hangs while some error/exception happens beyond job execution [Spark Branch]

2014-12-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14260652#comment-14260652 ] Rui Li commented on HIVE-8956: -- Hi [~chirag.aggarwal], which query triggered this error? And y

[jira] [Commented] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat

2014-12-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14259261#comment-14259261 ] Rui Li commented on HIVE-9153: -- Hi [~xuefuz], I don't think it's related. As it's been failing

[jira] [Updated] (HIVE-9216) Avoid redundant clone of JobConf [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9216: - Status: Patch Available (was: Open) > Avoid redundant clone of JobConf [Spark Branch] > --

[jira] [Updated] (HIVE-9216) Avoid redundant clone of JobConf [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9216: - Attachment: HIVE-9216.1-spark.patch > Avoid redundant clone of JobConf [Spark Branch] > ---

[jira] [Updated] (HIVE-9216) Avoid redundant clone of JobConf [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9216: - Priority: Minor (was: Major) > Avoid redundant clone of JobConf [Spark Branch] > -

[jira] [Updated] (HIVE-9216) Avoid redundant clone of JobConf [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9216: - Summary: Avoid redundant clone of JobConf [Spark Branch] (was: Avoid redundant clone of JobConf) > Avoid redundan

[jira] [Created] (HIVE-9216) Avoid redundant clone of JobConf

2014-12-25 Thread Rui Li (JIRA)
Rui Li created HIVE-9216: Summary: Avoid redundant clone of JobConf Key: HIVE-9216 URL: https://issues.apache.org/jira/browse/HIVE-9216 Project: Hive Issue Type: Sub-task Components: Spark

[jira] [Commented] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14258909#comment-14258909 ] Rui Li commented on HIVE-9153: -- Strange thing is that {{Utilities}} is different in trunk and

[jira] [Updated] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9153: - Attachment: HIVE-9153.3.patch Seems the redundant code in {{Utilities.getBasework}} has been taken care of in trun

[jira] [Updated] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9153: - Attachment: HIVE-9153.2.patch Upload trunk patch > Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark B

[jira] [Commented] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14258885#comment-14258885 ] Rui Li commented on HIVE-9153: -- Hi [~brocknoland] and [~xuefuz], Sorry maybe I was being conf

[jira] [Commented] (HIVE-9135) Cache Map and Reduce works in RSC [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14258741#comment-14258741 ] Rui Li commented on HIVE-9135: -- I'm not sure if this is correct: we clone JobConf in {{SparkP

[jira] [Updated] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9153: - Status: Patch Available (was: Open) > Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch] >

[jira] [Updated] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9153: - Attachment: HIVE-9153.1-spark.patch This patch should further improve spark performance by avoid retrieving MapWork

[jira] [Commented] (HIVE-9191) TimeOutException when using RSC with beeline [Spark Branch]

2014-12-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14256825#comment-14256825 ] Rui Li commented on HIVE-9191: -- I also run into this using Cli. > TimeOutException when using

[jira] [Commented] (HIVE-8722) Enhance InputSplitShims to extend InputSplitWithLocationInfo [Spark Branch]

2014-12-22 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14255668#comment-14255668 ] Rui Li commented on HIVE-8722: -- Hi [~brocknoland], I suppose this will be a little tricky for

[jira] [Commented] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-22 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14255650#comment-14255650 ] Rui Li commented on HIVE-9153: -- [~xuefuz] - I was wrong about turning off delay schedule. Actu

[jira] [Commented] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14253164#comment-14253164 ] Rui Li commented on HIVE-9153: -- Investigated a bit about why {{CombineHiveInputFormat.getLocat

[jira] [Commented] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14252874#comment-14252874 ] Rui Li commented on HIVE-9153: -- I think we actually can get location info with {{CombineHiveIn

[jira] [Commented] (HIVE-8722) Enhance InputSplitShims to extend InputSplitWithLocationInfo [Spark Branch]

2014-12-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14252859#comment-14252859 ] Rui Li commented on HIVE-8722: -- Hi [~jxiang], yes I think data locality can have dramatic impa

[jira] [Commented] (HIVE-8722) Enhance InputSplitShims to extend InputSplitWithLocationInfo [Spark Branch]

2014-12-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14252804#comment-14252804 ] Rui Li commented on HIVE-8722: -- I think spark doesn't require the input split to be a {{Input

[jira] [Commented] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14252762#comment-14252762 ] Rui Li commented on HIVE-9153: -- Hi [~xuefuz] - if the spark cluster is the same as the hadoop

[jira] [Commented] (HIVE-8722) Enhance InputSplitShims to extend InputSplitWithLocationInfo [Spark Branch]

2014-12-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14251643#comment-14251643 ] Rui Li commented on HIVE-8722: -- Never mind my last comments. That's because I used hadoop-2.4

[jira] [Updated] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9153: - Attachment: screenshot.PNG > Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch] > --

[jira] [Commented] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14251597#comment-14251597 ] Rui Li commented on HIVE-9153: -- Judging from the results, I think fewer mappers can improve ov

[jira] [Commented] (HIVE-8722) Enhance InputSplitShims to extend InputSplitWithLocationInfo [Spark Branch]

2014-12-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14251557#comment-14251557 ] Rui Li commented on HIVE-8722: -- I got this exception which also seems related: {noformat} 2014

[jira] [Commented] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14251407#comment-14251407 ] Rui Li commented on HIVE-9153: -- I used our cluster B to test this. Results show that CombineHi

[jira] [Assigned] (HIVE-9153) Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch]

2014-12-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-9153: Assignee: Rui Li > Evaluate CombineHiveInputFormat versus HiveInputFormat [Spark Branch] > -

[jira] [Commented] (HIVE-8972) Implement more fine-grained remote client-level events [Spark Branch]

2014-12-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250960#comment-14250960 ] Rui Li commented on HIVE-8972: -- Thanks guys for the review. > Implement more fine-grained rem

[jira] [Updated] (HIVE-8972) Implement more fine-grained remote client-level events [Spark Branch]

2014-12-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8972: - Attachment: HIVE-8972.5-spark.patch Try again. The failures {{union_remove_10}} and {{join10}} are all due to timeo

[jira] [Updated] (HIVE-8972) Implement more fine-grained remote client-level events [Spark Branch]

2014-12-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8972: - Attachment: HIVE-8972.4-spark.patch The latest patch only consists of minor fix and clean up. I talked about this w

[jira] [Commented] (HIVE-9127) Improve CombineHiveInputFormat.getSplit performance

2014-12-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14249481#comment-14249481 ] Rui Li commented on HIVE-9127: -- [~xuefuz] - Oh I see. Thanks for the explanation! > Improve C

[jira] [Commented] (HIVE-9097) Support runtime skew join for more queries [Spark Branch]

2014-12-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14249463#comment-14249463 ] Rui Li commented on HIVE-9097: -- Thanks [~xuefuz] for the review. > Support runtime skew join

[jira] [Updated] (HIVE-9097) Support runtime skew join for more queries [Spark Branch]

2014-12-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9097: - Component/s: Spark > Support runtime skew join for more queries [Spark Branch] > --

[jira] [Updated] (HIVE-9097) Support runtime skew join for more queries [Spark Branch]

2014-12-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9097: - Affects Version/s: spark-branch > Support runtime skew join for more queries [Spark Branch] > -

[jira] [Commented] (HIVE-9127) Improve CombineHiveInputFormat.getSplit performance

2014-12-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14249287#comment-14249287 ] Rui Li commented on HIVE-9127: -- Will this cache Map/Reduce works for spark? Seems changes to U

[jira] [Updated] (HIVE-9097) Support runtime skew join for more queries [Spark Branch]

2014-12-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9097: - Status: Patch Available (was: Open) > Support runtime skew join for more queries [Spark Branch] >

[jira] [Updated] (HIVE-9097) Support runtime skew join for more queries [Spark Branch]

2014-12-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9097: - Attachment: HIVE-9097.1-spark.patch The patch splits the original spark task into two tasks so that conditional map

[jira] [Assigned] (HIVE-9097) Support runtime skew join for more queries [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-9097: Assignee: Rui Li > Support runtime skew join for more queries [Spark Branch] > -

[jira] [Assigned] (HIVE-9098) Check cross product for conditional task [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-9098: Assignee: Rui Li > Check cross product for conditional task [Spark Branch] > ---

[jira] [Updated] (HIVE-9098) Check cross product for conditional task [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9098: - Status: Patch Available (was: Open) > Check cross product for conditional task [Spark Branch] > --

[jira] [Updated] (HIVE-9098) Check cross product for conditional task [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9098: - Attachment: HIVE-9098.1-spark.patch > Check cross product for conditional task [Spark Branch] > ---

[jira] [Created] (HIVE-9098) Check cross product for conditional task [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
Rui Li created HIVE-9098: Summary: Check cross product for conditional task [Spark Branch] Key: HIVE-9098 URL: https://issues.apache.org/jira/browse/HIVE-9098 Project: Hive Issue Type: Sub-task

[jira] [Commented] (HIVE-7816) Enable map-join tests which Tez executes [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14246273#comment-14246273 ] Rui Li commented on HIVE-7816: -- That's OK. No worries, I'll take care of that. > Enable map-j

[jira] [Commented] (HIVE-7816) Enable map-join tests which Tez executes [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14246268#comment-14246268 ] Rui Li commented on HIVE-7816: -- [~xuefuz] - shall we wait a little bit? That's just a couple o

[jira] [Updated] (HIVE-9097) Support runtime skew join for more queries [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9097: - Description: After HIVE-8913, runtime skew join is enabled for spark. But currently the optimization only supports

[jira] [Updated] (HIVE-9097) Support runtime skew join for more queries [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9097: - Issue Type: Improvement (was: Bug) > Support runtime skew join for more queries [Spark Branch] > -

[jira] [Created] (HIVE-9097) Support runtime skew join for more queries [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
Rui Li created HIVE-9097: Summary: Support runtime skew join for more queries [Spark Branch] Key: HIVE-9097 URL: https://issues.apache.org/jira/browse/HIVE-9097 Project: Hive Issue Type: Bug

[jira] [Commented] (HIVE-7816) Enable map-join tests which Tez executes [Spark Branch]

2014-12-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14246213#comment-14246213 ] Rui Li commented on HIVE-7816: -- Hi [~xuefuz] yeah we have to deal with conditional task as wel

[jira] [Updated] (HIVE-7816) Enable map-join tests which Tez executes [Spark Branch]

2014-12-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-7816: - Status: Patch Available (was: Open) > Enable map-join tests which Tez executes [Spark Branch] > --

[jira] [Updated] (HIVE-7816) Enable map-join tests which Tez executes [Spark Branch]

2014-12-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-7816: - Attachment: HIVE-7816.1-spark.patch This patch implements a spark-specific cross product checker {{SparkCrossProdu

[jira] [Commented] (HIVE-7816) Enable map-join tests which Tez executes [Spark Branch]

2014-12-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14243892#comment-14243892 ] Rui Li commented on HIVE-7816: -- We already have golden files for {{filter_join_breaktask.q}} a

[jira] [Updated] (HIVE-8913) Make SparkMapJoinResolver handle runtime skew join [Spark Branch]

2014-12-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8913: - Attachment: HIVE-8913.3-spark.patch Address RB comments > Make SparkMapJoinResolver handle runtime skew join [Spar

[jira] [Commented] (HIVE-7816) Enable map-join tests which Tez executes [Spark Branch]

2014-12-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14243589#comment-14243589 ] Rui Li commented on HIVE-7816: -- OK I'm on it. > Enable map-join tests which Tez executes [Spa

[jira] [Commented] (HIVE-8913) Make SparkMapJoinResolver handle runtime skew join [Spark Branch]

2014-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242145#comment-14242145 ] Rui Li commented on HIVE-8913: -- I think the call path should be {{ExecMapperContext.clear -> I

[jira] [Commented] (HIVE-8913) Make SparkMapJoinResolver handle runtime skew join [Spark Branch]

2014-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242087#comment-14242087 ] Rui Li commented on HIVE-8913: -- Another possible issue is that we have to make sure the thread

[jira] [Commented] (HIVE-8913) Make SparkMapJoinResolver handle runtime skew join [Spark Branch]

2014-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242035#comment-14242035 ] Rui Li commented on HIVE-8913: -- Just quick thought, maybe {{IOContext.inputNameIOContextMap}}

[jira] [Commented] (HIVE-8913) Make SparkMapJoinResolver handle runtime skew join [Spark Branch]

2014-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242020#comment-14242020 ] Rui Li commented on HIVE-8913: -- I think the failure is not related because it passes on my mac

[jira] [Commented] (HIVE-9019) Avoid using SPARK_JAVA_OPTS [Spark Branch]

2014-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241993#comment-14241993 ] Rui Li commented on HIVE-9019: -- OK... let me know if you find other problems with this :-) >

[jira] [Commented] (HIVE-9063) NPE in RemoteSparkJobStatus.getSparkStatistics [Spark Branch]

2014-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241984#comment-14241984 ] Rui Li commented on HIVE-9063: -- [~xuefuz] - that seems to be an auto optimize of IntelliJ. I'l

[jira] [Commented] (HIVE-8913) Make SparkMapJoinResolver handle runtime skew join [Spark Branch]

2014-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241026#comment-14241026 ] Rui Li commented on HIVE-8913: -- Since we only enable runtime skew join for simple cases where

[jira] [Updated] (HIVE-8913) Make SparkMapJoinResolver handle runtime skew join [Spark Branch]

2014-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8913: - Attachment: HIVE-8913.2-spark.patch Fix concurrent modification exception. Update to {{bucket_map_join_tez1.q}} and

[jira] [Updated] (HIVE-8913) Make SparkMapJoinResolver handle runtime skew join [Spark Branch]

2014-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8913: - Status: Patch Available (was: Open) > Make SparkMapJoinResolver handle runtime skew join [Spark Branch] >

[jira] [Updated] (HIVE-8913) Make SparkMapJoinResolver handle runtime skew join [Spark Branch]

2014-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8913: - Attachment: HIVE-8913.1-spark.patch The patch only supports runtime skew join for simple queries where join is the

[jira] [Updated] (HIVE-9063) NPE in RemoteSparkJobStatus.getSparkStatistics [Spark Branch]

2014-12-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9063: - Attachment: HIVE-9063.1-spark.patch cc [~brocknoland] > NPE in RemoteSparkJobStatus.getSparkStatistics [Spark Bran

[jira] [Updated] (HIVE-9063) NPE in RemoteSparkJobStatus.getSparkStatistics [Spark Branch]

2014-12-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9063: - Status: Patch Available (was: Open) > NPE in RemoteSparkJobStatus.getSparkStatistics [Spark Branch] >

[jira] [Created] (HIVE-9063) NPE in RemoteSparkJobStatus.getSparkStatistics [Spark Branch]

2014-12-09 Thread Rui Li (JIRA)
Rui Li created HIVE-9063: Summary: NPE in RemoteSparkJobStatus.getSparkStatistics [Spark Branch] Key: HIVE-9063 URL: https://issues.apache.org/jira/browse/HIVE-9063 Project: Hive Issue Type: Bug

[jira] [Updated] (HIVE-9063) NPE in RemoteSparkJobStatus.getSparkStatistics [Spark Branch]

2014-12-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9063: - Issue Type: Sub-task (was: Bug) Parent: HIVE-7292 > NPE in RemoteSparkJobStatus.getSparkStatistics [Spark

[jira] [Updated] (HIVE-9017) Clean up temp files of RSC [Spark Branch]

2014-12-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9017: - Summary: Clean up temp files of RSC [Spark Branch] (was: Clean up temp files after unit tests [Spark Branch]) > C

[jira] [Updated] (HIVE-9017) Clean up temp files of RSC [Spark Branch]

2014-12-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9017: - Description: Currently RSC will leave a lot of temp files in {{/tmp}}, including {{*_lock}}, {{*_cache}}, {{spark-

[jira] [Commented] (HIVE-9019) Avoid using SPARK_JAVA_OPTS [Spark Branch]

2014-12-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14238795#comment-14238795 ] Rui Li commented on HIVE-9019: -- Thanks [~brocknoland] for the review. > Avoid using SPARK_JAV

[jira] [Commented] (HIVE-9019) Avoid using SPARK_JAVA_OPTS [Spark Branch]

2014-12-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14237833#comment-14237833 ] Rui Li commented on HIVE-9019: -- Hi [~brocknoland], could you help to verify if this patch serv

<    1   2   3   4   5   6   7   >