[jira] [Commented] (HIVE-9697) Hive on Spark is not as aggressive as MR on map join [Spark Branch]

2015-03-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364420#comment-14364420 ] Rui Li commented on HIVE-9697: -- Hi [~xuefuz], I remember there was some discussion about this

[jira] [Commented] (HIVE-9697) Hive on Spark is not as aggressive as MR on map join [Spark Branch]

2015-03-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14365102#comment-14365102 ] Rui Li commented on HIVE-9697: -- [~csun] - I think MR doesn't use rawDataSize even when it's

[jira] [Commented] (HIVE-10006) RSC has memory leak while execute multi queries.[Spark Branch]

2015-03-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14370575#comment-14370575 ] Rui Li commented on HIVE-10006: --- The map work is retrieved/cached in

[jira] [Commented] (HIVE-9697) Hive on Spark is not as aggressive as MR on map join [Spark Branch]

2015-03-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14370634#comment-14370634 ] Rui Li commented on HIVE-9697: -- To get rawDataSize , user can run an analyze table as

[jira] [Commented] (HIVE-10006) RSC has memory leak while execute multi queries.[Spark Branch]

2015-03-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14370547#comment-14370547 ] Rui Li commented on HIVE-10006: --- If CombineHiveInputFormat is leaking, HiveInputFormat

[jira] [Commented] (HIVE-9697) Hive on Spark is not as aggressive as MR on map join [Spark Branch]

2015-03-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14370657#comment-14370657 ] Rui Li commented on HIVE-9697: -- Yep! Hive on Spark is not as aggressive as MR on map join

[jira] [Commented] (HIVE-9855) Runtime skew join doesn't work when skewed data only exists in big table

2015-03-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14352955#comment-14352955 ] Rui Li commented on HIVE-9855: -- Merged into spark. Runtime skew join doesn't work when

[jira] [Updated] (HIVE-9659) 'Error while trying to create table container' occurs during hive query case execution when hive.optimize.skewjoin set to 'true' [Spark Branch]

2015-03-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9659: - Attachment: HIVE-9659.3-spark.patch Address RB comments 'Error while trying to create table container' occurs

[jira] [Commented] (HIVE-9659) 'Error while trying to create table container' occurs during hive query case execution when hive.optimize.skewjoin set to 'true' [Spark Branch]

2015-03-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14354321#comment-14354321 ] Rui Li commented on HIVE-9659: -- I tried to add golden file for MR for the added test. However

[jira] [Commented] (HIVE-9659) 'Error while trying to create table container' occurs during hive query case execution when hive.optimize.skewjoin set to 'true' [Spark Branch]

2015-03-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14354342#comment-14354342 ] Rui Li commented on HIVE-9659: -- Is there a way to enable the test only for spark? Seems I add

[jira] [Commented] (HIVE-9924) Add SORT_QUERY_RESULTS to union12.q

2015-03-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357009#comment-14357009 ] Rui Li commented on HIVE-9924: -- Thanks Xuefu for taking care of this. I realized HIVE-9569

[jira] [Updated] (HIVE-9882) Add jar/file doesn't work with yarn-cluster mode [Spark Branch]

2015-03-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9882: - Description: It seems current fix for HIVE-9425 only uploads the Jar/Files to HDFS, however, they are not

[jira] [Updated] (HIVE-9882) Add jar/file doesn't work with yarn-cluster mode [Spark Branch]

2015-03-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9882: - Attachment: HIVE-9882.1-spark.patch Upload the proper patch for spark branch. Add jar/file doesn't work with

[jira] [Updated] (HIVE-9882) Add jar/file doesn't work with yarn-cluster mode [Spark Branch]

2015-03-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9882: - Issue Type: Sub-task (was: Bug) Parent: HIVE-7292 Add jar/file doesn't work with yarn-cluster mode

[jira] [Updated] (HIVE-9882) Add jar/file doesn't work with yarn-cluster mode [Spark Branch]

2015-03-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9882: - Attachment: HIVE-9882.1.patch Hi Xiaomin, please try this patch. I can run q10 with it on my side. Add jar/file

[jira] [Updated] (HIVE-9882) Add jar/file doesn't work with yarn-cluster mode [Spark Branch]

2015-03-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9882: - Attachment: HIVE-9882.1-spark.patch Upload same patch to trigger test Add jar/file doesn't work with

[jira] [Updated] (HIVE-9659) 'Error while trying to create table container' occurs during hive query case execution when hive.optimize.skewjoin set to 'true' [Spark Branch]

2015-03-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9659: - Attachment: HIVE-9659.4-spark.patch Add golden file for MR 'Error while trying to create table container' occurs

[jira] [Commented] (HIVE-9659) 'Error while trying to create table container' occurs during hive query case execution when hive.optimize.skewjoin set to 'true' [Spark Branch]

2015-03-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14356088#comment-14356088 ] Rui Li commented on HIVE-9659: -- Xuefu - Thanks very much for the explanation! I'll generate

[jira] [Updated] (HIVE-9924) Add SORT_QUERY_RESULT to union12.q

2015-03-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9924: - Priority: Minor (was: Major) Add SORT_QUERY_RESULT to union12.q --

[jira] [Updated] (HIVE-9924) Add SORT_QUERY_RESULTS to union12.q

2015-03-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9924: - Summary: Add SORT_QUERY_RESULTS to union12.q (was: Add SORT_QUERY_RESULT to union12.q) Add SORT_QUERY_RESULTS

[jira] [Commented] (HIVE-9659) 'Error while trying to create table container' occurs during hive query case execution when hive.optimize.skewjoin set to 'true' [Spark Branch]

2015-03-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14356828#comment-14356828 ] Rui Li commented on HIVE-9659: -- {{union12}} needs SORT_QUERY_RESULT label. {{union31}} failed

[jira] [Commented] (HIVE-9860) MapredLocalTask/SecureCmdDoAs leaks local files

2015-03-05 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14349779#comment-14349779 ] Rui Li commented on HIVE-9860: -- Patch looks good to me. Brock, do you think the failure is

[jira] [Updated] (HIVE-9924) Fix union12 and union31 for spark [Spark Branch]

2015-03-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9924: - Summary: Fix union12 and union31 for spark [Spark Branch] (was: Add SORT_QUERY_RESULTS to union12.q [Spark

[jira] [Updated] (HIVE-9924) Add SORT_QUERY_RESULTS to union12.q [Spark Branch]

2015-03-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9924: - Attachment: HIVE-9924.2-spark.patch Add SORT_QUERY_RESULTS to union12.q [Spark Branch]

[jira] [Updated] (HIVE-9969) Avoid Utilities.getMapRedWork for spark [Spark Branch]

2015-03-31 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9969: - Attachment: HIVE-9969.1-spark.patch Avoid Utilities.getMapRedWork for spark [Spark Branch]

[jira] [Commented] (HIVE-9969) Avoid Utilities.getMapRedWork for spark [Spark Branch]

2015-03-31 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14389995#comment-14389995 ] Rui Li commented on HIVE-9969: -- Committed to spark. Thanks Xuefu. Avoid

[jira] [Updated] (HIVE-9969) Avoid Utilities.getMapRedWork for spark [Spark Branch]

2015-03-31 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9969: - Release Note: (was: Committed to spark. Thanks Xuefu.) Avoid Utilities.getMapRedWork for spark [Spark Branch]

[jira] [Commented] (HIVE-10006) RSC has memory leak while execute multi queries.[Spark Branch]

2015-03-22 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14375322#comment-14375322 ] Rui Li commented on HIVE-10006: --- +1 RSC has memory leak while execute multi queries.[Spark

[jira] [Commented] (HIVE-9855) Runtime skew join doesn't work when skewed data only exists in big table

2015-03-04 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347924#comment-14347924 ] Rui Li commented on HIVE-9855: -- I'll commit this shortly Runtime skew join doesn't work when

[jira] [Commented] (HIVE-9869) Trunk doesn't build with hadoop-1

2015-03-05 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14348347#comment-14348347 ] Rui Li commented on HIVE-9869: -- cc [~xuefuz], [~navis], [~ashutoshc], [~adrian-wang] Trunk

[jira] [Updated] (HIVE-9659) 'Error while trying to create table container' occurs during hive query case execution when hive.optimize.skewjoin set to 'true' [Spark Branch]

2015-03-04 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9659: - Attachment: HIVE-9659.2-spark.patch Hi Xin, please help to verify if this patch works. Thanks! 'Error while

[jira] [Updated] (HIVE-9855) Runtime skew join doesn't work when skewed data only exists in big table

2015-03-04 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9855: - Attachment: HIVE-9855.1.patch The problem is FileSystem.rename returns false rather than throws FNF when the

[jira] [Commented] (HIVE-10084) Improve common join performance [Spark Branch]

2015-04-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503124#comment-14503124 ] Rui Li commented on HIVE-10084: --- OOO and travelling abroad from 4/14 to 4/22. Please expect

[jira] [Updated] (HIVE-10458) Enable parallel order by for spark [Spark Branch]

2015-04-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10458: -- Attachment: HIVE-10458.1-spark.patch Trigger tests. Enable parallel order by for spark [Spark Branch]

[jira] [Updated] (HIVE-10527) NPE in SparkUtilities::isDedicatedCluster [Spark Branch]

2015-04-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10527: -- Attachment: HIVE-10527.1-spark.patch cc [~jxiang] NPE in SparkUtilities::isDedicatedCluster [Spark Branch]

[jira] [Updated] (HIVE-10527) NPE in SparkUtilities::isDedicatedCluster [Spark Branch]

2015-04-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10527: -- Summary: NPE in SparkUtilities::isDedicatedCluster [Spark Branch] (was: NPE in

[jira] [Commented] (HIVE-10476) Hive query should fail when it fails to initialize a session in SetSparkReducerParallelism [Spark Branch]

2015-04-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516133#comment-14516133 ] Rui Li commented on HIVE-10476: --- One minor: if the session fails to initialize, we'll get

[jira] [Commented] (HIVE-10476) Hive query should fail when it fails to initialize a session in SetSparkReducerParallelism [Spark Branch]

2015-04-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516278#comment-14516278 ] Rui Li commented on HIVE-10476: --- Yeah that looks good to me. Hive query should fail when

[jira] [Commented] (HIVE-10476) Hive query should fail when it fails to initialize a session in SetSparkReducerParallelism [Spark Branch]

2015-04-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14516362#comment-14516362 ] Rui Li commented on HIVE-10476: --- +1 Hive query should fail when it fails to initialize a

[jira] [Assigned] (HIVE-10671) yarn-cluster mode offers a degraded performance from yarn-client [Spark Branch]

2015-05-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-10671: - Assignee: Rui Li yarn-cluster mode offers a degraded performance from yarn-client [Spark Branch]

[jira] [Commented] (HIVE-10671) yarn-cluster mode offers a degraded performance from yarn-client [Spark Branch]

2015-05-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14539055#comment-14539055 ] Rui Li commented on HIVE-10671: --- OK I'll have a look. yarn-cluster mode offers a degraded

[jira] [Commented] (HIVE-10671) yarn-cluster mode offers a degraded performance from yarn-client [Spark Branch]

2015-05-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14539287#comment-14539287 ] Rui Li commented on HIVE-10671: --- Why does each table have 2 sizes? The following is the

[jira] [Commented] (HIVE-10458) Enable parallel order by for spark [Spark Branch]

2015-05-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14539299#comment-14539299 ] Rui Li commented on HIVE-10458: --- Hi [~xuefuz], I've looked at some of the failures. Most of

[jira] [Commented] (HIVE-10671) yarn-cluster mode offers a degraded performance from yarn-client [Spark Branch]

2015-05-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14539114#comment-14539114 ] Rui Li commented on HIVE-10671: --- Hi [~xuefuz], what's the data size is the user using? I

[jira] [Updated] (HIVE-10671) yarn-cluster mode offers a degraded performance from yarn-client [Spark Branch]

2015-05-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10671: -- Attachment: HIVE-10671.1-spark.patch I managed to reproduce this with other queries. The problem turned out to

[jira] [Updated] (HIVE-10458) Enable parallel order by for spark [Spark Branch]

2015-05-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10458: -- Attachment: HIVE-10458.2-spark.patch Have another run. Enable parallel order by for spark [Spark Branch]

[jira] [Updated] (HIVE-10671) yarn-cluster mode offers a degraded performance from yarn-client [Spark Branch]

2015-05-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10671: -- Attachment: HIVE-10671.2-spark.patch Address RB comments. I don't think the failures are related.

[jira] [Commented] (HIVE-10458) Enable parallel order by for spark [Spark Branch]

2015-05-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14550283#comment-14550283 ] Rui Li commented on HIVE-10458: --- Hi [~xuefuz], we won't do double sample for approach a1.

[jira] [Updated] (HIVE-10458) Enable parallel order by for spark [Spark Branch]

2015-05-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10458: -- Attachment: HIVE-10458.3-spark.patch I found that hive already has parallel order by on MR (HIVE-1402), which

[jira] [Updated] (HIVE-10903) Add hive.in.test for HoS tests [Spark Branch]

2015-06-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10903: -- Attachment: (was: HIVE-10903.1-spark.patch) Add hive.in.test for HoS tests [Spark Branch]

[jira] [Updated] (HIVE-10903) Add hive.in.test for HoS tests

2015-06-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10903: -- Summary: Add hive.in.test for HoS tests (was: Add hive.in.test for HoS tests [Spark Branch]) Add

[jira] [Updated] (HIVE-10903) Add hive.in.test for HoS tests [Spark Branch]

2015-06-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10903: -- Attachment: HIVE-10903.2.patch Verified that the changes to golden files are inline with the MR version. This

[jira] [Commented] (HIVE-10855) Make HIVE-10568 work with Spark [Spark Branch]

2015-06-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14578263#comment-14578263 ] Rui Li commented on HIVE-10855: --- Hi [~xuefuz], thanks for taking care of this. We need to

[jira] [Commented] (HIVE-10903) Add hive.in.test for HoS tests [Spark Branch]

2015-06-03 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14570690#comment-14570690 ] Rui Li commented on HIVE-10903: --- Just quickly checked the age 1 failures. All diff is in

[jira] [Updated] (HIVE-10903) Add hive.in.test for HoS tests [Spark Branch]

2015-06-03 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10903: -- Attachment: HIVE-10903.1-spark.patch Add hive.in.test for HoS tests [Spark Branch]

[jira] [Updated] (HIVE-10903) Add hive.in.test for HoS tests [Spark Branch]

2015-06-03 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10903: -- Description: Missing the property can make CBO fails to run during UT. There should be other effects that can

[jira] [Assigned] (HIVE-10903) Add hive.in.test for HoS tests [Spark Branch]

2015-06-03 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-10903: - Assignee: Rui Li Add hive.in.test for HoS tests [Spark Branch]

[jira] [Updated] (HIVE-10903) Add hive.in.test for HoS tests

2015-06-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10903: -- Component/s: Spark Add hive.in.test for HoS tests -- Key:

[jira] [Commented] (HIVE-10903) Add hive.in.test for HoS tests [Spark Branch]

2015-06-03 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14572006#comment-14572006 ] Rui Li commented on HIVE-10903: --- Both MR and tez have this flag in hive-site.xml for tests.

[jira] [Commented] (HIVE-10816) NPE in ExecDriver::handleSampling when submitted via child JVM

2015-06-03 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14571995#comment-14571995 ] Rui Li commented on HIVE-10816: --- Hi [~navis], would you mind take a look at this when you

[jira] [Updated] (HIVE-10903) Add hive.in.test for HoS tests

2015-06-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10903: -- Attachment: HIVE-10903.3.patch Update more outputs. Add hive.in.test for HoS tests

[jira] [Commented] (HIVE-10816) NPE in ExecDriver::handleSampling when submitted via child JVM

2015-06-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14579865#comment-14579865 ] Rui Li commented on HIVE-10816: --- Thanks [~leftylev] and [~xuefuz] for catching this. I

[jira] [Commented] (HIVE-10903) Add hive.in.test for HoS tests

2015-06-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14579846#comment-14579846 ] Rui Li commented on HIVE-10903: --- cc [~xuefuz] Add hive.in.test for HoS tests

[jira] [Updated] (HIVE-10816) NPE in ExecDriver::handleSampling when submitted via child JVM

2015-06-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10816: -- Fix Version/s: 2.0.0 NPE in ExecDriver::handleSampling when submitted via child JVM

[jira] [Assigned] (HIVE-11108) HashTableSinkOperator doesn't support vectorization [Spark Branch]

2015-06-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-11108: - Assignee: Rui Li HashTableSinkOperator doesn't support vectorization [Spark Branch]

[jira] [Updated] (HIVE-11109) Replication factor is not properly set in SparkHashTableSinkOperator [Spark Branch]

2015-06-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-11109: -- Component/s: Spark Replication factor is not properly set in SparkHashTableSinkOperator [Spark Branch]

[jira] [Updated] (HIVE-11109) Replication factor is not properly set in SparkHashTableSinkOperator [Spark Branch]

2015-06-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-11109: -- Attachment: HIVE-11109.1-spark.patch Replication factor is not properly set in SparkHashTableSinkOperator

[jira] [Commented] (HIVE-11032) Enable more tests for grouping by skewed data [Spark Branch]

2015-06-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14602367#comment-14602367 ] Rui Li commented on HIVE-11032: --- Hi [~mohitsabharwal], thanks for the work. For the newly

[jira] [Commented] (HIVE-11109) Replication factor is not properly set in SparkHashTableSinkOperator [Spark Branch]

2015-06-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14602237#comment-14602237 ] Rui Li commented on HIVE-11109: --- Thanks Jimmy for the review. Replication factor is not

[jira] [Commented] (HIVE-11032) Enable more tests for grouping by skewed data [Spark Branch]

2015-06-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14602241#comment-14602241 ] Rui Li commented on HIVE-11032: --- Seems the failed tests need deterministic order. Enable

[jira] [Commented] (HIVE-10999) Upgrade Spark dependency to 1.4 [Spark Branch]

2015-06-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14593118#comment-14593118 ] Rui Li commented on HIVE-10999: --- The failures seem to be related to different jersey

[jira] [Commented] (HIVE-10999) Upgrade Spark dependency to 1.4 [Spark Branch]

2015-06-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14593020#comment-14593020 ] Rui Li commented on HIVE-10999: --- When I tried the patch earlier the downloaded jar was still

[jira] [Updated] (HIVE-10999) Upgrade Spark dependency to 1.4 [Spark Branch]

2015-06-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10999: -- Attachment: HIVE-10999.2-spark.patch Can't reproduce the failures locally. Try again. Upgrade Spark

[jira] [Updated] (HIVE-10999) Upgrade Spark dependency to 1.4 [Spark Branch]

2015-06-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10999: -- Attachment: HIVE-10999.2-spark.patch Talked about this with Chengxiang. We think the reason is that spark

[jira] [Commented] (HIVE-7292) Hive on Spark

2015-06-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589152#comment-14589152 ] Rui Li commented on HIVE-7292: -- [~riomario] - Yes you can. You can follow this

[jira] [Commented] (HIVE-10999) Upgrade Spark dependency to 1.4 [Spark Branch]

2015-06-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14591115#comment-14591115 ] Rui Li commented on HIVE-10999: --- Hi [~xuefuz], seems the new tar is not valid? I download it

[jira] [Commented] (HIVE-10999) Upgrade Spark dependency to 1.4 [Spark Branch]

2015-06-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589289#comment-14589289 ] Rui Li commented on HIVE-10999: --- I think you can use the [release

[jira] [Commented] (HIVE-10999) Upgrade Spark dependency to 1.4 [Spark Branch]

2015-06-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589223#comment-14589223 ] Rui Li commented on HIVE-10999: --- Hi [~xuefuz], the problem seems to be incorrect naming of

[jira] [Updated] (HIVE-10855) Make HIVE-10568 work with Spark [Spark Branch]

2015-06-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10855: -- Attachment: HIVE-10855.2-spark.patch Update golden files. Make HIVE-10568 work with Spark [Spark Branch]

[jira] [Updated] (HIVE-10855) Make HIVE-10568 work with Spark [Spark Branch]

2015-06-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10855: -- Attachment: HIVE-10855.3-spark.patch Update more golden files. They didn't show up previously because of

[jira] [Updated] (HIVE-10855) Make HIVE-10568 work with Spark [Spark Branch]

2015-06-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10855: -- Attachment: (was: HIVE-10855.1-spark.patch) Make HIVE-10568 work with Spark [Spark Branch]

[jira] [Commented] (HIVE-10855) Make HIVE-10568 work with Spark [Spark Branch]

2015-06-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581586#comment-14581586 ] Rui Li commented on HIVE-10855: --- Latest failures are not related. Make HIVE-10568 work

[jira] [Commented] (HIVE-10989) HoS can't control number of map tasks for runtime skew join [Spark Branch]

2015-06-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14585436#comment-14585436 ] Rui Li commented on HIVE-10989: --- Hi [~xuefuz], these flags should only be set for the

[jira] [Updated] (HIVE-10989) HoS can't control number of map tasks for runtime skew join [Spark Branch]

2015-06-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10989: -- Summary: HoS can't control number of map tasks for runtime skew join [Spark Branch] (was: Spark can't control

[jira] [Updated] (HIVE-10989) HoS can't control number of map tasks for runtime skew join [Spark Branch]

2015-06-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10989: -- Attachment: HIVE-10989.1-spark.patch The flags were properly set in the MapWork. We just need to create the RDD

[jira] [Commented] (HIVE-10989) HoS can't control number of map tasks for runtime skew join [Spark Branch]

2015-06-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14585418#comment-14585418 ] Rui Li commented on HIVE-10989: --- Failed tests are not related. HoS can't control number of

[jira] [Updated] (HIVE-10816) NPE in ExecDriver::handleSampling when submitted via child JVM

2015-06-02 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10816: -- Attachment: HIVE-10816.1.patch Attach same patch to have another run NPE in ExecDriver::handleSampling when

[jira] [Commented] (HIVE-10855) Make HIVE-10568 work with Spark [Spark Branch]

2015-05-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14564407#comment-14564407 ] Rui Li commented on HIVE-10855: --- Hi [~xuefuz], to make this work, we need first merge

[jira] [Commented] (HIVE-10855) Make HIVE-10568 work with Spark [Spark Branch]

2015-05-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14564410#comment-14564410 ] Rui Li commented on HIVE-10855: --- Never mind I just saw your merge :) Make HIVE-10568 work

[jira] [Commented] (HIVE-8043) Support merging small files [Spark Branch]

2015-05-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14564569#comment-14564569 ] Rui Li commented on HIVE-8043: -- [~leftylev] - I think that's already handled in HIVE-7810

[jira] [Updated] (HIVE-10855) Make HIVE-10568 work with Spark [Spark Branch]

2015-05-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10855: -- Attachment: HIVE-10855.1-spark.patch Should work out of box. Let the tests run. Make HIVE-10568 work with

[jira] [Commented] (HIVE-11108) HashTableSinkOperator doesn't support vectorization [Spark Branch]

2015-06-30 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14607936#comment-14607936 ] Rui Li commented on HIVE-11108: --- The failures are not related. HashTableSinkOperator

[jira] [Updated] (HIVE-11108) HashTableSinkOperator doesn't support vectorization [Spark Branch]

2015-06-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-11108: -- Attachment: HIVE-11108.1-spark.patch The patch enables vectorization for SparkHashTableSinkOperator. Did some

[jira] [Updated] (HIVE-11182) Enable optimized hash tables for spark [Spark Branch]

2015-07-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-11182: -- Attachment: HIVE-11182.1-spark.patch The optimized table is not a {{MapJoinPersistableTableContainer}}. So in

[jira] [Updated] (HIVE-10816) NPE in ExecDriver::handleSampling when submitted via child JVM

2015-05-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-10816: -- Attachment: HIVE-10816.1.patch NPE in ExecDriver::handleSampling when submitted via child JVM

[jira] [Commented] (HIVE-10816) NPE in ExecDriver::handleSampling when submitted via child JVM

2015-05-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14558609#comment-14558609 ] Rui Li commented on HIVE-10816: --- [~xuefuz] - Yeah, when submitted via child, the ExecDriver

[jira] [Commented] (HIVE-10816) NPE in ExecDriver::handleSampling when submitted via child JVM

2015-05-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14558630#comment-14558630 ] Rui Li commented on HIVE-10816: --- The 4 tests fail on master as well. Don't suppose they're

[jira] [Resolved] (HIVE-11183) Enable optimized hash tables for spark [Spark Branch]

2015-07-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li resolved HIVE-11183. --- Resolution: Duplicate Enable optimized hash tables for spark [Spark Branch]

[jira] [Updated] (HIVE-11182) Enable optimized hash tables for spark [Spark Branch]

2015-07-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-11182: -- Component/s: Spark Enable optimized hash tables for spark [Spark Branch]

[jira] [Commented] (HIVE-11138) Query fails when there isn't a comparator for an operator [Spark Branch]

2015-06-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-11138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605216#comment-14605216 ] Rui Li commented on HIVE-11138: --- cc [~chengxiang li], [~xuefuz] Query fails when there

  1   2   3   4   5   6   7   8   9   10   >