[jira] [Updated] (HIVE-8852) Update new spark progress API for local submitted job monitoring[Spark Branch]

2014-11-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8852: - Attachment: HIVE-8852.2-spark.patch Seems we're still picking up the old spark jar. Add clear library cache and

[jira] [Updated] (HIVE-8852) Update new spark progress API for local submitted job monitoring[Spark Branch]

2014-11-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8852: - Attachment: HIVE-8852.1-spark.patch Update new spark progress API for local submitted job monitoring[Spark

[jira] [Updated] (HIVE-8852) Update new spark progress API for local submitted job monitoring[Spark Branch]

2014-11-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8852: - Attachment: HIVE-8852.2-spark.patch Address some RB comments. Thanks [~chengxiang li] for the review! Update new

[jira] [Updated] (HIVE-8841) Make RDD caching work for multi-insert after HIVE-8793 when map join is involved [Spark Branch]

2014-11-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8841: - Attachment: HIVE-8841.1-spark.patch I think this patch shall serve the purpose. [~xuefuz] - do you know how I can

[jira] [Updated] (HIVE-8841) Make RDD caching work for multi-insert after HIVE-8793 when map join is involved [Spark Branch]

2014-11-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8841: - Status: Patch Available (was: Open) Make RDD caching work for multi-insert after HIVE-8793 when map join is

[jira] [Commented] (HIVE-8841) Make RDD caching work for multi-insert after HIVE-8793 when map join is involved [Spark Branch]

2014-11-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14211680#comment-14211680 ] Rui Li commented on HIVE-8841: -- I see, thanks [~xuefuz] for clarifying. Make RDD caching

[jira] [Commented] (HIVE-8793) Make sure multi-insert works with map join [Spark Branch]

2014-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207790#comment-14207790 ] Rui Li commented on HIVE-8793: -- Hi [~xuefuz], The failed tests are because I changed how

[jira] [Updated] (HIVE-8793) Make sure multi-insert works with map join [Spark Branch]

2014-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8793: - Attachment: HIVE-8793.2-spark.patch Revert changes to ExplainTask. Make sure multi-insert works with map join

[jira] [Commented] (HIVE-8793) Make sure multi-insert works with map join [Spark Branch]

2014-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14207983#comment-14207983 ] Rui Li commented on HIVE-8793: -- Now the result seems better. Another thing I want to clarify

[jira] [Commented] (HIVE-8536) Enable runtime skew join optimization for spark [Spark Branch]

2014-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209083#comment-14209083 ] Rui Li commented on HIVE-8536: -- Runtime skew join should be processed by two resolvers:

[jira] [Commented] (HIVE-8793) Refactor to make splitting SparkWork a physical resolver [Spark Branch]

2014-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209056#comment-14209056 ] Rui Li commented on HIVE-8793: -- Thank you [~xuefuz] for the review! Refactor to make

[jira] [Commented] (HIVE-8840) Print prettier Spark work graph after HIVE-8793 [Spark Branch]

2014-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209061#comment-14209061 ] Rui Li commented on HIVE-8840: -- I think two things need to be done to achieve this purpose: 1.

[jira] [Assigned] (HIVE-8841) Make RDD caching work for multi-insert after HIVE-8793 when map join is involved [Spark Branch]

2014-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-8841: Assignee: Rui Li Make RDD caching work for multi-insert after HIVE-8793 when map join is involved [Spark

[jira] [Commented] (HIVE-7333) Create RDD translator, translating Hive Tables into Spark RDDs [Spark Branch]

2014-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209297#comment-14209297 ] Rui Li commented on HIVE-7333: -- Hi [~klonikar], Thanks for your interest. When I did the

[jira] [Assigned] (HIVE-8852) Update new spark progress API for local submitted job monitoring[Spark Branch]

2014-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-8852: Assignee: Rui Li Update new spark progress API for local submitted job monitoring[Spark Branch]

[jira] [Commented] (HIVE-7333) Create RDD translator, translating Hive Tables into Spark RDDs [Spark Branch]

2014-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14209330#comment-14209330 ] Rui Li commented on HIVE-7333: -- Hi Xuefu Reynold, Thanks a lot for the explanations! Hi

[jira] [Updated] (HIVE-8793) Make sure multi-insert works with map join [Spark Branch]

2014-11-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8793: - Attachment: HIVE-8793.1-spark.patch Refactor to make split spark work as a physical resolver. Make sure

[jira] [Updated] (HIVE-8793) Make sure multi-insert works with map join [Spark Branch]

2014-11-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8793: - Status: Patch Available (was: Open) Make sure multi-insert works with map join [Spark Branch]

[jira] [Updated] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8542: - Attachment: HIVE-8542.5-spark.patch Update patch according to RB comments. Golden file for {{optimize_nullscan.q}}

[jira] [Updated] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8542: - Attachment: HIVE-8542.6-spark.patch Cannot reproduce the failed tests on my side. Upload the same patch to run the

[jira] [Commented] (HIVE-8793) Make sure multi-insert works with map join [Spark Branch]

2014-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204700#comment-14204700 ] Rui Li commented on HIVE-8793: -- Hi [~xuefuz], do you mean the job here is just to make

[jira] [Commented] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205838#comment-14205838 ] Rui Li commented on HIVE-8542: -- I still can't reproduce the failed tests. [~xuefuz] - do you

[jira] [Assigned] (HIVE-8793) Make sure multi-insert works with map join [Spark Branch]

2014-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-8793: Assignee: Rui Li Make sure multi-insert works with map join [Spark Branch]

[jira] [Commented] (HIVE-8793) Make sure multi-insert works with map join [Spark Branch]

2014-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205846#comment-14205846 ] Rui Li commented on HIVE-8793: -- Thanks [~xuefuz], I'll give it a try. Make sure multi-insert

[jira] [Commented] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205872#comment-14205872 ] Rui Li commented on HIVE-8542: -- [~xuefuz] - thanks for taking care of this. But did you forget

[jira] [Commented] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14205966#comment-14205966 ] Rui Li commented on HIVE-8542: -- Thank you [~xuefuz]! What's your opinion about

[jira] [Commented] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14206001#comment-14206001 ] Rui Li commented on HIVE-8542: -- [~xuefuz] - I tried SORT_QUERY_RESULTS but it doesn't help.

[jira] [Commented] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14206027#comment-14206027 ] Rui Li commented on HIVE-8542: -- Thanks [~xuefuz] for the review! Enable groupby_map_ppr.q

[jira] [Commented] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204230#comment-14204230 ] Rui Li commented on HIVE-8542: -- Hi [~xuefuz], yeah I'll do that once I figured out the failed

[jira] [Updated] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8542: - Attachment: HIVE-8542.4-spark.patch Rebase the patch to let the test run again. I cannot reproduce

[jira] [Commented] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204254#comment-14204254 ] Rui Li commented on HIVE-8542: -- Hi [~xuefuz], I added the RB entry. One thing I'm not quite

[jira] [Commented] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14204255#comment-14204255 ] Rui Li commented on HIVE-8542: -- Hi [~csun], thanks for the clarifications! Enable

[jira] [Updated] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8542: - Attachment: HIVE-8542.2-spark.patch Fix some bugs and update golden files. Most of the changes made to the golden

[jira] [Updated] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8542: - Attachment: HIVE-8542.3-spark.patch Rebase patch. Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q

[jira] [Updated] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8542: - Attachment: HIVE-8542.1-spark.patch Submit a patch to let the tests run. I expect many to fail. Enable

[jira] [Updated] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8542: - Issue Type: Bug (was: Test) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

[jira] [Updated] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8542: - Status: Patch Available (was: Open) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

[jira] [Commented] (HIVE-8073) Go thru all operator plan optimizations and disable those that are not suitable for Spark [Spark Branch]

2014-11-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14200023#comment-14200023 ] Rui Li commented on HIVE-8073: -- Hi [~xuefuz], I've investigated all the optimizations in

[jira] [Assigned] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-8542: Assignee: Rui Li Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

[jira] [Commented] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14200162#comment-14200162 ] Rui Li commented on HIVE-8542: -- Hi [~csun], let me take this one. As it seems to be a bug in

[jira] [Commented] (HIVE-8542) Enable groupby_map_ppr.q and groupby_map_ppr_multi_distinct.q [Spark Branch]

2014-11-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201691#comment-14201691 ] Rui Li commented on HIVE-8542: -- I think the problem is that the sorting keys and partition

[jira] [Assigned] (HIVE-8073) Go thru all operator plan optimizations and disable those that are not suitable for Spark [Spark Branch]

2014-11-02 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-8073: Assignee: Rui Li Go thru all operator plan optimizations and disable those that are not suitable for

[jira] [Updated] (HIVE-8610) Compile time skew join optimization doesn't work with auto map join

2014-10-30 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8610: - Attachment: HIVE-8610.3.patch Update patch according to RB comments. Compile time skew join optimization doesn't

[jira] [Updated] (HIVE-8610) Compile time skew join optimization doesn't work with auto map join

2014-10-30 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8610: - Attachment: HIVE-8610.4.patch Update patch. Thanks [~xuefuz] for review. Compile time skew join optimization

[jira] [Updated] (HIVE-8535) Enable compile time skew join optimization for spark [Spark Branch]

2014-10-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8535: - Attachment: HIVE-8535.5-spark.patch Hi [~xuefuz], I didn't see any conflicts rebasing the patch. The new patch

[jira] [Commented] (HIVE-8616) convert joinOp to MapJoinOp and generate MapWorks only [Spark Branch]

2014-10-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14188117#comment-14188117 ] Rui Li commented on HIVE-8616: -- Hi [~ssatish], forgive my ignorance, but why do we need two

[jira] [Commented] (HIVE-8616) convert joinOp to MapJoinOp and generate MapWorks only [Spark Branch]

2014-10-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14188120#comment-14188120 ] Rui Li commented on HIVE-8616: -- Oh sorry for the my last comment. I checked the task plan

[jira] [Updated] (HIVE-8610) Compile time skew join optimization doesn't work with auto map join

2014-10-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8610: - Status: Patch Available (was: Open) Compile time skew join optimization doesn't work with auto map join

[jira] [Updated] (HIVE-8610) Compile time skew join optimization doesn't work with auto map join

2014-10-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8610: - Attachment: HIVE-8610.1.patch This patch adds QBJoinTree and colExprMap for the cloned join operator tree in

[jira] [Commented] (HIVE-8610) Compile time skew join optimization doesn't work with auto map join

2014-10-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14186853#comment-14186853 ] Rui Li commented on HIVE-8610: -- Hi [~xuefuz], do I have to add something to

[jira] [Commented] (HIVE-8535) Enable compile time skew join optimization for spark [Spark Branch]

2014-10-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14187884#comment-14187884 ] Rui Li commented on HIVE-8535: -- Hi [~xuefuz], the failed test vectorization_13 doesn't seem to

[jira] [Updated] (HIVE-8610) Compile time skew join optimization doesn't work with auto map join

2014-10-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8610: - Attachment: HIVE-8610.2.patch Thanks [~xuefuz] for the review. The failed test is because I didn't sync with

[jira] [Commented] (HIVE-8535) Enable compile time skew join optimization for spark [Spark Branch]

2014-10-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14188013#comment-14188013 ] Rui Li commented on HIVE-8535: -- [~xuefuz] - no problem. I'll do the rebase now. Enable

[jira] [Assigned] (HIVE-8610) Compile time skew join optimization doesn't work with auto map join

2014-10-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-8610: Assignee: Rui Li Compile time skew join optimization doesn't work with auto map join

[jira] [Commented] (HIVE-8535) Enable compile time skew join optimization for spark [Spark Branch]

2014-10-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14186216#comment-14186216 ] Rui Li commented on HIVE-8535: -- [~xuefuz] - Sorry I forgot to update this one. I'll rebase the

[jira] [Updated] (HIVE-8535) Enable compile time skew join optimization for spark [Spark Branch]

2014-10-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8535: - Attachment: HIVE-8535.4-spark.patch Rebase patch. Enable compile time skew join optimization for spark [Spark

[jira] [Commented] (HIVE-8602) Add SORT_QUERY_RESULTS for skewjoinopt2

2014-10-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14184752#comment-14184752 ] Rui Li commented on HIVE-8602: -- Thank you [~xuefuz] for the review. Could you please also

[jira] [Created] (HIVE-8610) Compile time skew join optimization doesn't work with auto map join

2014-10-26 Thread Rui Li (JIRA)
Rui Li created HIVE-8610: Summary: Compile time skew join optimization doesn't work with auto map join Key: HIVE-8610 URL: https://issues.apache.org/jira/browse/HIVE-8610 Project: Hive Issue Type:

[jira] [Commented] (HIVE-8535) Enable compile time skew join optimization for spark [Spark Branch]

2014-10-24 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14182755#comment-14182755 ] Rui Li commented on HIVE-8535: -- The failed test is because I added SORT_QUERY_RESULTS label to

[jira] [Updated] (HIVE-8406) Research on skewed join [Spark Branch]

2014-10-24 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8406: - Attachment: Skew join background.pdf Upload the doc so it may help people get a better understand how skew join is

[jira] [Created] (HIVE-8602) Add SORT_QUERY_RESULTS for skewjoinopt2

2014-10-24 Thread Rui Li (JIRA)
Rui Li created HIVE-8602: Summary: Add SORT_QUERY_RESULTS for skewjoinopt2 Key: HIVE-8602 URL: https://issues.apache.org/jira/browse/HIVE-8602 Project: Hive Issue Type: Test Components:

[jira] [Updated] (HIVE-8602) Add SORT_QUERY_RESULTS for skewjoinopt2

2014-10-24 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8602: - Status: Patch Available (was: Open) Add SORT_QUERY_RESULTS for skewjoinopt2

[jira] [Updated] (HIVE-8602) Add SORT_QUERY_RESULTS for skewjoinopt2

2014-10-24 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8602: - Attachment: HIVE-8602.1.patch Add SORT_QUERY_RESULTS for skewjoinopt2 ---

[jira] [Commented] (HIVE-8602) Add SORT_QUERY_RESULTS for skewjoinopt2

2014-10-24 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14183854#comment-14183854 ] Rui Li commented on HIVE-8602: -- cc [~xuefuz] Add SORT_QUERY_RESULTS for skewjoinopt2

[jira] [Updated] (HIVE-8535) Enable compile time skew join optimization for spark [Spark Branch]

2014-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8535: - Attachment: HIVE-8535.1-spark.patch Since compile time optimization runs in logical layer and is quite independent

[jira] [Updated] (HIVE-8535) Enable compile time skew join optimization for spark [Spark Branch]

2014-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8535: - Status: Patch Available (was: Open) Enable compile time skew join optimization for spark [Spark Branch]

[jira] [Updated] (HIVE-8535) Enable compile time skew join optimization for spark [Spark Branch]

2014-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8535: - Attachment: HIVE-8535.2-spark.patch Enable more tests. Enable compile time skew join optimization for spark

[jira] [Commented] (HIVE-8528) Add remote Spark client to Hive [Spark Branch]

2014-10-22 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179644#comment-14179644 ] Rui Li commented on HIVE-8528: -- Forgive my ignorance, just some high level questions: * Why

[jira] [Commented] (HIVE-8528) Add remote Spark client to Hive [Spark Branch]

2014-10-22 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180919#comment-14180919 ] Rui Li commented on HIVE-8528: -- [~xuefuz] - thanks for pointing me to the detailed info. I

[jira] [Commented] (HIVE-8528) Add remote Spark client to Hive [Spark Branch]

2014-10-22 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180931#comment-14180931 ] Rui Li commented on HIVE-8528: -- Yep, I see. Thanks for explaining :-) Add remote Spark

[jira] [Created] (HIVE-8537) Update to use the stable TaskContext API [Spark Branch]

2014-10-21 Thread Rui Li (JIRA)
Rui Li created HIVE-8537: Summary: Update to use the stable TaskContext API [Spark Branch] Key: HIVE-8537 URL: https://issues.apache.org/jira/browse/HIVE-8537 Project: Hive Issue Type: Bug

[jira] [Updated] (HIVE-8537) Update to use the stable TaskContext API [Spark Branch]

2014-10-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8537: - Issue Type: Task (was: Bug) Update to use the stable TaskContext API [Spark Branch]

[jira] [Assigned] (HIVE-8537) Update to use the stable TaskContext API [Spark Branch]

2014-10-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-8537: Assignee: Rui Li Update to use the stable TaskContext API [Spark Branch]

[jira] [Updated] (HIVE-8537) Update to use the stable TaskContext API [Spark Branch]

2014-10-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8537: - Attachment: HIVE-8537-spark.patch Update to use the stable TaskContext API [Spark Branch]

[jira] [Updated] (HIVE-8537) Update to use the stable TaskContext API [Spark Branch]

2014-10-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8537: - Status: Patch Available (was: Open) Update to use the stable TaskContext API [Spark Branch]

[jira] [Commented] (HIVE-8537) Update to use the stable TaskContext API [Spark Branch]

2014-10-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14178157#comment-14178157 ] Rui Li commented on HIVE-8537: -- [~chengxiang li] - I created this because it breaks the build

[jira] [Assigned] (HIVE-8537) Update to use the stable TaskContext API [Spark Branch]

2014-10-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-8537: Assignee: Chengxiang Li (was: Rui Li) Assign this to you [~chengxiang li], as it's related to HIVE-8520.

[jira] [Commented] (HIVE-8518) Compile time skew join optimization returns duplicated results

2014-10-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179446#comment-14179446 ] Rui Li commented on HIVE-8518: -- Thank you [~xuefuz] for the review! Could you also merge this

[jira] [Commented] (HIVE-8518) Compile time skew join optimization returns duplicated results

2014-10-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179483#comment-14179483 ] Rui Li commented on HIVE-8518: -- OK got it. Compile time skew join optimization returns

[jira] [Commented] (HIVE-8518) Compile time skew join optimization returns duplicated results

2014-10-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179517#comment-14179517 ] Rui Li commented on HIVE-8518: -- Thanks [~xuefuz] and [~brocknoland]! Everything seems fine

[jira] [Created] (HIVE-8518) Compile time skew join optimization returns duplicated results

2014-10-20 Thread Rui Li (JIRA)
Rui Li created HIVE-8518: Summary: Compile time skew join optimization returns duplicated results Key: HIVE-8518 URL: https://issues.apache.org/jira/browse/HIVE-8518 Project: Hive Issue Type: Bug

[jira] [Assigned] (HIVE-8518) Compile time skew join optimization returns duplicated results

2014-10-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-8518: Assignee: Rui Li Compile time skew join optimization returns duplicated results

[jira] [Updated] (HIVE-8518) Compile time skew join optimization returns duplicated results

2014-10-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8518: - Attachment: HIVE-8518.1.patch Compile time skew join optimization returns duplicated results

[jira] [Updated] (HIVE-8518) Compile time skew join optimization returns duplicated results

2014-10-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-8518: - Status: Patch Available (was: Open) Compile time skew join optimization returns duplicated results

[jira] [Commented] (HIVE-8518) Compile time skew join optimization returns duplicated results

2014-10-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14176755#comment-14176755 ] Rui Li commented on HIVE-8518: -- cc [~xuefuz] Compile time skew join optimization returns

[jira] [Commented] (HIVE-8518) Compile time skew join optimization returns duplicated results

2014-10-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177807#comment-14177807 ] Rui Li commented on HIVE-8518: -- Hi [~xuefuz], I find there're already several unit tests for

[jira] [Created] (HIVE-8535) Enable compile time skew join optimization for spark [Spark Branch]

2014-10-20 Thread Rui Li (JIRA)
Rui Li created HIVE-8535: Summary: Enable compile time skew join optimization for spark [Spark Branch] Key: HIVE-8535 URL: https://issues.apache.org/jira/browse/HIVE-8535 Project: Hive Issue Type:

[jira] [Created] (HIVE-8536) Enable runtime skew join optimization for spark [Spark Branch]

2014-10-20 Thread Rui Li (JIRA)
Rui Li created HIVE-8536: Summary: Enable runtime skew join optimization for spark [Spark Branch] Key: HIVE-8536 URL: https://issues.apache.org/jira/browse/HIVE-8536 Project: Hive Issue Type:

[jira] [Commented] (HIVE-8406) Research on skewed join [Spark Branch]

2014-10-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177960#comment-14177960 ] Rui Li commented on HIVE-8406: -- Created and linked two sub tasks for compile-time and runtime

[jira] [Resolved] (HIVE-7893) Find a way to get a job identifier when submitting a spark job [Spark Branch]

2014-10-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li resolved HIVE-7893. -- Resolution: Fixed Fixed via HIVE-7439 Find a way to get a job identifier when submitting a spark job [Spark

[jira] [Commented] (HIVE-8456) Support Hive Counter to collect spark job metric[Spark Branch]

2014-10-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173414#comment-14173414 ] Rui Li commented on HIVE-8456: -- [~chengxiang li] - thanks for the explanation! I agree we

[jira] [Commented] (HIVE-8456) Support Hive Counter to collect spark job metric[Spark Branch]

2014-10-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173443#comment-14173443 ] Rui Li commented on HIVE-8456: -- I see. That makes sense. +1 The patch looks good to me.

[jira] [Commented] (HIVE-8406) Research on skewed join [Spark Branch]

2014-10-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173449#comment-14173449 ] Rui Li commented on HIVE-8406: -- Skew join optimization depends on map join. Research on

[jira] [Commented] (HIVE-8456) Support Hive Counter to collect spark job metric[Spark Branch]

2014-10-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173247#comment-14173247 ] Rui Li commented on HIVE-8456: -- I'm not familiar how the counter/accumulator works. Just a few

[jira] [Commented] (HIVE-7467) When querying HBase table, task fails with exception: java.lang.IllegalAccessError: com/google/protobuf/HBaseZeroCopyByteString

2014-10-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14171844#comment-14171844 ] Rui Li commented on HIVE-7467: -- [~jxiang] - thanks for the update! I know there's workarounds

[jira] [Commented] (HIVE-7893) Find a way to get a job identifier when submitting a spark job [Spark Branch]

2014-10-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14170353#comment-14170353 ] Rui Li commented on HIVE-7893: -- Thanks [~joshrosen]. Yes I've seen your PR and really

[jira] [Commented] (HIVE-7439) Spark job monitoring and error reporting [Spark Branch]

2014-10-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14170387#comment-14170387 ] Rui Li commented on HIVE-7439: -- The async APIs are stabilized in SPARK-3902. Spark job

[jira] [Commented] (HIVE-7439) Spark job monitoring and error reporting [Spark Branch]

2014-10-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14170417#comment-14170417 ] Rui Li commented on HIVE-7439: -- +1 patch looks good to me. Only a minor point: can we print

[jira] [Commented] (HIVE-7439) Spark job monitoring and error reporting [Spark Branch]

2014-10-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14170426#comment-14170426 ] Rui Li commented on HIVE-7439: -- [~brocknoland] - Yep, that'll be fine. Spark job monitoring

[jira] [Assigned] (HIVE-8406) Research on skewed join [Spark Branch]

2014-10-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-8406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-8406: Assignee: Rui Li Research on skewed join [Spark Branch] --

<    1   2   3   4   5   6   7   >