Re: Review Request 25495: HIVE-7776, enable sample10.q

2014-10-10 Thread chengxiang li

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/25495/
---

(Updated 十月 10, 2014, 9:05 a.m.)


Review request for hive, Brock Noland, Szehon Ho, and Xuefu Zhang.


Changes
---

update sample10.q.output for fixed partition file name.


Bugs: HIVE-7776
https://issues.apache.org/jira/browse/HIVE-7776


Repository: hive-git


Description
---

Hive get task Id through 2 ways in Utilities::getTaskId:
get parameter value of mapred.task.id from configuration.
generate random value while #1 return null.
set mapred.task.id on executor side as we can build it through TaskContext now.


Diffs (updated)
-

  itests/src/test/resources/testconfiguration.properties 468a441 
  ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HiveMapFunction.java 6d1e57b 
  ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HivePairFlatMapFunction.java 
PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HiveReduceFunction.java 
94a6141 
  ql/src/test/results/clientpositive/spark/sample10.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/25495/diff/


Testing
---


Thanks,

chengxiang li



Re: Review Request 25495: HIVE-7776, enable sample10.q

2014-10-10 Thread Xuefu Zhang

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/25495/#review56189
---



ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HiveMapFunction.java
https://reviews.apache.org/r/25495/#comment96513

Can we combine this line with the previous one?



ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HiveReduceFunction.java
https://reviews.apache.org/r/25495/#comment96517

Same as above.


- Xuefu Zhang


On Oct. 10, 2014, 9:05 a.m., chengxiang li wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/25495/
 ---
 
 (Updated Oct. 10, 2014, 9:05 a.m.)
 
 
 Review request for hive, Brock Noland, Szehon Ho, and Xuefu Zhang.
 
 
 Bugs: HIVE-7776
 https://issues.apache.org/jira/browse/HIVE-7776
 
 
 Repository: hive-git
 
 
 Description
 ---
 
 Hive get task Id through 2 ways in Utilities::getTaskId:
 get parameter value of mapred.task.id from configuration.
 generate random value while #1 return null.
 set mapred.task.id on executor side as we can build it through TaskContext 
 now.
 
 
 Diffs
 -
 
   itests/src/test/resources/testconfiguration.properties 468a441 
   ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HiveMapFunction.java 
 6d1e57b 
   
 ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HivePairFlatMapFunction.java 
 PRE-CREATION 
   ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HiveReduceFunction.java 
 94a6141 
   ql/src/test/results/clientpositive/spark/sample10.q.out PRE-CREATION 
 
 Diff: https://reviews.apache.org/r/25495/diff/
 
 
 Testing
 ---
 
 
 Thanks,
 
 chengxiang li
 




Re: Review Request 25495: HIVE-7776, enable sample10.q

2014-09-29 Thread chengxiang li

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/25495/
---

(Updated 九月 29, 2014, 9:10 a.m.)


Review request for hive, Brock Noland, Szehon Ho, and Xuefu Zhang.


Bugs: HIVE-7776
https://issues.apache.org/jira/browse/HIVE-7776


Repository: hive-git


Description (updated)
---

Hive get task Id through 2 ways in Utilities::getTaskId:
get parameter value of mapred.task.id from configuration.
generate random value while #1 return null.
set mapred.task.id on executor side as we can build it through TaskContext now.


Diffs
-

  itests/src/test/resources/testconfiguration.properties 155abad 
  ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java 3ff0782 
  ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 02f9d99 
  ql/src/test/results/clientpositive/spark/sample10.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/25495/diff/


Testing
---


Thanks,

chengxiang li



Re: Review Request 25495: HIVE-7776, enable sample10.q

2014-09-29 Thread chengxiang li

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/25495/
---

(Updated 九月 29, 2014, 9:11 a.m.)


Review request for hive, Brock Noland, Szehon Ho, and Xuefu Zhang.


Bugs: HIVE-7776
https://issues.apache.org/jira/browse/HIVE-7776


Repository: hive-git


Description
---

Hive get task Id through 2 ways in Utilities::getTaskId:
get parameter value of mapred.task.id from configuration.
generate random value while #1 return null.
set mapred.task.id on executor side as we can build it through TaskContext now.


Diffs (updated)
-

  itests/src/test/resources/testconfiguration.properties 89243fc 
  ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HiveMapFunction.java 1674d4b 
  ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HivePairFlatMapFunction.java 
PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HiveReduceFunction.java 
0b8b7c9 
  ql/src/test/results/clientpositive/spark/sample10.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/25495/diff/


Testing
---


Thanks,

chengxiang li



Re: Review Request 25495: HIVE-7776, enable sample10.q

2014-09-29 Thread chengxiang li

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/25495/
---

(Updated 九月 29, 2014, 9:13 a.m.)


Review request for hive, Brock Noland, Szehon Ho, and Xuefu Zhang.


Bugs: HIVE-7776
https://issues.apache.org/jira/browse/HIVE-7776


Repository: hive-git


Description
---

Hive get task Id through 2 ways in Utilities::getTaskId:
get parameter value of mapred.task.id from configuration.
generate random value while #1 return null.
set mapred.task.id on executor side as we can build it through TaskContext now.


Diffs (updated)
-

  itests/src/test/resources/testconfiguration.properties 89243fc 
  ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HiveMapFunction.java 1674d4b 
  ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HivePairFlatMapFunction.java 
PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/exec/spark/HiveReduceFunction.java 
0b8b7c9 
  ql/src/test/results/clientpositive/spark/sample10.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/25495/diff/


Testing
---


Thanks,

chengxiang li



Review Request 25495: HIVE-7776, enable sample10.q

2014-09-10 Thread chengxiang li

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/25495/
---

Review request for hive, Brock Noland and Xuefu Zhang.


Bugs: HIVE-7776
https://issues.apache.org/jira/browse/HIVE-7776


Repository: hive-git


Description
---

Hive get task Id through 2 ways in Utilities::getTaskId:
get parameter value of mapred.task.id from configuration.
generate random value while #1 return null.
Currently, Hive on Spark can't get parameter value of mapred.task.id from 
configuration.
FileSinkOperator use taskid to distinct different bucket file name, 
FileSinkOperator should take taskid as field variable and initiate it only once 
since one FileSinkOperator instance only refered in one task. but 
FileSinkOperator call Utilities::getTaskId to get new taskId each time, for 
this issue, it would cause more bucket files than bucket number, which lead to 
unexpected result of tablesample queries.


Diffs
-

  itests/src/test/resources/testconfiguration.properties 155abad 
  ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java 3ff0782 
  ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 02f9d99 
  ql/src/test/results/clientpositive/spark/sample10.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/25495/diff/


Testing
---


Thanks,

chengxiang li