[jira] [Created] (HIVE-7428) OrcSplit fails to account for columnar projections in its size estimates

2014-07-16 Thread Gopal V (JIRA)
Gopal V created HIVE-7428: - Summary: OrcSplit fails to account for columnar projections in its size estimates Key: HIVE-7428 URL: https://issues.apache.org/jira/browse/HIVE-7428 Project: Hive Issue

[jira] [Created] (HIVE-7417) select count(1) from ... where true; fails in optimizer

2014-07-15 Thread Gopal V (JIRA)
Gopal V created HIVE-7417: - Summary: select count(1) from ... where true; fails in optimizer Key: HIVE-7417 URL: https://issues.apache.org/jira/browse/HIVE-7417 Project: Hive Issue Type: Bug Affe

[jira] [Commented] (HIVE-7400) count and count distinct not correct

2014-07-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14061672#comment-14061672 ] Gopal V commented on HIVE-7400: --- [~darranl]: I am using hive-14 which is the only branch in d

[jira] [Commented] (HIVE-7400) count and count distinct not correct

2014-07-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14061662#comment-14061662 ] Gopal V commented on HIVE-7400: --- With Tez enabled, I cannot reproduce this {code} Status: Fi

[jira] [Commented] (HIVE-7400) count and count distinct not correct

2014-07-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14061654#comment-14061654 ] Gopal V commented on HIVE-7400: --- Never mind, laggy JIRA updates. Saw the file now. > count a

[jira] [Commented] (HIVE-7400) count and count distinct not correct

2014-07-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14061651#comment-14061651 ] Gopal V commented on HIVE-7400: --- Where is the data-set? Can you attach it to the JIRA? > c

[jira] [Created] (HIVE-7402) add `approx_distinct` & composable nDV UDAFs

2014-07-14 Thread Gopal V (JIRA)
Gopal V created HIVE-7402: - Summary: add `approx_distinct` & composable nDV UDAFs Key: HIVE-7402 URL: https://issues.apache.org/jira/browse/HIVE-7402 Project: Hive Issue Type: New Feature

[jira] [Updated] (HIVE-7394) ORC writer logging fails when the padding is < 0.01

2014-07-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7394: -- Resolution: Fixed Fix Version/s: 0.14.0 Status: Resolved (was: Patch Available) > ORC writer

[jira] [Commented] (HIVE-7394) ORC writer logging fails when the padding is < 0.01

2014-07-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14060289#comment-14060289 ] Gopal V commented on HIVE-7394: --- Committed to trunk, thanks! > ORC writer logging fails when

[jira] [Commented] (HIVE-7394) ORC writer logging fails when the padding is < 0.01

2014-07-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14060288#comment-14060288 ] Gopal V commented on HIVE-7394: --- Test failures unrelated. > ORC writer logging fails when th

[jira] [Updated] (HIVE-7397) Set the default threshold for fetch task conversion to 1Gb

2014-07-12 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7397: -- Attachment: HIVE-7397.3.patch Fix test failures > Set the default threshold for fetch task conversion to 1Gb >

[jira] [Commented] (HIVE-7397) Set the default threshold for fetch task conversion to 1Gb

2014-07-12 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14059690#comment-14059690 ] Gopal V commented on HIVE-7397: --- [~navis]: changed the defaults and disabled it for tests. A

[jira] [Updated] (HIVE-7397) Set the default threshold for fetch task conversion to 1Gb

2014-07-12 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7397: -- Attachment: HIVE-7397.2.patch Handle the partition filter only + limit case. > Set the default threshold for fe

[jira] [Commented] (HIVE-7397) Set the default threshold for fetch task conversion to 1Gb

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14059678#comment-14059678 ] Gopal V commented on HIVE-7397: --- Yes, we can try to get it in as default. My only concern wi

[jira] [Commented] (HIVE-7397) Set the default threshold for fetch task conversion to 1Gb

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14059671#comment-14059671 ] Gopal V commented on HIVE-7397: --- [~navis]: Could you review this change? > Set the default t

[jira] [Updated] (HIVE-7397) Set the default threshold for fetch task conversion to 1Gb

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7397: -- Release Note: Reduce the default threshold for fetch task conversion to 1Gb Status: Patch Available (w

[jira] [Updated] (HIVE-7397) Set the default threshold for fetch task conversion to 1Gb

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7397: -- Attachment: HIVE-7397.1.patch > Set the default threshold for fetch task conversion to 1Gb > ---

[jira] [Commented] (HIVE-7397) Set the default threshold for fetch task conversion to 1Gb

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14059667#comment-14059667 ] Gopal V commented on HIVE-7397: --- Candidate queries for my test were {code} select * from st

[jira] [Updated] (HIVE-7397) Set the default threshold for fetch task conversion to 1Gb

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7397: -- Summary: Set the default threshold for fetch task conversion to 1Gb (was: Set the defaults for fetch task conve

[jira] [Created] (HIVE-7397) Set the defaults for fetch task conversion to 1Gb

2014-07-11 Thread Gopal V (JIRA)
Gopal V created HIVE-7397: - Summary: Set the defaults for fetch task conversion to 1Gb Key: HIVE-7397 URL: https://issues.apache.org/jira/browse/HIVE-7397 Project: Hive Issue Type: Bug Affects Ve

[jira] [Commented] (HIVE-7396) BucketingSortingReduceSinkOptimizer throws NullPointException during ETL

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14059657#comment-14059657 ] Gopal V commented on HIVE-7396: --- This is not just a simple NULL check missing. I made modifi

[jira] [Updated] (HIVE-7396) BucketingSortingReduceSinkOptimizer throws NullPointException during ETL

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7396: -- Assignee: (was: Gopal V) > BucketingSortingReduceSinkOptimizer throws NullPointException during ETL > --

[jira] [Created] (HIVE-7396) BucketingSortingReduceSinkOptimizer throws NullPointException during ETL

2014-07-11 Thread Gopal V (JIRA)
Gopal V created HIVE-7396: - Summary: BucketingSortingReduceSinkOptimizer throws NullPointException during ETL Key: HIVE-7396 URL: https://issues.apache.org/jira/browse/HIVE-7396 Project: Hive Issue

[jira] [Updated] (HIVE-6988) Hive changes for tez-0.5.x compatibility

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-6988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-6988: -- Attachment: HIVE-6988.3.patch Rebase to trunk and update for TEZ-1130, TEZ-1076 > Hive changes for tez-0.5.x co

[jira] [Updated] (HIVE-7394) ORC writer logging fails when the padding is < 0.01

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7394: -- Release Note: Fix logging of ORC padding percentages during inserts Status: Patch Available (was: Open

[jira] [Updated] (HIVE-7394) ORC writer logging fails when the padding is < 0.01

2014-07-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7394: -- Attachment: HIVE-7394.1.patch > ORC writer logging fails when the padding is < 0.01 > --

[jira] [Created] (HIVE-7394) ORC writer logging fails when the padding is < 0.01

2014-07-11 Thread Gopal V (JIRA)
Gopal V created HIVE-7394: - Summary: ORC writer logging fails when the padding is < 0.01 Key: HIVE-7394 URL: https://issues.apache.org/jira/browse/HIVE-7394 Project: Hive Issue Type: Bug Co

[jira] [Commented] (HIVE-7364) Trunk cannot be built on -Phadoop1 after HIVE-7144

2014-07-09 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14056754#comment-14056754 ] Gopal V commented on HIVE-7364: --- Thanks, [~szehon]. Will follow that JIRA. > Trunk cannot

[jira] [Commented] (HIVE-7364) Trunk cannot be built on -Phadoop1 after HIVE-7144

2014-07-09 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14056365#comment-14056365 ] Gopal V commented on HIVE-7364: --- Thanks [~navis] for fixing this. Is there some change happe

[jira] [Updated] (HIVE-6988) Hive changes for tez-0.5.x compatibility

2014-07-07 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-6988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-6988: -- Attachment: HIVE-6988.1.patch Merged in change-sets of HIVE-6993 + HIVE-7350 + pom.xml upgrade to 0.5.x tez ver

[jira] [Updated] (HIVE-6988) Hive changes for tez-0.5.x compatibility

2014-07-07 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-6988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-6988: -- Status: Patch Available (was: Open) Run unit tests against tez-0.5.x > Hive changes for tez-0.5.x compatibilit

[jira] [Created] (HIVE-7359) Stats based compute query replies fail to do simple column transforms

2014-07-07 Thread Gopal V (JIRA)
Gopal V created HIVE-7359: - Summary: Stats based compute query replies fail to do simple column transforms Key: HIVE-7359 URL: https://issues.apache.org/jira/browse/HIVE-7359 Project: Hive Issue Typ

[jira] [Updated] (HIVE-7144) GC pressure during ORC StringDictionary writes

2014-07-07 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7144: -- Resolution: Fixed Fix Version/s: 0.14.0 Status: Resolved (was: Patch Available) Committed to

[jira] [Updated] (HIVE-7243) Print padding information in ORC file dump

2014-07-06 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7243: -- Status: Patch Available (was: Open) > Print padding information in ORC file dump >

[jira] [Updated] (HIVE-7243) Print padding information in ORC file dump

2014-07-06 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7243: -- Attachment: HIVE-7243.3.patch Rebase patch & unit-tests after HIVE-7231 > Print padding information in ORC file

[jira] [Updated] (HIVE-7231) Improve ORC padding

2014-07-06 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7231: -- Resolution: Fixed Fix Version/s: 0.14.0 Release Note: HIVE-7231 : Improve ORC padding (Prasanth J, re

[jira] [Commented] (HIVE-7231) Improve ORC padding

2014-07-06 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14053253#comment-14053253 ] Gopal V commented on HIVE-7231: --- Test failures unrelated. > Improve ORC padding > --

[jira] [Updated] (HIVE-7231) Improve ORC padding

2014-07-06 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7231: -- Attachment: HIVE-7231.8.patch > Improve ORC padding > --- > > Key: HIVE-7231 >

[jira] [Updated] (HIVE-7350) Changes related to TEZ-692, TEZ-1169, TEZ-1234

2014-07-05 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7350: -- Attachment: HIVE-7350.2.patch Remove all pom.xml references from the patch. > Changes related to TEZ-692, TEZ-1

[jira] [Commented] (HIVE-3990) Provide input threshold for direct-fetcher (HIVE-2925)

2014-07-05 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14052984#comment-14052984 ] Gopal V commented on HIVE-3990: --- [~leftylev]: I have it on my next week's schedule to revisit

[jira] [Commented] (HIVE-3990) Provide input threshold for direct-fetcher (HIVE-2925)

2014-07-05 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14052977#comment-14052977 ] Gopal V commented on HIVE-3990: --- [~leftylev]: I saw usability issues with the threshold setti

[jira] [Updated] (HIVE-7343) Update committer list

2014-07-03 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7343: -- Attachment: HIVE-7343.2.patch > Update committer list > - > > Key: HIVE-7343

[jira] [Updated] (HIVE-7343) Update committer list

2014-07-03 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7343: -- Attachment: (was: HIVE-7343.2.patch) > Update committer list > - > > Key

[jira] [Updated] (HIVE-7343) Update committer list

2014-07-03 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7343: -- Attachment: HIVE-7343.2.patch I can confirm that's the right name and org information. But the mark-down file t

[jira] [Updated] (HIVE-7231) Improve ORC padding

2014-07-02 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7231: -- Attachment: HIVE-7231.7.patch Rebase to trunk and update size correction code to reset all corrections to strip

[jira] [Commented] (HIVE-7144) GC pressure during ORC StringDictionary writes

2014-07-02 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14050635#comment-14050635 ] Gopal V commented on HIVE-7144: --- Re-run tests with trunk. > GC pressure during ORC StringDic

[jira] [Updated] (HIVE-7144) GC pressure during ORC StringDictionary writes

2014-07-02 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7144: -- Attachment: HIVE-7144.3.patch > GC pressure during ORC StringDictionary writes > --

[jira] [Commented] (HIVE-7231) Improve ORC padding

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14048531#comment-14048531 ] Gopal V commented on HIVE-7231: --- Tests on 1Tb proving that this does cut down on padding, but

[jira] [Updated] (HIVE-7105) Enable ReduceRecordProcessor to generate VectorizedRowBatches

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7105: -- Resolution: Fixed Release Note: Tez vectorized shuffle record reader (was: Tez shuffle vectorized ReduceR

[jira] [Commented] (HIVE-7105) Enable ReduceRecordProcessor to generate VectorizedRowBatches

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14048510#comment-14048510 ] Gopal V commented on HIVE-7105: --- Committed to trunk, thanks [~mmccline], [~jnp] & [~rajesh.ba

[jira] [Commented] (HIVE-7105) Enable ReduceRecordProcessor to generate VectorizedRowBatches

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14048504#comment-14048504 ] Gopal V commented on HIVE-7105: --- Test failure unrelated to Tez. > Enable ReduceRecordProcess

[jira] [Commented] (HIVE-7304) Transitive Predicate Propagation doesn't happen properly after HIVE-7159

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14048500#comment-14048500 ] Gopal V commented on HIVE-7304: --- [~ashutoshc]: I agree that the SARGs and filters shouldn't h

[jira] [Updated] (HIVE-7231) Improve ORC padding

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7231: -- Attachment: HIVE-7231.6.patch Removed the tez-0.4.1 reference and the "&" from the documentation XML. > Improve

[jira] [Commented] (HIVE-7231) Improve ORC padding

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14048431#comment-14048431 ] Gopal V commented on HIVE-7231: --- Looks like I jumped the gun and rebased my patch to also use

[jira] [Updated] (HIVE-7105) Enable ReduceRecordProcessor to generate VectorizedRowBatches

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7105: -- Attachment: HIVE-7105.3.patch > Enable ReduceRecordProcessor to generate VectorizedRowBatches >

[jira] [Updated] (HIVE-7105) Enable ReduceRecordProcessor to generate VectorizedRowBatches

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7105: -- Status: Patch Available (was: Open) > Enable ReduceRecordProcessor to generate VectorizedRowBatches > -

[jira] [Updated] (HIVE-7105) Enable ReduceRecordProcessor to generate VectorizedRowBatches

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7105: -- Status: Open (was: Patch Available) > Enable ReduceRecordProcessor to generate VectorizedRowBatches > -

[jira] [Commented] (HIVE-7231) Improve ORC padding

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14048404#comment-14048404 ] Gopal V commented on HIVE-7231: --- [~leftylev]: HIVE-7231.5.patch has that clarified in hive-de

[jira] [Updated] (HIVE-7231) Improve ORC padding

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7231: -- Attachment: HIVE-7231.5.patch Update docs in patch to match [~leftylev]'s RB comments. > Improve ORC padding >

[jira] [Commented] (HIVE-7231) Improve ORC padding

2014-06-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14048327#comment-14048327 ] Gopal V commented on HIVE-7231: --- With rebase & update to docs. it LGTM - +1 > Improve ORC pa

[jira] [Updated] (HIVE-7231) Improve ORC padding

2014-06-28 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7231: -- Status: Patch Available (was: Open) > Improve ORC padding > --- > > Key: HIVE-7

[jira] [Updated] (HIVE-7231) Improve ORC padding

2014-06-28 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7231: -- Status: Open (was: Patch Available) > Improve ORC padding > --- > > Key: HIVE-7

[jira] [Updated] (HIVE-7231) Improve ORC padding

2014-06-28 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7231: -- Attachment: HIVE-7231.4.patch Rebase to trunk > Improve ORC padding > --- > > K

[jira] [Created] (HIVE-7313) Allow session-level temp-tables to be marked as in-memory tables

2014-06-27 Thread Gopal V (JIRA)
Gopal V created HIVE-7313: - Summary: Allow session-level temp-tables to be marked as in-memory tables Key: HIVE-7313 URL: https://issues.apache.org/jira/browse/HIVE-7313 Project: Hive Issue Type: Im

[jira] [Created] (HIVE-7302) Allow Auto-reducer parallelism to be turned off by a logical optimizer

2014-06-26 Thread Gopal V (JIRA)
Gopal V created HIVE-7302: - Summary: Allow Auto-reducer parallelism to be turned off by a logical optimizer Key: HIVE-7302 URL: https://issues.apache.org/jira/browse/HIVE-7302 Project: Hive Issue Ty

[jira] [Updated] (HIVE-7232) VectorReduceSink is emitting incorrect JOIN keys

2014-06-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7232: -- Fix Version/s: 0.14.0 > VectorReduceSink is emitting incorrect JOIN keys > -

[jira] [Commented] (HIVE-7220) Empty dir in external table causes issue (root_dir_external_table.q failure)

2014-06-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14045424#comment-14045424 ] Gopal V commented on HIVE-7220: --- Is the de-dup of locations only to work around the FileInput

[jira] [Commented] (HIVE-7232) VectorReduceSink is emitting incorrect JOIN keys

2014-06-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14045308#comment-14045308 ] Gopal V commented on HIVE-7232: --- Committed to trunk, thanks! > VectorReduceSink is emitting

[jira] [Updated] (HIVE-7232) VectorReduceSink is emitting incorrect JOIN keys

2014-06-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7232: -- Resolution: Fixed Release Note: VectorReduceSink is emitting incorrect JOIN keys (Navis, via Gopal V)

[jira] [Updated] (HIVE-7232) VectorReduceSink is emitting incorrect JOIN keys

2014-06-25 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7232: -- Status: Patch Available (was: Open) > VectorReduceSink is emitting incorrect JOIN keys > --

[jira] [Updated] (HIVE-7232) VectorReduceSink is emitting incorrect JOIN keys

2014-06-25 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7232: -- Attachment: HIVE-7232.2.patch.txt > VectorReduceSink is emitting incorrect JOIN keys > -

[jira] [Updated] (HIVE-7232) VectorReduceSink is emitting incorrect JOIN keys

2014-06-25 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7232: -- Status: Open (was: Patch Available) Need to rebase patch to match recent qtest changes made HIVE-7258 > Vector

[jira] [Updated] (HIVE-7293) Hive-trunk does not build against JDK8 with generic class checks

2014-06-25 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7293: -- Labels: Vectorization (was: ) > Hive-trunk does not build against JDK8 with generic class checks >

[jira] [Updated] (HIVE-7293) Hive-trunk does not build against JDK8 with generic class checks

2014-06-25 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7293: -- Component/s: Query Processor > Hive-trunk does not build against JDK8 with generic class checks > --

[jira] [Created] (HIVE-7293) Hive-trunk does not build against JDK8 with generic class checks

2014-06-25 Thread Gopal V (JIRA)
Gopal V created HIVE-7293: - Summary: Hive-trunk does not build against JDK8 with generic class checks Key: HIVE-7293 URL: https://issues.apache.org/jira/browse/HIVE-7293 Project: Hive Issue Type: Bu

[jira] [Updated] (HIVE-7144) GC pressure during ORC StringDictionary writes

2014-06-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7144: -- Release Note: Use Text writables directly in ORC dictionaries to avoid String allocations. (was: Use Text field

[jira] [Updated] (HIVE-7144) GC pressure during ORC StringDictionary writes

2014-06-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7144: -- Attachment: HIVE-7144.2.patch Address rb comments - renamed methods for clarity, refactored duplicate String/Te

[jira] [Updated] (HIVE-7144) GC pressure during ORC StringDictionary writes

2014-06-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7144: -- Status: Open (was: Patch Available) > GC pressure during ORC StringDictionary writes > ---

[jira] [Commented] (HIVE-7232) VectorReduceSink is emitting incorrect JOIN keys

2014-06-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14041785#comment-14041785 ] Gopal V commented on HIVE-7232: --- [~navis]: LGTM +1. I will commit this. As part of my review

[jira] [Commented] (HIVE-7277) how to decide reduce numbers according to the input size of reduce stage rather than the input size of map stage?

2014-06-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14041745#comment-14041745 ] Gopal V commented on HIVE-7277: --- [~wangmeng]: The hive code changes to mark the logical plan

[jira] [Commented] (HIVE-7277) how to decide reduce numbers according to the input size of reduce stage rather than the input size of map stage?

2014-06-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14041733#comment-14041733 ] Gopal V commented on HIVE-7277: --- Yes, this is how HIVE-7158 works. The reducer counts are es

[jira] [Commented] (HIVE-7232) VectorReduceSink is emitting incorrect JOIN keys

2014-06-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14041575#comment-14041575 ] Gopal V commented on HIVE-7232: --- [~ashutoshc]: Yes, I will review this today. > VectorReduce

[jira] [Updated] (HIVE-7266) Optimized HashTable with vectorized map-joins results in String columns extending

2014-06-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7266: -- Assignee: Matt McCline (was: Jitendra Nath Pandey) > Optimized HashTable with vectorized map-joins results in S

[jira] [Commented] (HIVE-5775) Introduce Cost Based Optimizer to Hive

2014-06-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14040986#comment-14040986 ] Gopal V commented on HIVE-5775: --- [~xuefuz]: The CBO model rewrites queries using cardinality

[jira] [Updated] (HIVE-7144) GC pressure during ORC StringDictionary writes

2014-06-21 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7144: -- Release Note: Use Text fields for ORC dictionaries to prevent superfluous String allocations. Status:

[jira] [Commented] (HIVE-7144) GC pressure during ORC StringDictionary writes

2014-06-21 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14039732#comment-14039732 ] Gopal V commented on HIVE-7144: --- Previous comment, all table items are in seconds. > GC pres

[jira] [Commented] (HIVE-7144) GC pressure during ORC StringDictionary writes

2014-06-21 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14039731#comment-14039731 ] Gopal V commented on HIVE-7144: --- Benchmark insert of TPC-H 1Tb scale data || Table || Before

[jira] [Updated] (HIVE-7144) GC pressure during ORC StringDictionary writes

2014-06-20 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7144: -- Attachment: HIVE-7144.1.patch > GC pressure during ORC StringDictionary writes > --

[jira] [Commented] (HIVE-7236) Tez progress monitor should indicate running/failed tasks

2014-06-20 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14039616#comment-14039616 ] Gopal V commented on HIVE-7236: --- [~leftylev]: Not sure. The (+n,-m) needs explaining, but I

[jira] [Updated] (HIVE-7266) Optimized HashTable with vectorized map-joins results in String columns extending

2014-06-20 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7266: -- Description: The following query returns different results when both vectorized mapjoin and the new optimized h

[jira] [Updated] (HIVE-7266) Optimized HashTable with vectorized map-joins results in String columns extending

2014-06-20 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7266: -- Component/s: Tez > Optimized HashTable with vectorized map-joins results in String columns > extending > --

[jira] [Updated] (HIVE-7266) Optimized HashTable with vectorized map-joins results in String columns extending

2014-06-20 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7266: -- Attachment: hive-7266-small-test.tgz > Optimized HashTable with vectorized map-joins results in String columns

[jira] [Created] (HIVE-7266) Optimized HashTable with vectorized map-joins results in String columns extending

2014-06-20 Thread Gopal V (JIRA)
Gopal V created HIVE-7266: - Summary: Optimized HashTable with vectorized map-joins results in String columns extending Key: HIVE-7266 URL: https://issues.apache.org/jira/browse/HIVE-7266 Project: Hive

[jira] [Created] (HIVE-7265) BINARY columns use BytesWritable::getBytes() without ::getLength()

2014-06-20 Thread Gopal V (JIRA)
Gopal V created HIVE-7265: - Summary: BINARY columns use BytesWritable::getBytes() without ::getLength() Key: HIVE-7265 URL: https://issues.apache.org/jira/browse/HIVE-7265 Project: Hive Issue Type:

[jira] [Commented] (HIVE-7250) Adaptive compression buffer size for wide tables in ORC

2014-06-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14036726#comment-14036726 ] Gopal V commented on HIVE-7250: --- LGTM +1 (NB) > Adaptive compression buffer size for wide ta

[jira] [Updated] (HIVE-7232) VectorReduceSink is emitting incorrect JOIN keys

2014-06-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-7232: -- Description: After HIVE-7121, tpc-h query5 has resulted in incorrect results. Thanks to [~navis], it has been t

[jira] [Commented] (HIVE-7232) ReduceSink is emitting NULL keys due to failed keyEval

2014-06-17 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14034869#comment-14034869 ] Gopal V commented on HIVE-7232: --- [~navis]: I tested this with git commit id 50f517a3930 - it

[jira] [Assigned] (HIVE-7232) ReduceSink is emitting NULL keys due to failed keyEval

2014-06-17 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V reassigned HIVE-7232: - Assignee: Gopal V (was: Navis) > ReduceSink is emitting NULL keys due to failed keyEval > ---

[jira] [Commented] (HIVE-7231) Improve ORC padding

2014-06-17 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14034434#comment-14034434 ] Gopal V commented on HIVE-7231: --- The approach results in stray writes across the stripe bound

[jira] [Commented] (HIVE-7232) ReduceSink is emitting NULL keys due to failed keyEval

2014-06-17 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14034322#comment-14034322 ] Gopal V commented on HIVE-7232: --- [~navis]: I found out that there are indeed o_orderkey entri

<    5   6   7   8   9   10   11   12   13   14   >