[jira] [Commented] (HIVE-4486) FetchOperator slows down SMB map joins by 50% when there are many partitions

2013-05-16 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13659785#comment-13659785 ] Gopal V commented on HIVE-4486: --- I have already run all of tests in ql/ against svn (Wed May

[jira] [Commented] (HIVE-4486) FetchOperator slows down SMB map joins by 50% when there are many partitions

2013-05-17 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13660745#comment-13660745 ] Gopal V commented on HIVE-4486: --- [~owen.omalley] - all test pass on

[jira] [Commented] (HIVE-4436) hive.exec.parallel=true doesn't work on hadoop-2

2013-05-17 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13660747#comment-13660747 ] Gopal V commented on HIVE-4436: --- all test pass on (git

[jira] [Commented] (HIVE-4486) FetchOperator slows down SMB map joins by 50% when there are many partitions

2013-05-17 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13661276#comment-13661276 ] Gopal V commented on HIVE-4486: --- HBase errors in trunk tests? Caused by:

[jira] [Created] (HIVE-4608) Vectorized UDFs for Timestamp in nanoseconds

2013-05-24 Thread Gopal V (JIRA)
Gopal V created HIVE-4608: - Summary: Vectorized UDFs for Timestamp in nanoseconds Key: HIVE-4608 URL: https://issues.apache.org/jira/browse/HIVE-4608 Project: Hive Issue Type: New Feature

[jira] [Updated] (HIVE-4608) Vectorized UDFs for Timestamp in nanoseconds

2013-05-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4608: -- Attachment: 0001-Vectorized-UDFs-for-timestamp-functions-which-accept.patch Vectorized UDFs for Timestamp

[jira] [Commented] (HIVE-3562) Some limit can be pushed down to map stage

2013-05-28 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13668536#comment-13668536 ] Gopal V commented on HIVE-3562: --- I have implemented a similar fix as a Combiner in MR

[jira] [Commented] (HIVE-4436) hive.exec.parallel=true doesn't work on hadoop-2

2013-05-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13670154#comment-13670154 ] Gopal V commented on HIVE-4436: --- [~navis] will add a clientpositive testcase and will update

[jira] [Updated] (HIVE-4436) hive.exec.parallel=true doesn't work on hadoop-2

2013-05-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4436: -- Attachment: parallel_sorted.q Wrote a test-case, but the bug is not reproducible in TestCliDriver needs a

[jira] [Updated] (HIVE-4608) Vectorized UDFs for Timestamp in nanoseconds

2013-05-31 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4608: -- Attachment: 0002-Update-patch-to-the-review-comments-in-https-reviews.patch Patch on top of

[jira] [Updated] (HIVE-4608) Vectorized UDFs for Timestamp in nanoseconds

2013-06-03 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4608: -- Attachment: 0003-rebased-to-apache-hive.patch Rebased patch onto the apache/vectorization branch

[jira] [Created] (HIVE-4681) Fix ORC TimestampTreeReader.nextVector() to handle milli-nano math corectly

2013-06-06 Thread Gopal V (JIRA)
Gopal V created HIVE-4681: - Summary: Fix ORC TimestampTreeReader.nextVector() to handle milli-nano math corectly Key: HIVE-4681 URL: https://issues.apache.org/jira/browse/HIVE-4681 Project: Hive

[jira] [Updated] (HIVE-4681) Fix ORC TimestampTreeReader.nextVector() to handle milli-nano math corectly

2013-06-06 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4681: -- Attachment: HIVE-4160-001.patch Refactored TimestampTreeReader.nextVector() with bug fix. Fix

[jira] [Updated] (HIVE-4681) Fix ORC TimestampTreeReader.nextVector() to handle milli-nano math corectly

2013-06-06 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4681: -- Release Note: Fix ORC TimestampTreeReader to sum the second and nanosecond fraction timestamp vectors correctly

[jira] [Updated] (HIVE-4681) Fix ORC TimestampTreeReader.nextVector() to handle milli-nano math corectly

2013-06-06 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4681: -- Attachment: HIVE-4681-001.patch Fix ORC TimestampTreeReader.nextVector() to handle milli-nano math

[jira] [Updated] (HIVE-4681) Fix ORC TimestampTreeReader.nextVector() to handle milli-nano math corectly

2013-06-06 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4681: -- Attachment: (was: HIVE-4160-001.patch) Fix ORC TimestampTreeReader.nextVector() to handle milli-nano

[jira] [Created] (HIVE-4694) Fix ORC TestVectorizedORCReader testcase for Timestamps

2013-06-10 Thread Gopal V (JIRA)
Gopal V created HIVE-4694: - Summary: Fix ORC TestVectorizedORCReader testcase for Timestamps Key: HIVE-4694 URL: https://issues.apache.org/jira/browse/HIVE-4694 Project: Hive Issue Type: Bug

[jira] [Updated] (HIVE-4694) Fix ORC TestVectorizedORCReader testcase for Timestamps

2013-06-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4694: -- Attachment: HIVE-4694.patch Test-case patch Fix ORC TestVectorizedORCReader testcase for

[jira] [Updated] (HIVE-4694) Fix ORC TestVectorizedORCReader testcase for Timestamps

2013-06-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4694: -- Release Note: Fix ORC TestVectorizedORCReader for Timestamps containing sub-second timings Status:

[jira] [Updated] (HIVE-4694) Fix ORC TestVectorizedORCReader testcase for Timestamps

2013-06-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4694: -- Affects Version/s: vectorization-branch Fix ORC TestVectorizedORCReader testcase for Timestamps

[jira] [Updated] (HIVE-4699) Integrate basic UDFs for Timesamp

2013-06-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4699: -- Attachment: HIVE-4160.patch Integration patch Integrate basic UDFs for Timesamp

[jira] [Updated] (HIVE-4699) Integrate basic UDFs for Timesamp

2013-06-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4699: -- Attachment: HIVE-4699.patch Integrate basic UDFs for Timesamp -

[jira] [Updated] (HIVE-4699) Integrate basic UDFs for Timesamp

2013-06-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4699: -- Release Note: Integrate vectorized Year, Month, Week, Day, Hour, Minute, Second UnixTimestamp UDFs for

[jira] [Updated] (HIVE-4699) Integrate basic UDFs for Timesamp

2013-06-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4699: -- Attachment: (was: HIVE-4160.patch) Integrate basic UDFs for Timesamp

[jira] [Resolved] (HIVE-4722) MIN on timestamp column gives incorrect result.

2013-06-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V resolved HIVE-4722. --- Resolution: Duplicate MIN on timestamp column gives incorrect result.

[jira] [Created] (HIVE-4757) LazyTimestamp goes into irretrievable NULL mode once inited with NULL once

2013-06-19 Thread Gopal V (JIRA)
Gopal V created HIVE-4757: - Summary: LazyTimestamp goes into irretrievable NULL mode once inited with NULL once Key: HIVE-4757 URL: https://issues.apache.org/jira/browse/HIVE-4757 Project: Hive

[jira] [Updated] (HIVE-4757) LazyTimestamp goes into irretrievable NULL mode once inited with NULL once

2013-06-19 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4757: -- Component/s: Serializers/Deserializers Affects Version/s: 0.12.0 Assignee: Gopal V

[jira] [Created] (HIVE-4758) NULLs and record separators broken with vectorization branch intermediate outputs

2013-06-19 Thread Gopal V (JIRA)
Gopal V created HIVE-4758: - Summary: NULLs and record separators broken with vectorization branch intermediate outputs Key: HIVE-4758 URL: https://issues.apache.org/jira/browse/HIVE-4758 Project: Hive

[jira] [Updated] (HIVE-4758) NULLs and record separators broken with vectorization branch intermediate outputs

2013-06-19 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4758: -- Attachment: HIVE-4758-001.patch This patch assumes the LazySimpleSerDe's default NULL indicator to be \N.

[jira] [Updated] (HIVE-4758) NULLs and record separators broken with vectorization branch intermediate outputs

2013-06-19 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4758: -- Release Note: Fix the NULL serialization and record separator insertion for VectorizedRowBatch.toString()

[jira] [Updated] (HIVE-4757) LazyTimestamp goes into irretrievable NULL mode once inited with NULL once

2013-06-19 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4757: -- Attachment: HIVE-4757-001.patch 1 liner patch + test-case to reproduce issue LazyTimestamp

[jira] [Updated] (HIVE-4757) LazyTimestamp goes into irretrievable NULL mode once inited with NULL once

2013-06-19 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4757: -- Release Note: Fix bad isNull state for LazyTimestamp when encountering valid input after a NULL record.

[jira] [Commented] (HIVE-4436) hive.exec.parallel=true doesn't work on hadoop-2

2013-07-01 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13696849#comment-13696849 ] Gopal V commented on HIVE-4436: --- I have replied to Navis on RB. I tried to add more logging

[jira] [Commented] (HIVE-4436) hive.exec.parallel=true doesn't work on hadoop-2

2013-07-01 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13697397#comment-13697397 ] Gopal V commented on HIVE-4436: --- Thanks [~navis]! I wanted to fix the issue with the minimum

[jira] [Created] (HIVE-4884) ORC TimestampTreeReader.nextVector() off by a second when time in fractional

2013-07-18 Thread Gopal V (JIRA)
Gopal V created HIVE-4884: - Summary: ORC TimestampTreeReader.nextVector() off by a second when time in fractional Key: HIVE-4884 URL: https://issues.apache.org/jira/browse/HIVE-4884 Project: Hive

[jira] [Work started] (HIVE-4884) ORC TimestampTreeReader.nextVector() off by a second when time in fractional

2013-07-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-4884 started by Gopal V. ORC TimestampTreeReader.nextVector() off by a second when time in fractional

[jira] [Updated] (HIVE-4884) ORC TimestampTreeReader.nextVector() off by a second when time in fractional

2013-07-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4884: -- Attachment: HIVE-4884-.patch Adjust milliseconds down when encountering a negative second value, with a

[jira] [Updated] (HIVE-4884) ORC TimestampTreeReader.nextVector() off by a second when time in fractional

2013-07-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4884: -- Release Note: Adjust milliseconds down when encountering a negative second value with a fractional second

[jira] [Created] (HIVE-4926) Queries which specify clustered-by keys as constants will still scan all buckets

2013-07-24 Thread Gopal V (JIRA)
Gopal V created HIVE-4926: - Summary: Queries which specify clustered-by keys as constants will still scan all buckets Key: HIVE-4926 URL: https://issues.apache.org/jira/browse/HIVE-4926 Project: Hive

[jira] [Updated] (HIVE-4926) Queries which specify clustered-by keys as constants will still scan all buckets

2013-07-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4926: -- Attachment: HIVE-4926-test.tgz Simple self-contained test-case Queries which specify

[jira] [Updated] (HIVE-4926) Queries which specify clustered-by keys as constants will still scan all buckets

2013-07-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4926: -- Component/s: Query Processor Labels: perfomance (was: ) Queries which specify clustered-by keys

[jira] [Work started] (HIVE-4971) Unit test failure in TestVectorTimestampExpressions

2013-08-01 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-4971 started by Gopal V. Unit test failure in TestVectorTimestampExpressions --- Key:

[jira] [Updated] (HIVE-4971) Unit test failure in TestVectorTimestampExpressions

2013-08-01 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4971: -- Affects Version/s: vectorization-branch Unit test failure in TestVectorTimestampExpressions

[jira] [Updated] (HIVE-4971) Unit test failure in TestVectorTimestampExpressions

2013-08-01 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4971: -- Component/s: UDF Tests Unit test failure in TestVectorTimestampExpressions

[jira] [Updated] (HIVE-4971) Unit test failure in TestVectorTimestampExpressions

2013-08-01 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4971: -- Attachment: HIVE-4971.patch Fix the timestamp math, to match the rounding mechanism for java (all integer

[jira] [Updated] (HIVE-4971) Unit test failure in TestVectorTimestampExpressions

2013-08-01 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4971: -- Release Note: Fix vectorized TIMESTAMP() to handle negative timestamps with fractional seconds

[jira] [Updated] (HIVE-4971) Unit test failure in TestVectorTimestampExpressions

2013-08-02 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4971: -- Status: Open (was: Patch Available) Unit test failure in TestVectorTimestampExpressions

[jira] [Commented] (HIVE-4246) Implement predicate pushdown for ORC

2013-08-07 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13731681#comment-13731681 ] Gopal V commented on HIVE-4246: --- The IN() implementation does a linear search on the

[jira] [Commented] (HIVE-4423) Improve RCFile::sync(long) 10x

2013-08-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13740051#comment-13740051 ] Gopal V commented on HIVE-4423: --- Good catch [~taguswang], it is in fact missing 1 byte at the

[jira] [Created] (HIVE-5093) Use a combiner for LIMIT with GROUP BY and ORDER BY operators

2013-08-14 Thread Gopal V (JIRA)
Gopal V created HIVE-5093: - Summary: Use a combiner for LIMIT with GROUP BY and ORDER BY operators Key: HIVE-5093 URL: https://issues.apache.org/jira/browse/HIVE-5093 Project: Hive Issue Type: Bug

[jira] [Commented] (HIVE-5093) Use a combiner for LIMIT with GROUP BY and ORDER BY operators

2013-08-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13740258#comment-13740258 ] Gopal V commented on HIVE-5093: --- Quick test-results {code} hive set

[jira] [Updated] (HIVE-5093) Use a combiner for LIMIT with GROUP BY and ORDER BY operators

2013-08-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5093: -- Attachment: HIVE-5093-WIP-01.patch Rebased to trunk WIP. This does mix up a mapred.Reducer ref into MapWork,

[jira] [Commented] (HIVE-3562) Some limit can be pushed down to map stage

2013-08-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13740269#comment-13740269 ] Gopal V commented on HIVE-3562: --- I have created HIVE-5093 to hold my limit combiner patch

[jira] [Assigned] (HIVE-5100) RCFile::sync(long) missing 1 byte in System.arraycopy()

2013-08-17 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V reassigned HIVE-5100: - Assignee: Gopal V RCFile::sync(long) missing 1 byte in System.arraycopy()

[jira] [Commented] (HIVE-5093) Use a combiner for LIMIT with GROUP BY and ORDER BY operators

2013-08-19 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13743866#comment-13743866 ] Gopal V commented on HIVE-5093: --- [~thejas] You are right, the systems slows down adds

[jira] [Commented] (HIVE-5093) Use a combiner for LIMIT with GROUP BY and ORDER BY operators

2013-08-19 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13744063#comment-13744063 ] Gopal V commented on HIVE-5093: --- [~appodictic]: HIVE-3562 is actually doing this in the

[jira] [Updated] (HIVE-5100) RCFile::sync(long) missing 1 byte in System.arraycopy()

2013-08-22 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5100: -- Attachment: HIVE-5100-001.patch Fix from the bug report, with an associated test-case. Before fix, the new

[jira] [Updated] (HIVE-5100) RCFile::sync(long) missing 1 byte in System.arraycopy()

2013-08-22 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5100: -- Labels: regression (was: ) Affects Version/s: 0.12.0 Release Note: Fix off-by-one

[jira] [Updated] (HIVE-5100) RCFile::sync(long) missing 1 byte in System.arraycopy()

2013-08-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5100: -- Attachment: HIVE-5100.01.patch Reuploading patch to match the PreCommit patch format.

[jira] [Updated] (HIVE-5100) RCFile::sync(long) missing 1 byte in System.arraycopy()

2013-08-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5100: -- Status: Open (was: Patch Available) RCFile::sync(long) missing 1 byte in System.arraycopy()

[jira] [Updated] (HIVE-5100) RCFile::sync(long) missing 1 byte in System.arraycopy()

2013-08-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5100: -- Status: Patch Available (was: Open) RCFile::sync(long) missing 1 byte in System.arraycopy()

[jira] [Updated] (HIVE-5144) HashTableSink allocates empty new Object[] arrays OOMs - use a static emptyRow instead

2013-08-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5144: -- Attachment: HIVE-5144.01.patch With the attached patch, the memory usage drops from 199 Mb per million rows to

[jira] [Updated] (HIVE-5144) HashTableSink allocates empty new Object[] arrays OOMs - use a static emptyRow instead

2013-08-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5144: -- Environment: Ubuntu LXC + -Xmx512m client opts (was: Ubuntu LXC + -Xmx4096m client opts) HashTableSink

[jira] [Created] (HIVE-5144) HashTableSink allocates empty new Object[] arrays OOMs - use a static emptyRow instead

2013-08-23 Thread Gopal V (JIRA)
Gopal V created HIVE-5144: - Summary: HashTableSink allocates empty new Object[] arrays OOMs - use a static emptyRow instead Key: HIVE-5144 URL: https://issues.apache.org/jira/browse/HIVE-5144 Project: Hive

[jira] [Updated] (HIVE-5144) HashTableSink allocates empty new Object[] arrays OOMs - use a static emptyRow instead

2013-08-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5144: -- Labels: perfomance (was: ) Release Note: Save memory in HashTableSink for key-only map-joins, by

[jira] [Commented] (HIVE-5144) HashTableSink allocates empty new Object[] arrays OOMs - use a static emptyRow instead

2013-08-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13748846#comment-13748846 ] Gopal V commented on HIVE-5144: --- I followed the pattern from MapJoinKey and

[jira] [Updated] (HIVE-5144) HashTableSink allocates empty new Object[] arrays OOMs - use a static emptyRow instead

2013-08-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5144: -- Status: Open (was: Patch Available) Test case run finished, auto_join30.q fails. Cancelling patch.

[jira] [Updated] (HIVE-5144) HashTableSink allocates empty new Object[] arrays OOMs - use a static emptyRow instead

2013-08-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5144: -- Attachment: HIVE-5144.02.patch Bad merge in patch. {code} -if((hasFilter(alias) joinFilters[alias].size()

[jira] [Updated] (HIVE-5144) HashTableSink allocates empty new Object[] arrays OOMs - use a static emptyRow instead

2013-08-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5144: -- Status: Patch Available (was: Open) HashTableSink allocates empty new Object[] arrays OOMs - use a

[jira] [Commented] (HIVE-3562) Some limit can be pushed down to map stage

2013-08-28 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13752490#comment-13752490 ] Gopal V commented on HIVE-3562: --- Good work Navis. Let me mark HIVE-5093 as obsoleted by this

[jira] [Created] (HIVE-5169) Sorted Bucketed Partitioned Insert does not sort by dynamic partition column causing reducer OOMs/lease-expiry errors

2013-08-29 Thread Gopal V (JIRA)
Gopal V created HIVE-5169: - Summary: Sorted Bucketed Partitioned Insert does not sort by dynamic partition column causing reducer OOMs/lease-expiry errors Key: HIVE-5169 URL:

[jira] [Updated] (HIVE-5169) Sorted Bucketed Partitioned Insert does not sort by dynamic partition column causing reducer OOMs/lease-expiry errors

2013-08-29 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5169: -- Attachment: orc2.sql Scale=2 ORC loader. To generate TPC-h text tables, you can use

[jira] [Created] (HIVE-5170) Sorted Bucketed Partitioned Insert hard-codes the reducer count == bucket count

2013-08-29 Thread Gopal V (JIRA)
Gopal V created HIVE-5170: - Summary: Sorted Bucketed Partitioned Insert hard-codes the reducer count == bucket count Key: HIVE-5170 URL: https://issues.apache.org/jira/browse/HIVE-5170 Project: Hive

[jira] [Commented] (HIVE-5170) Sorted Bucketed Partitioned Insert hard-codes the reducer count == bucket count

2013-09-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13774655#comment-13774655 ] Gopal V commented on HIVE-5170: --- Tried to do this, unfortunately the FileSinkOperator uses

[jira] [Created] (HIVE-5345) Operator::close() leaks Operator::out, holding reference to buffers

2013-09-23 Thread Gopal V (JIRA)
Gopal V created HIVE-5345: - Summary: Operator::close() leaks Operator::out, holding reference to buffers Key: HIVE-5345 URL: https://issues.apache.org/jira/browse/HIVE-5345 Project: Hive Issue

[jira] [Updated] (HIVE-5345) Operator::close() leaks Operator::out, holding reference to buffers

2013-09-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5345: -- Attachment: HIVE-5345.01.patch One liner fix {code} @@ -613,6 +613,8 @@ public void close(boolean abort)

[jira] [Updated] (HIVE-5345) Operator::close() leaks Operator::out, holding reference to buffers

2013-09-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-5345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-5345: -- Release Note: Prevent OutputCollector leaks from an Operator by clearing the Operator::out reference on

[jira] [Created] (HIVE-3992) Hive RCFile::sync(long) does a sub-sequence linear search for sync blocks

2013-02-06 Thread Gopal V (JIRA)
Gopal V created HIVE-3992: - Summary: Hive RCFile::sync(long) does a sub-sequence linear search for sync blocks Key: HIVE-3992 URL: https://issues.apache.org/jira/browse/HIVE-3992 Project: Hive

[jira] [Updated] (HIVE-3992) Hive RCFile::sync(long) does a sub-sequence linear search for sync blocks

2013-02-06 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-3992: -- Attachment: select-join-limit.html Profile view of RCFile::sync(long) Hive RCFile::sync(long)

[jira] [Commented] (HIVE-3992) Hive RCFile::sync(long) does a sub-sequence linear search for sync blocks

2013-02-07 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13573298#comment-13573298 ] Gopal V commented on HIVE-3992: --- We can't fix it when the map-splits are properly distributed

[jira] [Updated] (HIVE-3992) Hive RCFile::sync(long) does a sub-sequence linear search for sync blocks

2013-02-07 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-3992: -- Attachment: HIVE-3992.patch Cuts down on Sync calls in 2 ways. 1) Does not do sync if ExecMapper.getDone() ==

[jira] [Commented] (HIVE-3992) Hive RCFile::sync(long) does a sub-sequence linear search for sync blocks

2013-02-07 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13573402#comment-13573402 ] Gopal V commented on HIVE-3992: --- Testing dummy query (to simulate a col in (select ...) style

[jira] [Created] (HIVE-3997) Use distributed cache to cache/localize dimension table filter it in map task setup

2013-02-07 Thread Gopal V (JIRA)
Gopal V created HIVE-3997: - Summary: Use distributed cache to cache/localize dimension table filter it in map task setup Key: HIVE-3997 URL: https://issues.apache.org/jira/browse/HIVE-3997 Project: Hive

[jira] [Commented] (HIVE-3997) Use distributed cache to cache/localize dimension table filter it in map task setup

2013-02-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13577884#comment-13577884 ] Gopal V commented on HIVE-3997: --- Added dist-cache support to the join example at

[jira] [Commented] (HIVE-3997) Use distributed cache to cache/localize dimension table filter it in map task setup

2013-02-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13577893#comment-13577893 ] Gopal V commented on HIVE-3997: --- Benchmarks are not promising SCALE=1 data-set || run ||

[jira] [Commented] (HIVE-3997) Use distributed cache to cache/localize dimension table filter it in map task setup

2013-02-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13577920#comment-13577920 ] Gopal V commented on HIVE-3997: --- Tested with 6x12 slots for the tasks, the results remain

[jira] [Commented] (HIVE-3997) Use distributed cache to cache/localize dimension table filter it in map task setup

2013-02-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13579018#comment-13579018 ] Gopal V commented on HIVE-3997: --- All map-tasks happening in a single wave, but some of the

[jira] [Updated] (HIVE-3992) Hive RCFile::sync(long) does a sub-sequence linear search for sync blocks

2013-02-20 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-3992: -- Assignee: Gopal V Release Note: Rely on previous sync-points when syncing within the same RCFile and

[jira] [Created] (HIVE-4051) Hive's metastore suffers from 1+N queries when querying partitions is slow

2013-02-21 Thread Gopal V (JIRA)
Gopal V created HIVE-4051: - Summary: Hive's metastore suffers from 1+N queries when querying partitions is slow Key: HIVE-4051 URL: https://issues.apache.org/jira/browse/HIVE-4051 Project: Hive

[jira] [Created] (HIVE-4078) Remove the serialize-deserialize pair in CommonJoinResolver

2013-02-26 Thread Gopal V (JIRA)
Gopal V created HIVE-4078: - Summary: Remove the serialize-deserialize pair in CommonJoinResolver Key: HIVE-4078 URL: https://issues.apache.org/jira/browse/HIVE-4078 Project: Hive Issue Type: Bug

[jira] [Updated] (HIVE-4078) Remove the serialize-deserialize pair in CommonJoinResolver

2013-02-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4078: -- Attachment: HIVE-4078.patch Over six runs of an explain query with 3 joins || pre-patch || post-patch ||

[jira] [Commented] (HIVE-4078) Remove the serialize-deserialize pair in CommonJoinResolver

2013-02-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587902#comment-13587902 ] Gopal V commented on HIVE-4078: --- Checked the plans generated by both -Hive history

[jira] [Commented] (HIVE-4078) Remove the serialize-deserialize pair in CommonJoinResolver

2013-02-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587917#comment-13587917 ] Gopal V commented on HIVE-4078: --- Up for review https://reviews.apache.org/r/9642/

[jira] [Updated] (HIVE-4078) Remove the serialize-deserialize pair in CommonJoinResolver

2013-02-27 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4078: -- Status: Open (was: Patch Available) Updating patch to match review comments. Remove the

[jira] [Updated] (HIVE-4078) Remove the serialize-deserialize pair in CommonJoinResolver

2013-02-27 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4078: -- Attachment: HIVE-4078-20130227.patch Updated to throw an exception if the cloner throws a SemanticException

[jira] [Updated] (HIVE-4078) Remove the serialize-deserialize pair in CommonJoinResolver

2013-02-27 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4078: -- Status: Patch Available (was: Open) Remove the serialize-deserialize pair in CommonJoinResolver

[jira] [Updated] (HIVE-4078) Remove the serialize-deserialize pair in CommonJoinResolver

2013-02-27 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4078: -- Status: Open (was: Patch Available) cloneBean() only clones part of the data, does not do a true deep-copy.

[jira] [Created] (HIVE-4103) Remove System.gc() call from the map-join local-task loop

2013-03-01 Thread Gopal V (JIRA)
Gopal V created HIVE-4103: - Summary: Remove System.gc() call from the map-join local-task loop Key: HIVE-4103 URL: https://issues.apache.org/jira/browse/HIVE-4103 Project: Hive Issue Type: Bug

[jira] [Updated] (HIVE-4103) Remove System.gc() call from the map-join local-task loop

2013-03-01 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4103: -- Priority: Minor (was: Major) Remove System.gc() call from the map-join local-task loop

[jira] [Updated] (HIVE-4103) Remove System.gc() call from the map-join local-task loop

2013-03-01 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-4103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated HIVE-4103: -- Attachment: HIVE-4103.patch Remove the thread-stopping System.gc() calls from isAbort()

  1   2   3   4   5   6   7   8   9   10   >