[jira] [Resolved] (HIVE-24710) Optimise PTF iteration for count(*) to reduce CPU and IO cost

2021-02-17 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan resolved HIVE-24710. - Fix Version/s: 4.0.0 Resolution: Fixed Thanks for the review [~ashutoshc]. Merged

[jira] [Commented] (HIVE-24774) Reduce FS listing during dynamic partition loading

2021-02-15 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284958#comment-17284958 ] Rajesh Balamohan commented on HIVE-24774: - Thanks [~pvargacl], I will go through

[jira] [Commented] (HIVE-24710) Optimise PTF iteration for count(*) to reduce CPU and IO cost

2021-02-03 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17277853#comment-17277853 ] Rajesh Balamohan commented on HIVE-24710: - Updated the subject and description of

[jira] [Updated] (HIVE-24710) Optimise PTF iteration for count(*) to reduce CPU and IO cost

2021-02-03 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24710: Description: E.g query {noformat} select x, y, count(*) over (partition by x order by y ra

[jira] [Updated] (HIVE-24710) Optimise PTF iteration for count(*) to reduce CPU and IO cost

2021-02-03 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24710: Summary: Optimise PTF iteration for count(*) to reduce CPU and IO cost (was: PTFRowContain

[jira] [Resolved] (HIVE-24695) Clean up session resources, if TezSession is unable to start

2021-02-02 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan resolved HIVE-24695. - Resolution: Fixed Thanks for the review [~ashutoshc]. Merged the PR. > Clean up session

[jira] [Assigned] (HIVE-24695) Clean up session resources, if TezSession is unable to start

2021-02-02 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan reassigned HIVE-24695: --- Assignee: Rajesh Balamohan > Clean up session resources, if TezSession is unable to

[jira] [Resolved] (HIVE-24443) Optimise VectorSerializeRow for primitives

2021-02-01 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan resolved HIVE-24443. - Resolution: Fixed Fixed via HIVE-24503 > Optimise VectorSerializeRow for primitives > --

[jira] [Updated] (HIVE-24710) PTFRowContainer could be reading more number of blocks than needed

2021-01-31 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24710: Description: PTFRowContainer could be reading the same block repeatedly for the first block

[jira] [Updated] (HIVE-24695) Clean up session resources, if TezSession is unable to start

2021-01-28 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24695: Description: There are cases when TezSessionState would not be able to start. (e.g resource

[jira] [Updated] (HIVE-24695) Clean up session resources, if TezSession is unable to start

2021-01-28 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24695: Description: There are cases when TezSessionState would not be able to start. (e.g resource

[jira] [Comment Edited] (HIVE-24596) Explain ddl for debugging

2021-01-06 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17260223#comment-17260223 ] Rajesh Balamohan edited comment on HIVE-24596 at 1/7/21, 4:34 AM: -

[jira] [Commented] (HIVE-24596) Explain ddl for debugging

2021-01-06 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17260223#comment-17260223 ] Rajesh Balamohan commented on HIVE-24596: - HIVE-24596 is on similar lines, but wi

[jira] [Updated] (HIVE-24546) Avoid unwanted cloud storage call during dynamic partition load

2020-12-16 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24546: Attachment: simple_test.sql > Avoid unwanted cloud storage call during dynamic partition lo

[jira] [Resolved] (HIVE-24520) Fix stackoverflow error in HiveMetaStore::get_partitions_by_names

2020-12-14 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan resolved HIVE-24520. - Fix Version/s: 4.0.0 Resolution: Fixed Merged the PR. Thanks [~kishendas] . > Fix

[jira] [Updated] (HIVE-24519) Optimize MV: Materialized views should not rebuild when tables are not modified

2020-12-10 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24519: Parent: HIVE-22253 Issue Type: Sub-task (was: Improvement) > Optimize MV: Material

[jira] [Commented] (HIVE-24472) Optimize LlapTaskSchedulerService::preemptTasksFromMap

2020-12-02 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17242961#comment-17242961 ] Rajesh Balamohan commented on HIVE-24472: - Ref: Q14 in tpcds > Optimize LlapTask

[jira] [Commented] (HIVE-24443) Optimise VectorSerializeRow for primitives

2020-11-29 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17240471#comment-17240471 ] Rajesh Balamohan commented on HIVE-24443: - This showed up as a part of Q67. > Op

[jira] [Commented] (HIVE-24409) Use LazyBinarySerDe2 in PlanUtils::getReduceValueTableDesc

2020-11-22 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17237154#comment-17237154 ] Rajesh Balamohan commented on HIVE-24409: - {noformat} e.g query @10TB scale ins

[jira] [Commented] (HIVE-24368) Optimise AcidUtils::getAcidFilesForStats for ACID tables

2020-11-11 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17230399#comment-17230399 ] Rajesh Balamohan commented on HIVE-24368: - Appears that this is handled in master

[jira] [Comment Edited] (HIVE-24109) Load partitions in batches for managed tables in the bootstrap phase

2020-11-01 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17224390#comment-17224390 ] Rajesh Balamohan edited comment on HIVE-24109 at 11/2/20, 4:11 AM:

[jira] [Comment Edited] (HIVE-24109) Load partitions in batches for managed tables in the bootstrap phase

2020-11-01 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17224390#comment-17224390 ] Rajesh Balamohan edited comment on HIVE-24109 at 11/2/20, 4:10 AM:

[jira] [Commented] (HIVE-24109) Load partitions in batches for managed tables in the bootstrap phase

2020-11-01 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17224417#comment-17224417 ] Rajesh Balamohan commented on HIVE-24109: - Just had a closer look at the patch. R

[jira] [Commented] (HIVE-24109) Load partitions in batches for managed tables in the bootstrap phase

2020-11-01 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17224390#comment-17224390 ] Rajesh Balamohan commented on HIVE-24109: - Sorry for noticing this patch late. Th

[jira] [Commented] (HIVE-23190) LLAP: modify IndexCache to pass filesystem object to TezSpillRecord

2020-10-18 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17216380#comment-17216380 ] Rajesh Balamohan commented on HIVE-23190: - Patch looks good. +1 pending tests. C

[jira] [Resolved] (HIVE-24234) Improve checkHashModeEfficiency in VectorGroupByOperator

2020-10-12 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan resolved HIVE-24234. - Fix Version/s: 4.0.0 Resolution: Fixed > Improve checkHashModeEfficiency in Vector

[jira] [Commented] (HIVE-24234) Improve checkHashModeEfficiency in VectorGroupByOperator

2020-10-12 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17212779#comment-17212779 ] Rajesh Balamohan commented on HIVE-24234: - Thanks [~ashutoshc], [~mustafaiman]. M

[jira] [Updated] (HIVE-24262) Optimise NullScanTaskDispatcher for cloud storage

2020-10-12 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24262: Description: {noformat} select count(DISTINCT ss_sold_date_sk) from store_sales; -

[jira] [Assigned] (HIVE-24234) Improve checkHashModeEfficiency in VectorGroupByOperator

2020-10-09 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan reassigned HIVE-24234: --- Assignee: Rajesh Balamohan > Improve checkHashModeEfficiency in VectorGroupByOperato

[jira] [Commented] (HIVE-24234) Improve checkHashModeEfficiency in VectorGroupByOperator

2020-10-06 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17209327#comment-17209327 ] Rajesh Balamohan commented on HIVE-24234: - Thanks [~mustafaiman]. >> (outputRec

[jira] [Updated] (HIVE-24234) Improve checkHashModeEfficiency in VectorGroupByOperator

2020-10-06 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24234: Attachment: HIVE-24234.wip.patch > Improve checkHashModeEfficiency in VectorGroupByOperator

[jira] [Commented] (HIVE-24205) Optimise CuckooSetBytes

2020-10-05 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17207989#comment-17207989 ] Rajesh Balamohan commented on HIVE-24205: - Thanks [~mustafaiman]. With repeated r

[jira] [Updated] (HIVE-24212) Refactor to take advantage of list* optimisations in cloud storage connectors

2020-09-30 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24212: Summary: Refactor to take advantage of list* optimisations in cloud storage connectors (wa

[jira] [Updated] (HIVE-24205) Optimise CuckooSetBytes

2020-09-28 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24205: Description: {{FilterStringColumnInList, StringColumnInList}}  etc use CuckooSetBytes for

[jira] [Resolved] (HIVE-24116) LLAP: Provide an opportunity for preempted tasks to get better locality in next iteration

2020-09-04 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan resolved HIVE-24116. - Fix Version/s: 4.0.0 Resolution: Fixed > LLAP: Provide an opportunity for preempte

[jira] [Commented] (HIVE-24116) LLAP: Provide an opportunity for preempted tasks to get better locality in next iteration

2020-09-04 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17190634#comment-17190634 ] Rajesh Balamohan commented on HIVE-24116: - Thanks [~gopalv] . Committed to master

[jira] [Updated] (HIVE-24116) LLAP: Provide an opportunity for preempted tasks to get better locality in next iteration

2020-09-03 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-24116: Description: In certain DAGs, tasks get preempted as higher priority tasks need to be exec

[jira] [Assigned] (HIVE-24116) LLAP: Provide an opportunity for preempted tasks to get better locality in next iteration

2020-09-03 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan reassigned HIVE-24116: --- > LLAP: Provide an opportunity for preempted tasks to get better locality in > next ite

[jira] [Assigned] (HIVE-24061) Improve llap task scheduling for better cache hit rate

2020-08-25 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-24061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan reassigned HIVE-24061: --- Assignee: Rajesh Balamohan > Improve llap task scheduling for better cache hit rate

[jira] [Updated] (HIVE-23917) Reset key access count during eviction in VectorGroupByOperator

2020-07-28 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23917: Fix Version/s: 4.0.0 Resolution: Fixed Status: Resolved (was: Patch Avail

[jira] [Commented] (HIVE-23917) Reset key access count during eviction in VectorGroupByOperator

2020-07-28 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17166844#comment-17166844 ] Rajesh Balamohan commented on HIVE-23917: - Thanks [~hashutosh] for the review. Co

[jira] [Updated] (HIVE-23917) Reset key access count during eviction in VectorGroupByOperator

2020-07-28 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23917: Status: Patch Available (was: Open) > Reset key access count during eviction in VectorGrou

[jira] [Assigned] (HIVE-23917) Reset key access count during eviction in VectorGroupByOperator

2020-07-28 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan reassigned HIVE-23917: --- Assignee: Rajesh Balamohan > Reset key access count during eviction in VectorGroupBy

[jira] [Commented] (HIVE-23936) Provide approximate number of input records to be processed in broadcast reader

2020-07-27 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165491#comment-17165491 ] Rajesh Balamohan commented on HIVE-23936: - E.g in hive, where approximate input r

[jira] [Updated] (HIVE-23870) Optimise multiple text conversions in WritableHiveCharObjectInspector.getPrimitiveJavaObject / HiveCharWritable

2020-07-20 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23870: Summary: Optimise multiple text conversions in WritableHiveCharObjectInspector.getPrimitive

[jira] [Updated] (HIVE-23878) Aggregate after join throws off MV rewrite

2020-07-19 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23878: Attachment: q81_eg.txt > Aggregate after join throws off MV rewrite >

[jira] [Assigned] (HIVE-23843) Improve key evictions in VectorGroupByOperator

2020-07-13 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan reassigned HIVE-23843: --- Assignee: Rajesh Balamohan > Improve key evictions in VectorGroupByOperator > --

[jira] [Commented] (HIVE-23764) Remove unnecessary getLastFlushLength when checking delete delta files

2020-07-02 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17150614#comment-17150614 ] Rajesh Balamohan commented on HIVE-23764: - [~pvary] : We can get this fix committ

[jira] [Commented] (HIVE-23764) Remove unnecessary getLastFlushLength when checking delete delta files

2020-06-28 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17147553#comment-17147553 ] Rajesh Balamohan commented on HIVE-23764: - Related ticket : https://issues.apache

[jira] [Commented] (HIVE-23738) DBLockManager::lock() : Move lock request to debug level

2020-06-23 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17143547#comment-17143547 ] Rajesh Balamohan commented on HIVE-23738: - Attaching a large lock request [^q78_3

[jira] [Updated] (HIVE-23738) DBLockManager::lock() : Move lock request to debug level

2020-06-23 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23738: Attachment: q78_30tb_lock_request.log > DBLockManager::lock() : Move lock request to debug

[jira] [Updated] (HIVE-23754) LLAP: Add LoggingHandler in ShuffleHandler pipeline for better debuggability

2020-06-23 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23754: Description: [https://github.com/apache/hive/blob/master/llap-server/src/java/org/apache/ha

[jira] [Updated] (HIVE-23754) LLAP: Add LoggingHandler in ShuffleHandler pipeline for better debuggability

2020-06-23 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23754: Environment:     was: [https://github.com/apache/hive/blob/master/llap-server/src/java/

[jira] [Updated] (HIVE-23735) Reducer misestimate for export command

2020-06-22 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23735: Status: Open (was: Patch Available) > Reducer misestimate for export command > ---

[jira] [Updated] (HIVE-23735) Reducer misestimate for export command

2020-06-22 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23735: Status: Patch Available (was: Open) > Reducer misestimate for export command > ---

[jira] [Updated] (HIVE-23735) Reducer misestimate for export command

2020-06-21 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23735: Attachment: HIVE-23735.1.wip.patch > Reducer misestimate for export command > -

[jira] [Updated] (HIVE-23499) REPL: Immutable repl dumps should be reusable across multiple repl loads

2020-06-15 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23499: Fix Version/s: 4.0.0 Resolution: Fixed Status: Resolved (was: Patch Avail

[jira] [Updated] (HIVE-23499) REPL: Immutable repl dumps should be reusable across multiple repl loads

2020-06-10 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23499: Description: "{{hive.repl.dump.metadata.only=true"}} is not currently honored during "{{rep

[jira] [Updated] (HIVE-23499) REPL: Immutable repl dumps should be reusable across multiple repl loads

2020-06-10 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23499: Summary: REPL: Immutable repl dumps should be reusable across multiple repl loads (was: RE

[jira] [Updated] (HIVE-23499) REPL: Immutable repl dumps should be reusable across multiple repl loads

2020-06-10 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23499: Description: "{{hive.repl.dump.metadata.only=true"}} is not currently honored during "{{rep

[jira] [Updated] (HIVE-23499) REPL: repl load should honor metadata only loads

2020-06-10 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23499: Summary: REPL: repl load should honor metadata only loads (was: REPL: repl load should hon

[jira] [Updated] (HIVE-23499) REPL: repl load should honor hive.repl.dump.skip.immutable.data.copy

2020-06-10 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23499: Summary: REPL: repl load should honor hive.repl.dump.skip.immutable.data.copy (was: REPL:

[jira] [Updated] (HIVE-23499) REPL: repl load should honor "hive.repl.dump.metadata.only=true"

2020-06-10 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23499: Attachment: HIVE-23499.2.patch > REPL: repl load should honor "hive.repl.dump.metadata.only

[jira] [Commented] (HIVE-23499) REPL: repl load should honor "hive.repl.dump.metadata.only=true"

2020-06-10 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132917#comment-17132917 ] Rajesh Balamohan commented on HIVE-23499: - Given that we have {{hive.repl.dump.sk

[jira] [Updated] (HIVE-23521) REPL: Optimise partition loading during bootstrap

2020-06-10 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23521: Fix Version/s: 4.0.0 Resolution: Fixed Status: Resolved (was: Patch Avail

[jira] [Updated] (HIVE-23520) REPL: repl dump could add support for immutable dataset

2020-06-10 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23520: Fix Version/s: 4.0.0 Resolution: Fixed Status: Resolved (was: Patch Avail

[jira] [Updated] (HIVE-23551) Acid: Update queries should treat dirCache as read-only in AcidUtils

2020-06-09 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23551: Fix Version/s: 4.0.0 Assignee: Rajesh Balamohan Resolution: Fixed

[jira] [Commented] (HIVE-23597) VectorizedOrcAcidRowBatchReader::ColumnizedDeleteEventRegistry reads delete delta directories multiple times

2020-06-08 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17128877#comment-17128877 ] Rajesh Balamohan commented on HIVE-23597: - Initial version of PR: https://github.

[jira] [Updated] (HIVE-23551) Acid: Update queries should treat dirCache as read-only in AcidUtils

2020-06-08 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23551: Attachment: HIVE-23551.6.patch > Acid: Update queries should treat dirCache as read-only in

[jira] [Commented] (HIVE-23551) Acid: Update queries should treat dirCache as read-only in AcidUtils

2020-06-08 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17128716#comment-17128716 ] Rajesh Balamohan commented on HIVE-23551: - PR: [https://github.com/apache/hive/pu

[jira] [Updated] (HIVE-23520) REPL: repl dump could add support for immutable dataset

2020-06-07 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23520: Attachment: HIVE-23520.2.patch > REPL: repl dump could add support for immutable dataset >

[jira] [Commented] (HIVE-23520) REPL: repl dump could add support for immutable dataset

2020-06-05 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126558#comment-17126558 ] Rajesh Balamohan commented on HIVE-23520: - Haven't added that, since we would be

[jira] [Commented] (HIVE-23521) REPL: Optimise partition loading during bootstrap

2020-06-05 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126515#comment-17126515 ] Rajesh Balamohan commented on HIVE-23521: - [~aasha]: Actually it would be good to

[jira] [Commented] (HIVE-23277) HiveProtoLogger should carry out JSON conversion in its own thread

2020-06-03 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125528#comment-17125528 ] Rajesh Balamohan commented on HIVE-23277: - This is to avoid JSON serialization be

[jira] [Updated] (HIVE-23551) Acid: Update queries should treat dirCache as read-only in AcidUtils

2020-05-31 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23551: Attachment: HIVE-23551.5.patch > Acid: Update queries should treat dirCache as read-only in

[jira] [Updated] (HIVE-23551) Acid: Update queries should treat dirCache as read-only in AcidUtils

2020-05-28 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23551: Attachment: HIVE-23551.4.patch > Acid: Update queries should treat dirCache as read-only in

[jira] [Updated] (HIVE-23551) Acid: Update queries should treat dirCache as read-only in AcidUtils

2020-05-28 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23551: Attachment: HIVE-23551.3.patch > Acid: Update queries should treat dirCache as read-only in

[jira] [Updated] (HIVE-23559) Optimise Hive::moveAcidFiles for cloud storage

2020-05-27 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23559: Issue Type: Improvement (was: Bug) > Optimise Hive::moveAcidFiles for cloud storage >

[jira] [Updated] (HIVE-23468) LLAP: Optimise OrcEncodedDataReader to avoid FS init to NN

2020-05-27 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23468: Attachment: HIVE-23468.6.patch > LLAP: Optimise OrcEncodedDataReader to avoid FS init to NN

[jira] [Updated] (HIVE-23488) Optimise PartitionManagementTask::Msck::repair

2020-05-27 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23488: Attachment: HIVE-23488.3.patch > Optimise PartitionManagementTask::Msck::repair > -

[jira] [Updated] (HIVE-23551) Acid: Update queries should treat dirCache as read-only in AcidUtils

2020-05-26 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23551: Attachment: HIVE-23551.2.patch > Acid: Update queries should treat dirCache as read-only in

[jira] [Updated] (HIVE-23551) Acid: Update queries should treat dirCache as read-only in AcidUtils

2020-05-26 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23551: Summary: Acid: Update queries should treat dirCache as read-only in AcidUtils (was: Acid:

[jira] [Updated] (HIVE-23551) Acid: Update queries should purge dir cache entry in AcidUtils

2020-05-26 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23551: Status: Patch Available (was: Open) > Acid: Update queries should purge dir cache entry in

[jira] [Updated] (HIVE-23551) Acid: Update queries should purge dir cache entry in AcidUtils

2020-05-26 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23551: Attachment: HIVE-23551.1.patch > Acid: Update queries should purge dir cache entry in AcidU

[jira] [Commented] (HIVE-23551) Acid: Update queries should purge dir cache entry in AcidUtils

2020-05-26 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17117192#comment-17117192 ] Rajesh Balamohan commented on HIVE-23551: - \cc [~gopalv] > Acid: Update queries

[jira] [Updated] (HIVE-23468) LLAP: Optimise OrcEncodedDataReader to avoid FS init to NN

2020-05-25 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23468: Attachment: HIVE-23468.5.patch > LLAP: Optimise OrcEncodedDataReader to avoid FS init to NN

[jira] [Updated] (HIVE-23487) Optimise PartitionManagementTask

2020-05-25 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23487: Attachment: HIVE-23487.2.patch > Optimise PartitionManagementTask > ---

[jira] [Updated] (HIVE-23521) REPL: Optimise partition loading during bootstrap

2020-05-25 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23521: Attachment: HIVE-23521.2.patch > REPL: Optimise partition loading during bootstrap > --

[jira] [Updated] (HIVE-23488) Optimise PartitionManagementTask::Msck::repair

2020-05-25 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23488: Attachment: HIVE-23488.2.patch > Optimise PartitionManagementTask::Msck::repair > -

[jira] [Updated] (HIVE-21971) HS2 leaks classloader due to `ReflectionUtils::CONSTRUCTOR_CACHE` with temporary functions + GenericUDF

2020-05-25 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-21971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-21971: Attachment: HIVE-21971.5.patch > HS2 leaks classloader due to `ReflectionUtils::CONSTRUCTOR

[jira] [Updated] (HIVE-23468) LLAP: Optimise OrcEncodedDataReader to avoid FS init to NN

2020-05-25 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23468: Attachment: HIVE-23468.4.patch > LLAP: Optimise OrcEncodedDataReader to avoid FS init to NN

[jira] [Commented] (HIVE-23521) REPL: Optimise partition loading during bootstrap

2020-05-21 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17113746#comment-17113746 ] Rajesh Balamohan commented on HIVE-23521: - Batching is one option, but need to st

[jira] [Updated] (HIVE-23521) REPL: Optimise partition loading during bootstrap

2020-05-21 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23521: Assignee: Rajesh Balamohan Status: Patch Available (was: Open) > REPL: Optimise part

[jira] [Updated] (HIVE-23521) REPL: Optimise partition loading during bootstrap

2020-05-21 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23521: Attachment: HIVE-23521.1.patch > REPL: Optimise partition loading during bootstrap > --

[jira] [Updated] (HIVE-23520) REPL: repl dump could add support for immutable dataset

2020-05-20 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23520: Attachment: HIVE-23520.1.patch > REPL: repl dump could add support for immutable dataset >

[jira] [Updated] (HIVE-23520) REPL: repl dump could add support for immutable dataset

2020-05-20 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23520: Assignee: Rajesh Balamohan Status: Patch Available (was: Open) > REPL: repl dump cou

[jira] [Updated] (HIVE-23520) REPL: repl dump could add support for immutable dataset

2020-05-20 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23520: Description: Currently, "REPL DUMP" ends up copying entire dataset along with partition inf

[jira] [Updated] (HIVE-23499) REPL: repl load should honor "hive.repl.dump.metadata.only=true"

2020-05-19 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23499: Attachment: HIVE-23499.1.patch > REPL: repl load should honor "hive.repl.dump.metadata.only

[jira] [Updated] (HIVE-23499) REPL: repl load should honor "hive.repl.dump.metadata.only=true"

2020-05-19 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23499: Assignee: Rajesh Balamohan Status: Patch Available (was: Open) > REPL: repl load sho

[jira] [Updated] (HIVE-23488) Optimise PartitionManagementTask::Msck::repair

2020-05-17 Thread Rajesh Balamohan (Jira)
[ https://issues.apache.org/jira/browse/HIVE-23488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated HIVE-23488: Attachment: HIVE-23488.1.patch > Optimise PartitionManagementTask::Msck::repair > -

<    1   2   3   4   5   6   7   8   9   10   >