[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Damien Carol updated HIVE-7876: --- Description: The previous solution HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter {noformat} org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out {noformat} We try to remove more of transaction overhead was: The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Fix For: 0.14.0 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, HIVE-7876.4.patch, HIVE-7876.5.patch The previous solution HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter {noformat} org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out {noformat} We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Damien Carol updated HIVE-7876: --- Component/s: Statistics further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Components: Statistics Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Fix For: 0.14.0 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, HIVE-7876.4.patch, HIVE-7876.5.patch The previous solution HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter {noformat} org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out {noformat} We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-7876: --- Resolution: Fixed Fix Version/s: 0.14.0 Status: Resolved (was: Patch Available) Committed to trunk. Thanks, Pengcheng! further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Fix For: 0.14.0 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, HIVE-7876.4.patch, HIVE-7876.5.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-7876: --- Status: Open (was: Patch Available) Left comments on RB. I don't get why you need to retrieve the object before persisting it. You want to persist regardless whether it exists or not in DB. further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, HIVE-7876.4.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengcheng xiong updated HIVE-7876: -- Status: Patch Available (was: Open) address ashutosh's comments, remove unnecessary txns further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, HIVE-7876.4.patch, HIVE-7876.5.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengcheng xiong updated HIVE-7876: -- Attachment: HIVE-7876.5.patch address ashutosh's comments, remove unnecessary txns further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, HIVE-7876.4.patch, HIVE-7876.5.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengcheng xiong updated HIVE-7876: -- Attachment: HIVE-7876.4.patch address QA issues further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengcheng xiong updated HIVE-7876: -- Attachment: HIVE-7876.4.patch further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, HIVE-7876.4.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengcheng xiong updated HIVE-7876: -- Status: Open (was: Patch Available) further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, HIVE-7876.4.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengcheng xiong updated HIVE-7876: -- Status: Patch Available (was: Open) further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, HIVE-7876.4.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-7876: --- Status: Patch Available (was: Open) further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-7876: --- Status: Open (was: Patch Available) Left couple of comments on RB. Seems, like there is a further scope of improvement here. further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengcheng xiong updated HIVE-7876: -- Status: Patch Available (was: Open) further reduce writepath time, set partition=null, now it is just read stats, then insert stats(if null) or update stats (if not null) further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengcheng xiong updated HIVE-7876: -- Attachment: HIVE-7876.3.patch further reduce writepath time, set partition=null, now it is just read stats, then insert stats(if null) or update stats (if not null) further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table
[ https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengcheng xiong updated HIVE-7876: -- Attachment: HIVE-7876.1.patch remove transaction overhead further improve the columns stats update speed for all the partitions of a table Key: HIVE-7876 URL: https://issues.apache.org/jira/browse/HIVE-7876 Project: Hive Issue Type: Improvement Reporter: pengcheng xiong Assignee: pengcheng xiong Priority: Minor Attachments: HIVE-7876.1.patch The previous solution https://issues.apache.org/jira/browse/HIVE-7736 is not enough for the case when there are too many columns/partitions. The user will encounter org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out We try to remove more of transaction overhead -- This message was sent by Atlassian JIRA (v6.2#6252)