[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-09-04 Thread Damien Carol (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Damien Carol updated HIVE-7876:
---
Description: 
The previous solution HIVE-7736 is not enough for the case when there are too 
many columns/partitions.

The user will encounter 
{noformat}
org.apache.thrift.transport.TTransportException: 
java.net.SocketTimeoutException: Read timed out
{noformat}
We try to remove more of transaction overhead

  was:
The previous solution https://issues.apache.org/jira/browse/HIVE-7736
is not enough for the case when there are too many columns/partitions.

The user will encounter 

org.apache.thrift.transport.TTransportException: 
java.net.SocketTimeoutException: Read timed out

We try to remove more of transaction overhead


 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Fix For: 0.14.0

 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, 
 HIVE-7876.4.patch, HIVE-7876.5.patch


 The previous solution HIVE-7736 is not enough for the case when there are too 
 many columns/partitions.
 The user will encounter 
 {noformat}
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 {noformat}
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-09-04 Thread Damien Carol (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Damien Carol updated HIVE-7876:
---
Component/s: Statistics

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
  Components: Statistics
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Fix For: 0.14.0

 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, 
 HIVE-7876.4.patch, HIVE-7876.5.patch


 The previous solution HIVE-7736 is not enough for the case when there are too 
 many columns/partitions.
 The user will encounter 
 {noformat}
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 {noformat}
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-30 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-7876:
---

   Resolution: Fixed
Fix Version/s: 0.14.0
   Status: Resolved  (was: Patch Available)

Committed to trunk. Thanks, Pengcheng!

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Fix For: 0.14.0

 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, 
 HIVE-7876.4.patch, HIVE-7876.5.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-28 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-7876:
---

Status: Open  (was: Patch Available)

Left comments on RB. I don't get why you need to retrieve the object before 
persisting it. You want to persist regardless whether it exists or not in DB.

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, 
 HIVE-7876.4.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-28 Thread pengcheng xiong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengcheng xiong updated HIVE-7876:
--

Status: Patch Available  (was: Open)

address ashutosh's comments, remove unnecessary txns

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, 
 HIVE-7876.4.patch, HIVE-7876.5.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-28 Thread pengcheng xiong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengcheng xiong updated HIVE-7876:
--

Attachment: HIVE-7876.5.patch

address ashutosh's comments, remove unnecessary txns

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, 
 HIVE-7876.4.patch, HIVE-7876.5.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-27 Thread pengcheng xiong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengcheng xiong updated HIVE-7876:
--

Attachment: HIVE-7876.4.patch

address QA issues

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-27 Thread pengcheng xiong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengcheng xiong updated HIVE-7876:
--

Attachment: HIVE-7876.4.patch

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, 
 HIVE-7876.4.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-27 Thread pengcheng xiong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengcheng xiong updated HIVE-7876:
--

Status: Open  (was: Patch Available)

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, 
 HIVE-7876.4.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-27 Thread pengcheng xiong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengcheng xiong updated HIVE-7876:
--

Status: Patch Available  (was: Open)

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch, HIVE-7876.4.patch, 
 HIVE-7876.4.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-26 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-7876:
---

Status: Patch Available  (was: Open)

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-26 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-7876:
---

Status: Open  (was: Patch Available)

Left couple of comments on RB. Seems, like there is a further scope of 
improvement here.

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-26 Thread pengcheng xiong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengcheng xiong updated HIVE-7876:
--

Status: Patch Available  (was: Open)

further reduce writepath time, set partition=null, now it is just read stats, 
then insert stats(if null) or update stats (if not null)

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-26 Thread pengcheng xiong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengcheng xiong updated HIVE-7876:
--

Attachment: HIVE-7876.3.patch

further reduce writepath time, set partition=null, now it is just read stats, 
then insert stats(if null) or update stats (if not null)

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.2.patch, HIVE-7876.3.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7876) further improve the columns stats update speed for all the partitions of a table

2014-08-25 Thread pengcheng xiong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengcheng xiong updated HIVE-7876:
--

Attachment: HIVE-7876.1.patch

remove transaction overhead

 further improve the columns stats update speed for all the partitions of a 
 table
 

 Key: HIVE-7876
 URL: https://issues.apache.org/jira/browse/HIVE-7876
 Project: Hive
  Issue Type: Improvement
Reporter: pengcheng xiong
Assignee: pengcheng xiong
Priority: Minor
 Attachments: HIVE-7876.1.patch


 The previous solution https://issues.apache.org/jira/browse/HIVE-7736
 is not enough for the case when there are too many columns/partitions.
 The user will encounter 
 org.apache.thrift.transport.TTransportException: 
 java.net.SocketTimeoutException: Read timed out
 We try to remove more of transaction overhead



--
This message was sent by Atlassian JIRA
(v6.2#6252)