[
https://issues.apache.org/jira/browse/HIVE-12411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15025967#comment-15025967
]
Lefty Leverenz commented on HIVE-12411:
---------------------------------------
Doc note: This changes *hive.stats.dbclass* (removing counter as a value) and
removes *hive.stats.key.prefix.reserve.length* so the wiki needs to be updated
for release 2.0.0.
* [Configuration Properties -- hive.stats.dbclass |
https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-hive.stats.dbclass]
* [Configuration Properties -- hive.stats.key.prefix.reserve.length |
https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-hive.stats.key.prefix.reserve.length]
The Statistics doc does not mention counter-based stats so no update is
required, although an explanation of collection mechanisms would be a helpful
addition. *hive.stats.dbclass* is discussed in the Usage section.
* [Statistics in Hive |
https://cwiki.apache.org/confluence/display/Hive/StatsDev]
** [Implementation |
https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-Implementation]
** [Usage |
https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-Usage]
> Remove counter based stats collection mechanism
> -----------------------------------------------
>
> Key: HIVE-12411
> URL: https://issues.apache.org/jira/browse/HIVE-12411
> Project: Hive
> Issue Type: Task
> Components: Statistics
> Affects Versions: 1.2.0, 1.2.1
> Reporter: Pengcheng Xiong
> Assignee: Pengcheng Xiong
> Labels: TODOC2.0
> Fix For: 2.0.0
>
> Attachments: HIVE-12411.01.patch, HIVE-12411.02.patch
>
>
> Following HIVE-12005, HIVE-12164, we have removed jdbc and hbase stats
> collection mechanism. Now we are targeting counter based stats collection
> mechanism. The main advantages are as follows (1) counter based stats has
> limitation on the length of the counter itself, if it is too long, MD5 will
> be applied. (2) when there are a large number of partitions and columns, we
> need to create a large number of counters in memory. This will put a heavy
> load on the M/R AM or Tez AM etc. FS based stats will do a better job.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)