[
https://issues.apache.org/jira/browse/HIVE-10503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Stamatis Zampetakis updated HIVE-10503:
---------------------------------------
I cleared the fixVersion field since this ticket is not resolved. Please review
this ticket and if the fix is already committed to a specific version please
set the version accordingly and mark the ticket as RESOLVED.
According to the JIRA guidelines
(https://cwiki.apache.org/confluence/display/Hive/HowToContribute) the
fixVersion should be set only when the issue is resolved/closed.
> Aggregate stats cache: follow up optimizations
> ----------------------------------------------
>
> Key: HIVE-10503
> URL: https://issues.apache.org/jira/browse/HIVE-10503
> Project: Hive
> Issue Type: Improvement
> Components: Metastore
> Affects Versions: 1.2.0
> Reporter: Vaibhav Gumashta
> Priority: Major
> Fix For: 1.3.0
>
>
> Some follow up work items:
> 1. Estimate cache nodes from memory size - currently the user needs to
> specify size based on #nodes.
> 2. Make the AggregateStatsCache#add method asynchronous - adding to cache can
> happen in a new thread.
> 3. Based on perf testing, explore an alternate data structure for the node
> list per cache key.
> 4. Explore ideas to reduce locking granularity of the value list per cache
> key.
> 5. There is an O(n*n) loop while finding the match - that should go away.
> 6. Single call to DB to get aggregate for columns not in cache.
> 7. Organize metrics capturing in a better way.
> 8. Address concerns on TTL causing stale data in cache.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)