Ádám Szita created HIVE-22284:
---------------------------------

             Summary: Improve LLAP CacheContentsTracker to collect and display 
correct statistics
                 Key: HIVE-22284
                 URL: https://issues.apache.org/jira/browse/HIVE-22284
             Project: Hive
          Issue Type: Improvement
          Components: llap
            Reporter: Ádám Szita
            Assignee: Ádám Szita


When keeping track of which buffers correspond to what Hive objects, 
CacheContentsTracker relies on cache tags.

Currently a tag is a simple String that ideally holds DB and table name, and a 
partition spec concatenated by . and / . The information here is derived from 
the Path of the file that is getting cached. Needless to say sometimes this 
produces a wrong tag especially for external tables.

Also there's a bug when calculating aggregated stats for a 'parent' tag 
(corresponding to the table of the partition) because the overall maxCount and 
maxSize do not add up to the sum of those in the partitions. This happens when 
buffers get removed from the cache.

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to