[GitHub] [carbondata] Zhangshunyu commented on issue #3701: [WIP] improve pure partition count star performance

GitBox Sun, 12 Apr 2020 19:04:26 -0700

Zhangshunyu commented on issue #3701: [WIP]  improve pure partition count star 
performance
URL: https://github.com/apache/carbondata/pull/3701#issuecomment-612715844
 
 
   @ajantha-bhat 
   We find that select count() for some partitons is time costly and worse than 
parquet, as currently the count() with filter whose culumns are all partition 
columns will load all datamaps of these partitions including block info/minmax 
info, but it is no need to load them ,we can just read it from valid index 
files directly using partition prune as the rowCount stored inside index files, 
and we can cache these info. For no-sort partition table, minmax is almost no 
using but cost time.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

[GitHub] [carbondata] Zhangshunyu commented on issue #3701: [WIP] improve pure partition count star performance

Reply via email to