Yes. the log is as following.

2018-11-02 22:25:34,980 DEBUG [Query 
03ea4f21-29ed-4b74-8faa-c57ecd44f412-198914] 
gtrecord.StorageResponseGTScatter:88 : Using SortMergedPartitionResultIterator 
to merge 103 partition results
2018-11-02 22:25:34,982 INFO  [Query 
03ea4f21-29ed-4b74-8faa-c57ecd44f412-198914] 
gtrecord.SequentialCubeTupleIterator:73 : Using Iterators.concat to merge 
segment results
2018-11-02 22:25:34,982 DEBUG [Query 
03ea4f21-29ed-4b74-8faa-c57ecd44f412-198914] enumerator.OLAPEnumerator:122 : 
return TupleIterator...
2018-11-02 22:25:34,991 INFO  [Query 
03ea4f21-29ed-4b74-8faa-c57ecd44f412-198914] service.QueryService:897 : 
Processed rows for each storageContext: 366
2018-11-02 22:25:34,991 INFO  [Query 
03ea4f21-29ed-4b74-8faa-c57ecd44f412-198914] service.QueryService:422 : Stats 
of SQL response: isException: false, duration: 20, total scan count 1552



Acoording the log,  valueA = 366. valueB= (total scan count) 1552 - (total 
Agrrated/filterd in hbase)270 = 1282
 valueB is much larger than valueA .



             


------------------ ???????? ------------------
??????: "JiaTao Tao"<[email protected]>;
????????: 2018??11??5??(??????) ????2:41
??????: "user"<[email protected]>;

????: Re: doubt about measure of processedRowCount



Can you grep logs like "to merge segment results" in that scenario?


cheney <[email protected]> ??2018??11??3?????? ????4:15??????

Thank your repling, .but I  am sure there's only one OlapContext in the quey in 
my scenario.
 
---Original---
From: "JiaTao Tao"<[email protected]>
Date: Sat, Nov 3, 2018 10:42 AM
To: "user"<[email protected]>;
Subject: Re: doubt about measure of processedRowCount


Maybe count all the valueA would be more appropriate, cuz maybe there's more 
than one OlapContext in the query ( one OlapContext correspond one 
storageContext ).


There are two good blogs about Kylin's query engine, you may take a look :).


https://blog.csdn.net/yu616568/article/details/50838504



https://zhuanlan.zhihu.com/p/30613434







cheney <[email protected]> ??2018??11??2?????? ????11:10??????

Hi, guys   
        When I executed a sql in kylin, kylin server will log some log about 
query statics. for example, The log is as following:
       
       "Processed rows for each storageContext: valueA". valueA is 
processedRowCount.   
       What I understand is processedRowCount is the record rows numbers 
returned by hbase. 


       Hbase corprocessor will log region stats, including:  "Total scanned 
row","Total filtered/aggred row".
       
        For  one region,  final records returned by hbase = Total scanned row - 
Total filtered/aggred row;
       Suppose this query need to scan 10 region in hbase, we can get every 
region stats. we can get all records  valueB returned by hbase by
       suming every final records in 10 region. 
       
      In general, valueA is equal to  valueB, but valueB is much larger than 
valueA in sometimes. Why?
       
       




-- 




Regards!

Aron Tao





 




-- 




Regards!

Aron Tao

Reply via email to