1. can you tell me how you gonna read the in max? I mean to say, are you
going to store the segment level min max for all the column or since you
said blocklevel, it means for every carbondata file
you are going to store it? If it is block level,in case of more file,
segment file size might increase. Can you please explain more about this?

>>  yes. I am planning to store MinMax for all columns in segment file. I
>> agree that, 
segment file may increase, in case of more file. For solving this, i think
we could 
store minMax only for sort columns. We can add a table property to control
it.
what do you think?


2. How are you going to get the min max in driver? its obvious that you are
not planning to read the file.

>> During writing index file, will get minMax info for each block and store
>> it into an 
SegmentMinMax object. When InsertionTaskCompletion listener is called, will
add this 
segmentMinMax info into an accumulator. Later, while writing the segment
file, will get 
minMax Info from accmulator and serilaize and store into it. During query,
will read minmax 
from segment file and cache it and use for segment level pruning.



--
Sent from: 
http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/

Reply via email to