Hi Manhua, Even at page level the row count will not be available probably from the next version. It would be decided as per size, not per count. Already code got merged and we are keeping the count based page configuration temporarily for backward compatibility.
So at any place, we will not get the cardinality of the column beforehand. Either we need to estimate the count from the history or take the approximate value from the user. And one more thing is the generation of bloom should follow the datamap interfaces, not in the min/max generation flow. But we can change datamap interfaces to add the generated datamap index to carbondata file and index file instead of separate files. Otherwise, we will loose the index interface capabilities and makes our code complex. This was already discussed earlier with Jacky. @Jacky please comment on it. Regards, Ravindra. -- Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/
