There is an option to standardize Hive's ColStatistics object schema and use 
Iceberg:

class ColStatistics {

  static class Range {
    Number minValue;
    Number maxValue;
  }
  
  String colName;
  String colType;
  long countDistinct;
  long numNulls;
  double avgColLen;
  long numTrues;
  long numFalses;
  Range range;
  boolean isPrimaryKey;
  boolean isEstimated;
  boolean isFilteredColumn;
  byte[] bitVectors;
  byte[] histogram;
}

Reply via email to