wgtmac opened a new pull request, #194:
URL: https://github.com/apache/parquet-format/pull/194

   The specs has only added `bloom_filter_offset` to locate the bloom filter. 
The reader cannot load the bloom filter in a single shot until it parses the 
bloom filter header to get the total size.
   ```thrift
   struct ColumnMetaData {
     /** Byte offset from beginning of file to Bloom filter data. **/
     14: optional i64 bloom_filter_offset;
   }
   ```
   This patch proposes to add an optional bloom_filter_length field to track 
the size of bloom filter to facilitate I/O scheduling. The specs already do the 
similar things for column index and offset index.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to