wgtmac opened a new pull request, #194:
URL: https://github.com/apache/parquet-format/pull/194
The specs has only added `bloom_filter_offset` to locate the bloom filter.
The reader cannot load the bloom filter in a single shot until it parses the
bloom filter header to get the total size.
```thrift
struct ColumnMetaData {
/** Byte offset from beginning of file to Bloom filter data. **/
14: optional i64 bloom_filter_offset;
}
```
This patch proposes to add an optional bloom_filter_length field to track
the size of bloom filter to facilitate I/O scheduling. The specs already do the
similar things for column index and offset index.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]