bithw1 opened a new issue, #17767:
URL: https://github.com/apache/hudi/issues/17767
### Describe the problem you faced
Hi,
I am using Hudi 0.15.0, I have following sql:
```
set hoodie.spark.sql.insert.into.operation=insert;
CREATE TABLE IF NOT EXISTS hudi_cow_20260102_05 (
a INT,
b INT,
c INT
)
USING hudi
tblproperties(
type='cow',
primaryKey='a',
hoodie.datasource.write.precombine.field='c',
hoodie.index.type='BLOOM',
hoodie.index.bloom.num_entries='20',
hoodie.bloom.index.filter.dynamic.max.entries='25'
);
insert into hudi_cow_20260102_05(a,b,c)
values(1,2,3),(4,5,6),(5,8,9),(10,11,12);
```
I am using `insert` operation, I have thought that no index will be created
after insertion, but when I look at the parquet footer, I see the bloom filter
in the parquet footer(this is `hoodie_bloom_filter_type_code: DYNAMIC_V0` in
the footer). I am using the following command
```
[hadoop@hadoop ~]$ hadoop jar software/parquet-cli-1.14.1-runtime.jar meta
hdfs:///user/hive/warehouse/hudi_cow_20260102_05/5abe00e7-a601-44ee-a535-900876beca31-0_0-587-588_20260102155319811.parquet
File path:
hdfs:///user/hive/warehouse/hudi_cow_20260102_05/5abe00e7-a601-44ee-a535-900876beca31-0_0-587-588_20260102155319811.parquet
Created by: parquet-mr version 1.12.2 (build
77e30c8093386ec52c3cfa6c34b7ef3321322c94)
Properties:
hoodie_bloom_filter_type_code: DYNAMIC_V0
org.apache.hudi.bloomfilter:
/////wAAAB4BAAADXwAAABQAAAAEAAAAAf////8AAAAeAQAAA18QAIABEAAJEMAAAQZCEMEABACAAwRCBAIAVAYAIBAAAAAIAAEYIASAACAAAABC2KMggAIIAAEAAAwAQIAAAJBABAACAdAYAQQAMgEQAwJAAABIARAAgACAAAACQaggAAIAAgAgCAFAAiwAUAA=
hoodie_min_record_key: 1
parquet.avro.schema:
{"type":"record","name":"hudi_cow_20260102_05_record","namespace":"hoodie.hudi_cow_20260102_05","fields":[{"name":"_hoodie_commit_time","type":["null","string"],"doc":"","default":null},{"name":"_hoodie_commit_seqno","type":["null","string"],"doc":"","default":null},{"name":"_hoodie_record_key","type":["null","string"],"doc":"","default":null},{"name":"_hoodie_partition_path","type":["null","string"],"doc":"","default":null},{"name":"_hoodie_file_name","type":["null","string"],"doc":"","default":null},{"name":"a","type":["null","int"],"default":null},{"name":"b","type":["null","int"],"default":null},{"name":"c","type":["null","int"],"default":null}]}
writer.model.name: avro
hoodie_max_record_key: 7
Schema:
message hoodie.hudi_cow_20260102_05.hudi_cow_20260102_05_record {
optional binary _hoodie_commit_time (STRING);
optional binary _hoodie_commit_seqno (STRING);
optional binary _hoodie_record_key (STRING);
optional binary _hoodie_partition_path (STRING);
optional binary _hoodie_file_name (STRING);
optional int32 a;
optional int32 b;
optional int32 c;
}
Row group 0: count: 4 171.25 B records start: 4 total(compressed): 685 B
total(uncompressed):570 B
--------------------------------------------------------------------------------
type encodings count avg size nulls min
/ max
_hoodie_commit_time BINARY G _ R 4 27.75 B 0
"20260102155319811" / "20260102155319811"
_hoodie_commit_seqno BINARY G _ 4 20.75 B 0
"20260102155319811_0_0" / "20260102155319811_0_3"
_hoodie_record_key BINARY G _ 4 15.25 B 0 "1"
/ "7"
_hoodie_partition_path BINARY G _ R 4 23.50 B 0 ""
/ ""
_hoodie_file_name BINARY G _ R 4 39.75 B 0
"5abe00e7-a601-44ee-a535-9..." / "5abe00e7-a601-44ee-a535-9..."
a INT32 G _ 4 14.75 B 0 "1"
/ "10"
b INT32 G _ 4 14.75 B 0 "2"
/ "11"
c INT32 G _ 4 14.75 B 0 "3"
/ "12"
```
### To Reproduce
1
### Expected behavior
1
### Environment Description
1
### Additional context
_No response_
### Stacktrace
```shell
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]