loveShyBoy commented on issue #5911:
URL: https://github.com/apache/hudi/issues/5911#issuecomment-1161130862

   > @loveShyBoy Are you using any particular [index 
type](https://hudi.apache.org/docs/configurations/#indextype) ? Can you share 
all your write configs? cc @danny0405 @yuzhaojing
   
   delta streamer used default index type , flink also used 
hoodie.index.type=BLOOM
   0.10.1 do not have index.type config ,  exists since 0.11.0
   
   `--hoodie.table.name=tablename_hudi
   --table.type=COPY_ON_WRITE
   --hoodie.datasource.write.recordkey.field=primary_id
   --write.precombine.field=update_time
   --hoodie.datasource.write.partitionpath.field=dt
   --hive_sync.db=ods
   --write.tasks=4
   --hoodie.datasource.write.keygenerator.type=SIMPLE
   --write.bucket_assign.tasks=6
   --max.poll.records=100
   --max.partition.fetch.bytes=524288
   --hoodie.datasource.write.hive_style_partitioning=true
   --index.bootstrap.enabled=false
   --write.index_bootstrap.tasks=6
   --index.state.ttl=0.2
   --prop:hoodie.index.type=BLOOM`
   
   
   in 0.10.1 flink used default index type is HoodieIndex.IndexType.INMEMORY,so 
i changed to BLOOM. but the doc say Bloom is default . I'm a little 
confused。that is right?
   
   `
     public static final ConfigProperty<String> INDEX_TYPE = ConfigProperty
         .key("hoodie.index.type")
         .noDefaultValue()
         .withDocumentation("Type of index to use. Default is Bloom filter. "
             + "Possible options are [BLOOM | GLOBAL_BLOOM |SIMPLE | 
GLOBAL_SIMPLE | INMEMORY | HBASE | BUCKET]. "
             + "Bloom filters removes the dependency on a external system "
             + "and is stored in the footer of the Parquet Data Files");
   
   private String getDefaultIndexType(EngineType engineType) {
         switch (engineType) {
           case SPARK:
             return HoodieIndex.IndexType.SIMPLE.name();
           case FLINK:
           case JAVA:
             return HoodieIndex.IndexType.INMEMORY.name();
           default:
             throw new HoodieNotSupportedException("Unsupported engine " + 
engineType);
         }
       }`
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to