kepplertreet commented on issue #7628:
URL: https://github.com/apache/hudi/issues/7628#issuecomment-1376709063

   Hi @alexeykudinkin  
   
   We are using an integer id column as  the 
`hoodie.datasource.write.recordkey.field` 
   Listing a few sample values
   ```
   1263633528
   1263633530
   1263633531 
   ``` 
   
   As for the Hud configurations we are using : 
   ```
   0                                hoodie.table.version                        
                          5
   1                   hoodie.datasource.write.operation                        
                     upsert
   2     hoodie.datasource.write.hive_style_partitioning                        
                       true
   3            hoodie.datasource.write.precombine.field                        
            _commit_time_ms
   4       hoodie.datasource.write.commitmeta.key.prefix                        
                          _
   5      hoodie.datasource.write.insert.drop.duplicates                        
                       true
   6                  hoodie.datasource.hive_sync.enable                        
                       true
   7                hoodie.datasource.hive_sync.use_jdbc                        
                       true
   8    hoodie.datasource.hive_sync.auto_create_database                        
                       true
   9       hoodie.datasource.hive_sync.support_timestamp                        
                      false
   10         hoodie.datasource.hive_sync.skip_ro_suffix                        
                       true
   11                   hoodie.parquet.compression.codec                        
                     snappy
   12                                  hoodie.metrics.on                        
                      false
   13                       hoodie.metrics.reporter.type                        
     PROMETHEUS_PUSHGATEWAY
   14                    hoodie.metrics.pushgateway.host                        
               <ip_address>
   15                    hoodie.metrics.pushgateway.port                        
               <port_number>
   16  hoodie.metrics.pushgateway.random.job.name.suffix                        
                      false
   17   hoodie.metrics.pushgateway.report.period.seconds                        
                         30
   18                             hoodie.metadata.enable                        
                       true
   19                     hoodie.metadata.metrics.enable                        
                       true
   20                        hoodie.metadata.clean.async                        
                       true
   21          hoodie.metadata.index.column.stats.enable                        
                       true
   22          hoodie.metadata.index.bloom.filter.enable                        
                       true
   23                        hoodie.metadata.index.async                        
                       true
   24                      hoodie.write.concurrency.mode                     
OPTIMISTIC_CONCURRENCY_CONTROL
   25                         hoodie.write.lock.provider  
org.apache.hudi.client.transaction.lock.FileSy...
   26          hoodie.datasource.compaction.async.enable                        
                       true
   27                     hoodie.compact.schedule.inline                        
                      false
   28             hoodie.compact.inline.trigger.strategy                        
                NUM_COMMITS
   29            hoodie.compact.inline.max.delta.commits                        
                          2
   30                                  hoodie.index.type                        
                      BLOOM
   31                hoodie.cleaner.policy.failed.writes                        
                       LAZY
   32                             hoodie.clean.automatic                        
                       true
   33                                 hoodie.clean.async                        
                       true
   34                    hoodie.cleaner.commits.retained                        
                          4
   35               hoodie.write.lock.client.num_retries                        
                         10
   36       hoodie.write.lock.wait_time_ms_between_retry                        
                       1000
   37                      hoodie.write.lock.num_retries                        
                         15
   38                     hoodie.write.lock.wait_time_ms                        
                      60000
   39  hoodie.write.lock.zookeeper.connection_timeout_ms                        
                      15000
   40                    hoodie.bloom.index.use.metadata                        
                       true
   41                               hoodie.archive.async                        
                       true
   42                       hoodie.parquet.max.file.size                        
                 1073741824
   43                    hoodie.parquet.small.file.limit                        
                 1610612736
   44                                  hoodie.table.name                        
                 <table_name>
   45                 hoodie.datasource.write.table.name                        
                 <table_name>
   46                 hoodie.datasource.write.table.type                        
              MERGE_ON_READ
   47            hoodie.datasource.write.recordkey.field                        
                         id
   48        hoodie.datasource.write.partitionpath.field                        
                _year_month
   49         hoodie.datasource.write.keygenerator.class          
org.apache.hudi.keygen.SimpleKeyGenerator
   50                  hoodie.datasource.hive_sync.table                        
               <table_name>
   51               hoodie.datasource.hive_sync.database                        
             <database_name>
   52                hoodie.metrics.pushgateway.job.name                        
  <database_name>.<table_name>
   53                  hoodie.write.lock.filesystem.path                        
                  <table_name>
   54                  hoodie.insert.shuffle.parallelism                        
                          4
   55                  hoodie.upsert.shuffle.parallelism                        
                          4
   56                  hoodie.delete.shuffle.parallelism                        
                          4
   ```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to