Kong Wei created HUDI-6259:
------------------------------

             Summary: deltastreamer support configuration hot update
                 Key: HUDI-6259
                 URL: https://issues.apache.org/jira/browse/HUDI-6259
             Project: Apache Hudi
          Issue Type: New Feature
          Components: deltastreamer
            Reporter: Kong Wei
            Assignee: Kong Wei
         Attachments: image-2023-05-24-14-26-45-826.png

In our company, hudi is used as a platform. We provide deltastreamer (run in 
continuous mode) to write to a large number of sources (including mysql & tidb) 
as a long time service. We often need to update the hudi configuration, but we 
don’t want to restart deltastreamer to achieve it, which may be too heavy for 
our job scheduler server based on livy/yarn. Therefore, we provide 
deltastreamer configuration hot update function. It is possible to update some 
common parameters instantly, and these parameters will take effect at the next 
sync of deltastreamer. These parameters include:
 * hoodie.bulkinsert.shuffle.parallelism (used only in bulkinsert)
 * hoodie.upsert.shuffle.parallelism
 * hoodie.deltastreamer.kafka.source.maxEvents
 * hoodie.memory.merge.max.size
 * hoodie.memory.compaction.max.size
 * hoodie.datasource.hive_sync.*
 * hoodie.compact.inline.max.delta.commits
 * hoodie.compaction.strategy
 * hoodie.compaction.target.io
!image-2023-05-24-14-26-45-826.png!

So this JIRA ticket is to add the configuration hot update function to 
deltastreamer



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to