fantapsody opened a new issue #6030: Support bulk write in Elasticsearch Sink
URL: https://github.com/apache/pulsar/issues/6030
 
 
   **Is your feature request related to a problem? Please describe.**
   
   The [current 
implementation](https://github.com/apache/pulsar/blob/master/pulsar-io/elastic-search/src/main/java/org/apache/pulsar/io/elasticsearch/ElasticSearchSink.java#L84)
 of Elasticsearch Sink connector write documents to the cluster individually in 
sync mode, while write documents to Elasticsearch clusters in bulk could 
[improve the performance 
significantly](https://github.com/apache/pulsar/issues/5690) as it minimizes 
network roundtrips.
   
   **Describe the solution you'd like**
   
   Accumulate a limited number of records for a while and write them in bulk to 
the cluster asynchronously.
   
   **Describe alternatives you've considered**
   
   As many sink connectors share the idea of writing in bulk to improve 
performance, maybe it is worth of consideration for a general abstraction of 
"batch mode" to simplify the development of sink connectors.
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to