Your use case seems to be a simple ETL (read from a data source and write to a Sink), which is very well addressed by Storm. With Storm you don’t necessarily need to split the data into batches, but can continuously load the data into ES. If your data set is bounded, you can just kill the topology after all the data has been processed.
Thanks, Arun From: Gaurav Sehgal <[email protected]> Reply-To: "[email protected]" <[email protected]> Date: Thursday, July 5, 2018 at 5:34 AM To: "[email protected]" <[email protected]> Subject: Batch processing. Hello, Is there another framework like Apache Storm, which does batch processing of Data. I have been looking at Apache Spark, but the use cases it addresses are more of Map Reduce nature. The use case we are looking at is to read data from data source such as Mongo in batches and upload in ElasticSearch. Regards, Gaurav
