Samza + MongoDb Sharding

Angelica Garcia-Gutierrez Thu, 06 Jul 2017 12:22:14 -0700

Hello,

I have a Samza job that currently makes remote calls to a MongoDB to get 
additional information about the input stream. For scalability, MongoDB was 
initially partitioned into 4 shards (more shards will be added as needed).
The questions are:


  *   Does it make sense to attempt to partition the input stream into multiple 
partitions such that a given task can consume it and expand the message with 
information retrieved from a specific MongoDB shard?

Can someone please shed some lights?

Thanks,
Angelica.


The information transmitted is intended only for the person or entity to which 
it is addressed
and may contain CONFIDENTIAL material.  If you receive this 
material/information in error,
please contact the sender and delete or destroy the material/information.

Samza + MongoDb Sharding

Reply via email to