Hi All,

  I have the following use case for Spark Streaming -
There are 2 streams of data  say - FlightBookings and Ticket

For each ticket, I need to associate it with relevant Booking info. There
are distinct applications for Booking and Ticket. The Booking streaming
application processes the incoming bookings and writes the data to NoSQL
store (Cassandra).

The Ticket application process the tickets, queries for Booking info and
associates with relevant data and writes the processed results back.

In this flow for each ticket message there are atleast 20 reads on DB store
for Booking info.

Is there a better way of doing this ?? What is the probable approach to
address problems that need association between different streams of data
when these streams have altogether different network latency of arrival ?

--Padma CH

Reply via email to