Hi all,

I am new to the spark streaming and trying to evaluate it and I have couple
of questions.

1. Can setting window sand slide duration to 1 day cause any  problem? My
data size that will  fall to that interval is small.   Do you have other
suggestions ?

2. What is the best way to detect correlation? Suppose that I have 2
different events from the same source. I want to do an action when these 2
events happen in the same day. I thought about having a reducer.

Thanks in advance!

Best Regards,
Cem

Reply via email to