Hi all, I am new to the spark streaming and trying to evaluate it and I have couple of questions.
1. Can setting window sand slide duration to 1 day cause any problem? My data size that will fall to that interval is small. Do you have other suggestions ? 2. What is the best way to detect correlation? Suppose that I have 2 different events from the same source. I want to do an action when these 2 events happen in the same day. I thought about having a reducer. Thanks in advance! Best Regards, Cem
