I've replaced socketStream with kafka and it seems to catch and store all messages now. So i guess it's a problem with either sample PageViewGenerator or socketTextStream. Anyway, i see that pageCounts only contains counts from last batch. Is there a way to aggregate across all batches?
-- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/PageView-streaming-sample-lost-page-views-tp1126p1143.html Sent from the Apache Spark User List mailing list archive at Nabble.com.
