[ 
https://issues.apache.org/jira/browse/SPARK-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16622729#comment-16622729
 ] 

Arun Mahadevan commented on SPARK-10816:
----------------------------------------

[~rxin], "map/flapMapGroupsWithState" looks to suited to mapping the grouped 
values to custom state on the reduce side. Not sure its a general aggregation 
mechanism that can merge the windows at the map side. 

IMO, theres still value in providing a session window operator. For the more 
advanced windowing it could be addressed by exposing Windowing with generic 
"assignWindows" and "mergeWindows" constructs. It could then support different 
types of custom windowing.

K,Values -> AssignWindows (produces [k, v, timestamp, window]) -> GroupByKey 
(shuffle) -> MergeWindows (optional step) -> GroupWindows -> aggregate values.

The windows can also be merged and pre-aggregated at the map side for 
optimization.

> EventTime based sessionization
> ------------------------------
>
>                 Key: SPARK-10816
>                 URL: https://issues.apache.org/jira/browse/SPARK-10816
>             Project: Spark
>          Issue Type: New Feature
>          Components: Structured Streaming
>            Reporter: Reynold Xin
>            Priority: Major
>         Attachments: SPARK-10816 Support session window natively.pdf
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to