[ 
https://issues.apache.org/jira/browse/SPARK-17893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15571582#comment-15571582
 ] 

Raviteja Lokineni edited comment on SPARK-17893 at 10/13/16 10:51 AM:
----------------------------------------------------------------------

[~srowen] No it's not actually streaming, it can be though if we also include 
slideDuration.

Let me explain my use case:
* We get data in batches
* Once the data is available it has a Date column and a few other numeric 
columns
* Now for every unique date that is available I have to look back 7 days and 
aggregate all the numeric columns

I am looking at past records not future.


was (Author: bond_):
[~srowen] No it's not actually streaming.

Let me explain my use case:
* We get data in batches
* Once the data is available it has a Date column and a few other numeric 
columns
* Now for every unique date that is available I have to look back 7 days and 
aggregate all the numeric columns

I am looking at past records not future.

> Window functions should also allow looking back in time
> -------------------------------------------------------
>
>                 Key: SPARK-17893
>                 URL: https://issues.apache.org/jira/browse/SPARK-17893
>             Project: Spark
>          Issue Type: New Feature
>          Components: Spark Core
>    Affects Versions: 2.0.1
>            Reporter: Raviteja Lokineni
>
> This function should allow looking back. The current window(timestamp, 
> duration) seems to be for looking forward in time.
> Example:
> {code}dataFrame.groupBy(window("date", "7 days ago")).agg(min("col1"), 
> max("col1")){code}
> For example, if date: 2013-01-07 then the window should be 2013-01-01 - 
> 2013-01-07



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to