[jira] [Updated] (FLINK-19807) Add documentation for Flink's "pre-flight phase" and "cluster phase"

2020-10-26 Thread Manas Kale (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-19807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Manas Kale updated FLINK-19807:
---
Description: 
A result of 
[this|https://lists.apache.org/thread.html/r496e1c312d4675afd32388f3c1f993e2f6d1c369c98580fd7c436457%40%3Cuser.flink.apache.org%3E]
 thread.
Basically, care has to be taken when dealing with static variables when running 
in a minicluster in your IDE v/s running on a cluster.
As such, Flink's "pre-flight phase" and "cluster phase" should be explained on 
the website along with a warning about this non-obvious behaviour.

  was:
A result of 
[this|https://lists.apache.org/thread.html/r496e1c312d4675afd32388f3c1f993e2f6d1c369c98580fd7c436457%40%3Cuser.flink.apache.org%3E]
 thread.
Basically, care has to be taken when dealing with static variables when running 
in a minicluster in your IDE v/s running on a cluster.
As such, Flink's "pre-flight phase" and "cluster phase" should be explained on 
the website along with a warning about this behaviour.


> Add documentation for Flink's "pre-flight phase" and "cluster phase"
> 
>
> Key: FLINK-19807
> URL: https://issues.apache.org/jira/browse/FLINK-19807
> Project: Flink
>  Issue Type: Improvement
>  Components: Documentation
>Reporter: Manas Kale
>Priority: Major
>
> A result of 
> [this|https://lists.apache.org/thread.html/r496e1c312d4675afd32388f3c1f993e2f6d1c369c98580fd7c436457%40%3Cuser.flink.apache.org%3E]
>  thread.
> Basically, care has to be taken when dealing with static variables when 
> running in a minicluster in your IDE v/s running on a cluster.
> As such, Flink's "pre-flight phase" and "cluster phase" should be explained 
> on the website along with a warning about this non-obvious behaviour.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-19807) Add documentation for Flink's "pre-flight phase" and "cluster phase"

2020-10-26 Thread Manas Kale (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-19807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Manas Kale updated FLINK-19807:
---
Description: 
A result of 
[this|https://lists.apache.org/thread.html/r496e1c312d4675afd32388f3c1f993e2f6d1c369c98580fd7c436457%40%3Cuser.flink.apache.org%3E]
 thread.
Basically, care has to be taken when dealing with static variables when running 
in a minicluster in your IDE v/s running on a cluster.
As such, Flink's "pre-flight phase" and "cluster phase" should be explained on 
the website along with a warning about this behaviour.

  was:
A result of 
[this|https://lists.apache.org/thread.html/r496e1c312d4675afd32388f3c1f993e2f6d1c369c98580fd7c436457%40%3Cuser.flink.apache.org%3E]
 thread.
Basically, care has to be taken when dealing with static variables - the 
behaviour might be different when running in a minicluster in your IDE v/s 
running on a cluster.
As such, Flink's "pre-flight phase" and "cluster phase" should be explained on 
the website along with a warning about this behaviour.


> Add documentation for Flink's "pre-flight phase" and "cluster phase"
> 
>
> Key: FLINK-19807
> URL: https://issues.apache.org/jira/browse/FLINK-19807
> Project: Flink
>  Issue Type: Improvement
>  Components: Documentation
>Reporter: Manas Kale
>Priority: Major
>
> A result of 
> [this|https://lists.apache.org/thread.html/r496e1c312d4675afd32388f3c1f993e2f6d1c369c98580fd7c436457%40%3Cuser.flink.apache.org%3E]
>  thread.
> Basically, care has to be taken when dealing with static variables when 
> running in a minicluster in your IDE v/s running on a cluster.
> As such, Flink's "pre-flight phase" and "cluster phase" should be explained 
> on the website along with a warning about this behaviour.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-19807) Add documentation for Flink's "pre-flight phase" and "cluster phase"

2020-10-26 Thread Manas Kale (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-19807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Manas Kale updated FLINK-19807:
---
Description: 
A result of 
[this|https://lists.apache.org/thread.html/r496e1c312d4675afd32388f3c1f993e2f6d1c369c98580fd7c436457%40%3Cuser.flink.apache.org%3E]
 thread.
Basically, care has to be taken when dealing with static variables - the 
behaviour might be different when running in a minicluster in your IDE v/s 
running on a cluster.
As such, Flink's "pre-flight phase" and "cluster phase" should be explained on 
the website along with a warning about this behaviour.

  was:
This is a result of 
[this|https://lists.apache.org/thread.html/r496e1c312d4675afd32388f3c1f993e2f6d1c369c98580fd7c436457%40%3Cuser.flink.apache.org%3E]
 thread.
Basically, care has to be taken when dealing with static variables - the 
behaviour might be different when running in a minicluster in your IDE v/s 
running on a cluster.
As such, Flink's "pre-flight phase" and "cluster phase" should be explained on 
the website along with a warning about this behaviour.


> Add documentation for Flink's "pre-flight phase" and "cluster phase"
> 
>
> Key: FLINK-19807
> URL: https://issues.apache.org/jira/browse/FLINK-19807
> Project: Flink
>  Issue Type: Improvement
>  Components: Documentation
>Reporter: Manas Kale
>Priority: Major
>
> A result of 
> [this|https://lists.apache.org/thread.html/r496e1c312d4675afd32388f3c1f993e2f6d1c369c98580fd7c436457%40%3Cuser.flink.apache.org%3E]
>  thread.
> Basically, care has to be taken when dealing with static variables - the 
> behaviour might be different when running in a minicluster in your IDE v/s 
> running on a cluster.
> As such, Flink's "pre-flight phase" and "cluster phase" should be explained 
> on the website along with a warning about this behaviour.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (FLINK-19807) Add documentation for Flink's "pre-flight phase" and "cluster phase"

2020-10-26 Thread Manas Kale (Jira)
Manas Kale created FLINK-19807:
--

 Summary: Add documentation for Flink's "pre-flight phase" and 
"cluster phase"
 Key: FLINK-19807
 URL: https://issues.apache.org/jira/browse/FLINK-19807
 Project: Flink
  Issue Type: Improvement
  Components: Documentation
Reporter: Manas Kale


This is a result of 
[this|https://lists.apache.org/thread.html/r496e1c312d4675afd32388f3c1f993e2f6d1c369c98580fd7c436457%40%3Cuser.flink.apache.org%3E]
 thread.
Basically, care has to be taken when dealing with static variables - the 
behaviour might be different when running in a minicluster in your IDE v/s 
running on a cluster.
As such, Flink's "pre-flight phase" and "cluster phase" should be explained on 
the website along with a warning about this behaviour.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-16039) [Discuss] Add API method to get last element in session window

2020-03-25 Thread Manas Kale (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-16039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17067332#comment-17067332
 ] 

Manas Kale edited comment on FLINK-16039 at 3/26/20, 3:49 AM:
--

Hi all, I've started a discussion on the mailing archive 
[here][[https://lists.apache.org/thread.html/rf44c033e20d165105a3adf1d36c1e3857231e7266aa844d4fc4b6e1f%40%3Cdev.flink.apache.org%3E]].


was (Author: manas96):
Hi all, I've started a discussion on the mailing archive 
[here|[https://lists.apache.org/thread.html/rf44c033e20d165105a3adf1d36c1e3857231e7266aa844d4fc4b6e1f%40%3Cdev.flink.apache.org%3E]].

> [Discuss] Add API method to get last element in session window
> --
>
> Key: FLINK-16039
> URL: https://issues.apache.org/jira/browse/FLINK-16039
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.10.0
>Reporter: Manas Kale
>Priority: Minor
>
> Consider the events : 
> [1, event], [2, event]
> where first element is event timestamp in seconds and second element is event 
> code/name.
> Also consider that an Event time session window with inactivityGap = 2 
> seconds is acting on above stream.
> When the first event arrives, a session window should be created that is 
> [1,1].
> When the second event arrives, a new session window should be created that is 
> [2,2]. Since this falls within firstWindowTimestamp+inactivityGap, it should 
> be merged into session window [1,2] and  [2,2] should be deleted.
> This is my understanding of how session windows are created. *Please correct 
> me if wrong.*
> However, Flink does not follow such a definition of windows semantically. If 
> I call the  getEnd() method of the TimeWindow() class, I get back _timestamp 
> + inactivityGap_.
> For the above example, after processing the first element, I would get 1 + 2 
> = 3 seconds as the window "end".
> The actual window end should be the timestamp 1, which is the last event in 
> the session window. 
> A solution would be to change the "end" definition of all windows, but I 
> suppose this would be breaking and would need some debate.
> Therefore, I propose an intermediate solution : add a new API method that 
> keeps track of the last element added in the session window. 
> If there is agreement on this, I would like to start drafting a change 
> document and implement this. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-16039) [Discuss] Add API method to get last element in session window

2020-03-25 Thread Manas Kale (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-16039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17067332#comment-17067332
 ] 

Manas Kale edited comment on FLINK-16039 at 3/26/20, 3:49 AM:
--

Hi all, I've started a discussion on the mailing archive 
[here|[https://lists.apache.org/thread.html/rf44c033e20d165105a3adf1d36c1e3857231e7266aa844d4fc4b6e1f%40%3Cdev.flink.apache.org%3E]]
 


was (Author: manas96):
Hi all, I've started a discussion on the mailing archive 
[here][[https://lists.apache.org/thread.html/rf44c033e20d165105a3adf1d36c1e3857231e7266aa844d4fc4b6e1f%40%3Cdev.flink.apache.org%3E]].

> [Discuss] Add API method to get last element in session window
> --
>
> Key: FLINK-16039
> URL: https://issues.apache.org/jira/browse/FLINK-16039
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.10.0
>Reporter: Manas Kale
>Priority: Minor
>
> Consider the events : 
> [1, event], [2, event]
> where first element is event timestamp in seconds and second element is event 
> code/name.
> Also consider that an Event time session window with inactivityGap = 2 
> seconds is acting on above stream.
> When the first event arrives, a session window should be created that is 
> [1,1].
> When the second event arrives, a new session window should be created that is 
> [2,2]. Since this falls within firstWindowTimestamp+inactivityGap, it should 
> be merged into session window [1,2] and  [2,2] should be deleted.
> This is my understanding of how session windows are created. *Please correct 
> me if wrong.*
> However, Flink does not follow such a definition of windows semantically. If 
> I call the  getEnd() method of the TimeWindow() class, I get back _timestamp 
> + inactivityGap_.
> For the above example, after processing the first element, I would get 1 + 2 
> = 3 seconds as the window "end".
> The actual window end should be the timestamp 1, which is the last event in 
> the session window. 
> A solution would be to change the "end" definition of all windows, but I 
> suppose this would be breaking and would need some debate.
> Therefore, I propose an intermediate solution : add a new API method that 
> keeps track of the last element added in the session window. 
> If there is agreement on this, I would like to start drafting a change 
> document and implement this. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-16039) [Discuss] Add API method to get last element in session window

2020-03-25 Thread Manas Kale (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-16039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17067332#comment-17067332
 ] 

Manas Kale commented on FLINK-16039:


Hi all, I've started a discussion on the mailing archive 
[here|[https://lists.apache.org/thread.html/rf44c033e20d165105a3adf1d36c1e3857231e7266aa844d4fc4b6e1f%40%3Cdev.flink.apache.org%3E]].

> [Discuss] Add API method to get last element in session window
> --
>
> Key: FLINK-16039
> URL: https://issues.apache.org/jira/browse/FLINK-16039
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.10.0
>Reporter: Manas Kale
>Priority: Minor
>
> Consider the events : 
> [1, event], [2, event]
> where first element is event timestamp in seconds and second element is event 
> code/name.
> Also consider that an Event time session window with inactivityGap = 2 
> seconds is acting on above stream.
> When the first event arrives, a session window should be created that is 
> [1,1].
> When the second event arrives, a new session window should be created that is 
> [2,2]. Since this falls within firstWindowTimestamp+inactivityGap, it should 
> be merged into session window [1,2] and  [2,2] should be deleted.
> This is my understanding of how session windows are created. *Please correct 
> me if wrong.*
> However, Flink does not follow such a definition of windows semantically. If 
> I call the  getEnd() method of the TimeWindow() class, I get back _timestamp 
> + inactivityGap_.
> For the above example, after processing the first element, I would get 1 + 2 
> = 3 seconds as the window "end".
> The actual window end should be the timestamp 1, which is the last event in 
> the session window. 
> A solution would be to change the "end" definition of all windows, but I 
> suppose this would be breaking and would need some debate.
> Therefore, I propose an intermediate solution : add a new API method that 
> keeps track of the last element added in the session window. 
> If there is agreement on this, I would like to start drafting a change 
> document and implement this. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-6726) Allow setting Timers in ProcessWindowFunction

2020-02-16 Thread Manas Kale (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-6726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17038084#comment-17038084
 ] 

Manas Kale commented on FLINK-6726:
---

Hi, is there any progress on this issue? I have a similar use case.

> Allow setting Timers in ProcessWindowFunction
> -
>
> Key: FLINK-6726
> URL: https://issues.apache.org/jira/browse/FLINK-6726
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Reporter: Vishnu Viswanath
>Assignee: Vishnu Viswanath
>Priority: Minor
>
> Allow registration of timers in ProcessWindowFunction.
> {code}
> public abstract void registerEventTimeTimer(long time);
> public abstract void registerProcessingTimeTimer(long time);
> {code}
> This is based on one of the use case that I have, where I need to register an 
> EventTimeTimer that will clean the elements in the Window State based on some 
> condition. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-16039) [Discussion] Add API method to get last element in session window

2020-02-13 Thread Manas Kale (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-16039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Manas Kale updated FLINK-16039:
---
Summary: [Discussion] Add API method to get last element in session window  
(was: Add API method to get last element in session window)

> [Discussion] Add API method to get last element in session window
> -
>
> Key: FLINK-16039
> URL: https://issues.apache.org/jira/browse/FLINK-16039
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.10.0
>Reporter: Manas Kale
>Priority: Minor
>
> Consider the events : 
> [1, event], [2, event]
> where first element is event timestamp in seconds and second element is event 
> code/name.
> Also consider that an Event time session window with inactivityGap = 2 
> seconds is acting on above stream.
> When the first event arrives, a session window should be created that is 
> [1,1].
> When the second event arrives, a new session window should be created that is 
> [2,2]. Since this falls within firstWindowTimestamp+inactivityGap, it should 
> be merged into session window [1,2] and  [2,2] should be deleted.
> This is my understanding of how session windows are created. *Please correct 
> me if wrong.*
> However, Flink does not follow such a definition of windows semantically. If 
> I call the  getEnd() method of the TimeWindow() class, I get back _timestamp 
> + inactivityGap_.
> For the above example, after processing the first element, I would get 1 + 2 
> = 3 seconds as the window "end".
> The actual window end should be the timestamp 1, which is the last event in 
> the session window. 
> A solution would be to change the "end" definition of all windows, but I 
> suppose this would be breaking and would need some debate.
> Therefore, I propose an intermediate solution : add a new API method that 
> keeps track of the last element added in the session window. 
> If there is agreement on this, I would like to start drafting a change 
> document and implement this. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-16039) [Discuss] Add API method to get last element in session window

2020-02-13 Thread Manas Kale (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-16039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Manas Kale updated FLINK-16039:
---
Summary: [Discuss] Add API method to get last element in session window  
(was: [Discussion] Add API method to get last element in session window)

> [Discuss] Add API method to get last element in session window
> --
>
> Key: FLINK-16039
> URL: https://issues.apache.org/jira/browse/FLINK-16039
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.10.0
>Reporter: Manas Kale
>Priority: Minor
>
> Consider the events : 
> [1, event], [2, event]
> where first element is event timestamp in seconds and second element is event 
> code/name.
> Also consider that an Event time session window with inactivityGap = 2 
> seconds is acting on above stream.
> When the first event arrives, a session window should be created that is 
> [1,1].
> When the second event arrives, a new session window should be created that is 
> [2,2]. Since this falls within firstWindowTimestamp+inactivityGap, it should 
> be merged into session window [1,2] and  [2,2] should be deleted.
> This is my understanding of how session windows are created. *Please correct 
> me if wrong.*
> However, Flink does not follow such a definition of windows semantically. If 
> I call the  getEnd() method of the TimeWindow() class, I get back _timestamp 
> + inactivityGap_.
> For the above example, after processing the first element, I would get 1 + 2 
> = 3 seconds as the window "end".
> The actual window end should be the timestamp 1, which is the last event in 
> the session window. 
> A solution would be to change the "end" definition of all windows, but I 
> suppose this would be breaking and would need some debate.
> Therefore, I propose an intermediate solution : add a new API method that 
> keeps track of the last element added in the session window. 
> If there is agreement on this, I would like to start drafting a change 
> document and implement this. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (FLINK-16039) Add API method to get last element in session window

2020-02-13 Thread Manas Kale (Jira)
Manas Kale created FLINK-16039:
--

 Summary: Add API method to get last element in session window
 Key: FLINK-16039
 URL: https://issues.apache.org/jira/browse/FLINK-16039
 Project: Flink
  Issue Type: Improvement
  Components: API / DataStream
Affects Versions: 1.10.0
Reporter: Manas Kale


Consider the events : 

[1, event], [2, event]

where first element is event timestamp in seconds and second element is event 
code/name.

Also consider that an Event time session window with inactivityGap = 2 seconds 
is acting on above stream.

When the first event arrives, a session window should be created that is [1,1].

When the second event arrives, a new session window should be created that is 
[2,2]. Since this falls within firstWindowTimestamp+inactivityGap, it should be 
merged into session window [1,2] and  [2,2] should be deleted.

This is my understanding of how session windows are created. *Please correct me 
if wrong.*

However, Flink does not follow such a definition of windows semantically. If I 
call the  getEnd() method of the TimeWindow() class, I get back _timestamp + 
inactivityGap_.

For the above example, after processing the first element, I would get 1 + 2 = 
3 seconds as the window "end".

The actual window end should be the timestamp 1, which is the last event in the 
session window. 

A solution would be to change the "end" definition of all windows, but I 
suppose this would be breaking and would need some debate.

Therefore, I propose an intermediate solution : add a new API method that keeps 
track of the last element added in the session window. 

If there is agreement on this, I would like to start drafting a change document 
and implement this. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)