[jira] [Closed] (BEAM-9573) Watermark hold for timer output timestamp is not computed correctly

2020-03-25 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng closed BEAM-9573.
-
Resolution: Fixed

Close it as the PR is already merged :)

> Watermark hold for timer output timestamp is not computed correctly
> ---
>
> Key: BEAM-9573
> URL: https://issues.apache.org/jira/browse/BEAM-9573
> Project: Beam
>  Issue Type: Bug
>  Components: runner-flink
>Affects Versions: 2.20.0
>Reporter: Maximilian Michels
>Assignee: Maximilian Michels
>Priority: Blocker
> Fix For: 2.20.0
>
>  Time Spent: 5h 20m
>  Remaining Estimate: 0h
>
> With the introduction of timer output timestamp, a new watermark hold had 
> been added to the Flink Runner. The watermark computation works on the keyed 
> state backend which computes a key-scoped watermark hold and not the desired 
> operator-wide watermark hold.
> Computation: 
> https://github.com/apache/beam/blob/b564239081e9351c56fb0e7d263495b95dd3f8f3/runners/flink/src/main/java/org/apache/beam/runners/flink/translation/wrappers/streaming/DoFnOperator.java#L1140
> Key-scoped state: 
> https://github.com/apache/beam/blob/b564239081e9351c56fb0e7d263495b95dd3f8f3/runners/flink/src/main/java/org/apache/beam/runners/flink/translation/wrappers/streaming/DoFnOperator.java#L1130
> We need to change this to operate on all keys. This has to be done before 
> fixing BEAM-9566.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9298) Drop support for Flink 1.7

2020-02-28 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17048192#comment-17048192
 ] 

sunjincheng commented on BEAM-9298:
---

I think there is no need to make it a blocker of 2.20.0. I have removed the 
target version.

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.21.0
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9298) Drop support for Flink 1.7

2020-02-28 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-9298:
--
Fix Version/s: 2.21.0

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.21.0
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8618) Tear down unused DoFns periodically in Python SDK harness

2020-02-25 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17045105#comment-17045105
 ] 

sunjincheng commented on BEAM-8618:
---

Yes,just remove the tag for now. :)

> Tear down unused DoFns periodically in Python SDK harness
> -
>
> Key: BEAM-8618
> URL: https://issues.apache.org/jira/browse/BEAM-8618
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 5h 20m
>  Remaining Estimate: 0h
>
> Per the discussion in the ML, detail can be found [1],  the teardown of DoFns 
> should be supported in the portability framework. It happens at two places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Python SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8618) Tear down unused DoFns periodically in Python SDK harness

2020-02-25 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8618:
--
Fix Version/s: (was: 2.20.0)

> Tear down unused DoFns periodically in Python SDK harness
> -
>
> Key: BEAM-8618
> URL: https://issues.apache.org/jira/browse/BEAM-8618
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 5h 20m
>  Remaining Estimate: 0h
>
> Per the discussion in the ML, detail can be found [1],  the teardown of DoFns 
> should be supported in the portability framework. It happens at two places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Python SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8620) Tear down unused DoFns periodically in Java SDK harness

2020-02-24 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8620:
--
Fix Version/s: (was: 2.20.0)

> Tear down unused DoFns periodically in Java SDK harness
> ---
>
> Key: BEAM-8620
> URL: https://issues.apache.org/jira/browse/BEAM-8620
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>
> Per the discussion in the ML the detail can be found here[1], the teardown of 
> DoFns should be supported in the portability framework. It happens at two 
> places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Java SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8620) Tear down unused DoFns periodically in Java SDK harness

2020-02-24 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17044119#comment-17044119
 ] 

sunjincheng commented on BEAM-8620:
---

Thanks for the reminder, I would like to reset the fix version to 2.21.0, and 
appreciate if you(or some one) can add unrelease version of 2.21.0. :) for now, 
reset it as None. 

> Tear down unused DoFns periodically in Java SDK harness
> ---
>
> Key: BEAM-8620
> URL: https://issues.apache.org/jira/browse/BEAM-8620
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> Per the discussion in the ML the detail can be found here[1], the teardown of 
> DoFns should be supported in the portability framework. It happens at two 
> places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Java SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9298) Drop support for Flink 1.7

2020-02-24 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-9298:
--
Fix Version/s: (was: 2.20.0)

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-24 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-9299:
--
Fix Version/s: (was: 2.20.0)

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-9295) Add Flink 1.10 build target and Make FlinkRunner compatible with Flink 1.10

2020-02-24 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17044074#comment-17044074
 ] 

sunjincheng edited comment on BEAM-9295 at 2/25/20 2:41 AM:


I would like reset the fix version to 2.21. Thank you . [~amaliujia]
Could you please add the un-release version 2.21.0 ?


was (Author: sunjincheng121):
I would like reset the fix version to 2.21. Thank you . [~amaliujia]

> Add Flink 1.10 build target and Make FlinkRunner compatible with Flink 1.10
> ---
>
> Key: BEAM-9295
> URL: https://issues.apache.org/jira/browse/BEAM-9295
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> Apache Flink 1.10 has completed the final release vote, see [1]. So, I would 
> like to add Flink 1.10 build target and make Flink Runner compatible with 
> Flink 1.10.
> And I appreciate it if you can leave your suggestions or comments!
> [1] 
> https://lists.apache.org/thread.html/r97672d4d1e47372cebf23e6643a6cc30a06bfbdf3f277b0be3695b15%40%3Cdev.flink.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9295) Add Flink 1.10 build target and Make FlinkRunner compatible with Flink 1.10

2020-02-24 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-9295:
--
Fix Version/s: (was: 2.20.0)

> Add Flink 1.10 build target and Make FlinkRunner compatible with Flink 1.10
> ---
>
> Key: BEAM-9295
> URL: https://issues.apache.org/jira/browse/BEAM-9295
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> Apache Flink 1.10 has completed the final release vote, see [1]. So, I would 
> like to add Flink 1.10 build target and make Flink Runner compatible with 
> Flink 1.10.
> And I appreciate it if you can leave your suggestions or comments!
> [1] 
> https://lists.apache.org/thread.html/r97672d4d1e47372cebf23e6643a6cc30a06bfbdf3f277b0be3695b15%40%3Cdev.flink.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9295) Add Flink 1.10 build target and Make FlinkRunner compatible with Flink 1.10

2020-02-24 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17044074#comment-17044074
 ] 

sunjincheng commented on BEAM-9295:
---

I would like reset the fix version to 2.21. Thank you . [~amaliujia]

> Add Flink 1.10 build target and Make FlinkRunner compatible with Flink 1.10
> ---
>
> Key: BEAM-9295
> URL: https://issues.apache.org/jira/browse/BEAM-9295
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> Apache Flink 1.10 has completed the final release vote, see [1]. So, I would 
> like to add Flink 1.10 build target and make Flink Runner compatible with 
> Flink 1.10.
> And I appreciate it if you can leave your suggestions or comments!
> [1] 
> https://lists.apache.org/thread.html/r97672d4d1e47372cebf23e6643a6cc30a06bfbdf3f277b0be3695b15%40%3Cdev.flink.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9288) Conscrypt shaded dependency

2020-02-22 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17042601#comment-17042601
 ] 

sunjincheng commented on BEAM-9288:
---

I agree that we should exclude it from the vendored grpc for now. I will submit 
a PR to address this issue.

> Conscrypt shaded dependency
> ---
>
> Key: BEAM-9288
> URL: https://issues.apache.org/jira/browse/BEAM-9288
> Project: Beam
>  Issue Type: Bug
>  Components: build-system
>Reporter: Esun Kim
>Assignee: sunjincheng
>Priority: Critical
> Fix For: 2.20.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Conscrypt is not designed to be shaded properly mainly because of so files. I 
> happened to see BEAM-9030 (*1) creating a new vendored gRPC shading Conscrypt 
> (*2) in it. I think this could make a problem when new Conscrypt is brought 
> by new gcsio depending on gRPC-alts (*4) in a dependency chain. (*5) In this 
> case, it may have a conflict when finding proper so files for Conscrypt. 
> *1: https://issues.apache.org/jira/browse/BEAM-9030
> *2:  
> [https://github.com/apache/beam/blob/e24d1e51cbabe27cb3cc381fd95b334db639c45d/buildSrc/src/main/groovy/org/apache/beam/gradle/GrpcVendoring_1_26_0.groovy#L78]
> *3: https://issues.apache.org/jira/browse/BEAM-6136
> *4: [https://mvnrepository.com/artifact/io.grpc/grpc-alts/1.27.0]
> *5: https://issues.apache.org/jira/browse/BEAM-8889
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-18 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039584#comment-17039584
 ] 

sunjincheng commented on BEAM-9299:
---

Currently we always test the last version for flink runners,and I think it's by 
design for now, and I agree with you [~iemejia], would be better to avoid  test 
all versions for every PR.

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9298) Drop support for Flink 1.7

2020-02-18 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039129#comment-17039129
 ] 

sunjincheng commented on BEAM-9298:
---

I've brought up a community discussion here:  
[https://lists.apache.org/thread.html/rfb5ac9d889d0e3f4400471de3c25000a15352bde879622c899d97581%40%3Cdev.beam.apache.org%3E]

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9298) Drop support for Flink`1.7

2020-02-18 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039079#comment-17039079
 ] 

sunjincheng commented on BEAM-9298:
---

Thanks for the confirm [~mxm]
Because the code of Flink 1.7 is too old, the current existence of Flink runner 
1.7 will affect the upgrade of Flink run.er 1.8x and 1.9x , more detail can be 
found in BEAM-9299, so we need to remove the support of Flink runner 1.7 as 
soon as possible. and I'll bring  up the DISCUSSION soon,and open the PR ASAP. 

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>  ` Priority: Major
> Fix For: 2.20.0
>
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Issue Comment Deleted] (BEAM-9298) Drop support for Flink 1.7

2020-02-18 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-9298:
--
Comment: was deleted

(was: I see, thank you all! I will do it accodingly after add Flink 1.10 build 
target. :))

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-18 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17038898#comment-17038898
 ] 

sunjincheng commented on BEAM-9299:
---

As BEAM-5396 and FLINK-11048 have been completed so far, we have a cleaner 
solution 3, which is to remove `BeamFlinkRemoteStreamEnvironment` class. I have 
updated the PR, but the content does not include the changes of `cover all of 
the version for test` which [~angoenka] mentioned before, I think we can handle 
it in another PR.  What do you think?

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-17 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17038240#comment-17038240
 ] 

sunjincheng commented on BEAM-9299:
---

Thanks for your feedback [~iemejia] [~angoenka]!

I will update PR according to solution 2.

And I agree that it's better to let the test cover all of versions of runner. 
But I'm curious. Why only test the latest one before, Is this by design?

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-13 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17036700#comment-17036700
 ] 

sunjincheng edited comment on BEAM-9299 at 2/14/20 5:42 AM:


I found that the issue FLINK-15844 also exists in Flink 1.8.3. I noticed that 
FLINK-15844 has provided a fix for 1.9.3. However, AFAIK, there will be no new 
1.8 releases any more according to FLink's [release 
policy|#update-policy-for-old-releases]]. There are two solutions in my mind:
 - Solution1: As the signature of `JobWithJars.buildUserCodeClassLoader `has 
changed in both Flink 1.8.3 and 1.9.2 and Flink 1.7.2 still uses the old 
signature, we could drop the Flink 1.7 support firstly and then update the 
implementation of `FlinkExecutionEnvironments` to use the new signature.

 - Solution2:  We could make a copy of `FlinkExecutionEnvironments` in each 
version of Flink runner and update the implementation for each copy according 
to the Flink version. This solution decouples the drop of Flink 1.7 support and 
the upgrades of 1.8 and 1.9. Besides, Flink community has made big change for 
the job submission path(FLINK-13954) in 1.10, e.g. `JobWithJars` has been 
removed in 1.10. It means that the job submission logic will be separate for 
1.8/1.9 and 1.10 anyway.

Personally I prefer solution2 and what's your thought? :) [~iemejia] [~mxm] 
[~thw]

 


was (Author: sunjincheng121):
I found that the issue FLINK-15844 also exists in Flink 1.8.3. I noticed that 
FLINK-15844 has provided a fix for 1.9.3. However, AFAIK, there will be no new 
1.8 releases any more according to FLink's [release 
policy|#update-policy-for-old-releases]]. There are two solutions in my mind:
 - Solution1: As the signature of `JobWithJars.buildUserCodeClassLoader `has 
changed in both Flink 1.8.3 and 1.9.2 and Flink 1.7.2 still uses the old 
signature, we could drop the Flink 1.7 support firstly and then update the 
implementation of `FlinkExecutionEnvironments` to use the new signature.

- Solution2:  We could make a copy of `FlinkExecutionEnvironments` in each 
version of Flink runner and update the implementation for each copy according 
to the Flink version. This solution decouples the drop of Flink 1.7 support and 
the upgrades of 1.8 and 1.9. Besides, Flink community has made big change for 
the job submission path(FLINK-13954) in 1.10, e.g. `JobWithJars` has been 
removed in 1.10. It means that the job submission logic will be separate for 
1.8/1.9 and 1.10 anyway.

Personally I prefer solution2 and what's your thought? :)

 

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-13 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17036700#comment-17036700
 ] 

sunjincheng edited comment on BEAM-9299 at 2/14/20 5:35 AM:


I found that the issue FLINK-15844 also exists in Flink 1.8.3. I noticed that 
FLINK-15844 has provided a fix for 1.9.3. However, AFAIK, there will be no new 
1.8 releases any more according to FLink's [release 
policy|#update-policy-for-old-releases]]. There are two solutions in my mind:
 - Solution1: As the signature of `JobWithJars.buildUserCodeClassLoader `has 
changed in both Flink 1.8.3 and 1.9.2 and Flink 1.7.2 still uses the old 
signature, we could drop the Flink 1.7 support firstly and then update the 
implementation of `FlinkExecutionEnvironments` to use the new signature.

- Solution2:  We could make a copy of `FlinkExecutionEnvironments` in each 
version of Flink runner and update the implementation for each copy according 
to the Flink version. This solution decouples the drop of Flink 1.7 support and 
the upgrades of 1.8 and 1.9. Besides, Flink community has made big change for 
the job submission path(FLINK-13954) in 1.10, e.g. `JobWithJars` has been 
removed in 1.10. It means that the job submission logic will be separate for 
1.8/1.9 and 1.10 anyway.

Personally I prefer solution2 and what's your thought? :)

 


was (Author: sunjincheng121):
I found that the issue FLINK-15844 also exists in Flink 1.8.3. I noticed that 
FLINK-15844 has provided a fix for 1.9.3. However, AFAIK, there will be no new 
1.8 releases any more according to FLink's [release 
policy|[https://flink.apache.org/downloads.html#update-policy-for-old-releases]].
 There are two solutions in my mind:


- Solution1: As the signature of `JobWithJars.buildUserCodeClassLoader `has 
changed in both Flink 1.8.3 and 1.9.2 and Flink 1.7.2 still uses the old 
signature, we could drop the Flink 1.7 support firstly and then update the 
implementation of `FlinkExecutionEnvironments` to use the new signature.


-Solution2:  We could make a copy of `FlinkExecutionEnvironments` in each 
version of Flink runner and update the implementation for each copy according 
to the Flink version. This solution decouples the drop of Flink 1.7 support and 
the upgrades of 1.8 and 1.9. Besides, Flink community has made big change for 
the job submission path(FLINK-13954) in 1.10, e.g. `JobWithJars` has been 
removed in 1.10. It means that the job submission logic will be separate for 
1.8/1.9 and 1.10 anyway.

Personally I prefer solution2 and what's your thought? :)

 

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-13 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17036700#comment-17036700
 ] 

sunjincheng commented on BEAM-9299:
---

I found that the issue FLINK-15844 also exists in Flink 1.8.3. I noticed that 
FLINK-15844 has provided a fix for 1.9.3. However, AFAIK, there will be no new 
1.8 releases any more according to FLink's [release 
policy|[https://flink.apache.org/downloads.html#update-policy-for-old-releases]].
 There are two solutions in my mind:


- Solution1: As the signature of `JobWithJars.buildUserCodeClassLoader `has 
changed in both Flink 1.8.3 and 1.9.2 and Flink 1.7.2 still uses the old 
signature, we could drop the Flink 1.7 support firstly and then update the 
implementation of `FlinkExecutionEnvironments` to use the new signature.


-Solution2:  We could make a copy of `FlinkExecutionEnvironments` in each 
version of Flink runner and update the implementation for each copy according 
to the Flink version. This solution decouples the drop of Flink 1.7 support and 
the upgrades of 1.8 and 1.9. Besides, Flink community has made big change for 
the job submission path(FLINK-13954) in 1.10, e.g. `JobWithJars` has been 
removed in 1.10. It means that the job submission logic will be separate for 
1.8/1.9 and 1.10 anyway.

Personally I prefer solution2 and what's your thought? :)

 

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-9298) Drop support for Flink 1.7

2020-02-13 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17036154#comment-17036154
 ] 

sunjincheng edited comment on BEAM-9298 at 2/13/20 11:46 AM:
-

I see, thank you all! I will do it accodingly after add Flink 1.10 build 
target. :)


was (Author: sunjincheng121):
Thank you all! I will do it accodingly after add Flink 1.10 build target. :)

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9298) Drop support for Flink 1.7

2020-02-13 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17036154#comment-17036154
 ] 

sunjincheng commented on BEAM-9298:
---

Thank you all! I will do it accodingly after add Flink 1.10 build target. :)

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-12 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17035945#comment-17035945
 ] 

sunjincheng edited comment on BEAM-9299 at 2/13/20 6:20 AM:


Thanks for add this input [~iemejia] , I think we can upgrade the 1.9.1 to 
1.9.3 when Flink 1.9.3 released due to this upgrade is not urgent. :) And we 
only upgrade 1.8.x to 1.8.3 in this JIRA. What do you think?


was (Author: sunjincheng121):
Thanks for add this input [~iemejia] , I think we can upgrade the 1.9.1 to 
1.9.3 when Flink 1.9.3 released due to this upgrade is not urgent. :)

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-12 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17035945#comment-17035945
 ] 

sunjincheng commented on BEAM-9299:
---

Thanks for add this input [~iemejia] , I think we can upgrade the 1.9.1 to 
1.9.3 when Flink 1.9.3 released due to this upgrade is not urgent. :)

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Issue Comment Deleted] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-12 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-9299:
--
Comment: was deleted

(was: Thanks for add this input [~iemejia] 

I think we can upgrade the 1.9.1 to 1.9.3 when Flink 1.9.3 released. :)  and do 
not upgrade the 1.9.1 in this JIRA. )

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-12 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17035941#comment-17035941
 ] 

sunjincheng commented on BEAM-9299:
---

Thanks for add this input [~iemejia] 

I think we can upgrade the 1.9.1 to 1.9.3 when Flink 1.9.3 released. :)  and do 
not upgrade the 1.9.1 in this JIRA. 

> Upgrade Flink Runner to 1.8.3 and 1.9.2
> ---
>
> Key: BEAM-9299
> URL: https://issues.apache.org/jira/browse/BEAM-9299
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
> Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 
> What do you think?
> [1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-9298) Drop support for Flink 1.7

2020-02-12 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17035890#comment-17035890
 ] 

sunjincheng edited comment on BEAM-9298 at 2/13/20 4:53 AM:


Thanks for the reminder [~iemejia] and [~thw] !

I prefer maintain at most three(Even maintaining two is reasonable) versions of 
Flink runner for Apache Beam community, because Apache Flink only support the 
current and previous minor release with bugfixes, more detail can be found in 
[1]. So, I think its better to maintain the three versions of 1.8/1.9/1.10 
after add Flink v1.10 build target to Flink runner.

I'm not sure if it deserves a discussion in the mailing list as it seems that 
we already maintain 3 versions of Flink runner in Beam now. Removing Flink 
runner 1.7 seems straightward if we upgrade Flink runner to 1.10. However, I'm 
not sure what's the policy before and it's great to also hear [~mxm] 's 
suggestions? :)

 

[1] [https://flink.apache.org/downloads.html#update-policy-for-old-releases]


was (Author: sunjincheng121):
Thanks for the reminder [~iemejia] and [~thw] !

I prefer maintain at most three versions of Flink runner for Apache Beam 
community, because Apache Flink only support the current and previous minor 
release with bugfixes, more detail can be found in [1]. So, I think its better 
to maintain the three versions of 1.8/1.9/1.10 after add Flink v1.10 build 
target to Flink runner.

I'm not sure if it deserves a discussion in the mailing list as it seems that 
we already maintain 3 versions of Flink runner in Beam now. Removing Flink 
runner 1.7 seems straightward if we upgrade Flink runner to 1.10. However, I'm 
not sure what's the policy before and it's great to also hear [~mxm] 's 
suggestions? :)

 

[1] [https://flink.apache.org/downloads.html#update-policy-for-old-releases]

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-9298) Drop support for Flink 1.7

2020-02-12 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17035890#comment-17035890
 ] 

sunjincheng edited comment on BEAM-9298 at 2/13/20 3:48 AM:


Thanks for the reminder [~iemejia] and [~thw] !

I prefer maintain at most three versions of Flink runner for Apache Beam 
community, because Apache Flink only support the current and previous minor 
release with bugfixes, more detail can be found in [1]. So, I think its better 
to maintain the three versions of 1.8/1.9/1.10 after add Flink v1.10 build 
target to Flink runner.

I'm not sure if it deserves a discussion in the mailing list as it seems that 
we already maintain 3 versions of Flink runner in Beam now. Removing Flink 
runner 1.7 seems straightward if we upgrade Flink runner to 1.10. However, I'm 
not sure what's the policy before and it's great to also hear [~mxm] 's 
suggestions? :)

 

[1] [https://flink.apache.org/downloads.html#update-policy-for-old-releases]


was (Author: sunjincheng121):
Thanks for the reminder [~iemejia] and [~thw] !

I prefer maintain at most three versions of Flink runner for Apache Beam 
community, because Apache Flink only support the current and previous minor 
release with bugfixes, more detail can be found in [1]. So, I think its better 
to maintain the three versions of 1.8/1.9/1.10 after add Flink v1.10 build 
target to Flink runner.

I'm not sure if it deserves a discussion in the mailing list as it seems that 
we always maintain 3 versions of Flink runner in Beam. Removing Flink runner 
1.7 seems straightward if we upgrade Flink runner to 1.10. However, I'm not 
sure what's the policy before and it's great to also hear [~mxm] 's 
suggestions? :)

 

[1] [https://flink.apache.org/downloads.html#update-policy-for-old-releases]

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9298) Drop support for Flink 1.7

2020-02-12 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17035890#comment-17035890
 ] 

sunjincheng commented on BEAM-9298:
---

Thanks for the reminder [~iemejia] and [~thw] !

I prefer maintain at most three versions of Flink runner for Apache Beam 
community, because Apache Flink only support the current and previous minor 
release with bugfixes, more detail can be found in [1]. So, I think its better 
to maintain the three versions of 1.8/1.9/1.10 after add Flink v1.10 build 
target to Flink runner.

I'm not sure if it deserves a discussion in the mailing list as it seems that 
we always maintain 3 versions of Flink runner in Beam. Removing Flink runner 
1.7 seems straightward if we upgrade Flink runner to 1.10. However, I'm not 
sure what's the policy before and it's great to also hear [~mxm] 's 
suggestions? :)

 

[1] [https://flink.apache.org/downloads.html#update-policy-for-old-releases]

> Drop support for Flink 1.7 
> ---
>
> Key: BEAM-9298
> URL: https://issues.apache.org/jira/browse/BEAM-9298
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
> should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
> decrease the build time.
> What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9299) Upgrade Flink Runner to 1.8.3 and 1.9.2

2020-02-11 Thread sunjincheng (Jira)
sunjincheng created BEAM-9299:
-

 Summary: Upgrade Flink Runner to 1.8.3 and 1.9.2
 Key: BEAM-9299
 URL: https://issues.apache.org/jira/browse/BEAM-9299
 Project: Beam
  Issue Type: Task
  Components: runner-flink
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.20.0


I would like to Upgrade Flink Runner to 18.3 and 1.9.2 due to both the Apache 
Flink 1.8.3 and Apache Flink 1.9.2 have been released [1]. 

What do you think?


[1] https://dist.apache.org/repos/dist/release/flink/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9298) Drop support for Flink 1.7

2020-02-11 Thread sunjincheng (Jira)
sunjincheng created BEAM-9298:
-

 Summary: Drop support for Flink 1.7 
 Key: BEAM-9298
 URL: https://issues.apache.org/jira/browse/BEAM-9298
 Project: Beam
  Issue Type: Task
  Components: runner-flink
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.20.0


With Flink 1.10 around the corner, more detail can be found in BEAM-9295, we 
should consider dropping support for Flink 1.7. Then dropping 1.7 will also 
decrease the build time.

What do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9295) Add Flink 1.10 build target and Make FlinkRunner compatible with Flink 1.10

2020-02-11 Thread sunjincheng (Jira)
sunjincheng created BEAM-9295:
-

 Summary: Add Flink 1.10 build target and Make FlinkRunner 
compatible with Flink 1.10
 Key: BEAM-9295
 URL: https://issues.apache.org/jira/browse/BEAM-9295
 Project: Beam
  Issue Type: New Feature
  Components: runner-flink
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.20.0


Apache Flink 1.10 has completed the final release vote, see [1]. So, I would 
like to add Flink 1.10 build target and make Flink Runner compatible with Flink 
1.10.

And I appreciate it if you can leave your suggestions or comments!

[1] 
https://lists.apache.org/thread.html/r97672d4d1e47372cebf23e6643a6cc30a06bfbdf3f277b0be3695b15%40%3Cdev.flink.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8618) Tear down unused DoFns periodically in Python SDK harness

2020-01-20 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8618:
--
Fix Version/s: (was: 2.19.0)
   2.20.0

> Tear down unused DoFns periodically in Python SDK harness
> -
>
> Key: BEAM-8618
> URL: https://issues.apache.org/jira/browse/BEAM-8618
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> Per the discussion in the ML, detail can be found [1],  the teardown of DoFns 
> should be supported in the portability framework. It happens at two places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Python SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8618) Tear down unused DoFns periodically in Python SDK harness

2020-01-20 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17019363#comment-17019363
 ] 

sunjincheng commented on BEAM-8618:
---

Thanks for the reminder, reset the fixversion to 2.20.

> Tear down unused DoFns periodically in Python SDK harness
> -
>
> Key: BEAM-8618
> URL: https://issues.apache.org/jira/browse/BEAM-8618
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> Per the discussion in the ML, detail can be found [1],  the teardown of DoFns 
> should be supported in the portability framework. It happens at two places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Python SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8620) Tear down unused DoFns periodically in Java SDK harness

2020-01-20 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8620:
--
Fix Version/s: (was: 2.19.0)
   2.20.0

> Tear down unused DoFns periodically in Java SDK harness
> ---
>
> Key: BEAM-8620
> URL: https://issues.apache.org/jira/browse/BEAM-8620
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> Per the discussion in the ML the detail can be found here[1], the teardown of 
> DoFns should be supported in the portability framework. It happens at two 
> places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Java SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (BEAM-7951) Allow runner to configure customization WindowedValue coder such as ValueOnlyWindowedValueCoder

2020-01-20 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng resolved BEAM-7951.
---
Fix Version/s: 2.19.0
   Resolution: Fixed

> Allow runner to configure customization WindowedValue coder such as 
> ValueOnlyWindowedValueCoder
> ---
>
> Key: BEAM-7951
> URL: https://issues.apache.org/jira/browse/BEAM-7951
> Project: Beam
>  Issue Type: Sub-task
>  Components: java-fn-execution
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 14h
>  Remaining Estimate: 0h
>
> The coder of WindowedValue cannot be configured and it’s always 
> FullWindowedValueCoder. We don't need to serialize the timestamp, window and 
> pane properties in Flink and so it will be better to make the coder 
> configurable (i.e. allowing to use ValueOnlyWindowedValueCoder)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9153) The heading level of "Deploy source release to dist.apache.org" in release guide is incorrect

2020-01-20 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-9153:
--
Description: 
Deploy source release  to dist.apache.org" in [1] should have the same heading 
level as "Deploy Python artifacts to PyPI.

 

 
[1][https://beam.apache.org/contribute/release-guide/#deploy-source-release-to-distapacheorg]

  was:Deploy source release to dist.apache.org" should have the same heading 
level as "Deploy Python artifacts to PyPI


> The heading level of "Deploy source release to dist.apache.org" in release 
> guide is incorrect
> -
>
> Key: BEAM-9153
> URL: https://issues.apache.org/jira/browse/BEAM-9153
> Project: Beam
>  Issue Type: Bug
>  Components: website
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.20.0
>
>
> Deploy source release  to dist.apache.org" in [1] should have the same 
> heading level as "Deploy Python artifacts to PyPI.
>  
>  
> [1][https://beam.apache.org/contribute/release-guide/#deploy-source-release-to-distapacheorg]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9153) The heading level of "Deploy source release to dist.apache.org" in release guide is incorrect

2020-01-20 Thread sunjincheng (Jira)
sunjincheng created BEAM-9153:
-

 Summary: The heading level of "Deploy source release to 
dist.apache.org" in release guide is incorrect
 Key: BEAM-9153
 URL: https://issues.apache.org/jira/browse/BEAM-9153
 Project: Beam
  Issue Type: Bug
  Components: website
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.20.0


Deploy source release to dist.apache.org" should have the same heading level as 
"Deploy Python artifacts to PyPI



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-9137) PR10338 breaks beam_PostCommit_Py_ValCont

2020-01-16 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17017754#comment-17017754
 ] 

sunjincheng edited comment on BEAM-9137 at 1/17/20 7:14 AM:


I have a quick look at of this issue and it seems that the 
beam_PostCommit_Py_ValCont was broken on 20 Dec, 2019.

We can see that it succeed on [19 Dec 2019 
|http://mail-archives.apache.org/mod_mbox/beam-builds/201912.mbox/%3c100795298.3239.1576775433670.JavaMail.jenkins@jenkins02%3e]and
 failed on [20 Dec, 
2019|http://mail-archives.apache.org/mod_mbox/beam-builds/201912.mbox/%3c165865149.3447.1576823413985.JavaMail.jenkins@jenkins02%3e].

So, maybe we should focus on  the commits of that day.


was (Author: sunjincheng121):
I have a quick look at of this issue and it seems that the 
beam_PostCommit_Py_ValCont was broken on 20 Dec, 2019.

We can see that it succeed on [19 Dec 2019 
|http://mail-archives.apache.org/mod_mbox/beam-builds/201912.mbox/%3c100795298.3239.1576775433670.JavaMail.jenkins@jenkins02%3e]and
 failed on [20 Dec, 
2019|http://mail-archives.apache.org/mod_mbox/beam-builds/201912.mbox/%3c165865149.3447.1576823413985.JavaMail.jenkins@jenkins02%3e].

> PR10338 breaks beam_PostCommit_Py_ValCont
> -
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Affects Versions: 2.19.0
>Reporter: Boyuan Zhang
>Priority: Blocker
> Fix For: 2.19.0
>
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9137) PR10338 breaks beam_PostCommit_Py_ValCont

2020-01-16 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17017754#comment-17017754
 ] 

sunjincheng commented on BEAM-9137:
---

I have a quick look at of this issue and it seems that the 
beam_PostCommit_Py_ValCont was broken on 20 Dec, 2019.

We can see that it succeed on [19 Dec 2019 
|http://mail-archives.apache.org/mod_mbox/beam-builds/201912.mbox/%3c100795298.3239.1576775433670.JavaMail.jenkins@jenkins02%3e]and
 failed on [20 Dec, 
2019|http://mail-archives.apache.org/mod_mbox/beam-builds/201912.mbox/%3c165865149.3447.1576823413985.JavaMail.jenkins@jenkins02%3e].

> PR10338 breaks beam_PostCommit_Py_ValCont
> -
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Affects Versions: 2.19.0
>Reporter: Boyuan Zhang
>Priority: Blocker
> Fix For: 2.19.0
>
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9137) PR10338 breaks beam_PostCommit_Py_ValCont

2020-01-16 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng reassigned BEAM-9137:
-

Assignee: (was: sunjincheng)

> PR10338 breaks beam_PostCommit_Py_ValCont
> -
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Affects Versions: 2.19.0
>Reporter: Boyuan Zhang
>Priority: Blocker
> Fix For: 2.19.0
>
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9137) PR10338 breaks beam_PostCommit_Py_ValCont

2020-01-16 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17017747#comment-17017747
 ] 

sunjincheng commented on BEAM-9137:
---

It seems that the test error is not caused by[1], more detail log can be found 
in [2]. [~boyuanz] 

[1] [https://github.com/apache/beam/pull/10338]

[2] [https://github.com/apache/beam/pull/10338#issuecomment-575482269] 

> PR10338 breaks beam_PostCommit_Py_ValCont
> -
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Affects Versions: 2.19.0
>Reporter: Boyuan Zhang
>Assignee: sunjincheng
>Priority: Blocker
> Fix For: 2.19.0
>
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9055) Unify the config names of Fn Data API across languages

2020-01-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-9055:
--
Fix Version/s: 2.19.0

> Unify the config names of Fn Data API across languages
> --
>
> Key: BEAM-9055
> URL: https://issues.apache.org/jira/browse/BEAM-9055
> Project: Beam
>  Issue Type: Improvement
>  Components: java-fn-execution
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.19.0
>
>
> Currently for time-based cache threshold config, the config key is 
> "data_buffer_time_limit_ms" in the Python SDK harness and 
> "beam_fn_api_data_buffer_time_limit" in the Java SDK harness. As discussed in 
> [https://github.com/apache/beam/pull/10246#discussion_r362572952], we should 
> unify the config names of Fn Data API across languages to have a good user 
> experience.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9055) Unify the config names of Fn Data API across languages

2020-01-05 Thread sunjincheng (Jira)
sunjincheng created BEAM-9055:
-

 Summary: Unify the config names of Fn Data API across languages
 Key: BEAM-9055
 URL: https://issues.apache.org/jira/browse/BEAM-9055
 Project: Beam
  Issue Type: Improvement
  Components: java-fn-execution
Reporter: sunjincheng
Assignee: sunjincheng


Currently for time-based cache threshold config, the config key is 
"data_buffer_time_limit_ms" in the Python SDK harness and 
"beam_fn_api_data_buffer_time_limit" in the Java SDK harness. As discussed in 
[https://github.com/apache/beam/pull/10246#discussion_r362572952], we should 
unify the config names of Fn Data API across languages to have a good user 
experience.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9030) Bump the version of GRPC to 1.22.0+(May be latest 1.26.0, currently 1.21.0)

2019-12-25 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-9030:
--
Summary: Bump the version of GRPC to 1.22.0+(May be latest 1.26.0, 
currently 1.21.0)  (was: Metaspace memory leak when running python jobs with 
flink runner)

> Bump the version of GRPC to 1.22.0+(May be latest 1.26.0, currently 1.21.0)
> ---
>
> Key: BEAM-9030
> URL: https://issues.apache.org/jira/browse/BEAM-9030
> Project: Beam
>  Issue Type: Bug
>  Components: java-fn-execution, runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.19.0
>
>
> When submitting a Python word count job to a Flink session/standalone cluster 
> repeatedly, the meta space usage of the task manager of the Flink cluster 
> will continuously increase (about 40MB each time). The reason is that the 
> Beam classes are loaded with the user class loader in Flink and there are 
> problems with the implementation of `ProcessManager`(from Beam) and 
> `ThreadPoolCache`(from netty) which may cause the user class loader could not 
> be garbage collected even after the job finished which causes the meta space 
> memory leak eventually. You can refer to FLINK-15338[1] for more information.
> Regarding to `ProcessManager`, I have created a JIRA BEAM-9006[2] to track 
> it. Regarding to `ThreadPoolCache`, it is a Netty problem and has been fixed 
> in NETTY#8955[3]. Netty 4.1.35 Final has already included this fix and GRPC 
> 1.22.0 has already dependents on Netty 4.1.35 Final. So we need to bump the 
> version of GRPC to 1.22.0+ (currently 1.21.0).
>  
> What do you think?
> [1] https://issues.apache.org/jira/browse/FLINK-15338
> [2] https://issues.apache.org/jira/browse/BEAM-9006
> [3] [https://github.com/netty/netty/pull/8955]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9030) Metaspace memory leak when running python jobs with flink runner

2019-12-23 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17002682#comment-17002682
 ] 

sunjincheng commented on BEAM-9030:
---

Discussion details can be found here: 
[https://lists.apache.org/thread.html/ef5b24766d94d3d389bee9c03e59003b9cf417c81cde50ede5856ad7%40%3Cdev.beam.apache.org%3E]

> Metaspace memory leak when running python jobs with flink runner
> 
>
> Key: BEAM-9030
> URL: https://issues.apache.org/jira/browse/BEAM-9030
> Project: Beam
>  Issue Type: Bug
>  Components: java-fn-execution, runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.19.0
>
>
> When submitting a Python word count job to a Flink session/standalone cluster 
> repeatedly, the meta space usage of the task manager of the Flink cluster 
> will continuously increase (about 40MB each time). The reason is that the 
> Beam classes are loaded with the user class loader in Flink and there are 
> problems with the implementation of `ProcessManager`(from Beam) and 
> `ThreadPoolCache`(from netty) which may cause the user class loader could not 
> be garbage collected even after the job finished which causes the meta space 
> memory leak eventually. You can refer to FLINK-15338[1] for more information.
> Regarding to `ProcessManager`, I have created a JIRA BEAM-9006[2] to track 
> it. Regarding to `ThreadPoolCache`, it is a Netty problem and has been fixed 
> in NETTY#8955[3]. Netty 4.1.35 Final has already included this fix and GRPC 
> 1.22.0 has already dependents on Netty 4.1.35 Final. So we need to bump the 
> version of GRPC to 1.22.0+ (currently 1.21.0).
>  
> What do you think?
> [1] https://issues.apache.org/jira/browse/FLINK-15338
> [2] https://issues.apache.org/jira/browse/BEAM-9006
> [3] [https://github.com/netty/netty/pull/8955]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9030) Metaspace memory leak when running python jobs with flink runner

2019-12-23 Thread sunjincheng (Jira)
sunjincheng created BEAM-9030:
-

 Summary: Metaspace memory leak when running python jobs with flink 
runner
 Key: BEAM-9030
 URL: https://issues.apache.org/jira/browse/BEAM-9030
 Project: Beam
  Issue Type: Bug
  Components: java-fn-execution, runner-flink
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.19.0


When submitting a Python word count job to a Flink session/standalone cluster 
repeatedly, the meta space usage of the task manager of the Flink cluster will 
continuously increase (about 40MB each time). The reason is that the Beam 
classes are loaded with the user class loader in Flink and there are problems 
with the implementation of `ProcessManager`(from Beam) and 
`ThreadPoolCache`(from netty) which may cause the user class loader could not 
be garbage collected even after the job finished which causes the meta space 
memory leak eventually. You can refer to FLINK-15338[1] for more information.

Regarding to `ProcessManager`, I have created a JIRA BEAM-9006[2] to track it. 
Regarding to `ThreadPoolCache`, it is a Netty problem and has been fixed in 
NETTY#8955[3]. Netty 4.1.35 Final has already included this fix and GRPC 1.22.0 
has already dependents on Netty 4.1.35 Final. So we need to bump the version of 
GRPC to 1.22.0+ (currently 1.21.0).

 

What do you think?

[1] https://issues.apache.org/jira/browse/FLINK-15338
[2] https://issues.apache.org/jira/browse/BEAM-9006
[3] [https://github.com/netty/netty/pull/8955]

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9006) Meta space memory leak caused by the shutdown hook of ProcessManager

2019-12-19 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-9006:
--
Fix Version/s: (was: 2.18.0)
   2.19.0

> Meta space memory leak caused by the shutdown hook of ProcessManager 
> -
>
> Key: BEAM-9006
> URL: https://issues.apache.org/jira/browse/BEAM-9006
> Project: Beam
>  Issue Type: Bug
>  Components: java-fn-execution
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.19.0
>
>
> Currently the class `ProcessManager` will add a shutdown hook to stop all the 
> living processes before JVM exits. The shutdown hook will never be removed. 
> If this class is loaded by the user class loader, it will cause the user 
> class loader could not be garbage collected which causes meta space memory 
> leak eventually.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9006) Meta space memory leak caused by the shutdown hook of ProcessManager

2019-12-19 Thread sunjincheng (Jira)
sunjincheng created BEAM-9006:
-

 Summary: Meta space memory leak caused by the shutdown hook of 
ProcessManager 
 Key: BEAM-9006
 URL: https://issues.apache.org/jira/browse/BEAM-9006
 Project: Beam
  Issue Type: Bug
  Components: java-fn-execution
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.18.0


Currently the class `ProcessManager` will add a shutdown hook to stop all the 
living processes before JVM exits. The shutdown hook will never be removed. If 
this class is loaded by the user class loader, it will cause the user class 
loader could not be garbage collected which causes meta space memory leak 
eventually.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8935) Fail fast if sdk harness startup failed

2019-12-09 Thread sunjincheng (Jira)
sunjincheng created BEAM-8935:
-

 Summary: Fail fast if sdk harness startup failed
 Key: BEAM-8935
 URL: https://issues.apache.org/jira/browse/BEAM-8935
 Project: Beam
  Issue Type: Improvement
  Components: java-fn-execution
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.19.0


Currently the runner waits for the sdk harness to startup blockingly until the 
sdk harness is available or timeout occurs. The timeout is 1 or 2 minutes. If 
the sdk harness startup failed for some reason, the runner may be aware of it 
after 1 or 2 minutes. This is too long.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8929) Remove unnecessary exception handling in FnApiControlClientPoolService

2019-12-09 Thread sunjincheng (Jira)
sunjincheng created BEAM-8929:
-

 Summary: Remove unnecessary exception handling in 
FnApiControlClientPoolService
 Key: BEAM-8929
 URL: https://issues.apache.org/jira/browse/BEAM-8929
 Project: Beam
  Issue Type: Improvement
  Components: java-fn-execution
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.19.0


The exception handling logic in 
[FnApiControlClientPoolService|https://github.com/apache/beam/blob/c2f0d282337f3ae0196a7717712396a5a41fdde1/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClientPoolService.java#L102]
 is unnecessary and could be removed.(Clean up usless code)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8618) Tear down unused DoFns periodically in Python SDK harness

2019-12-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8618:
--
Fix Version/s: (was: 2.18.0)
   2.19.0

> Tear down unused DoFns periodically in Python SDK harness
> -
>
> Key: BEAM-8618
> URL: https://issues.apache.org/jira/browse/BEAM-8618
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.19.0
>
>
> Per the discussion in the ML, detail can be found [1],  the teardown of DoFns 
> should be supported in the portability framework. It happens at two places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Python SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (BEAM-8733) The "KeyError: u'-47'" error from line 305 of sdk_worker.py

2019-12-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng resolved BEAM-8733.
---
Resolution: Fixed

> The "KeyError: u'-47'" error from line 305 of sdk_worker.py
> ---
>
> Key: BEAM-8733
> URL: https://issues.apache.org/jira/browse/BEAM-8733
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>  Time Spent: 5.5h
>  Remaining Estimate: 0h
>
> The issue reported by [~chamikara], error message as follows:
> apache_beam/runners/worker/sdk_worker.py", line 305, in get
> self.fns[bundle_descriptor_id],
> KeyError: u'-47'
> {code}
> at 
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
> at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895)
> at org.apache.beam.sdk.util.MoreFutures.get(MoreFutures.java:57)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.RegisterAndProcessBundleOperation.finish(RegisterAndProcessBundleOperation.java:330)
> at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:85)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.BeamFnMapTaskExecutor.execute(BeamFnMapTaskExecutor.java:125)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.executeWork(BatchDataflowWorker.java:411)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.doWork(BatchDataflowWorker.java:380)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.getAndPerformWork(BatchDataflowWorker.java:305)
> at 
> org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.start(DataflowRunnerHarness.java:195)
> at 
> org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.main(DataflowRunnerHarness.java:123)
> Suppressed: java.lang.IllegalStateException: Already closed.
>   at 
> org.apache.beam.sdk.fn.data.BeamFnDataBufferingOutboundObserver.close(BeamFnDataBufferingOutboundObserver.java:93)
>   at 
> org.apache.beam.runners.dataflow.worker.fn.data.RemoteGrpcPortWriteOperation.abort(RemoteGrpcPortWriteOperation.java:220)
>   at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:91)
> {code}
> More discussion info can be found here: 
> https://github.com/apache/beam/pull/10004



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8620) Tear down unused DoFns periodically in Java SDK harness

2019-12-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8620:
--
Fix Version/s: (was: 2.18.0)
   2.19.0

> Tear down unused DoFns periodically in Java SDK harness
> ---
>
> Key: BEAM-8620
> URL: https://issues.apache.org/jira/browse/BEAM-8620
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.19.0
>
>
> Per the discussion in the ML the detail can be found here[1], the teardown of 
> DoFns should be supported in the portability framework. It happens at two 
> places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Java SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8847) Handle the registration request synchronously in the Java SDK harness

2019-11-28 Thread sunjincheng (Jira)
sunjincheng created BEAM-8847:
-

 Summary: Handle the registration request synchronously in the Java 
SDK harness
 Key: BEAM-8847
 URL: https://issues.apache.org/jira/browse/BEAM-8847
 Project: Beam
  Issue Type: Bug
  Components: sdk-java-harness
Affects Versions: 2.18.0
Reporter: sunjincheng
Assignee: sunjincheng


Currently the registration request is handled asynchronously in the Java SDK 
harness. As discussed in BEAM-8733, this JIRA. tries to change it to 
synchronous.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8846) Force synchronization of the stream observer in BeamFnControlClient

2019-11-28 Thread sunjincheng (Jira)
sunjincheng created BEAM-8846:
-

 Summary: Force synchronization of the stream observer in 
BeamFnControlClient
 Key: BEAM-8846
 URL: https://issues.apache.org/jira/browse/BEAM-8846
 Project: Beam
  Issue Type: Bug
  Components: sdk-java-harness
Affects Versions: 2.18.0
Reporter: sunjincheng
Assignee: sunjincheng


Currently there is no synchronization to access the stream observer in 
BeamFnControlClient which is not thread safe. We should fix it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (BEAM-7952) Make the input queue of the input buffer in Python SDK Harness size limited.

2019-11-28 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng resolved BEAM-7952.
---
Fix Version/s: 2.18.0
   Resolution: Duplicate

https://issues.apache.org/jira/browse/BEAM-8667

> Make the input queue of the input buffer in Python SDK Harness size limited.
> 
>
> Key: BEAM-7952
> URL: https://issues.apache.org/jira/browse/BEAM-7952
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> At Python SDK harness, the input queue size of the input buffer in Python SDK 
> Harness is not size limited and also not configurable. This may become a 
> problem if the data production rate is more than the data consumption rate.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Closed] (BEAM-8617) Tear down the DoFns upon the control service termination in Python SDK harness

2019-11-24 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng closed BEAM-8617.
-
Resolution: Duplicate

> Tear down the DoFns upon the control service termination in Python SDK harness
> --
>
> Key: BEAM-8617
> URL: https://issues.apache.org/jira/browse/BEAM-8617
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> Per the discussion in the ML can be found [1], the teardown of DoFns should 
> be supported in the portability framework. It happens at two places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support to teardown the DoFns upon the control 
> service termination for Python SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Closed] (BEAM-8557) Clean up useless null check.

2019-11-24 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng closed BEAM-8557.
-
Resolution: Fixed

> Clean up useless null check.
> 
>
> Key: BEAM-8557
> URL: https://issues.apache.org/jira/browse/BEAM-8557
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-core, sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
> I think we do not need null check here:
> [https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]
> Because before the the `onNext` call, the `Future` already put into the queue 
> in `handle` method.
>  
> I found the test as follows:
> {code:java}
>  @Test
>  public void testUnknownResponseIgnored() throws Exception{code}
> I do not know why we need test this case? I think it would be better if we 
> throw the Exception for an UnknownResponse, otherwise, this may hidden a 
> potential bug. 
> Please correct me if there anything I misunderstand @kennknowles
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8557) Clean up useless null check.

2019-11-24 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8557:
--
Fix Version/s: 2.18.0

> Clean up useless null check.
> 
>
> Key: BEAM-8557
> URL: https://issues.apache.org/jira/browse/BEAM-8557
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-core, sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
> I think we do not need null check here:
> [https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]
> Because before the the `onNext` call, the `Future` already put into the queue 
> in `handle` method.
>  
> I found the test as follows:
> {code:java}
>  @Test
>  public void testUnknownResponseIgnored() throws Exception{code}
> I do not know why we need test this case? I think it would be better if we 
> throw the Exception for an UnknownResponse, otherwise, this may hidden a 
> potential bug. 
> Please correct me if there anything I misunderstand @kennknowles
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-7948) Add time-based cache threshold support in the Java data service

2019-11-24 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-7948:
--
Fix Version/s: 2.18.0

> Add time-based cache threshold support in the Java data service
> ---
>
> Key: BEAM-7948
> URL: https://issues.apache.org/jira/browse/BEAM-7948
> Project: Beam
>  Issue Type: Sub-task
>  Components: java-fn-execution
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>  Time Spent: 3h 40m
>  Remaining Estimate: 0h
>
> Currently only size-based cache threshold is supported in data service. It 
> should also support the time-based cache threshold. This is very important, 
> especially for streaming jobs which are sensitive to the delay.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (BEAM-7948) Add time-based cache threshold support in the Java data service

2019-11-24 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng resolved BEAM-7948.
---
Resolution: Fixed

> Add time-based cache threshold support in the Java data service
> ---
>
> Key: BEAM-7948
> URL: https://issues.apache.org/jira/browse/BEAM-7948
> Project: Beam
>  Issue Type: Sub-task
>  Components: java-fn-execution
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>  Time Spent: 3h 40m
>  Remaining Estimate: 0h
>
> Currently only size-based cache threshold is supported in data service. It 
> should also support the time-based cache threshold. This is very important, 
> especially for streaming jobs which are sensitive to the delay.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8733) The "KeyError: u'-47'" error from line 305 of sdk_worker.py

2019-11-22 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16980674#comment-16980674
 ] 

sunjincheng commented on BEAM-8733:
---

Hi [~lcwik], thank you for explaining, then it makes sense to me now. I will 
submit a PR to change both the Python SDK harness and Java SDK harness to 
synchronous. 
Does it make sense to you?

> The "KeyError: u'-47'" error from line 305 of sdk_worker.py
> ---
>
> Key: BEAM-8733
> URL: https://issues.apache.org/jira/browse/BEAM-8733
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> The issue reported by [~chamikara], error message as follows:
> apache_beam/runners/worker/sdk_worker.py", line 305, in get
> self.fns[bundle_descriptor_id],
> KeyError: u'-47'
> {code}
> at 
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
> at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895)
> at org.apache.beam.sdk.util.MoreFutures.get(MoreFutures.java:57)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.RegisterAndProcessBundleOperation.finish(RegisterAndProcessBundleOperation.java:330)
> at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:85)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.BeamFnMapTaskExecutor.execute(BeamFnMapTaskExecutor.java:125)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.executeWork(BatchDataflowWorker.java:411)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.doWork(BatchDataflowWorker.java:380)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.getAndPerformWork(BatchDataflowWorker.java:305)
> at 
> org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.start(DataflowRunnerHarness.java:195)
> at 
> org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.main(DataflowRunnerHarness.java:123)
> Suppressed: java.lang.IllegalStateException: Already closed.
>   at 
> org.apache.beam.sdk.fn.data.BeamFnDataBufferingOutboundObserver.close(BeamFnDataBufferingOutboundObserver.java:93)
>   at 
> org.apache.beam.runners.dataflow.worker.fn.data.RemoteGrpcPortWriteOperation.abort(RemoteGrpcPortWriteOperation.java:220)
>   at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:91)
> {code}
> More discussion info can be found here: 
> https://github.com/apache/beam/pull/10004



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-7952) Make the input queue of the input buffer in Python SDK Harness size limited.

2019-11-21 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng reassigned BEAM-7952:
-

Assignee: sunjincheng

> Make the input queue of the input buffer in Python SDK Harness size limited.
> 
>
> Key: BEAM-7952
> URL: https://issues.apache.org/jira/browse/BEAM-7952
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>
> At Python SDK harness, the input queue size of the input buffer in Python SDK 
> Harness is not size limited and also not configurable. This may become a 
> problem if the data production rate is more than the data consumption rate.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8733) The "KeyError: u'-47'" error from line 305 of sdk_worker.py

2019-11-21 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16979765#comment-16979765
 ] 

sunjincheng commented on BEAM-8733:
---

I want to confirm a few things with you before making changes as I'm still not 
quite familiar with the Beam. Per my understanding, the registration in the 
Java SDK harness is also 
asynchronous(https://github.com/apache/beam/blob/c2f0d282337f3ae0196a7717712396a5a41fdde1/sdks/java/harness/src/main/java/org/apache/beam/fn/harness/control/BeamFnControlClient.java#L138).
 Have I missed something? (I am not arguing, just want to have the correct 
understanding) :)

> The "KeyError: u'-47'" error from line 305 of sdk_worker.py
> ---
>
> Key: BEAM-8733
> URL: https://issues.apache.org/jira/browse/BEAM-8733
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> The issue reported by [~chamikara], error message as follows:
> apache_beam/runners/worker/sdk_worker.py", line 305, in get
> self.fns[bundle_descriptor_id],
> KeyError: u'-47'
> {code}
> at 
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
> at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895)
> at org.apache.beam.sdk.util.MoreFutures.get(MoreFutures.java:57)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.RegisterAndProcessBundleOperation.finish(RegisterAndProcessBundleOperation.java:330)
> at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:85)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.BeamFnMapTaskExecutor.execute(BeamFnMapTaskExecutor.java:125)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.executeWork(BatchDataflowWorker.java:411)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.doWork(BatchDataflowWorker.java:380)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.getAndPerformWork(BatchDataflowWorker.java:305)
> at 
> org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.start(DataflowRunnerHarness.java:195)
> at 
> org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.main(DataflowRunnerHarness.java:123)
> Suppressed: java.lang.IllegalStateException: Already closed.
>   at 
> org.apache.beam.sdk.fn.data.BeamFnDataBufferingOutboundObserver.close(BeamFnDataBufferingOutboundObserver.java:93)
>   at 
> org.apache.beam.runners.dataflow.worker.fn.data.RemoteGrpcPortWriteOperation.abort(RemoteGrpcPortWriteOperation.java:220)
>   at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:91)
> {code}
> More discussion info can be found here: 
> https://github.com/apache/beam/pull/10004



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-8733) The "KeyError: u'-47'" error from line 305 of sdk_worker.py

2019-11-21 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng reassigned BEAM-8733:
-

Assignee: sunjincheng

> The "KeyError: u'-47'" error from line 305 of sdk_worker.py
> ---
>
> Key: BEAM-8733
> URL: https://issues.apache.org/jira/browse/BEAM-8733
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> The issue reported by [~chamikara], error message as follows:
> apache_beam/runners/worker/sdk_worker.py", line 305, in get
> self.fns[bundle_descriptor_id],
> KeyError: u'-47'
> {code}
> at 
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
> at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895)
> at org.apache.beam.sdk.util.MoreFutures.get(MoreFutures.java:57)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.RegisterAndProcessBundleOperation.finish(RegisterAndProcessBundleOperation.java:330)
> at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:85)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.BeamFnMapTaskExecutor.execute(BeamFnMapTaskExecutor.java:125)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.executeWork(BatchDataflowWorker.java:411)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.doWork(BatchDataflowWorker.java:380)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.getAndPerformWork(BatchDataflowWorker.java:305)
> at 
> org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.start(DataflowRunnerHarness.java:195)
> at 
> org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.main(DataflowRunnerHarness.java:123)
> Suppressed: java.lang.IllegalStateException: Already closed.
>   at 
> org.apache.beam.sdk.fn.data.BeamFnDataBufferingOutboundObserver.close(BeamFnDataBufferingOutboundObserver.java:93)
>   at 
> org.apache.beam.runners.dataflow.worker.fn.data.RemoteGrpcPortWriteOperation.abort(RemoteGrpcPortWriteOperation.java:220)
>   at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:91)
> {code}
> More discussion info can be found here: 
> https://github.com/apache/beam/pull/10004



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8733) The "KeyError: u'-47'" error from line 305 of sdk_worker.py

2019-11-17 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16976225#comment-16976225
 ] 

sunjincheng commented on BEAM-8733:
---

Hi, Thanks for the log info, [~chamikara]. 
>From the exception log(the line number of RegisterAndProcessBundleOperation), 
>it seems that the 
>[commit|https://github.com/apache/beam/commit/686833381ecc92f0fbe04e576a582a7640ca7bbd]
> is not included in the DataFlow runner. Could you help to check this as this 
>commit ensures that registration is executed successfully before executing 
>process bundle request?

> The "KeyError: u'-47'" error from line 305 of sdk_worker.py
> ---
>
> Key: BEAM-8733
> URL: https://issues.apache.org/jira/browse/BEAM-8733
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> The issue reported by [~chamikara], error message as follows:
> apache_beam/runners/worker/sdk_worker.py", line 305, in get
> self.fns[bundle_descriptor_id],
> KeyError: u'-47'
> {code}
> at 
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
> at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895)
> at org.apache.beam.sdk.util.MoreFutures.get(MoreFutures.java:57)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.RegisterAndProcessBundleOperation.finish(RegisterAndProcessBundleOperation.java:330)
> at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:85)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.BeamFnMapTaskExecutor.execute(BeamFnMapTaskExecutor.java:125)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.executeWork(BatchDataflowWorker.java:411)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.doWork(BatchDataflowWorker.java:380)
> at 
> org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.getAndPerformWork(BatchDataflowWorker.java:305)
> at 
> org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.start(DataflowRunnerHarness.java:195)
> at 
> org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.main(DataflowRunnerHarness.java:123)
> Suppressed: java.lang.IllegalStateException: Already closed.
>   at 
> org.apache.beam.sdk.fn.data.BeamFnDataBufferingOutboundObserver.close(BeamFnDataBufferingOutboundObserver.java:93)
>   at 
> org.apache.beam.runners.dataflow.worker.fn.data.RemoteGrpcPortWriteOperation.abort(RemoteGrpcPortWriteOperation.java:220)
>   at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:91)
> {code}
> More discussion info can be found here: 
> https://github.com/apache/beam/pull/10004



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8733) The "KeyError: u'-47'" error from line 305 of sdk_worker.py

2019-11-17 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8733:
--
Description: 
The issue reported by [~chamikara], error message as follows:

apache_beam/runners/worker/sdk_worker.py", line 305, in get
self.fns[bundle_descriptor_id],
KeyError: u'-47'

{code}
at java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895)
at org.apache.beam.sdk.util.MoreFutures.get(MoreFutures.java:57)
at 
org.apache.beam.runners.dataflow.worker.fn.control.RegisterAndProcessBundleOperation.finish(RegisterAndProcessBundleOperation.java:330)
at 
org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:85)
at 
org.apache.beam.runners.dataflow.worker.fn.control.BeamFnMapTaskExecutor.execute(BeamFnMapTaskExecutor.java:125)
at 
org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.executeWork(BatchDataflowWorker.java:411)
at 
org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.doWork(BatchDataflowWorker.java:380)
at 
org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.getAndPerformWork(BatchDataflowWorker.java:305)
at 
org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.start(DataflowRunnerHarness.java:195)
at 
org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.main(DataflowRunnerHarness.java:123)
Suppressed: java.lang.IllegalStateException: Already closed.
at 
org.apache.beam.sdk.fn.data.BeamFnDataBufferingOutboundObserver.close(BeamFnDataBufferingOutboundObserver.java:93)
at 
org.apache.beam.runners.dataflow.worker.fn.data.RemoteGrpcPortWriteOperation.abort(RemoteGrpcPortWriteOperation.java:220)
at 
org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:91)
{code}

More discussion info can be found here: 
https://github.com/apache/beam/pull/10004

  was:
The issue reported by [~chamikara], error message as follows:

apache_beam/runners/worker/sdk_worker.py", line 305, in get
self.fns[bundle_descriptor_id],
KeyError: u'-47'

{code}
at java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895)
at org.apache.beam.sdk.util.MoreFutures.get(MoreFutures.java:57)
at 
org.apache.beam.runners.dataflow.worker.fn.control.RegisterAndProcessBundleOperation.finish(RegisterAndProcessBundleOperation.java:330)
at 
org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:85)
at 
org.apache.beam.runners.dataflow.worker.fn.control.BeamFnMapTaskExecutor.execute(BeamFnMapTaskExecutor.java:125)
at 
org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.executeWork(BatchDataflowWorker.java:411)
at 
org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.doWork(BatchDataflowWorker.java:380)
at 
org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.getAndPerformWork(BatchDataflowWorker.java:305)
at 
org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.start(DataflowRunnerHarness.java:195)
at 
org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.main(DataflowRunnerHarness.java:123)
Suppressed: java.lang.IllegalStateException: Already closed.
at 
org.apache.beam.sdk.fn.data.BeamFnDataBufferingOutboundObserver.close(BeamFnDataBufferingOutboundObserver.java:93)
at 
org.apache.beam.runners.dataflow.worker.fn.data.RemoteGrpcPortWriteOperation.abort(RemoteGrpcPortWriteOperation.java:220)
at 
org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:91)
{code}


> The "KeyError: u'-47'" error from line 305 of sdk_worker.py
> ---
>
> Key: BEAM-8733
> URL: https://issues.apache.org/jira/browse/BEAM-8733
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> The issue reported by [~chamikara], error message as follows:
> apache_beam/runners/worker/sdk_worker.py", line 305, in get
> self.fns[bundle_descriptor_id],
> KeyError: u'-47'
> {code}
> at 
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
> at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895)
> at org.apache.beam.sdk.util.MoreFutures.get(MoreFutures.java:57)
> at 
> org.apache.beam.runners.dataflow.worker.fn.control.RegisterAndProcessBundleOperation.finish(RegisterAndProcessBundleOperation.java:330)
> at 
> org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:85)
> at 
> 

[jira] [Created] (BEAM-8733) The "KeyError: u'-47'" error from line 305 of sdk_worker.py

2019-11-17 Thread sunjincheng (Jira)
sunjincheng created BEAM-8733:
-

 Summary: The "KeyError: u'-47'" error from line 305 of 
sdk_worker.py
 Key: BEAM-8733
 URL: https://issues.apache.org/jira/browse/BEAM-8733
 Project: Beam
  Issue Type: Bug
  Components: sdk-py-harness
Reporter: sunjincheng
 Fix For: 2.18.0


The issue reported by [~chamikara], error message as follows:

apache_beam/runners/worker/sdk_worker.py", line 305, in get
self.fns[bundle_descriptor_id],
KeyError: u'-47'

{code}
at java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895)
at org.apache.beam.sdk.util.MoreFutures.get(MoreFutures.java:57)
at 
org.apache.beam.runners.dataflow.worker.fn.control.RegisterAndProcessBundleOperation.finish(RegisterAndProcessBundleOperation.java:330)
at 
org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:85)
at 
org.apache.beam.runners.dataflow.worker.fn.control.BeamFnMapTaskExecutor.execute(BeamFnMapTaskExecutor.java:125)
at 
org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.executeWork(BatchDataflowWorker.java:411)
at 
org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.doWork(BatchDataflowWorker.java:380)
at 
org.apache.beam.runners.dataflow.worker.BatchDataflowWorker.getAndPerformWork(BatchDataflowWorker.java:305)
at 
org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.start(DataflowRunnerHarness.java:195)
at 
org.apache.beam.runners.dataflow.worker.DataflowRunnerHarness.main(DataflowRunnerHarness.java:123)
Suppressed: java.lang.IllegalStateException: Already closed.
at 
org.apache.beam.sdk.fn.data.BeamFnDataBufferingOutboundObserver.close(BeamFnDataBufferingOutboundObserver.java:93)
at 
org.apache.beam.runners.dataflow.worker.fn.data.RemoteGrpcPortWriteOperation.abort(RemoteGrpcPortWriteOperation.java:220)
at 
org.apache.beam.runners.dataflow.worker.util.common.worker.MapTaskExecutor.execute(MapTaskExecutor.java:91)
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8620) Tear down unused DoFns periodically in Java SDK harness

2019-11-12 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8620:
--
Issue Type: Improvement  (was: Task)

> Tear down unused DoFns periodically in Java SDK harness
> ---
>
> Key: BEAM-8620
> URL: https://issues.apache.org/jira/browse/BEAM-8620
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> Per the discussion in the ML the detail can be found here[1], the teardown of 
> DoFns should be supported in the portability framework. It happens at two 
> places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Java SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8619) Tear down the DoFns upon the control service termination in Java SDK harness

2019-11-12 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8619:
--
Issue Type: Improvement  (was: Task)

> Tear down the DoFns upon the control service termination in Java SDK harness
> 
>
> Key: BEAM-8619
> URL: https://issues.apache.org/jira/browse/BEAM-8619
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-harness
>Affects Versions: 2.18.0
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>
> Per the discussion in the ML, the detail can be found [1], the teardown of 
> DoFns should be supported in the portability framework. It happens at two 
> places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for teardown the DoFns upon the 
> control service termination in Java SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8618) Tear down unused DoFns periodically in Python SDK harness

2019-11-12 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8618:
--
Issue Type: Improvement  (was: Task)

> Tear down unused DoFns periodically in Python SDK harness
> -
>
> Key: BEAM-8618
> URL: https://issues.apache.org/jira/browse/BEAM-8618
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> Per the discussion in the ML, detail can be found [1],  the teardown of DoFns 
> should be supported in the portability framework. It happens at two places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Python SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8617) Tear down the DoFns upon the control service termination in Python SDK harness

2019-11-12 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8617:
--
Issue Type: Improvement  (was: Task)

> Tear down the DoFns upon the control service termination in Python SDK harness
> --
>
> Key: BEAM-8617
> URL: https://issues.apache.org/jira/browse/BEAM-8617
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> Per the discussion in the ML can be found [1], the teardown of DoFns should 
> be supported in the portability framework. It happens at two places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support to teardown the DoFns upon the control 
> service termination for Python SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8620) Tear down unused DoFns periodically in Java SDK harness

2019-11-12 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8620:
--
Issue Type: Task  (was: Improvement)

> Tear down unused DoFns periodically in Java SDK harness
> ---
>
> Key: BEAM-8620
> URL: https://issues.apache.org/jira/browse/BEAM-8620
> Project: Beam
>  Issue Type: Task
>  Components: sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> Per the discussion in the ML the detail can be found here[1], the teardown of 
> DoFns should be supported in the portability framework. It happens at two 
> places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Java SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8617) Tear down the DoFns upon the control service termination in Python SDK harness

2019-11-12 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8617:
--
Issue Type: Task  (was: Improvement)

> Tear down the DoFns upon the control service termination in Python SDK harness
> --
>
> Key: BEAM-8617
> URL: https://issues.apache.org/jira/browse/BEAM-8617
> Project: Beam
>  Issue Type: Task
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> Per the discussion in the ML can be found [1], the teardown of DoFns should 
> be supported in the portability framework. It happens at two places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support to teardown the DoFns upon the control 
> service termination for Python SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8618) Tear down unused DoFns periodically in Python SDK harness

2019-11-12 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8618:
--
Issue Type: Task  (was: Improvement)

> Tear down unused DoFns periodically in Python SDK harness
> -
>
> Key: BEAM-8618
> URL: https://issues.apache.org/jira/browse/BEAM-8618
> Project: Beam
>  Issue Type: Task
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> Per the discussion in the ML, detail can be found [1],  the teardown of DoFns 
> should be supported in the portability framework. It happens at two places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for tear down the unused DoFns 
> periodically in Python SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8619) Tear down the DoFns upon the control service termination in Java SDK harness

2019-11-12 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8619:
--
Summary: Tear down the DoFns upon the control service termination in Java 
SDK harness  (was: Teardown the DoFns upon the control service termination in 
Java SDK harness)

> Tear down the DoFns upon the control service termination in Java SDK harness
> 
>
> Key: BEAM-8619
> URL: https://issues.apache.org/jira/browse/BEAM-8619
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-harness
>Affects Versions: 2.18.0
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>
> Per the discussion in the ML, the detail can be found [1], the teardown of 
> DoFns should be supported in the portability framework. It happens at two 
> places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for teardown the DoFns upon the 
> control service termination in Java SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8619) Tear down the DoFns upon the control service termination in Java SDK harness

2019-11-12 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8619:
--
Issue Type: Task  (was: Improvement)

> Tear down the DoFns upon the control service termination in Java SDK harness
> 
>
> Key: BEAM-8619
> URL: https://issues.apache.org/jira/browse/BEAM-8619
> Project: Beam
>  Issue Type: Task
>  Components: sdk-java-harness
>Affects Versions: 2.18.0
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>
> Per the discussion in the ML, the detail can be found [1], the teardown of 
> DoFns should be supported in the portability framework. It happens at two 
> places:
> 1) Upon the control service termination
> 2) Tear down the unused DoFns periodically
> The aim of this JIRA is to add support for teardown the DoFns upon the 
> control service termination in Java SDK harness.
> [1] 
> https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8620) Tear down unused DoFns periodically in Java SDK harness

2019-11-12 Thread sunjincheng (Jira)
sunjincheng created BEAM-8620:
-

 Summary: Tear down unused DoFns periodically in Java SDK harness
 Key: BEAM-8620
 URL: https://issues.apache.org/jira/browse/BEAM-8620
 Project: Beam
  Issue Type: Improvement
  Components: sdk-java-harness
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.18.0


Per the discussion in the ML the detail can be found here[1], the teardown of 
DoFns should be supported in the portability framework. It happens at two 
places:
1) Upon the control service termination
2) Tear down the unused DoFns periodically

The aim of this JIRA is to add support for tear down the unused DoFns 
periodically in Java SDK harness.

[1] 
https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8619) Teardown the DoFns upon the control service termination in Java SDK harness

2019-11-12 Thread sunjincheng (Jira)
sunjincheng created BEAM-8619:
-

 Summary: Teardown the DoFns upon the control service termination 
in Java SDK harness
 Key: BEAM-8619
 URL: https://issues.apache.org/jira/browse/BEAM-8619
 Project: Beam
  Issue Type: Improvement
  Components: sdk-java-harness
Affects Versions: 2.18.0
Reporter: sunjincheng
Assignee: sunjincheng


Per the discussion in the ML, the detail can be found [1], the teardown of 
DoFns should be supported in the portability framework. It happens at two 
places:
1) Upon the control service termination
2) Tear down the unused DoFns periodically

The aim of this JIRA is to add support for teardown the DoFns upon the control 
service termination in Java SDK harness.
[1] 
https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8618) Tear down unused DoFns periodically in Python SDK harness

2019-11-12 Thread sunjincheng (Jira)
sunjincheng created BEAM-8618:
-

 Summary: Tear down unused DoFns periodically in Python SDK harness
 Key: BEAM-8618
 URL: https://issues.apache.org/jira/browse/BEAM-8618
 Project: Beam
  Issue Type: Improvement
  Components: sdk-py-harness
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.18.0


Per the discussion in the ML, detail can be found [1],  the teardown of DoFns 
should be supported in the portability framework. It happens at two places:
1) Upon the control service termination
2) Tear down the unused DoFns periodically

The aim of this JIRA is to add support for tear down the unused DoFns 
periodically in Python SDK harness.
[1] 
https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8617) Tear down the DoFns upon the control service termination in Python SDK harness

2019-11-12 Thread sunjincheng (Jira)
sunjincheng created BEAM-8617:
-

 Summary: Tear down the DoFns upon the control service termination 
in Python SDK harness
 Key: BEAM-8617
 URL: https://issues.apache.org/jira/browse/BEAM-8617
 Project: Beam
  Issue Type: Improvement
  Components: sdk-py-harness
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.18.0


Per the discussion in the ML can be found [1], the teardown of DoFns should be 
supported in the portability framework. It happens at two places:
1) Upon the control service termination
2) Tear down the unused DoFns periodically

The aim of this JIRA is to add support to teardown the DoFns upon the control 
service termination for Python SDK harness.

[1] 
https://lists.apache.org/thread.html/0c4a4cf83cf2e35c3dfeb9d906e26cd82d3820968ba6f862f91739e4@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8594) Remove unnecessary error check of the control service accessing in DataFlow Runner

2019-11-07 Thread sunjincheng (Jira)
sunjincheng created BEAM-8594:
-

 Summary: Remove unnecessary error check of the control service 
accessing in DataFlow Runner
 Key: BEAM-8594
 URL: https://issues.apache.org/jira/browse/BEAM-8594
 Project: Beam
  Issue Type: Improvement
  Components: runner-dataflow
Reporter: sunjincheng
Assignee: sunjincheng
 Fix For: 2.18.0


Currently there are a few places in the DataFlow Runner which checks if there 
is error reported when accessing the SDK harness's control service. Actually, 
the error reported by the SDK harness has already been handled in the 
[FnApiControlClient|https://github.com/apache/beam/blob/c2f0d282337f3ae0196a7717712396a5a41fdde1/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L152].
 There is no need to check it anymore.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8565) Update .test-infra/jenkins/README with missing entries and correct wrong entries

2019-11-06 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8565:
--
Description: 
Currently the following tests are missing in the .test-infra/jenkins/README:
{code:java}
 beam_PreCommit_BeamSQL_ZetaSQL
 beam_PostCommit_CrossLanguageValidatesRunner
 beam_PostCommit_Java11_Dataflow_Examples
 beam_PostCommit_Java11_Dataflow_Portability_Examples
 beam_PostCommit_PortableJar_Flink
 beam_PostCommit_Python_MongoDBIO_IT
 beam_PostCommit_Website_Test
 beam_PerformanceTests_KafkaIOIT
 beam_PerformanceTests_MongoDBIOIT
 beam_LoadTests_Python_ParDo_Flink_Batch{code}
 

The following tests are duplicate:
{code:java}
 beam_PreCommit_Go
 beam_PreCommit_JavaPortabilityApi{code}
 

The trigger command for the following items are wrong:
{code:java}
 beam_PreCommit_Java_Examples_Dataflow
 beam_PreCommit_Portable_Python
 beam_PreCommit_Python2_PVR_Flink
 beam_PerformanceTests_TFRecordIOIT

{code}
This JIRA will address these issues.

 

  was:
Currently the following tests are missing in the .test-infra/jenkins/README:
beam_PreCommit_BeamSQL_ZetaSQL
beam_PostCommit_CrossLanguageValidatesRunner
beam_PostCommit_Java11_Dataflow_Examples
beam_PostCommit_Java11_Dataflow_Portability_Examples
beam_PostCommit_PortableJar_Flink
beam_PostCommit_Python_MongoDBIO_IT
beam_PostCommit_Website_Test
beam_PerformanceTests_KafkaIOIT
beam_PerformanceTests_MongoDBIOIT
beam_LoadTests_Python_ParDo_Flink_Batch

The following tests are duplicate:
beam_PreCommit_Go
beam_PreCommit_JavaPortabilityApi

The trigger command for the following items are wrong:
beam_PreCommit_Java_Examples_Dataflow
beam_PreCommit_Portable_Python
beam_PreCommit_Python2_PVR_Flink
beam_PerformanceTests_TFRecordIOIT

This JIRA will address these issues.


> Update .test-infra/jenkins/README with missing entries and correct wrong 
> entries
> 
>
> Key: BEAM-8565
> URL: https://issues.apache.org/jira/browse/BEAM-8565
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> Currently the following tests are missing in the .test-infra/jenkins/README:
> {code:java}
>  beam_PreCommit_BeamSQL_ZetaSQL
>  beam_PostCommit_CrossLanguageValidatesRunner
>  beam_PostCommit_Java11_Dataflow_Examples
>  beam_PostCommit_Java11_Dataflow_Portability_Examples
>  beam_PostCommit_PortableJar_Flink
>  beam_PostCommit_Python_MongoDBIO_IT
>  beam_PostCommit_Website_Test
>  beam_PerformanceTests_KafkaIOIT
>  beam_PerformanceTests_MongoDBIOIT
>  beam_LoadTests_Python_ParDo_Flink_Batch{code}
>  
> The following tests are duplicate:
> {code:java}
>  beam_PreCommit_Go
>  beam_PreCommit_JavaPortabilityApi{code}
>  
> The trigger command for the following items are wrong:
> {code:java}
>  beam_PreCommit_Java_Examples_Dataflow
>  beam_PreCommit_Portable_Python
>  beam_PreCommit_Python2_PVR_Flink
>  beam_PerformanceTests_TFRecordIOIT
> {code}
> This JIRA will address these issues.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8565) Update .test-infra/jenkins/README with missing entries and correct wrong entries

2019-11-06 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8565:
--
Fix Version/s: 2.18.0

> Update .test-infra/jenkins/README with missing entries and correct wrong 
> entries
> 
>
> Key: BEAM-8565
> URL: https://issues.apache.org/jira/browse/BEAM-8565
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.18.0
>
>
> Currently the following tests are missing in the .test-infra/jenkins/README:
> beam_PreCommit_BeamSQL_ZetaSQL
> beam_PostCommit_CrossLanguageValidatesRunner
> beam_PostCommit_Java11_Dataflow_Examples
> beam_PostCommit_Java11_Dataflow_Portability_Examples
> beam_PostCommit_PortableJar_Flink
> beam_PostCommit_Python_MongoDBIO_IT
> beam_PostCommit_Website_Test
> beam_PerformanceTests_KafkaIOIT
> beam_PerformanceTests_MongoDBIOIT
> beam_LoadTests_Python_ParDo_Flink_Batch
> The following tests are duplicate:
> beam_PreCommit_Go
> beam_PreCommit_JavaPortabilityApi
> The trigger command for the following items are wrong:
> beam_PreCommit_Java_Examples_Dataflow
> beam_PreCommit_Portable_Python
> beam_PreCommit_Python2_PVR_Flink
> beam_PerformanceTests_TFRecordIOIT
> This JIRA will address these issues.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8565) Update .test-infra/jenkins/README with missing entries and correct wrong entries

2019-11-06 Thread sunjincheng (Jira)
sunjincheng created BEAM-8565:
-

 Summary: Update .test-infra/jenkins/README with missing entries 
and correct wrong entries
 Key: BEAM-8565
 URL: https://issues.apache.org/jira/browse/BEAM-8565
 Project: Beam
  Issue Type: Bug
  Components: testing
Reporter: sunjincheng
Assignee: sunjincheng


Currently the following tests are missing in the .test-infra/jenkins/README:
beam_PreCommit_BeamSQL_ZetaSQL
beam_PostCommit_CrossLanguageValidatesRunner
beam_PostCommit_Java11_Dataflow_Examples
beam_PostCommit_Java11_Dataflow_Portability_Examples
beam_PostCommit_PortableJar_Flink
beam_PostCommit_Python_MongoDBIO_IT
beam_PostCommit_Website_Test
beam_PerformanceTests_KafkaIOIT
beam_PerformanceTests_MongoDBIOIT
beam_LoadTests_Python_ParDo_Flink_Batch

The following tests are duplicate:
beam_PreCommit_Go
beam_PreCommit_JavaPortabilityApi

The trigger command for the following items are wrong:
beam_PreCommit_Java_Examples_Dataflow
beam_PreCommit_Portable_Python
beam_PreCommit_Python2_PVR_Flink
beam_PerformanceTests_TFRecordIOIT

This JIRA will address these issues.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8557) Clean up useless null check.

2019-11-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8557:
--
Description: 
I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

 

I found the test as follows:
{code:java}
 @Test
 public void testUnknownResponseIgnored() throws Exception{code}
I do not know why we need test this case? I think it would be better if we 
throw the Exception for an UnknownResponse, otherwise, this may hidden a 
potential bug. 

Please correct me if there anything I misunderstand @kennknowles

 

  was:
I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

 

I found the test as follows:
{code:java}
 @Test
 public void testUnknownResponseIgnored() throws Exception{code}
I do not know why we need test this case? I think it would be better if we 
throw the Exception for an UnknownResponse, otherwise, this may hidden a 
potential bug. 

What do you think? @kennknowles

 


> Clean up useless null check.
> 
>
> Key: BEAM-8557
> URL: https://issues.apache.org/jira/browse/BEAM-8557
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-core, sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> I think we do not need null check here:
> [https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]
> Because before the the `onNext` call, the `Future` already put into the queue 
> in `handle` method.
>  
> I found the test as follows:
> {code:java}
>  @Test
>  public void testUnknownResponseIgnored() throws Exception{code}
> I do not know why we need test this case? I think it would be better if we 
> throw the Exception for an UnknownResponse, otherwise, this may hidden a 
> potential bug. 
> Please correct me if there anything I misunderstand @kennknowles
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8557) Clean up useless null check.

2019-11-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8557:
--
Description: 
I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

 

I found the test as follows:
{code:java}
 @Test
 public void testUnknownResponseIgnored() throws Exception{code}
I do not know why we need test this case? I think it would be better if we 
throw the Exception for an UnknownResponse, otherwise, this may hidden a 
potential bug. 

What do you think? @kennknowles

 

  was:
I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

 

I found the test as follows:
{code:java}
 @Test
 public void testUnknownResponseIgnored() throws Exception{code}
I am do not know why we need test this case? I think it would be better if we 
throw the Exception for an UnknownResponse, otherwise, this may hidden a 
potential bug. 

What do you think? @kennknowles

 


> Clean up useless null check.
> 
>
> Key: BEAM-8557
> URL: https://issues.apache.org/jira/browse/BEAM-8557
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-core, sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> I think we do not need null check here:
> [https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]
> Because before the the `onNext` call, the `Future` already put into the queue 
> in `handle` method.
>  
> I found the test as follows:
> {code:java}
>  @Test
>  public void testUnknownResponseIgnored() throws Exception{code}
> I do not know why we need test this case? I think it would be better if we 
> throw the Exception for an UnknownResponse, otherwise, this may hidden a 
> potential bug. 
> What do you think? @kennknowles
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8557) Clean up useless null check.

2019-11-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8557:
--
Description: 
I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

 

I found the test as follows:
{code:java}
 @Test
 public void testUnknownResponseIgnored() throws Exception{code}
I am do not know why we need test this case? I think it would be better if we 
throw the Exception for an UnknownResponse, otherwise, this may hidden a 
potential bug. 

What do you think? @kennknowles

 

  was:
I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

 

I found the test as follows:
{code:java}
 @Test
 public void testUnknownResponseIgnored() throws Exception{code}

 I am do not know why we need test this case? @kennknowles

What do you think?

 


> Clean up useless null check.
> 
>
> Key: BEAM-8557
> URL: https://issues.apache.org/jira/browse/BEAM-8557
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-core, sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> I think we do not need null check here:
> [https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]
> Because before the the `onNext` call, the `Future` already put into the queue 
> in `handle` method.
>  
> I found the test as follows:
> {code:java}
>  @Test
>  public void testUnknownResponseIgnored() throws Exception{code}
> I am do not know why we need test this case? I think it would be better if we 
> throw the Exception for an UnknownResponse, otherwise, this may hidden a 
> potential bug. 
> What do you think? @kennknowles
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8557) Clean up useless null check.

2019-11-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8557:
--
Description: 
I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

 

I found the test as follows:
{code:java}
 @Test
 public void testUnknownResponseIgnored() throws Exception{code}

 I am do not know why we need test this case? @kennknowles

What do you think?

 

  was:
I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

 

I found the test as follows:
```
@Test
 public void testUnknownResponseIgnored() throws Exception {
 String id = "actualInstruction";
 String unknownId = "unknownInstruction";

CompletionStage responseFuture =
 
client.handle(BeamFnApi.InstructionRequest.newBuilder().setInstructionId(id).build());

client
 .asResponseObserver()
 
.onNext(BeamFnApi.InstructionResponse.newBuilder().setInstructionId(unknownId).build());

assertThat(MoreFutures.isDone(responseFuture), is(false));
 assertThat(MoreFutures.isCancelled(responseFuture), is(false));
 }
```
I am do not know why we need test this case? @kennknowles

What do you think?

 


> Clean up useless null check.
> 
>
> Key: BEAM-8557
> URL: https://issues.apache.org/jira/browse/BEAM-8557
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-core, sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> I think we do not need null check here:
> [https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]
> Because before the the `onNext` call, the `Future` already put into the queue 
> in `handle` method.
>  
> I found the test as follows:
> {code:java}
>  @Test
>  public void testUnknownResponseIgnored() throws Exception{code}
>  I am do not know why we need test this case? @kennknowles
> What do you think?
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8557) Clean up useless null check.

2019-11-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8557:
--
Description: 
I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

 

I found the test as follows:
```
@Test
 public void testUnknownResponseIgnored() throws Exception {
 String id = "actualInstruction";
 String unknownId = "unknownInstruction";

CompletionStage responseFuture =
 
client.handle(BeamFnApi.InstructionRequest.newBuilder().setInstructionId(id).build());

client
 .asResponseObserver()
 
.onNext(BeamFnApi.InstructionResponse.newBuilder().setInstructionId(unknownId).build());

assertThat(MoreFutures.isDone(responseFuture), is(false));
 assertThat(MoreFutures.isCancelled(responseFuture), is(false));
 }
```
I am do not know why we need test this case? @kennknowles

What do you think?

 

  was:
I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

What do you think?

 


> Clean up useless null check.
> 
>
> Key: BEAM-8557
> URL: https://issues.apache.org/jira/browse/BEAM-8557
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-core, sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> I think we do not need null check here:
> [https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]
> Because before the the `onNext` call, the `Future` already put into the queue 
> in `handle` method.
>  
> I found the test as follows:
> ```
> @Test
>  public void testUnknownResponseIgnored() throws Exception {
>  String id = "actualInstruction";
>  String unknownId = "unknownInstruction";
> CompletionStage responseFuture =
>  
> client.handle(BeamFnApi.InstructionRequest.newBuilder().setInstructionId(id).build());
> client
>  .asResponseObserver()
>  
> .onNext(BeamFnApi.InstructionResponse.newBuilder().setInstructionId(unknownId).build());
> assertThat(MoreFutures.isDone(responseFuture), is(false));
>  assertThat(MoreFutures.isCancelled(responseFuture), is(false));
>  }
> ```
> I am do not know why we need test this case? @kennknowles
> What do you think?
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8442) Unify bundle register in Python SDK harness

2019-11-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8442:
--
Summary: Unify bundle register in Python SDK harness  (was: Unfiy bundle 
register in Python SDK harness)

> Unify bundle register in Python SDK harness
> ---
>
> Key: BEAM-8442
> URL: https://issues.apache.org/jira/browse/BEAM-8442
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> There are two methods for bundle register in Python SDK harness:
> `SdkHarness._request_register` and `SdkWorker.register.` It should be unfied.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8557) Clean up useless null check.

2019-11-05 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8557:
--
Parent: BEAM-7944
Issue Type: Sub-task  (was: Improvement)

> Clean up useless null check.
> 
>
> Key: BEAM-8557
> URL: https://issues.apache.org/jira/browse/BEAM-8557
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-core, sdk-java-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>
> I think we do not need null check here:
> [https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]
> Because before the the `onNext` call, the `Future` already put into the queue 
> in `handle` method.
> What do you think?
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8557) Clean up useless null check.

2019-11-05 Thread sunjincheng (Jira)
sunjincheng created BEAM-8557:
-

 Summary: Clean up useless null check.
 Key: BEAM-8557
 URL: https://issues.apache.org/jira/browse/BEAM-8557
 Project: Beam
  Issue Type: Improvement
  Components: runner-core, sdk-java-harness
Reporter: sunjincheng
Assignee: sunjincheng


I think we do not need null check here:

[https://github.com/apache/beam/blob/master/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/FnApiControlClient.java#L151]

Because before the the `onNext` call, the `Future` already put into the queue 
in `handle` method.

What do you think?

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-7951) Allow runner to configure customization WindowedValue coder such as ValueOnlyWindowedValueCoder

2019-10-25 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-7951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16959462#comment-16959462
 ] 

sunjincheng edited comment on BEAM-7951 at 10/25/19 6:19 AM:
-

This is not release blocker, thanks for reset the Fix Version. [~kenn]


was (Author: sunjincheng121):
This is not release blocker, thanks for reset the Fix Version.

> Allow runner to configure customization WindowedValue coder such as 
> ValueOnlyWindowedValueCoder
> ---
>
> Key: BEAM-7951
> URL: https://issues.apache.org/jira/browse/BEAM-7951
> Project: Beam
>  Issue Type: Sub-task
>  Components: java-fn-execution
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>
> The coder of WindowedValue cannot be configured and it’s always 
> FullWindowedValueCoder. We don't need to serialize the timestamp, window and 
> pane properties in Flink and so it will be better to make the coder 
> configurable (i.e. allowing to use ValueOnlyWindowedValueCoder)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-7951) Allow runner to configure customization WindowedValue coder such as ValueOnlyWindowedValueCoder

2019-10-25 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-7951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16959462#comment-16959462
 ] 

sunjincheng commented on BEAM-7951:
---

This is not release blocker, thanks for reset the Fix Version.

> Allow runner to configure customization WindowedValue coder such as 
> ValueOnlyWindowedValueCoder
> ---
>
> Key: BEAM-7951
> URL: https://issues.apache.org/jira/browse/BEAM-7951
> Project: Beam
>  Issue Type: Sub-task
>  Components: java-fn-execution
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>
> The coder of WindowedValue cannot be configured and it’s always 
> FullWindowedValueCoder. We don't need to serialize the timestamp, window and 
> pane properties in Flink and so it will be better to make the coder 
> configurable (i.e. allowing to use ValueOnlyWindowedValueCoder)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-7950) Remove the Python 3 warning as it has already been supported

2019-10-25 Thread sunjincheng (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-7950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16959460#comment-16959460
 ] 

sunjincheng commented on BEAM-7950:
---

Thanks [~altay] and [~tvalentyn] :)

> Remove the Python 3 warning as it has already been supported
> 
>
> Key: BEAM-7950
> URL: https://issues.apache.org/jira/browse/BEAM-7950
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.16.0
>
>
> There are warnings that Python 3 is not fully supported in Beam 
> (beam/sdks/python/setup.py). As mentioned in the ML, we should remove the 
> Python 3 warning as it has already been supported as an effort of 
> https://issues.apache.org/jira/browse/BEAM-1251.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8442) Unfiy bundle register in Python SDK harness

2019-10-22 Thread sunjincheng (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sunjincheng updated BEAM-8442:
--
Parent: BEAM-7944
Issue Type: Sub-task  (was: Improvement)

> Unfiy bundle register in Python SDK harness
> ---
>
> Key: BEAM-8442
> URL: https://issues.apache.org/jira/browse/BEAM-8442
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-harness
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.17.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> There are two methods for bundle register in Python SDK harness:
> `SdkHarness._request_register` and `SdkWorker.register.` It should be unfied.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


  1   2   >