[beam-site] 01/01: Prepare repository for deployment.

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 32712a7a3ff7842575a78a9f7d2f551784d481b6
Author: Mergebot 
AuthorDate: Thu Sep 6 05:43:58 2018 +

Prepare repository for deployment.
---
 content/contribute/portability/index.html | 8 
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/content/contribute/portability/index.html 
b/content/contribute/portability/index.html
index cb90344..a39d239 100644
--- a/content/contribute/portability/index.html
+++ b/content/contribute/portability/index.html
@@ -426,12 +426,13 @@ for details.
 To run a basic Python wordcount (in batch mode) with embedded Flink:
 
 
-  Run once to build the SDK harness container: ./gradlew -p sdks/python/container
-docker
+  Run once to build the SDK harness container: ./gradlew 
:beam-sdks-python-container:docker
   Start the Flink portable JobService endpoint: ./gradlew 
:beam-runners-flink_2.11-job-server:runShadow
-  Submit the wordcount pipeline to above endpoint: ./gradlew 
:beam-sdks-python:portableWordCount
+  Submit the wordcount pipeline to above endpoint: ./gradlew :beam-sdks-python:portableWordCount 
-PjobEndpoint=localhost:8099
 
 
+To run the pipeline in streaming mode: ./gradlew :beam-sdks-python:portableWordCount 
-PjobEndpoint=localhost:8099 -Pstreaming
+
 To run on a separate https://ci.apache.org/projects/flink/flink-docs-release-1.5/quickstart/setup_quickstart.html;>Flink
 cluster:
 
 
@@ -442,7 +443,6 @@ docker
   Submit the pipeline.
 
 
-Note: A subset of the functionality is also supported in streaming mode; 
use --streaming in the command line to 
enable it.
 
   
 



[beam-site] branch asf-site updated (3adb0c6 -> 32712a7)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from 3adb0c6  Prepare repository for deployment.
 add 510b769  Fix portable Flink runner instructions.
 add c54d1bb  This closes #548
 new 32712a7  Prepare repository for deployment.

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/portability/index.html | 8 
 src/contribute/portability.md | 8 
 2 files changed, 8 insertions(+), 8 deletions(-)



[beam-site] 02/02: This closes #548

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit c54d1bb64e53f155ac8aa5b9d3d8d67042a81db0
Merge: 3adb0c6 510b769
Author: Mergebot 
AuthorDate: Thu Sep 6 05:41:07 2018 +

This closes #548

 src/contribute/portability.md | 8 
 1 file changed, 4 insertions(+), 4 deletions(-)



[beam-site] branch mergebot updated (aa437b2 -> c54d1bb)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from aa437b2  This closes #543
 add 3adb0c6  Prepare repository for deployment.
 new 510b769  Fix portable Flink runner instructions.
 new c54d1bb  This closes #548

The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/design-documents/index.html | 2 +-
 src/contribute/portability.md  | 8 
 2 files changed, 5 insertions(+), 5 deletions(-)



[beam-site] 01/02: Fix portable Flink runner instructions.

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 510b7690525db609b3ff2aab4794286defc63507
Author: Thomas Weise 
AuthorDate: Wed Sep 5 22:58:27 2018 +0200

Fix portable Flink runner instructions.
---
 src/contribute/portability.md | 8 
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/src/contribute/portability.md b/src/contribute/portability.md
index c3abaad..4a4d9bb 100644
--- a/src/contribute/portability.md
+++ b/src/contribute/portability.md
@@ -183,10 +183,11 @@ for details.
 The Flink runner is currently the only runner to support portable pipeline 
execution.
 To run a basic Python wordcount (in batch mode) with embedded Flink:
 
-1. Run once to build the SDK harness container: `./gradlew -p 
sdks/python/container
-docker`
+1. Run once to build the SDK harness container: `./gradlew 
:beam-sdks-python-container:docker`
 2. Start the Flink portable JobService endpoint: `./gradlew 
:beam-runners-flink_2.11-job-server:runShadow`
-3. Submit the wordcount pipeline to above endpoint: `./gradlew 
:beam-sdks-python:portableWordCount`
+3. Submit the wordcount pipeline to above endpoint: `./gradlew 
:beam-sdks-python:portableWordCount -PjobEndpoint=localhost:8099`
+
+To run the pipeline in streaming mode: `./gradlew 
:beam-sdks-python:portableWordCount -PjobEndpoint=localhost:8099 -Pstreaming`
 
 To run on a separate [Flink 
cluster](https://ci.apache.org/projects/flink/flink-docs-release-1.5/quickstart/setup_quickstart.html):
 
@@ -196,4 +197,3 @@ To run on a separate [Flink 
cluster](https://ci.apache.org/projects/flink/flink-
 4. Start JobService with Flink web service endpoint: `./gradlew 
:beam-runners-flink_2.11-job-server:runShadow -PflinkMasterUrl=localhost:8081`
 5. Submit the pipeline.
 
-Note: A subset of the functionality is also supported in streaming mode; use 
`--streaming` in the command line to enable it.



[jira] [Commented] (BEAM-5328) Java starter archetype does not contain dependency versions

2018-09-05 Thread Alan Myrvold (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16605231#comment-16605231
 ] 

Alan Myrvold commented on BEAM-5328:


[https://github.com/apache/beam/blob/519e91e6de516a2800b23bec77a78acf8b94f075/release/src/main/groovy/QuickstartArchetype.groovy#L25]

Should be easy to add a test for the starter.

 

[https://github.com/apache/beam/blob/master/sdks/java/maven-archetypes/examples/build.gradle]
 has the replace tokens calls

[https://github.com/apache/beam/blob/master/sdks/java/maven-archetypes/starter/build.gradle]
 does not

> Java starter archetype does not contain dependency versions
> ---
>
> Key: BEAM-5328
> URL: https://issues.apache.org/jira/browse/BEAM-5328
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Affects Versions: 2.6.0, 2.7.0
>Reporter: Luke Cwik
>Priority: Major
>  Labels: newbie, starter
>
> The starter archetype contains resource annotation markers instead of 
> versions:
> {code:java}
> @maven-compiler-plugin.version@
> @maven-exec-plugin.version@
> @slf4j.version@
> {code}
> in the properties block at the top.
>  
> This means that the starter project is broken without the user manually 
> editing the pom.xml that is generated and populating the versions at the top.
>  
> We also lack testing that validates that the starter archetype works.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[beam] 01/01: Merge pull request #6332 from qinyeli/master

2018-09-05 Thread ccy
This is an automated email from the ASF dual-hosted git repository.

ccy pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 519e91e6de516a2800b23bec77a78acf8b94f075
Merge: 0749dc5 c6290c4
Author: Charles Chen 
AuthorDate: Wed Sep 5 20:32:43 2018 -0700

Merge pull request #6332 from qinyeli/master

Interactive Beam -- getting started guide

 .../apache_beam/runners/interactive/README.md  | 266 +
 .../examples/Interactive Beam Example.ipynb| 432 +
 .../Interactive Beam Running on Flink.ipynb| 419 
 3 files changed, 1117 insertions(+)



[beam] branch master updated (0749dc5 -> 519e91e)

2018-09-05 Thread ccy
This is an automated email from the ASF dual-hosted git repository.

ccy pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git.


from 0749dc5  Merge pull request #6335: [BEAM-5308] Correct/cleanup 
DockerOnMac code in DockerJobBundleFactory
 add c6290c4  Interactive Beam -- getting started guide
 new 519e91e  Merge pull request #6332 from qinyeli/master

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 .../apache_beam/runners/interactive/README.md  | 266 +
 .../examples/Interactive Beam Example.ipynb| 432 +
 .../Interactive Beam Running on Flink.ipynb| 419 
 3 files changed, 1117 insertions(+)
 create mode 100644 sdks/python/apache_beam/runners/interactive/README.md
 create mode 100644 
sdks/python/apache_beam/runners/interactive/examples/Interactive Beam 
Example.ipynb
 create mode 100644 
sdks/python/apache_beam/runners/interactive/examples/Interactive Beam Running 
on Flink.ipynb



[jira] [Comment Edited] (BEAM-5296) python beam_integration_benchmark(0) continuously failed.

2018-09-05 Thread Mark Liu (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16605115#comment-16605115
 ] 

Mark Liu edited comment on BEAM-5296 at 9/6/18 12:55 AM:
-

Investigations are made by Lukasz and summarized in 
https://github.com/apache/beam/pull/6321#issuecomment-418339689

If no further progress on the fix, we probably should consider upgrade fastavro 
on Jenkins nodes or rollback relevant 
[PR|https://github.com/apache/beam/pull/6321] since it affects many performance 
benchmarks.


was (Author: markflyhigh):
Investigations are made by Lukasz and summarized in 
https://github.com/apache/beam/pull/6321#issuecomment-418339689

If no further progress on the fix, we probably should consider upgrade fastavro 
on Jenkins nodes or rollback relevant PR 
https://github.com/apache/beam/pull/6321

> python beam_integration_benchmark(0) continuously failed.
> -
>
> Key: BEAM-5296
> URL: https://issues.apache.org/jira/browse/BEAM-5296
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Boyuan Zhang
>Assignee: Mark Liu
>Priority: Major
>
> beam_integration_benchmark0) failed
> https://builds.apache.org/job/beam_PerformanceTests_Python/1377/



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (BEAM-5296) python beam_integration_benchmark(0) continuously failed.

2018-09-05 Thread Mark Liu (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16605115#comment-16605115
 ] 

Mark Liu edited comment on BEAM-5296 at 9/6/18 12:54 AM:
-

Investigations are made by Lukasz and summarized in 
https://github.com/apache/beam/pull/6321#issuecomment-418339689

If no further progress on the fix, we probably should consider upgrade fastavro 
on Jenkins nodes or rollback relevant PR 
https://github.com/apache/beam/pull/6321


was (Author: markflyhigh):
Investigations are made by Lukasz and summarized in 
https://github.com/apache/beam/pull/6321#issuecomment-418339689

We probably should consider upgrade fastavro on Jenkins nodes or rollback 
relevant PR https://github.com/apache/beam/pull/6321

> python beam_integration_benchmark(0) continuously failed.
> -
>
> Key: BEAM-5296
> URL: https://issues.apache.org/jira/browse/BEAM-5296
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Boyuan Zhang
>Assignee: Mark Liu
>Priority: Major
>
> beam_integration_benchmark0) failed
> https://builds.apache.org/job/beam_PerformanceTests_Python/1377/



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (BEAM-5296) python beam_integration_benchmark(0) continuously failed.

2018-09-05 Thread Mark Liu (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16605115#comment-16605115
 ] 

Mark Liu edited comment on BEAM-5296 at 9/6/18 12:52 AM:
-

Investigations are made by Lukasz and summarized in 
https://github.com/apache/beam/pull/6321#issuecomment-418339689

We probably should consider upgrade fastavro on Jenkins nodes or rollback 
relevant PR https://github.com/apache/beam/pull/6321


was (Author: markflyhigh):
Investigations are made by Lukasz and summarized in 
https://github.com/apache/beam/pull/6321#issuecomment-418339689

> python beam_integration_benchmark(0) continuously failed.
> -
>
> Key: BEAM-5296
> URL: https://issues.apache.org/jira/browse/BEAM-5296
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Boyuan Zhang
>Assignee: Mark Liu
>Priority: Major
>
> beam_integration_benchmark0) failed
> https://builds.apache.org/job/beam_PerformanceTests_Python/1377/



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-5296) python beam_integration_benchmark(0) continuously failed.

2018-09-05 Thread Mark Liu (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16605115#comment-16605115
 ] 

Mark Liu commented on BEAM-5296:


Investigations are made by Lukasz and summarized in 
https://github.com/apache/beam/pull/6321#issuecomment-418339689

> python beam_integration_benchmark(0) continuously failed.
> -
>
> Key: BEAM-5296
> URL: https://issues.apache.org/jira/browse/BEAM-5296
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Boyuan Zhang
>Assignee: Mark Liu
>Priority: Major
>
> beam_integration_benchmark0) failed
> https://builds.apache.org/job/beam_PerformanceTests_Python/1377/



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141585=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141585
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 06/Sep/18 00:14
Start Date: 06/Sep/18 00:14
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418921593
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141585)
Time Spent: 3h 20m  (was: 3h 10m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 3h 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141584=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141584
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 06/Sep/18 00:13
Start Date: 06/Sep/18 00:13
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418921460
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141584)
Time Spent: 3h 10m  (was: 3h)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141583=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141583
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 06/Sep/18 00:12
Start Date: 06/Sep/18 00:12
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418921172
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141583)
Time Spent: 3h  (was: 2h 50m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 3h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141582=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141582
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 06/Sep/18 00:08
Start Date: 06/Sep/18 00:08
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418920474
 
 
   Run Python PostCommit


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141582)
Time Spent: 2h 50m  (was: 2h 40m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 2h 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141580=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141580
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 06/Sep/18 00:03
Start Date: 06/Sep/18 00:03
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418919687
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141580)
Time Spent: 2h 40m  (was: 2.5h)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 2h 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141578=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141578
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 23:56
Start Date: 05/Sep/18 23:56
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418918357
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141578)
Time Spent: 2.5h  (was: 2h 20m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 2.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141571=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141571
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 23:40
Start Date: 05/Sep/18 23:40
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418915687
 
 
   Run Python PostCommit


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141571)
Time Spent: 2h 20m  (was: 2h 10m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141570=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141570
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 23:37
Start Date: 05/Sep/18 23:37
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418915321
 
 
   Run Python PostCommit


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141570)
Time Spent: 2h 10m  (was: 2h)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141569=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141569
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 23:37
Start Date: 05/Sep/18 23:37
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418915227
 
 
   Python SDK PostCommit Tests


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141569)
Time Spent: 2h  (was: 1h 50m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 2h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141566=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141566
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 23:33
Start Date: 05/Sep/18 23:33
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418914603
 
 
   Python SDK PostCommit Tests


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141566)
Time Spent: 1h 50m  (was: 1h 40m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141565=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141565
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 23:30
Start Date: 05/Sep/18 23:30
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418914068
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141565)
Time Spent: 1h 40m  (was: 1.5h)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141563=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141563
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 23:27
Start Date: 05/Sep/18 23:27
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418913503
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141563)
Time Spent: 1.5h  (was: 1h 20m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141554=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141554
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 23:02
Start Date: 05/Sep/18 23:02
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418909126
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141554)
Time Spent: 1h 20m  (was: 1h 10m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141551=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141551
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 22:57
Start Date: 05/Sep/18 22:57
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418908108
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141551)
Time Spent: 1h 10m  (was: 1h)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141550=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141550
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 22:52
Start Date: 05/Sep/18 22:52
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418907034
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141550)
Time Spent: 1h  (was: 50m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 1h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (BEAM-5329) Increase test coverage of Perfkit benchmark for Python case

2018-09-05 Thread Mark Liu (JIRA)
Mark Liu created BEAM-5329:
--

 Summary: Increase test coverage of Perfkit benchmark for Python 
case
 Key: BEAM-5329
 URL: https://issues.apache.org/jira/browse/BEAM-5329
 Project: Beam
  Issue Type: Test
  Components: testing
Reporter: Mark Liu
Assignee: Jason Kuster


Perfkit benchmark is defined in PerfKitBenchmarker repo (mostly 
[here|https://github.com/GoogleCloudPlatform/PerfKitBenchmarker/blob/master/perfkitbenchmarker/beam_benchmark_helper.py]
 and 
[here|https://github.com/GoogleCloudPlatform/PerfKitBenchmarker/blob/master/perfkitbenchmarker/linux_benchmarks/beam_integration_benchmark.py])
 but not enough test coverage to verify changes to the benchmark or it's utils. 
Breakage can be surfaced only when scheduled Beam Postcommit job runs which 
will directly affect performance measurement.

We want to add more tests starting from unit tests to cover missing branches 
and utils like Python benchmark.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-5328) Java starter archetype does not contain dependency versions

2018-09-05 Thread Luke Cwik (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16605008#comment-16605008
 ] 

Luke Cwik commented on BEAM-5328:
-

[~alanmyrvold], can you link to where we test our examples archetypes so that 
they work. Would it be difficult to test the starter archetype as well?

> Java starter archetype does not contain dependency versions
> ---
>
> Key: BEAM-5328
> URL: https://issues.apache.org/jira/browse/BEAM-5328
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Affects Versions: 2.6.0, 2.7.0
>Reporter: Luke Cwik
>Priority: Major
>  Labels: newbie, starter
>
> The starter archetype contains resource annotation markers instead of 
> versions:
> {code:java}
> @maven-compiler-plugin.version@
> @maven-exec-plugin.version@
> @slf4j.version@
> {code}
> in the properties block at the top.
>  
> This means that the starter project is broken without the user manually 
> editing the pom.xml that is generated and populating the versions at the top.
>  
> We also lack testing that validates that the starter archetype works.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-5308) JobBundleFactory BindException with FlinkRunner and remote cluster

2018-09-05 Thread Thomas Weise (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16605007#comment-16605007
 ] 

Thomas Weise commented on BEAM-5308:


After the port range fix multiple jobs can run on the cluster. However, the 
second issue of containers not terminating still exists. The docker containers 
remain active after the job has finished and are only removed when the Flink 
cluster is stopped. That's different from behavior in embedded mode, where the 
containers exit after 30s. [~angoenka] any ideas?

> JobBundleFactory BindException with FlinkRunner and remote cluster
> --
>
> Key: BEAM-5308
> URL: https://issues.apache.org/jira/browse/BEAM-5308
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: Thomas Weise
>Assignee: Maximilian Michels
>Priority: Major
>  Labels: portability
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Repeated execution of the same job on remote Flink cluster (not embedded in 
> job server) fails with bind exception. There seem to be 2 issues:
>  * Multiple instances of job bundle factory cannot be created (port conflict)
>  * Job bundle factory is not released after job completes (and Docker 
> container keeps on running). That's not the case in embedded mode).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (BEAM-5328) Java starter archetype does not contain dependency versions

2018-09-05 Thread Luke Cwik (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Luke Cwik updated BEAM-5328:

Labels: newbie starter  (was: )

> Java starter archetype does not contain dependency versions
> ---
>
> Key: BEAM-5328
> URL: https://issues.apache.org/jira/browse/BEAM-5328
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Affects Versions: 2.6.0, 2.7.0
>Reporter: Luke Cwik
>Priority: Major
>  Labels: newbie, starter
>
> The starter archetype contains resource annotation markers instead of 
> versions:
> {code:java}
> @maven-compiler-plugin.version@
> @maven-exec-plugin.version@
> @slf4j.version@
> {code}
> in the properties block at the top.
>  
> This means that the starter project is broken without the user manually 
> editing the pom.xml that is generated and populating the versions at the top.
>  
> We also lack testing that validates that the starter archetype works.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (BEAM-5328) Java starter archetype does not contain dependency versions

2018-09-05 Thread Luke Cwik (JIRA)
Luke Cwik created BEAM-5328:
---

 Summary: Java starter archetype does not contain dependency 
versions
 Key: BEAM-5328
 URL: https://issues.apache.org/jira/browse/BEAM-5328
 Project: Beam
  Issue Type: Bug
  Components: sdk-java-core
Affects Versions: 2.6.0, 2.7.0
Reporter: Luke Cwik


The starter archetype contains resource annotation markers instead of versions:
{code:java}
@maven-compiler-plugin.version@
@maven-exec-plugin.version@
@slf4j.version@
{code}
in the properties block at the top.

 

This means that the starter project is broken without the user manually editing 
the pom.xml that is generated and populating the versions at the top.

 

We also lack testing that validates that the starter archetype works.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5308) JobBundleFactory BindException with FlinkRunner and remote cluster

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5308?focusedWorklogId=141542=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141542
 ]

ASF GitHub Bot logged work on BEAM-5308:


Author: ASF GitHub Bot
Created on: 05/Sep/18 22:14
Start Date: 05/Sep/18 22:14
Worklog Time Spent: 10m 
  Work Description: tweise closed pull request #6335: [BEAM-5308] 
Correct/cleanup DockerOnMac code in DockerJobBundleFactory
URL: https://github.com/apache/beam/pull/6335
 
 
   

This is a PR merged from a forked repository.
As GitHub hides the original diff on merge, it is displayed below for
the sake of provenance:

As this is a foreign pull request (from a fork), the diff is supplied
below (as it won't show otherwise due to GitHub magic):

diff --git 
a/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/DockerJobBundleFactory.java
 
b/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/DockerJobBundleFactory.java
index 3178a2e2b3b..fec29231a97 100644
--- 
a/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/DockerJobBundleFactory.java
+++ 
b/runners/java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution/control/DockerJobBundleFactory.java
@@ -43,11 +43,6 @@
 public class DockerJobBundleFactory extends JobBundleFactoryBase {
   private static final Logger LOG = 
LoggerFactory.getLogger(DockerJobBundleFactory.class);
 
-  // Port offset for MacOS since we don't have host networking and need to use 
published ports
-  private static final int MAC_PORT_START = 8100;
-  private static final int MAC_PORT_END = 8200;
-  private static final AtomicInteger MAC_PORT = new 
AtomicInteger(MAC_PORT_START);
-
   /** Factory that creates {@link JobBundleFactory} for the given {@link 
JobInfo}. */
   public interface JobBundleFactoryFactory {
 JobBundleFactory create(JobInfo jobInfo) throws Exception;
@@ -63,10 +58,6 @@ public JobBundleFactory create(JobInfo jobInfo) throws 
Exception {
 }
   });
 
-  // TODO: This host name seems to change with every other Docker release. Do 
we attempt to keep up
-  // or attempt to document the supported Docker version(s)?
-  private static final String DOCKER_FOR_MAC_HOST = "host.docker.internal";
-
   public static JobBundleFactory create(JobInfo jobInfo) throws Exception {
 return FACTORY.get().create(jobInfo);
   }
@@ -113,21 +104,7 @@ protected ServerFactory getServerFactory() {
   case LINUX:
 return ServerFactory.createDefault();
   case MAC:
-// NOTE: Deployment on Macs is intended for local development. As of 
18.03, Docker-for-Mac
-// does not implement host networking (--networking=host is 
effectively a no-op). Instead,
-// we use a special DNS entry that points to the host:
-// 
https://docs.docker.com/docker-for-mac/networking/#use-cases-and-workarounds
-// The special hostname has historically changed between versions, so 
this is subject to
-// breakages and will likely only support the latest version at any 
time.
-
-// We need to use a fixed port range due to non-existing host 
networking in Docker-for-Mac.
-// The port range needs to be published when bringing up the Docker 
container, see
-// DockerEnvironmentFactory.
-
-return ServerFactory.createWithUrlFactoryAndPortSupplier(
-(host, port) -> HostAndPort.fromParts(DOCKER_FOR_MAC_HOST, 
port).toString(),
-// We only use the published Docker ports 8100-8200 in a 
round-robin fashion
-() -> MAC_PORT.getAndUpdate(val -> val == MAC_PORT_END ? 
MAC_PORT_START : val + 1));
+return DockerOnMac.getServerFactory();
   default:
 LOG.warn("Unknown Docker platform. Falling back to default server 
factory");
 return ServerFactory.createDefault();
@@ -140,7 +117,7 @@ private static Platform getPlatform() {
 // The DOCKER_MAC_CONTAINER environment variable is necessary to detect 
whether we run on
 // a container on MacOs. MacOs internally uses a Linux VM which makes it 
indistinguishable from Linux.
 // We still need to apply port mapping due to missing host networking.
-if (osName.startsWith("mac") || 
"1".equals(System.getenv("DOCKER_MAC_CONTAINER"))) {
+if (osName.startsWith("mac") || DockerOnMac.RUNNING_INSIDE_DOCKER_ON_MAC) {
   return Platform.MAC;
 } else if (osName.startsWith("linux")) {
   return Platform.LINUX;
@@ -154,6 +131,44 @@ private static Platform getPlatform() {
 OTHER,
   }
 
+  /**
+   * NOTE: Deployment on Macs is intended for local development. As of 18.03, 
Docker-for-Mac does
+   * not implement host networking (--networking=host is effectively a no-op). 
Instead, we use a
+   * special DNS entry that points to the host:
+   * 

[beam] 01/01: Merge pull request #6335: [BEAM-5308] Correct/cleanup DockerOnMac code in DockerJobBundleFactory

2018-09-05 Thread thw
This is an automated email from the ASF dual-hosted git repository.

thw pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 0749dc56348880b30d7d6824adf125c88ab895ce
Merge: 96df192 d701a91
Author: Thomas Weise 
AuthorDate: Thu Sep 6 00:14:16 2018 +0200

Merge pull request #6335: [BEAM-5308] Correct/cleanup DockerOnMac code in 
DockerJobBundleFactory

 .../control/DockerJobBundleFactory.java| 65 +-
 1 file changed, 40 insertions(+), 25 deletions(-)



[beam] branch master updated (96df192 -> 0749dc5)

2018-09-05 Thread thw
This is an automated email from the ASF dual-hosted git repository.

thw pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git.


from 96df192  Merge pull request #6290 from 
amaliujia/rui_wang-correct_test_case_of_complex_type
 add d701a91  [BEAM-5308] Correct/cleanup DockerOnMac code in 
DockerJobBundleFactory
 new 0749dc5  Merge pull request #6335: [BEAM-5308] Correct/cleanup 
DockerOnMac code in DockerJobBundleFactory

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 .../control/DockerJobBundleFactory.java| 65 +-
 1 file changed, 40 insertions(+), 25 deletions(-)



[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141541=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141541
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 22:12
Start Date: 05/Sep/18 22:12
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6340: [BEAM-5283] Fixing 
Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340#issuecomment-418898557
 
 
   Run Python Flink PortableValidatesRunner


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141541)
Time Spent: 50m  (was: 40m)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5283?focusedWorklogId=141540=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141540
 ]

ASF GitHub Bot logged work on BEAM-5283:


Author: ASF GitHub Bot
Created on: 05/Sep/18 22:12
Start Date: 05/Sep/18 22:12
Worklog Time Spent: 10m 
  Work Description: angoenka opened a new pull request #6340: [BEAM-5283] 
Fixing Comment in Flink Post commit jenkins task
URL: https://github.com/apache/beam/pull/6340
 
 
   **Please** add a meaningful description for your change here
   
   
   
   Follow this checklist to help us incorporate your contribution quickly and 
easily:
   
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   It will help us expedite review of your Pull Request if you tag someone 
(e.g. `@username`) to look at it.
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Gearpump | Samza | Spark
   --- | --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_GradleBuild/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_GradleBuild/lastCompletedBuild/)
 | --- | --- | --- | --- | --- | ---
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_GradleBuild/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_GradleBuild/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex_Gradle/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex_Gradle/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Gradle/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Gradle/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Gradle/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Gradle/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump_Gradle/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump_Gradle/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza_Gradle/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza_Gradle/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark_Gradle/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark_Gradle/lastCompletedBuild/)
   Python | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Python_Verify/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python_Verify/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCompletedBuild/)
  [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Py_ValCont/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Py_ValCont/lastCompletedBuild/)
 | --- | --- | --- | ---
   
   
   
   
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141540)
Time Spent: 40m  (was: 0.5h)

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA

[jira] [Work logged] (BEAM-5262) JobState support for Reference Runner

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5262?focusedWorklogId=141539=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141539
 ]

ASF GitHub Bot logged work on BEAM-5262:


Author: ASF GitHub Bot
Created on: 05/Sep/18 21:38
Start Date: 05/Sep/18 21:38
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #6301: [BEAM-5262] Add 
Reference runner support for add state stream
URL: https://github.com/apache/beam/pull/6301#issuecomment-418890184
 
 
   Ping!


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141539)
Time Spent: 0.5h  (was: 20m)

> JobState support for Reference Runner
> -
>
> Key: BEAM-5262
> URL: https://issues.apache.org/jira/browse/BEAM-5262
> Project: Beam
>  Issue Type: Bug
>  Components: runner-direct
>Reporter: Ankur Goenka
>Assignee: Ankur Goenka
>Priority: Minor
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Reference runner does not support getStateStream which is needed by portable 
> SDK



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5301) Migrate integration tests for datastore_wordcount

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5301?focusedWorklogId=141531=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141531
 ]

ASF GitHub Bot logged work on BEAM-5301:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:59
Start Date: 05/Sep/18 20:59
Worklog Time Spent: 10m 
  Work Description: yifanzou commented on issue #6333: [BEAM-5301] migrate 
datastore_worcount integration test to Beam
URL: https://github.com/apache/beam/pull/6333#issuecomment-418879614
 
 
   Run Python PreCommit


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141531)
Time Spent: 20m  (was: 10m)

> Migrate integration tests for datastore_wordcount
> -
>
> Key: BEAM-5301
> URL: https://issues.apache.org/jira/browse/BEAM-5301
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>Reporter: yifan zou
>Assignee: yifan zou
>Priority: Major
>  Time Spent: 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[beam-site] 01/01: Prepare repository for deployment.

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 3adb0c6f504df0fc04477e8dddb36077eeebfc3d
Author: Mergebot 
AuthorDate: Wed Sep 5 20:59:24 2018 +

Prepare repository for deployment.
---
 content/contribute/design-documents/index.html | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/content/contribute/design-documents/index.html 
b/content/contribute/design-documents/index.html
index 505efbb..fb2559e 100644
--- a/content/contribute/design-documents/index.html
+++ b/content/contribute/design-documents/index.html
@@ -275,7 +275,7 @@ limitations under the License.
   Error Reporting Callback (WIP) [https://docs.google.com/document/d/1o2VXwCL97k3G-1BR9RSKNc6XtJTIA6SEKPMne91S67Y;>doc]
   Snapshotting and Updating Beam Pipelines [https://docs.google.com/document/d/1UWhnYPgui0gUYOsuGcCjLuoOUlGA4QaY91n8p3wz9MY;>doc]
   Requiring PTransform to set a coder on its resulting collections [https://lists.apache.org/thread.html/1dde0b5a93c2983cbab5f68ce7c74580102f5bb2baaa816585d7eabb@%3Cdev.beam.apache.org%3E;>mail]
-  [PROPOSAL] Support of @RequiresStableInput annotation [https://docs.google.com/document/d/117yRKbbcEdm3eIKB_26BHOJGmHSZl1YNoF0RqWGtqAM;>doc],
 [https://lists.apache.org/thread.html/ae3c838df060e47148439d1dad818d5e927b2a25ff00cc4153221dff@%3Cdev.beam.apache.org%3E;>mail]
+  Support of @RequiresStableInput annotation [https://docs.google.com/document/d/117yRKbbcEdm3eIKB_26BHOJGmHSZl1YNoF0RqWGtqAM;>doc],
 [https://lists.apache.org/thread.html/ae3c838df060e47148439d1dad818d5e927b2a25ff00cc4153221dff@%3Cdev.beam.apache.org%3E;>mail]
   [PROPOSAL] @onwindowexpiration [https://lists.apache.org/thread.html/1dab7f17c97378e665928b6cbd887dc7be93390ab26c593ee49a@%3Cdev.beam.apache.org%3E;>mail]
 
 



[beam-site] branch asf-site updated (5a00fd9 -> 3adb0c6)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from 5a00fd9  Prepare repository for deployment.
 add 288f55d  Remove the [PROPOSAL] label of the @RequiresStableInput 
design doc
 add aa437b2  This closes #543
 new 3adb0c6  Prepare repository for deployment.

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/design-documents/index.html | 2 +-
 src/contribute/design-documents.md | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)



[beam-site] branch mergebot updated (49563e7 -> aa437b2)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from 49563e7  This closes #545
 add 5a00fd9  Prepare repository for deployment.
 new 288f55d  Remove the [PROPOSAL] label of the @RequiresStableInput 
design doc
 new aa437b2  This closes #543

The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/index.html  | 36 
 src/contribute/design-documents.md |  2 +-
 2 files changed, 37 insertions(+), 1 deletion(-)



[beam-site] 02/02: This closes #543

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit aa437b26935f27c97e0ee3a855bf8b21776c477b
Merge: 5a00fd9 288f55d
Author: Mergebot 
AuthorDate: Wed Sep 5 20:56:26 2018 +

This closes #543

 src/contribute/design-documents.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)



[beam-site] 01/02: Remove the [PROPOSAL] label of the @RequiresStableInput design doc

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 288f55dc1594ed5be14a26f1d8a31732a8279f18
Author: Yueyang Qiu 
AuthorDate: Mon Aug 27 16:03:12 2018 -0700

Remove the [PROPOSAL] label of the @RequiresStableInput design doc
---
 src/contribute/design-documents.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/src/contribute/design-documents.md 
b/src/contribute/design-documents.md
index a37beca..deed450 100644
--- a/src/contribute/design-documents.md
+++ b/src/contribute/design-documents.md
@@ -53,7 +53,7 @@ This is a collection of documents that may or may not be up 
to date.
 - Error Reporting Callback (WIP) 
[[doc](https://docs.google.com/document/d/1o2VXwCL97k3G-1BR9RSKNc6XtJTIA6SEKPMne91S67Y)]
 - Snapshotting and Updating Beam Pipelines 
[[doc](https://docs.google.com/document/d/1UWhnYPgui0gUYOsuGcCjLuoOUlGA4QaY91n8p3wz9MY)]
 - Requiring PTransform to set a coder on its resulting collections 
[[mail](https://lists.apache.org/thread.html/1dde0b5a93c2983cbab5f68ce7c74580102f5bb2baaa816585d7eabb@%3Cdev.beam.apache.org%3E)]
-- [PROPOSAL] Support of @RequiresStableInput annotation 
[[doc](https://docs.google.com/document/d/117yRKbbcEdm3eIKB_26BHOJGmHSZl1YNoF0RqWGtqAM)],
 
[[mail](https://lists.apache.org/thread.html/ae3c838df060e47148439d1dad818d5e927b2a25ff00cc4153221dff@%3Cdev.beam.apache.org%3E)]
+- Support of @RequiresStableInput annotation 
[[doc](https://docs.google.com/document/d/117yRKbbcEdm3eIKB_26BHOJGmHSZl1YNoF0RqWGtqAM)],
 
[[mail](https://lists.apache.org/thread.html/ae3c838df060e47148439d1dad818d5e927b2a25ff00cc4153221dff@%3Cdev.beam.apache.org%3E)]
 - [PROPOSAL] @onwindowexpiration 
[[mail](https://lists.apache.org/thread.html/1dab7f17c97378e665928b6cbd887dc7be93390ab26c593ee49a@%3Cdev.beam.apache.org%3E)]
 
 ### IO / Filesystem



[beam-site] branch asf-site updated (2aca6fa -> 5a00fd9)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from 2aca6fa  Prepare repository for deployment.
 add f1dcace  Contribute: document how to find and add reviewers
 add 49563e7  This closes #545
 new 5a00fd9  Prepare repository for deployment.

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/index.html | 36 
 src/contribute/index.md   | 30 ++
 2 files changed, 66 insertions(+)



[beam-site] 01/01: Prepare repository for deployment.

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 5a00fd9a0d3de646f5a4d4cea554d3930e0231e8
Author: Mergebot 
AuthorDate: Wed Sep 5 20:55:27 2018 +

Prepare repository for deployment.
---
 content/contribute/index.html | 36 
 1 file changed, 36 insertions(+)

diff --git a/content/contribute/index.html b/content/contribute/index.html
index c6f1426..b2767bf 100644
--- a/content/contribute/index.html
+++ b/content/contribute/index.html
@@ -207,6 +207,12 @@
   Developing with the Python 
SDK
 
   
+  Reviews
+
+  Finding reviewers
+  Adding yourself as a 
reviewer
+
+  
   Contributing to the 
website
   Works in progress
 
@@ -479,6 +485,36 @@ $ python setup.py sdist
 
 
 
+Reviews
+
+Reviewers for https://github.com/apache/beam;>apache/beam are 
listed in
+Prow-style OWNERS files. A description of these files can be found
+https://go.k8s.io/owners;>here.
+
+Finding reviewers
+
+Currently this is a manual process. Tracking bug for automating this:
+https://issues.apache.org/jira/browse/BEAM-4790;>BEAM-4790.
+
+For each file to be reviewed, look for an OWNERS file in its directory. 
Pick a
+single reviewer from that file. If the directory doesn’t contain an OWNERS 
file,
+go up a directory. Keep going until you find one. Try to limit the number of
+reviewers to 2 per PR if possible, to minimize reviewer load.
+
+Adding yourself as a reviewer
+
+Find the deepest sub-directory that contains the files you want to be a 
reviewer
+for and add your Github username under reviewers in the OWNERS file (create a
+new OWNERS file if necessary).
+
+The Beam project currently only uses the reviewers key in OWNERS and no other
+features, as reviewer selection is still a manual process.
+
+
+
 Contributing to the website
 
 The Beam website is in the https://github.com/apache/beam-site;>Beam Site GitHub



[jira] [Work logged] (BEAM-4790) Make it easier to select code reviewers for PRs

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4790?focusedWorklogId=141530=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141530
 ]

ASF GitHub Bot logged work on BEAM-4790:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:55
Start Date: 05/Sep/18 20:55
Worklog Time Spent: 10m 
  Work Description: asfgit closed pull request #545: [BEAM-4790] 
Contribute: document how to find and add reviewers
URL: https://github.com/apache/beam-site/pull/545
 
 
   

This is a PR merged from a forked repository.
As GitHub hides the original diff on merge, it is displayed below for
the sake of provenance:

As this is a foreign pull request (from a fork), the diff is supplied
below (as it won't show otherwise due to GitHub magic):

diff --git a/src/contribute/index.md b/src/contribute/index.md
index b202214218..a26ca8939a 100644
--- a/src/contribute/index.md
+++ b/src/contribute/index.md
@@ -228,6 +228,36 @@ $ python setup.py sdist > /dev/null && \
 --sdk_location dist/apache-beam-2.5.0.dev0.tar.gz
 ```
 
+## Reviews
+
+Reviewers for [apache/beam](https://github.com/apache/beam) are listed in
+Prow-style OWNERS files. A description of these files can be found
+[here](https://go.k8s.io/owners).
+
+### Finding reviewers
+
+Currently this is a manual process. Tracking bug for automating this:
+[BEAM-4790](https://issues.apache.org/jira/browse/BEAM-4790).
+
+For each file to be reviewed, look for an OWNERS file in its directory. Pick a
+single reviewer from that file. If the directory doesn't contain an OWNERS 
file,
+go up a directory. Keep going until you find one. Try to limit the number of
+reviewers to 2 per PR if possible, to minimize reviewer load.
+
+### Adding yourself as a reviewer
+
+Find the deepest sub-directory that contains the files you want to be a 
reviewer
+for and add your Github username under `reviewers` in the OWNERS file (create a
+new OWNERS file if necessary).
+
+The Beam project currently only uses the `reviewers` key in OWNERS and no other
+features, as reviewer selection is still a manual process.
+
+
+
 ## Contributing to the website
 
 The Beam website is in the [Beam Site GitHub


 


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141530)
Time Spent: 3h 40m  (was: 3.5h)

> Make it easier to select code reviewers for PRs
> ---
>
> Key: BEAM-4790
> URL: https://issues.apache.org/jira/browse/BEAM-4790
> Project: Beam
>  Issue Type: Improvement
>  Components: project-management
>Reporter: Udi Meiri
>Assignee: Udi Meiri
>Priority: Major
>  Time Spent: 3h 40m
>  Remaining Estimate: 0h
>
> The original idea was to use Github's CODEOWNERS feature, but that turned out 
> to not work well for Apache Beam.
> Thread: 
> https://lists.apache.org/thread.html/9cc3191b21489c855ed502beade6d453b18e26eec46cb2c3dfe810e9@%3Cdev.beam.apache.org%3E
> Document discussing this idea:
> https://docs.google.com/document/d/184Tn0ONJ2ENHBVrhIes4seVuBSkww7xX5xb5W2VZwm8/edit#heading=h.l7mcrg761zp8
> Mailing list thread:
> https://lists.apache.org/thread.html/6138d08c551e254b5f13b26c6ba06579a49a4694f4d13ad6d164689a@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[beam-site] 01/02: Contribute: document how to find and add reviewers

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit f1dcace7db47779c5a9589202c0522687d15a815
Author: Udi Meiri 
AuthorDate: Fri Aug 31 11:42:37 2018 -0700

Contribute: document how to find and add reviewers
---
 src/contribute/index.md | 30 ++
 1 file changed, 30 insertions(+)

diff --git a/src/contribute/index.md b/src/contribute/index.md
index b202214..a26ca89 100644
--- a/src/contribute/index.md
+++ b/src/contribute/index.md
@@ -228,6 +228,36 @@ $ python setup.py sdist > /dev/null && \
 --sdk_location dist/apache-beam-2.5.0.dev0.tar.gz
 ```
 
+## Reviews
+
+Reviewers for [apache/beam](https://github.com/apache/beam) are listed in
+Prow-style OWNERS files. A description of these files can be found
+[here](https://go.k8s.io/owners).
+
+### Finding reviewers
+
+Currently this is a manual process. Tracking bug for automating this:
+[BEAM-4790](https://issues.apache.org/jira/browse/BEAM-4790).
+
+For each file to be reviewed, look for an OWNERS file in its directory. Pick a
+single reviewer from that file. If the directory doesn't contain an OWNERS 
file,
+go up a directory. Keep going until you find one. Try to limit the number of
+reviewers to 2 per PR if possible, to minimize reviewer load.
+
+### Adding yourself as a reviewer
+
+Find the deepest sub-directory that contains the files you want to be a 
reviewer
+for and add your Github username under `reviewers` in the OWNERS file (create a
+new OWNERS file if necessary).
+
+The Beam project currently only uses the `reviewers` key in OWNERS and no other
+features, as reviewer selection is still a manual process.
+
+
+
 ## Contributing to the website
 
 The Beam website is in the [Beam Site GitHub



[beam-site] branch mergebot updated (d065f1e -> 49563e7)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from d065f1e  This closes #547
 add 2aca6fa  Prepare repository for deployment.
 new f1dcace  Contribute: document how to find and add reviewers
 new 49563e7  This closes #545

The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/eclipse/index.html | 10 +-
 src/contribute/index.md   | 30 ++
 2 files changed, 31 insertions(+), 9 deletions(-)



[beam-site] 02/02: This closes #545

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 49563e7d97d2b37a2f02869c45976830c7f33fec
Merge: 2aca6fa f1dcace
Author: Mergebot 
AuthorDate: Wed Sep 5 20:52:44 2018 +

This closes #545

 src/contribute/index.md | 30 ++
 1 file changed, 30 insertions(+)



[beam-site] 01/01: Prepare repository for deployment.

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 2aca6fa7a6d7be380f1fb5fc941a2edbc29b805c
Author: Mergebot 
AuthorDate: Wed Sep 5 20:52:02 2018 +

Prepare repository for deployment.
---
 content/contribute/eclipse/index.html | 10 +-
 1 file changed, 1 insertion(+), 9 deletions(-)

diff --git a/content/contribute/eclipse/index.html 
b/content/contribute/eclipse/index.html
index 6ee2b6d..480e7f2 100644
--- a/content/contribute/eclipse/index.html
+++ b/content/contribute/eclipse/index.html
@@ -227,15 +227,7 @@ limitations under the License.
 Eclipse Tips
 
 
-  These are best-effort community-contributed tips, and are not…
-
-  
-… guaranteed to work with any particular Eclipse setup.
-… the only or best way to work.
-… an endorsement of Eclipse over alternatives.
-  
-
-  Mastering Eclipse is, ultimately, your responsibility.
+  These are best-effort community-contributed tips, and are not guaranteed 
to work with any particular Eclipse setup.
 
 
 Eclipse version



[beam-site] branch asf-site updated (7a75616 -> 2aca6fa)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from 7a75616  Prepare repository for deployment.
 add 8a1a418  Update eclipse.md
 add d065f1e  This closes #547
 new 2aca6fa  Prepare repository for deployment.

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/eclipse/index.html | 10 +-
 src/contribute/eclipse.md |  8 +---
 2 files changed, 2 insertions(+), 16 deletions(-)



[jira] [Work logged] (BEAM-5124) Write Euphoria in Beam documentation

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5124?focusedWorklogId=141529=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141529
 ]

ASF GitHub Bot logged work on BEAM-5124:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:50
Start Date: 05/Sep/18 20:50
Worklog Time Spent: 10m 
  Work Description: melap commented on issue #540: [BEAM-5124] DSL Euphoria 
documentation update
URL: https://github.com/apache/beam-site/pull/540#issuecomment-418876911
 
 
   R: @jbonofre 


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141529)
Time Spent: 2h 10m  (was: 2h)

> Write Euphoria in Beam documentation
> 
>
> Key: BEAM-5124
> URL: https://issues.apache.org/jira/browse/BEAM-5124
> Project: Beam
>  Issue Type: Sub-task
>  Components: dsl-euphoria
>Reporter: Vaclav Plajt
>Assignee: Vaclav Plajt
>Priority: Major
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[beam-site] 01/02: Update eclipse.md

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 8a1a418c54a669080519cf0a54795f8a7d3ebed8
Author: Rafael Fernandez 
AuthorDate: Fri Aug 31 19:12:29 2018 -0700

Update eclipse.md

Friendlier language...
---
 src/contribute/eclipse.md | 8 +---
 1 file changed, 1 insertion(+), 7 deletions(-)

diff --git a/src/contribute/eclipse.md b/src/contribute/eclipse.md
index bedf2bd..4eff1f0 100644
--- a/src/contribute/eclipse.md
+++ b/src/contribute/eclipse.md
@@ -20,13 +20,7 @@ limitations under the License.
 
 # Eclipse Tips
 
-> These are best-effort community-contributed tips, and are not...
->
-> - ... guaranteed to work with any particular Eclipse setup.
-> - ... the only or best way to work.
-> - ... an endorsement of Eclipse over alternatives.
->
-> Mastering Eclipse is, ultimately, your responsibility.
+> These are best-effort community-contributed tips, and are not guaranteed to 
work with any particular Eclipse setup.
 
 ## Eclipse version
 



[beam-site] 02/02: This closes #547

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit d065f1e64f095dd2f4f98a1fa0463663d3a08851
Merge: 7a75616 8a1a418
Author: Mergebot 
AuthorDate: Wed Sep 5 20:48:59 2018 +

This closes #547

 src/contribute/eclipse.md | 8 +---
 1 file changed, 1 insertion(+), 7 deletions(-)



[beam-site] branch mergebot updated (91d5227 -> d065f1e)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from 91d5227  This closes #544
 add 7a75616  Prepare repository for deployment.
 new 8a1a418  Update eclipse.md
 new d065f1e  This closes #547

The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/portability/index.html | 5 +++--
 src/contribute/eclipse.md | 8 +---
 2 files changed, 4 insertions(+), 9 deletions(-)



[beam-site] 01/01: Prepare repository for deployment.

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 7a7561660612a9561be7107fd3fc92a9e2eb4a5f
Author: Mergebot 
AuthorDate: Wed Sep 5 20:48:19 2018 +

Prepare repository for deployment.
---
 content/contribute/portability/index.html | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/content/contribute/portability/index.html 
b/content/contribute/portability/index.html
index bae564b..cb90344 100644
--- a/content/contribute/portability/index.html
+++ b/content/contribute/portability/index.html
@@ -417,7 +417,7 @@ their respective components.
 Status
 
 MVP in progress (near completion for Flink runner). See the
-https://docs.google.com/spreadsheets/d/1KDa_FGn1ShjomGd-UUDOhuh2q73de2tPz6BqHpzqvNI/edit?usp=sharing;>Portability
 support table
+https://s.apache.org/apache-beam-portability-support-table;>Portability 
support table
 for details.
 
 Running Python wordcount on 
Flink
@@ -437,7 +437,8 @@ docker
 
   Start local Flink cluster
   Create shaded JobService jar: ./gradlew 
:beam-runners-flink_2.11-job-server:installShadowDist
-  Start JobService with Flink web service endpoint: java -jar 
./runners/flink/job-server/build/install/beam-runners-flink_2.11-job-server-shadow/lib/beam-runners-flink_2.11-job-server-*.jar
 "--job-host=localhost:8099" "--artifacts-dir=/tmp/flink-artifacts" 
"--flink-master-url=localhost:8081"
+  Optional optimization step: Place the generated JobServer Jar beam/runners/flink/job-server/build/libs/beam-runners-flink_2.11-job-server-2.7.0-SNAPSHOT.jar
 in flink/lib and change class loading 
order for Flink by adding classloader.resolve-order: parent-first to 
conf/flink-conf.yaml.
+  Start JobService with Flink web service endpoint: ./gradlew 
:beam-runners-flink_2.11-job-server:runShadow 
-PflinkMasterUrl=localhost:8081
   Submit the pipeline.
 
 



[beam-site] branch asf-site updated (27eac2e -> 7a75616)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from 27eac2e  Prepare repository for deployment.
 add 0c32c28  Update Portable Flink cluster run instructions
 add 91d5227  This closes #544
 new 7a75616  Prepare repository for deployment.

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/portability/index.html | 5 +++--
 src/contribute/portability.md | 7 ---
 2 files changed, 7 insertions(+), 5 deletions(-)



[beam-site] 02/02: This closes #544

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 91d522768cd811cc7549b135961e819d130c7c98
Merge: 27eac2e 0c32c28
Author: Mergebot 
AuthorDate: Wed Sep 5 20:45:00 2018 +

This closes #544

 src/contribute/portability.md | 7 ---
 1 file changed, 4 insertions(+), 3 deletions(-)



[beam-site] branch mergebot updated (6932df9 -> 91d5227)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from 6932df9  This closes #546
 add 27eac2e  Prepare repository for deployment.
 new 0c32c28  Update Portable Flink cluster run instructions
 new 91d5227  This closes #544

The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/intellij/index.html | 10 +-
 src/contribute/portability.md  |  7 ---
 2 files changed, 5 insertions(+), 12 deletions(-)



[beam-site] 01/02: Update Portable Flink cluster run instructions

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 0c32c2823673bcfa5f146be9705b2d469b13d3ab
Author: Ankur 
AuthorDate: Wed Aug 29 14:42:47 2018 -0700

Update Portable Flink cluster run instructions
---
 src/contribute/portability.md | 7 ---
 1 file changed, 4 insertions(+), 3 deletions(-)

diff --git a/src/contribute/portability.md b/src/contribute/portability.md
index 4e3be37..c3abaad 100644
--- a/src/contribute/portability.md
+++ b/src/contribute/portability.md
@@ -175,7 +175,7 @@ their respective components.
 ### Status
 
 MVP in progress (near completion for Flink runner). See the
-[Portability support 
table](https://docs.google.com/spreadsheets/d/1KDa_FGn1ShjomGd-UUDOhuh2q73de2tPz6BqHpzqvNI/edit?usp=sharing)
+[Portability support 
table](https://s.apache.org/apache-beam-portability-support-table)
 for details.
 
  Running Python wordcount on Flink
@@ -192,7 +192,8 @@ To run on a separate [Flink 
cluster](https://ci.apache.org/projects/flink/flink-
 
 1. Start local Flink cluster
 2. Create shaded JobService jar: `./gradlew 
:beam-runners-flink_2.11-job-server:installShadowDist`
-3. Start JobService with Flink web service endpoint: `java -jar 
./runners/flink/job-server/build/install/beam-runners-flink_2.11-job-server-shadow/lib/beam-runners-flink_2.11-job-server-*.jar
 "--job-host=localhost:8099" "--artifacts-dir=/tmp/flink-artifacts" 
"--flink-master-url=localhost:8081"`
-4. Submit the pipeline.
+3. Optional optimization step: Place the generated JobServer Jar 
`beam/runners/flink/job-server/build/libs/beam-runners-flink_2.11-job-server-2.7.0-SNAPSHOT.jar`
 in `flink/lib` and change class loading order for Flink by adding 
`classloader.resolve-order: parent-first` to `conf/flink-conf.yaml`.
+4. Start JobService with Flink web service endpoint: `./gradlew 
:beam-runners-flink_2.11-job-server:runShadow -PflinkMasterUrl=localhost:8081`
+5. Submit the pipeline.
 
 Note: A subset of the functionality is also supported in streaming mode; use 
`--streaming` in the command line to enable it.



[jira] [Work logged] (BEAM-4790) Make it easier to select code reviewers for PRs

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4790?focusedWorklogId=141528=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141528
 ]

ASF GitHub Bot logged work on BEAM-4790:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:45
Start Date: 05/Sep/18 20:45
Worklog Time Spent: 10m 
  Work Description: melap commented on issue #545: [BEAM-4790] Contribute: 
document how to find and add reviewers
URL: https://github.com/apache/beam-site/pull/545#issuecomment-418875200
 
 
   @asfgit merge


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141528)
Time Spent: 3.5h  (was: 3h 20m)

> Make it easier to select code reviewers for PRs
> ---
>
> Key: BEAM-4790
> URL: https://issues.apache.org/jira/browse/BEAM-4790
> Project: Beam
>  Issue Type: Improvement
>  Components: project-management
>Reporter: Udi Meiri
>Assignee: Udi Meiri
>Priority: Major
>  Time Spent: 3.5h
>  Remaining Estimate: 0h
>
> The original idea was to use Github's CODEOWNERS feature, but that turned out 
> to not work well for Apache Beam.
> Thread: 
> https://lists.apache.org/thread.html/9cc3191b21489c855ed502beade6d453b18e26eec46cb2c3dfe810e9@%3Cdev.beam.apache.org%3E
> Document discussing this idea:
> https://docs.google.com/document/d/184Tn0ONJ2ENHBVrhIes4seVuBSkww7xX5xb5W2VZwm8/edit#heading=h.l7mcrg761zp8
> Mailing list thread:
> https://lists.apache.org/thread.html/6138d08c551e254b5f13b26c6ba06579a49a4694f4d13ad6d164689a@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-5258) Investigate if we can disable Row type flattening in Calcite

2018-09-05 Thread Rui Wang (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604918#comment-16604918
 ] 

Rui Wang commented on BEAM-5258:


The feedback from Calcite community saying that disabling flatterer makes 
sense. It might not be a minor change though.

> Investigate if we can disable Row type flattening in Calcite
> 
>
> Key: BEAM-5258
> URL: https://issues.apache.org/jira/browse/BEAM-5258
> Project: Beam
>  Issue Type: Sub-task
>  Components: dsl-sql
>Reporter: Rui Wang
>Assignee: Rui Wang
>Priority: Major
>
> Either disable the flattening in PlannerImpl or Flattener could be a good 
> start.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[beam-site] 01/01: Prepare repository for deployment.

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 27eac2e570dac5df1a1767f9e6aa19851d8a2dd8
Author: Mergebot 
AuthorDate: Wed Sep 5 20:44:19 2018 +

Prepare repository for deployment.
---
 content/contribute/intellij/index.html | 10 +-
 1 file changed, 1 insertion(+), 9 deletions(-)

diff --git a/content/contribute/intellij/index.html 
b/content/contribute/intellij/index.html
index 5da99be..c4de113 100644
--- a/content/contribute/intellij/index.html
+++ b/content/contribute/intellij/index.html
@@ -225,15 +225,7 @@ limitations under the License.
 IntelliJ Tips
 
 
-  These are best-effort community-contributed tips, and are not…
-
-  
-… guaranteed to work with any particular IntelliJ setup.
-… the only or best way to work.
-… an endorsement of IntelliJ over alternatives.
-  
-
-  Mastering IntelliJ is, ultimately, your responsibility.
+  These are best-effort community-contributed tips, and are not guaranteed 
to work with any particular IntelliJ setup.
 
 
 Create a working Gradle module



[beam-site] branch asf-site updated (423f9e8 -> 27eac2e)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from 423f9e8  Prepare repository for deployment.
 add 1627e8b  Update intellij.md
 add 6932df9  This closes #546
 new 27eac2e  Prepare repository for deployment.

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 content/contribute/intellij/index.html | 10 +-
 src/contribute/intellij.md |  8 +---
 2 files changed, 2 insertions(+), 16 deletions(-)



[beam-site] 02/02: This closes #546

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 6932df9bde9c39892210b0128f6412828620a0f6
Merge: 423f9e8 1627e8b
Author: Mergebot 
AuthorDate: Wed Sep 5 20:41:16 2018 +

This closes #546

 src/contribute/intellij.md | 8 +---
 1 file changed, 1 insertion(+), 7 deletions(-)



[beam-site] branch mergebot updated (d8070cb -> 6932df9)

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a change to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git.


from d8070cb  This closes #542
 add 423f9e8  Prepare repository for deployment.
 new 1627e8b  Update intellij.md
 new 6932df9  This closes #546

The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 .../2018/08/21/beam-summit-europe.html}|  61 +-
 content/blog/index.html|  19 +
 content/feed.xml   | 639 ++---
 {src => content}/images/blog/Facebook-AD.png   | Bin
 content/index.html |  10 +-
 src/contribute/intellij.md |   8 +-
 6 files changed, 98 insertions(+), 639 deletions(-)
 copy content/{beam/capability/2016/04/03/presentation-materials.html => 
blog/2018/08/21/beam-summit-europe.html} (74%)
 copy {src => content}/images/blog/Facebook-AD.png (100%)



[beam-site] 01/02: Update intellij.md

2018-09-05 Thread mergebot-role
This is an automated email from the ASF dual-hosted git repository.

mergebot-role pushed a commit to branch mergebot
in repository https://gitbox.apache.org/repos/asf/beam-site.git

commit 1627e8bd84eb04265f38ffe89e53e6bbc58e
Author: Rafael Fernandez 
AuthorDate: Fri Aug 31 19:11:31 2018 -0700

Update intellij.md

Soften the language a bit.
---
 src/contribute/intellij.md | 8 +---
 1 file changed, 1 insertion(+), 7 deletions(-)

diff --git a/src/contribute/intellij.md b/src/contribute/intellij.md
index 2b4f5bd..e30911e 100644
--- a/src/contribute/intellij.md
+++ b/src/contribute/intellij.md
@@ -20,13 +20,7 @@ limitations under the License.
 
 # IntelliJ Tips
 
-> These are best-effort community-contributed tips, and are not...
->
-> - ... guaranteed to work with any particular IntelliJ setup.
-> - ... the only or best way to work.
-> - ... an endorsement of IntelliJ over alternatives.
->
-> Mastering IntelliJ is, ultimately, your responsibility.
+> These are best-effort community-contributed tips, and are not guaranteed to 
work with any particular IntelliJ setup. 
 
 ## Create a working Gradle module
 



[jira] [Work logged] (BEAM-4790) Make it easier to select code reviewers for PRs

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4790?focusedWorklogId=141523=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141523
 ]

ASF GitHub Bot logged work on BEAM-4790:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:33
Start Date: 05/Sep/18 20:33
Worklog Time Spent: 10m 
  Work Description: udim commented on issue #545: [BEAM-4790] Contribute: 
document how to find and add reviewers
URL: https://github.com/apache/beam-site/pull/545#issuecomment-418871778
 
 
   Thanks!


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141523)
Time Spent: 3h 20m  (was: 3h 10m)

> Make it easier to select code reviewers for PRs
> ---
>
> Key: BEAM-4790
> URL: https://issues.apache.org/jira/browse/BEAM-4790
> Project: Beam
>  Issue Type: Improvement
>  Components: project-management
>Reporter: Udi Meiri
>Assignee: Udi Meiri
>Priority: Major
>  Time Spent: 3h 20m
>  Remaining Estimate: 0h
>
> The original idea was to use Github's CODEOWNERS feature, but that turned out 
> to not work well for Apache Beam.
> Thread: 
> https://lists.apache.org/thread.html/9cc3191b21489c855ed502beade6d453b18e26eec46cb2c3dfe810e9@%3Cdev.beam.apache.org%3E
> Document discussing this idea:
> https://docs.google.com/document/d/184Tn0ONJ2ENHBVrhIes4seVuBSkww7xX5xb5W2VZwm8/edit#heading=h.l7mcrg761zp8
> Mailing list thread:
> https://lists.apache.org/thread.html/6138d08c551e254b5f13b26c6ba06579a49a4694f4d13ad6d164689a@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-4790) Make it easier to select code reviewers for PRs

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4790?focusedWorklogId=141522=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141522
 ]

ASF GitHub Bot logged work on BEAM-4790:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:26
Start Date: 05/Sep/18 20:26
Worklog Time Spent: 10m 
  Work Description: melap commented on issue #545: [BEAM-4790] Contribute: 
document how to find and add reviewers
URL: https://github.com/apache/beam-site/pull/545#issuecomment-418869774
 
 
   @asfgit merge


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141522)
Time Spent: 3h 10m  (was: 3h)

> Make it easier to select code reviewers for PRs
> ---
>
> Key: BEAM-4790
> URL: https://issues.apache.org/jira/browse/BEAM-4790
> Project: Beam
>  Issue Type: Improvement
>  Components: project-management
>Reporter: Udi Meiri
>Assignee: Udi Meiri
>Priority: Major
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> The original idea was to use Github's CODEOWNERS feature, but that turned out 
> to not work well for Apache Beam.
> Thread: 
> https://lists.apache.org/thread.html/9cc3191b21489c855ed502beade6d453b18e26eec46cb2c3dfe810e9@%3Cdev.beam.apache.org%3E
> Document discussing this idea:
> https://docs.google.com/document/d/184Tn0ONJ2ENHBVrhIes4seVuBSkww7xX5xb5W2VZwm8/edit#heading=h.l7mcrg761zp8
> Mailing list thread:
> https://lists.apache.org/thread.html/6138d08c551e254b5f13b26c6ba06579a49a4694f4d13ad6d164689a@%3Cdev.beam.apache.org%3E



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5149) Add support to the Java SDK harness to merge windows

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5149?focusedWorklogId=141521=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141521
 ]

ASF GitHub Bot logged work on BEAM-5149:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:25
Start Date: 05/Sep/18 20:25
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on issue #6222: [BEAM-5149] Add 
support for the Java SDK harness to merge windows.
URL: https://github.com/apache/beam/pull/6222#issuecomment-418869259
 
 
   @youngoli PTAL


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141521)
Time Spent: 1h 40m  (was: 1.5h)

> Add support to the Java SDK harness to merge windows
> 
>
> Key: BEAM-5149
> URL: https://issues.apache.org/jira/browse/BEAM-5149
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-harness
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Major
>  Labels: portability
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> Window merging function:
>  
> {code:java}
> Input: KV>
> Output: KV, 
> iterable>>> 
> {code}
> For each set of original windows, a list of all unmerged windows is output 
> alongside a map of merged window to set of consumed windows. All original 
> windows must be contained in either the unmerged original window set or one 
> of the consumed original window sets. Each original window can only be part 
> of one output set. The nonce is used by a runner to associate each input with 
> its output. The nonce is represented as an opaque set of bytes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Closed] (BEAM-4696) Execute Jenkins website tests in a Docker container

2018-09-05 Thread Udi Meiri (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Udi Meiri closed BEAM-4696.
---
   Resolution: Fixed
Fix Version/s: Not applicable

> Execute Jenkins website tests in a Docker container
> ---
>
> Key: BEAM-4696
> URL: https://issues.apache.org/jira/browse/BEAM-4696
> Project: Beam
>  Issue Type: Improvement
>  Components: testing, website
>Reporter: Scott Wegner
>Assignee: Udi Meiri
>Priority: Major
> Fix For: Not applicable
>
>
> Currently, the website tests run in a vanilla Linux environment, which 
> require a prerequisite step to install Ruby. The install script is flaky and 
> adds extra time to the job.
> Instead, we should run the website pre-commits inside the pre-built ruby/2.5 
> docker image so that we don't need to worry about installing extra 
> dependencies.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Closed] (BEAM-4606) Upper bound for pytz dependency

2018-09-05 Thread Udi Meiri (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Udi Meiri closed BEAM-4606.
---
   Resolution: Fixed
Fix Version/s: Not applicable

> Upper bound for pytz dependency
> ---
>
> Key: BEAM-4606
> URL: https://issues.apache.org/jira/browse/BEAM-4606
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-core
>Reporter: Ahmet Altay
>Assignee: Udi Meiri
>Priority: Major
> Fix For: Not applicable
>
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> Do we need an upper bound for the pytz dependency? 
> ([https://github.com/apache/beam/blob/release-2.5.0/sdks/python/setup.py#L108)]
>  We typically have upper bounds, in order to avoid future breakages due to a 
> possibility of breaking/backward incompatible change of that depepdency.
> Good practice is to upper bound either at known version, or next major 
> version. Do we need an exception for pytz because it does not seem to be 
> following semantic versioning?
> cc: [~yifanzou] Is this something dependency notifier can warn on? Dependency 
> without upper version bounds.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-4374) Update existing metrics in the FN API to use new Metric Schema

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4374?focusedWorklogId=141520=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141520
 ]

ASF GitHub Bot logged work on BEAM-4374:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:20
Start Date: 05/Sep/18 20:20
Worklog Time Spent: 10m 
  Work Description: pabloem commented on issue #6205: [BEAM-4374] 
Implementing a subset of the new metrics framework in python.
URL: https://github.com/apache/beam/pull/6205#issuecomment-418867827
 
 
   Lint issue is breaking python precommits ^^


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141520)
Time Spent: 6h  (was: 5h 50m)

> Update existing metrics in the FN API to use new Metric Schema
> --
>
> Key: BEAM-4374
> URL: https://issues.apache.org/jira/browse/BEAM-4374
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model
>Reporter: Alex Amato
>Priority: Major
>  Time Spent: 6h
>  Remaining Estimate: 0h
>
> Update existing metrics to use the new proto and cataloging schema defined in:
> [_https://s.apache.org/beam-fn-api-metrics_]
>  * Check in new protos
>  * Define catalog file for metrics
>  * Port existing metrics to use this new format, based on catalog 
> names+metadata



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5149) Add support to the Java SDK harness to merge windows

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5149?focusedWorklogId=141518=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141518
 ]

ASF GitHub Bot logged work on BEAM-5149:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:16
Start Date: 05/Sep/18 20:16
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on a change in pull request #6222: 
[BEAM-5149] Add support for the Java SDK harness to merge windows.
URL: https://github.com/apache/beam/pull/6222#discussion_r215411282
 
 

 ##
 File path: 
sdks/java/harness/src/main/java/org/apache/beam/fn/harness/WindowMergingFnRunner.java
 ##
 @@ -0,0 +1,158 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.beam.fn.harness;
+
+import com.google.auto.service.AutoService;
+import com.google.common.collect.ImmutableMap;
+import com.google.common.collect.Sets;
+import java.io.IOException;
+import java.util.ArrayList;
+import java.util.Collection;
+import java.util.Collections;
+import java.util.List;
+import java.util.Map;
+import org.apache.beam.model.pipeline.v1.RunnerApi;
+import org.apache.beam.model.pipeline.v1.RunnerApi.PTransform;
+import org.apache.beam.model.pipeline.v1.RunnerApi.StandardPTransforms;
+import org.apache.beam.runners.core.construction.BeamUrns;
+import org.apache.beam.runners.core.construction.WindowingStrategyTranslation;
+import org.apache.beam.sdk.fn.function.ThrowingFunction;
+import org.apache.beam.sdk.transforms.windowing.BoundedWindow;
+import org.apache.beam.sdk.transforms.windowing.WindowFn;
+import org.apache.beam.sdk.transforms.windowing.WindowFn.MergeContext;
+import org.apache.beam.sdk.values.KV;
+
+/**
+ * Merges windows using a {@link 
org.apache.beam.sdk.transforms.windowing.WindowFn}.
+ *
+ * Window merging function:
+ *
+ * 
+ *   Input: {@code KV>}
+ *   Output: {@code KV, 
iterable Add support to the Java SDK harness to merge windows
> 
>
> Key: BEAM-5149
> URL: https://issues.apache.org/jira/browse/BEAM-5149
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-harness
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Major
>  Labels: portability
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> Window merging function:
>  
> {code:java}
> Input: KV>
> Output: KV, 
> iterable>>> 
> {code}
> For each set of original windows, a list of all unmerged windows is output 
> alongside a map of merged window to set of consumed windows. All original 
> windows must be contained in either the unmerged original window set or one 
> of the consumed original window sets. Each original window can only be part 
> of one output set. The nonce is used by a runner to associate each input with 
> its output. The nonce is represented as an opaque set of bytes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5149) Add support to the Java SDK harness to merge windows

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5149?focusedWorklogId=141517=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141517
 ]

ASF GitHub Bot logged work on BEAM-5149:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:15
Start Date: 05/Sep/18 20:15
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on a change in pull request #6222: 
[BEAM-5149] Add support for the Java SDK harness to merge windows.
URL: https://github.com/apache/beam/pull/6222#discussion_r215410924
 
 

 ##
 File path: 
sdks/java/harness/src/main/java/org/apache/beam/fn/harness/WindowMappingFnRunner.java
 ##
 @@ -32,9 +32,17 @@
 import org.apache.beam.sdk.values.KV;
 
 /**
- * Maps windows using a window mapping fn. The input is {@link KV} with the 
key being a nonce and
- * the value being a window, the output must be a {@link KV} with the key 
being the same nonce as
- * the input and the value being the mapped window.
+ * Represents mapping of main input window onto side input window.
+ *
+ * Side input window mapping function:
+ *
+ * 
+ *   Input: {@code KV}
+ *   Output: {@code KV}
+ * 
+ *
+ * For each main input window, the side input window is returned. The nonce 
is used by a runner
 
 Review comment:
   Ditto on copy paste typo


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141517)
Time Spent: 1h 10m  (was: 1h)

> Add support to the Java SDK harness to merge windows
> 
>
> Key: BEAM-5149
> URL: https://issues.apache.org/jira/browse/BEAM-5149
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-harness
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Major
>  Labels: portability
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> Window merging function:
>  
> {code:java}
> Input: KV>
> Output: KV, 
> iterable>>> 
> {code}
> For each set of original windows, a list of all unmerged windows is output 
> alongside a map of merged window to set of consumed windows. All original 
> windows must be contained in either the unmerged original window set or one 
> of the consumed original window sets. Each original window can only be part 
> of one output set. The nonce is used by a runner to associate each input with 
> its output. The nonce is represented as an opaque set of bytes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5107) Support ES 6.x for ElasticsearchIO

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5107?focusedWorklogId=141513=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141513
 ]

ASF GitHub Bot logged work on BEAM-5107:


Author: ASF GitHub Bot
Created on: 05/Sep/18 20:06
Start Date: 05/Sep/18 20:06
Worklog Time Spent: 10m 
  Work Description: timrobertson100 commented on issue #6211: [BEAM-5107] 
Support ES-6.x for ElasticsearchIO
URL: https://github.com/apache/beam/pull/6211#issuecomment-418863949
 
 
   Thank you @dattran-vn01 and congratulations. This is a really great addition.
   
   Sorry for my silence, but I have been / am very busy on other things this 
week and next.
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141513)
Time Spent: 11.5h  (was: 11h 20m)

> Support ES 6.x for ElasticsearchIO
> --
>
> Key: BEAM-5107
> URL: https://issues.apache.org/jira/browse/BEAM-5107
> Project: Beam
>  Issue Type: Improvement
>  Components: io-java-elasticsearch
>Reporter: Dat Tran
>Assignee: Etienne Chauchot
>Priority: Major
>  Time Spent: 11.5h
>  Remaining Estimate: 0h
>
> Elasticsearch has released 6.3.2 but ElasticsearchIO only supports 2x-5.x.
> We should support ES 6.x for ElasticsearchIO.
> https://www.elastic.co/guide/en/elasticsearch/reference/current/index.html
> https://github.com/apache/beam/blob/master/sdks/java/io/elasticsearch/src/main/java/org/apache/beam/sdk/io/elasticsearch/ElasticsearchIO.java



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-5308) JobBundleFactory BindException with FlinkRunner and remote cluster

2018-09-05 Thread Ankur Goenka (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604840#comment-16604840
 ] 

Ankur Goenka commented on BEAM-5308:


I agree, the bug is setting the port range.

> JobBundleFactory BindException with FlinkRunner and remote cluster
> --
>
> Key: BEAM-5308
> URL: https://issues.apache.org/jira/browse/BEAM-5308
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: Thomas Weise
>Assignee: Maximilian Michels
>Priority: Major
>  Labels: portability
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Repeated execution of the same job on remote Flink cluster (not embedded in 
> job server) fails with bind exception. There seem to be 2 issues:
>  * Multiple instances of job bundle factory cannot be created (port conflict)
>  * Job bundle factory is not released after job completes (and Docker 
> container keeps on running). That's not the case in embedded mode).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (BEAM-5308) JobBundleFactory BindException with FlinkRunner and remote cluster

2018-09-05 Thread Thomas Weise (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604836#comment-16604836
 ] 

Thomas Weise edited comment on BEAM-5308 at 9/5/18 7:40 PM:


That is an optimization and a different issue. Here we have a bug that multiple 
job bundle factories cannot coexist in a single JVM (which must be possible). 
Max has already identified the bug, see linked PR.

 


was (Author: thw):
That is an optimization and a different issue. Here we have a bug that multiple 
harnesses cannot run in a single JVM (which must be possible). Max has already 
identified the bug, see linked PR.

 

> JobBundleFactory BindException with FlinkRunner and remote cluster
> --
>
> Key: BEAM-5308
> URL: https://issues.apache.org/jira/browse/BEAM-5308
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: Thomas Weise
>Assignee: Maximilian Michels
>Priority: Major
>  Labels: portability
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Repeated execution of the same job on remote Flink cluster (not embedded in 
> job server) fails with bind exception. There seem to be 2 issues:
>  * Multiple instances of job bundle factory cannot be created (port conflict)
>  * Job bundle factory is not released after job completes (and Docker 
> container keeps on running). That's not the case in embedded mode).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-5308) JobBundleFactory BindException with FlinkRunner and remote cluster

2018-09-05 Thread Thomas Weise (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604836#comment-16604836
 ] 

Thomas Weise commented on BEAM-5308:


That is an optimization and a different issue. Here we have a bug that multiple 
harnesses cannot run in a single JVM (which must be possible). Max has already 
identified the bug, see linked PR.

 

> JobBundleFactory BindException with FlinkRunner and remote cluster
> --
>
> Key: BEAM-5308
> URL: https://issues.apache.org/jira/browse/BEAM-5308
> Project: Beam
>  Issue Type: Task
>  Components: runner-flink
>Reporter: Thomas Weise
>Assignee: Maximilian Michels
>Priority: Major
>  Labels: portability
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Repeated execution of the same job on remote Flink cluster (not embedded in 
> job server) fails with bind exception. There seem to be 2 issues:
>  * Multiple instances of job bundle factory cannot be created (port conflict)
>  * Job bundle factory is not released after job completes (and Docker 
> container keeps on running). That's not the case in embedded mode).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread Ankur Goenka (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604833#comment-16604833
 ] 

Ankur Goenka commented on BEAM-5283:


The root cause seems to be related to permission.

 

 :beam-sdks-python:setupVirtualenv FAILED
New python executable in 
/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_PortableValidatesRunner_Flink_Gradle/src/sdks/python/build/gradleenv/bin/python2
Also creating executable in 
/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_PortableValidatesRunner_Flink_Gradle/src/sdks/python/build/gradleenv/bin/python
Installing setuptools, pkg_resources, pip, wheel...done.
Running virtualenv with interpreter /usr/bin/python2
Collecting tox==3.0.0
  Using cached 
https://files.pythonhosted.org/packages/e6/41/4dcfd713282bf3213b0384320fa8841e4db032ddcb80bc08a540159d42a8/tox-3.0.0-py2.py3-none-any.whl
Collecting grpcio-tools==1.3.5
  Using cached 
https://files.pythonhosted.org/packages/05/f6/0296e29b1bac6f85d2a8556d48adf825307f73109a3c2c17fb734292db0a/grpcio_tools-1.3.5-cp27-cp27mu-manylinux1_x86_64.whl
Collecting pluggy<1.0,>=0.3.0 (from tox==3.0.0)
  Using cached 
https://files.pythonhosted.org/packages/f5/f1/5a93c118663896d83f7bcbfb7f657ce1d0c0d617e6b4a443a53abcc658ca/pluggy-0.7.1-py2.py3-none-any.whl
Requirement not upgraded as not directly required: six in 
/usr/local/lib/python2.7/dist-packages (from tox==3.0.0) (1.11.0)
Requirement not upgraded as not directly required: virtualenv>=1.11.2 in 
/usr/lib/python2.7/dist-packages (from tox==3.0.0) (15.0.1)
Collecting py>=1.4.17 (from tox==3.0.0)
  Using cached 
https://files.pythonhosted.org/packages/c8/47/d179b80ab1dc1bfd46a0c87e391be47e6c7ef5831a9c138c5c49d1756288/py-1.6.0-py2.py3-none-any.whl
Collecting grpcio>=1.3.5 (from grpcio-tools==1.3.5)
  Using cached 
https://files.pythonhosted.org/packages/bd/a6/4bad0d1a49071363dc6547a5178656fe375c80535128c12bb65c59d1a329/grpcio-1.14.2-cp27-cp27mu-manylinux1_x86_64.whl
Collecting protobuf>=3.2.0 (from grpcio-tools==1.3.5)
  Using cached 
https://files.pythonhosted.org/packages/b8/c2/b7f587c0aaf8bf2201405e8162323037fe8d17aa21d3c7dda811b8d01469/protobuf-3.6.1-cp27-cp27mu-manylinux1_x86_64.whl
Requirement not upgraded as not directly required: enum34>=1.0.4 in 
/usr/local/lib/python2.7/dist-packages (from 
grpcio>=1.3.5->grpcio-tools==1.3.5) (1.1.6)
Collecting futures>=2.2.0 (from grpcio>=1.3.5->grpcio-tools==1.3.5)
  Using cached 
https://files.pythonhosted.org/packages/2d/99/b2c4e9d5a30f6471e410a146232b4118e697fa3ffc06d6a65efde84debd0/futures-3.2.0-py2-none-any.whl
Requirement not upgraded as not directly required: setuptools in 
/usr/local/lib/python2.7/dist-packages (from 
protobuf>=3.2.0->grpcio-tools==1.3.5) (39.0.1)
Installing collected packages: pluggy, py, tox, futures, grpcio, protobuf, 
grpcio-tools
Could not install packages due to an EnvironmentError: [Errno 13] Permission 
denied: '/usr/local/lib/python2.7/dist-packages/pluggy-0.7.1.dist-info'
Consider using the `--user` option or check the permissions.

You are using pip version 10.0.1, however version 18.0 is available.
You should consider upgrading via the 'pip install --upgrade pip' command.

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5149) Add support to the Java SDK harness to merge windows

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5149?focusedWorklogId=141502=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141502
 ]

ASF GitHub Bot logged work on BEAM-5149:


Author: ASF GitHub Bot
Created on: 05/Sep/18 19:05
Start Date: 05/Sep/18 19:05
Worklog Time Spent: 10m 
  Work Description: youngoli commented on a change in pull request #6222: 
[BEAM-5149] Add support for the Java SDK harness to merge windows.
URL: https://github.com/apache/beam/pull/6222#discussion_r212776929
 
 

 ##
 File path: model/pipeline/src/main/proto/beam_runner_api.proto
 ##
 @@ -202,9 +202,35 @@ message StandardPTransforms {
 // Payload: TestStreamPayload
 TEST_STREAM = 5 [(beam_urn) = "urn:beam:transform:teststream:v1"];
 
-// Represents mapping of main input window into side input window.
-// Payload: serialized WindowMappingFn.
+// Represents mapping of main input window onto side input window.
+//
+// Side input window mapping function:
+// Input: KV
+// Output: KV
+//
+// For each main input window, the side input window is returned. The
 
 Review comment:
   Doesn't the Input and Output above imply the opposite of this? This line 
makes it sound like the input is a MainInputWindow and the output a 
SideInputWindow,


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141502)
Time Spent: 50m  (was: 40m)

> Add support to the Java SDK harness to merge windows
> 
>
> Key: BEAM-5149
> URL: https://issues.apache.org/jira/browse/BEAM-5149
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-harness
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Major
>  Labels: portability
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> Window merging function:
>  
> {code:java}
> Input: KV>
> Output: KV, 
> iterable>>> 
> {code}
> For each set of original windows, a list of all unmerged windows is output 
> alongside a map of merged window to set of consumed windows. All original 
> windows must be contained in either the unmerged original window set or one 
> of the consumed original window sets. Each original window can only be part 
> of one output set. The nonce is used by a runner to associate each input with 
> its output. The nonce is represented as an opaque set of bytes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5149) Add support to the Java SDK harness to merge windows

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5149?focusedWorklogId=141499=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141499
 ]

ASF GitHub Bot logged work on BEAM-5149:


Author: ASF GitHub Bot
Created on: 05/Sep/18 19:05
Start Date: 05/Sep/18 19:05
Worklog Time Spent: 10m 
  Work Description: youngoli commented on a change in pull request #6222: 
[BEAM-5149] Add support for the Java SDK harness to merge windows.
URL: https://github.com/apache/beam/pull/6222#discussion_r212777355
 
 

 ##
 File path: 
sdks/java/harness/src/main/java/org/apache/beam/fn/harness/WindowMappingFnRunner.java
 ##
 @@ -32,9 +32,17 @@
 import org.apache.beam.sdk.values.KV;
 
 /**
- * Maps windows using a window mapping fn. The input is {@link KV} with the 
key being a nonce and
- * the value being a window, the output must be a {@link KV} with the key 
being the same nonce as
- * the input and the value being the mapped window.
+ * Represents mapping of main input window onto side input window.
+ *
+ * Side input window mapping function:
+ *
+ * 
+ *   Input: {@code KV}
+ *   Output: {@code KV}
+ * 
+ *
+ * For each main input window, the side input window is returned. The nonce 
is used by a runner
 
 Review comment:
   Same question as earlier comment.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141499)
Time Spent: 40m  (was: 0.5h)

> Add support to the Java SDK harness to merge windows
> 
>
> Key: BEAM-5149
> URL: https://issues.apache.org/jira/browse/BEAM-5149
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-harness
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Major
>  Labels: portability
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> Window merging function:
>  
> {code:java}
> Input: KV>
> Output: KV, 
> iterable>>> 
> {code}
> For each set of original windows, a list of all unmerged windows is output 
> alongside a map of merged window to set of consumed windows. All original 
> windows must be contained in either the unmerged original window set or one 
> of the consumed original window sets. Each original window can only be part 
> of one output set. The nonce is used by a runner to associate each input with 
> its output. The nonce is represented as an opaque set of bytes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5149) Add support to the Java SDK harness to merge windows

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5149?focusedWorklogId=141500=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141500
 ]

ASF GitHub Bot logged work on BEAM-5149:


Author: ASF GitHub Bot
Created on: 05/Sep/18 19:05
Start Date: 05/Sep/18 19:05
Worklog Time Spent: 10m 
  Work Description: youngoli commented on a change in pull request #6222: 
[BEAM-5149] Add support for the Java SDK harness to merge windows.
URL: https://github.com/apache/beam/pull/6222#discussion_r215102198
 
 

 ##
 File path: 
sdks/java/harness/src/main/java/org/apache/beam/fn/harness/WindowMergingFnRunner.java
 ##
 @@ -0,0 +1,158 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.beam.fn.harness;
+
+import com.google.auto.service.AutoService;
+import com.google.common.collect.ImmutableMap;
+import com.google.common.collect.Sets;
+import java.io.IOException;
+import java.util.ArrayList;
+import java.util.Collection;
+import java.util.Collections;
+import java.util.List;
+import java.util.Map;
+import org.apache.beam.model.pipeline.v1.RunnerApi;
+import org.apache.beam.model.pipeline.v1.RunnerApi.PTransform;
+import org.apache.beam.model.pipeline.v1.RunnerApi.StandardPTransforms;
+import org.apache.beam.runners.core.construction.BeamUrns;
+import org.apache.beam.runners.core.construction.WindowingStrategyTranslation;
+import org.apache.beam.sdk.fn.function.ThrowingFunction;
+import org.apache.beam.sdk.transforms.windowing.BoundedWindow;
+import org.apache.beam.sdk.transforms.windowing.WindowFn;
+import org.apache.beam.sdk.transforms.windowing.WindowFn.MergeContext;
+import org.apache.beam.sdk.values.KV;
+
+/**
+ * Merges windows using a {@link 
org.apache.beam.sdk.transforms.windowing.WindowFn}.
+ *
+ * Window merging function:
+ *
+ * 
+ *   Input: {@code KV>}
+ *   Output: {@code KV, 
iterable, ...>` here being used simply 
because there's no coder that handles non-keyed pairs? Because it doesn't seem 
that the set of unmerged windows is actually used as a key associated with the 
map of `MergedWindow` to `ConsumedOriginalWindow`, but instead as just data 
paired with the map.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141500)

> Add support to the Java SDK harness to merge windows
> 
>
> Key: BEAM-5149
> URL: https://issues.apache.org/jira/browse/BEAM-5149
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-harness
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Major
>  Labels: portability
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> Window merging function:
>  
> {code:java}
> Input: KV>
> Output: KV, 
> iterable>>> 
> {code}
> For each set of original windows, a list of all unmerged windows is output 
> alongside a map of merged window to set of consumed windows. All original 
> windows must be contained in either the unmerged original window set or one 
> of the consumed original window sets. Each original window can only be part 
> of one output set. The nonce is used by a runner to associate each input with 
> its output. The nonce is represented as an opaque set of bytes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5149) Add support to the Java SDK harness to merge windows

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5149?focusedWorklogId=141501=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141501
 ]

ASF GitHub Bot logged work on BEAM-5149:


Author: ASF GitHub Bot
Created on: 05/Sep/18 19:05
Start Date: 05/Sep/18 19:05
Worklog Time Spent: 10m 
  Work Description: youngoli commented on a change in pull request #6222: 
[BEAM-5149] Add support for the Java SDK harness to merge windows.
URL: https://github.com/apache/beam/pull/6222#discussion_r215382967
 
 

 ##
 File path: 
sdks/java/harness/src/main/java/org/apache/beam/fn/harness/WindowMergingFnRunner.java
 ##
 @@ -0,0 +1,158 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.beam.fn.harness;
+
+import com.google.auto.service.AutoService;
+import com.google.common.collect.ImmutableMap;
+import com.google.common.collect.Sets;
+import java.io.IOException;
+import java.util.ArrayList;
+import java.util.Collection;
+import java.util.Collections;
+import java.util.List;
+import java.util.Map;
+import org.apache.beam.model.pipeline.v1.RunnerApi;
+import org.apache.beam.model.pipeline.v1.RunnerApi.PTransform;
+import org.apache.beam.model.pipeline.v1.RunnerApi.StandardPTransforms;
+import org.apache.beam.runners.core.construction.BeamUrns;
+import org.apache.beam.runners.core.construction.WindowingStrategyTranslation;
+import org.apache.beam.sdk.fn.function.ThrowingFunction;
+import org.apache.beam.sdk.transforms.windowing.BoundedWindow;
+import org.apache.beam.sdk.transforms.windowing.WindowFn;
+import org.apache.beam.sdk.transforms.windowing.WindowFn.MergeContext;
+import org.apache.beam.sdk.values.KV;
+
+/**
+ * Merges windows using a {@link 
org.apache.beam.sdk.transforms.windowing.WindowFn}.
+ *
+ * Window merging function:
+ *
+ * 
+ *   Input: {@code KV>}
+ *   Output: {@code KV, 
iterable>>>}
+ * 
+ *
+ * For each set of original windows, a list of all unmerged windows is 
output alongside a map of
+ * merged window to set of consumed windows. All original windows must be 
contained in either the
+ * unmerged original window set or one of the consumed original window sets. 
Each original window
+ * can only be part of one output set. The nonce is used by a runner to 
associate each input with
+ * its output. The nonce is represented as an opaque set of bytes.
+ */
+public abstract class WindowMergingFnRunner {
+  static final String URN = 
BeamUrns.getUrn(StandardPTransforms.Primitives.MERGE_WINDOWS);
+
+  /**
+   * A registrar which provides a factory to handle merging windows based upon 
the {@link WindowFn}.
+   */
+  @AutoService(PTransformRunnerFactory.Registrar.class)
+  public static class Registrar implements PTransformRunnerFactory.Registrar {
+
+@Override
+public Map getPTransformRunnerFactories() 
{
+  return ImmutableMap.of(
+  URN,
+  
MapFnRunners.forValueMapFnFactory(WindowMergingFnRunner::createMapFunctionForPTransform));
+}
+  }
+
+  static 
+  ThrowingFunction>, KV, 
Iterable>
+  createMapFunctionForPTransform(String ptransformId, PTransform 
ptransform)
+  throws IOException {
+RunnerApi.SdkFunctionSpec payload =
+RunnerApi.SdkFunctionSpec.parseFrom(ptransform.getSpec().getPayload());
+
+WindowFn windowFn =
+(WindowFn) 
WindowingStrategyTranslation.windowFnFromProto(payload);
+return WindowMergingFnRunner.create(windowFn)::mergeWindows;
+  }
+
+  static  WindowMergingFnRunner 
create(WindowFn windowFn) {
+if (windowFn.isNonMerging()) {
+  return new NonMergingWindowFnRunner();
+} else {
+  return new MergingWindowFnRunner(windowFn);
 
 Review comment:
   It seems confusing that there's a class named MergingWindowFnRunner that 
extends WindowMergingFnRunner. Maybe name it something like 
WindowMergingFnRunnerImpl? Or if you want to be descriptive, maybe 
MergingViaWindowFnRunner, or WindowMergingViaWindowFnRunner, or some other 
variation like that.


This is an automated message from the Apache Git Service.
To respond to 

[jira] [Work logged] (BEAM-4461) Create a library of useful transforms that use schemas

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4461?focusedWorklogId=141488=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141488
 ]

ASF GitHub Bot logged work on BEAM-4461:


Author: ASF GitHub Bot
Created on: 05/Sep/18 18:35
Start Date: 05/Sep/18 18:35
Worklog Time Spent: 10m 
  Work Description: akedin commented on a change in pull request #6318: 
[BEAM-4461] Some fixes to Combiners needed for Schema support.
URL: https://github.com/apache/beam/pull/6318#discussion_r215360597
 
 

 ##
 File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/CombineFns.java
 ##
 @@ -112,6 +113,16 @@ public static ComposeCombineFnBuilder compose() {
   return new ComposedCombineFn().with(extractInputFn, combineFn, 
outputTag);
 }
 
+/** Like {@link #with(SimpleFunction, CombineFn, TupleTag)} bit with an 
explicit input coder. */
 
 Review comment:
   typo *but


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141488)
Time Spent: 5h 40m  (was: 5.5h)

> Create a library of useful transforms that use schemas
> --
>
> Key: BEAM-4461
> URL: https://issues.apache.org/jira/browse/BEAM-4461
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-core
>Reporter: Reuven Lax
>Assignee: Reuven Lax
>Priority: Major
>  Time Spent: 5h 40m
>  Remaining Estimate: 0h
>
> e.g. JoinBy(fields). Project, Filter, etc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-4461) Create a library of useful transforms that use schemas

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4461?focusedWorklogId=141489=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141489
 ]

ASF GitHub Bot logged work on BEAM-4461:


Author: ASF GitHub Bot
Created on: 05/Sep/18 18:35
Start Date: 05/Sep/18 18:35
Worklog Time Spent: 10m 
  Work Description: akedin commented on a change in pull request #6318: 
[BEAM-4461] Some fixes to Combiners needed for Schema support.
URL: https://github.com/apache/beam/pull/6318#discussion_r215381299
 
 

 ##
 File path: sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/Top.java
 ##
 @@ -168,7 +168,40 @@ private Top() {
* {@code PCollection} of {@code KV}s and return the top values associated 
with each key.
*/
   public static > Combine.Globally> 
largest(int count) {
-return Combine.globally(new TopCombineFn<>(count, new Natural()));
+return Combine.globally(largestFn(count));
+  }
+
+  /** Returns a {@link TopCombineFn} that aggregates the largest count values. 
*/
+  public static > TopCombineFn> 
largestFn(int count) {
+return new TopCombineFn>(count, new Natural()) {};
+  }
+  /** Returns a {@link TopCombineFn} that aggregates the largest count long 
values. */
+  public static TopCombineFn> largestLongsFn(int count) {
+return new TopCombineFn>(count, new Natural()) 
{};
 
 Review comment:
   can java infer more of `<>`? E.g. `new TopCombineFn<>(count, new 
Natural()) {};`
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141489)
Time Spent: 5h 50m  (was: 5h 40m)

> Create a library of useful transforms that use schemas
> --
>
> Key: BEAM-4461
> URL: https://issues.apache.org/jira/browse/BEAM-4461
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-core
>Reporter: Reuven Lax
>Assignee: Reuven Lax
>Priority: Major
>  Time Spent: 5h 50m
>  Remaining Estimate: 0h
>
> e.g. JoinBy(fields). Project, Filter, etc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-4461) Create a library of useful transforms that use schemas

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4461?focusedWorklogId=141490=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141490
 ]

ASF GitHub Bot logged work on BEAM-4461:


Author: ASF GitHub Bot
Created on: 05/Sep/18 18:35
Start Date: 05/Sep/18 18:35
Worklog Time Spent: 10m 
  Work Description: akedin commented on a change in pull request #6318: 
[BEAM-4461] Some fixes to Combiners needed for Schema support.
URL: https://github.com/apache/beam/pull/6318#discussion_r215361562
 
 

 ##
 File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/CombineFns.java
 ##
 @@ -470,7 +589,10 @@ public CoCombineResult extractOutput(Object[] 
accumulator, Context c) {
 throws CannotProvideCoderException {
   List> coders = Lists.newArrayList();
   for (int i = 0; i < combineFnCount; ++i) {
-Coder inputCoder = 
registry.getOutputCoder(extractInputFns.get(i), dataCoder);
+Coder inputCoder =
+combineInputCoders.get(i).isPresent()
 
 Review comment:
   I think something like 
`combineInputCoders.get(i).or(registry.getOutputCoder(extractInputFns.get(i), 
dataCoder))` should work.
   
   Also, I believe this logic should be changed, it would be clearer if the 
coders was not a list of optionals, but a list of coders, and the registry was 
consulted before this instead of adding `Optional.absent()`. The end goal is 
not to have optionals but use the values from registry instead when no coder is 
passed. Similar to how `CombineFnUtil.toFnWithContext(globalCombineFn)` is 
called.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141490)

> Create a library of useful transforms that use schemas
> --
>
> Key: BEAM-4461
> URL: https://issues.apache.org/jira/browse/BEAM-4461
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-core
>Reporter: Reuven Lax
>Assignee: Reuven Lax
>Priority: Major
>  Time Spent: 5h 50m
>  Remaining Estimate: 0h
>
> e.g. JoinBy(fields). Project, Filter, etc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-4461) Create a library of useful transforms that use schemas

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4461?focusedWorklogId=141487=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141487
 ]

ASF GitHub Bot logged work on BEAM-4461:


Author: ASF GitHub Bot
Created on: 05/Sep/18 18:35
Start Date: 05/Sep/18 18:35
Worklog Time Spent: 10m 
  Work Description: akedin commented on a change in pull request #6318: 
[BEAM-4461] Some fixes to Combiners needed for Schema support.
URL: https://github.com/apache/beam/pull/6318#discussion_r215378279
 
 

 ##
 File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/CombineFns.java
 ##
 @@ -272,6 +301,60 @@ private ComposedCombineFn(
   .addAll(extractInputFns)
   .add(extractInputFn)
   .build(),
+  ImmutableList.>builder()
+  .addAll(combineInputCoders)
+  .add(Optional.absent())
+  .build(),
+  ImmutableList.>builder()
+  .addAll(fnsWithContext)
+  .add(combineFn)
+  .build(),
+  
ImmutableList.>builder().addAll(outputTags).add(outputTag).build());
+}
+
+/** Returns a {@link ComposedCombineFn} with an additional {@link 
CombineFn}. */
+public  ComposedCombineFn with(
+SimpleFunction extractInputFn,
+Coder combineInputCoder,
+CombineFn combineFn,
+TupleTag outputTag) {
+  checkUniqueness(outputTags, outputTag);
+  return new ComposedCombineFn<>(
+  ImmutableList.>builder()
+  .addAll(extractInputFns)
+  .add(extractInputFn)
+  .build(),
+  ImmutableList.>builder()
+  .addAll(combineInputCoders)
+  .add(Optional.of(combineInputCoder))
+  .build(),
+  ImmutableList.>builder().addAll(combineFns).add(combineFn).build(),
+  
ImmutableList.>builder().addAll(outputTags).add(outputTag).build());
+}
+
+/**
+ * Returns a {@link ComposedCombineFnWithContext} with an additional {@link
+ * CombineFnWithContext}.
+ */
+public  ComposedCombineFnWithContext with(
+SimpleFunction extractInputFn,
+Coder combineInputCoder,
+CombineFnWithContext combineFn,
+TupleTag outputTag) {
+  checkUniqueness(outputTags, outputTag);
+  List> fnsWithContext = 
Lists.newArrayList();
+  for (CombineFn fn : combineFns) {
+fnsWithContext.add(CombineFnUtil.toFnWithContext(fn));
 
 Review comment:
   nit: can this be rewritten as 
`combineFns.stream().map(CombilneFnUtil::toFnWithContext).collect(toList())`?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141487)
Time Spent: 5h 40m  (was: 5.5h)

> Create a library of useful transforms that use schemas
> --
>
> Key: BEAM-4461
> URL: https://issues.apache.org/jira/browse/BEAM-4461
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-core
>Reporter: Reuven Lax
>Assignee: Reuven Lax
>Priority: Major
>  Time Spent: 5h 40m
>  Remaining Estimate: 0h
>
> e.g. JoinBy(fields). Project, Filter, etc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-5283) Enable Python Portable Flink PostCommit Tests to Jenkins

2018-09-05 Thread Thomas Weise (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-5283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604761#comment-16604761
 ] 

Thomas Weise commented on BEAM-5283:


The Jenkins job currently fails because the build it is trying to access a 
non-existing directory:
{code:java}
20:02:31 file or directory 
'/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_PortableValidatesRunner_Flink_Gradle/src/runners/flink/job-server/src/main/java',
 not found
20:02:31 Skipping task ':beam-runners-flink_2.11-job-server:compileJava' as it 
has no source files and no previous output files.
{code}
[https://builds.apache.org/job/beam_PostCommit_Python_PortableValidatesRunner_Flink_Gradle/29/console]

[~angoenka] can you please take a look?

 

 

> Enable Python Portable Flink PostCommit Tests to Jenkins
> 
>
> Key: BEAM-5283
> URL: https://issues.apache.org/jira/browse/BEAM-5283
> Project: Beam
>  Issue Type: Test
>  Components: testing
>Reporter: Ankur Goenka
>Assignee: Jason Kuster
>Priority: Major
>  Labels: CI
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5297) Support for provided configuration in Intellij Idea

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5297?focusedWorklogId=141464=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141464
 ]

ASF GitHub Bot logged work on BEAM-5297:


Author: ASF GitHub Bot
Created on: 05/Sep/18 18:03
Start Date: 05/Sep/18 18:03
Worklog Time Spent: 10m 
  Work Description: dmvk edited a comment on issue #6329: [BEAM-5297] Add 
propdeps-idea plugin.
URL: https://github.com/apache/beam/pull/6329#issuecomment-418824832
 
 
   Hello Ryan,
   
   I also see deps listed as `compile` scoped, but they don't get picked up as 
in your setup.
   
   https://user-images.githubusercontent.com/299781/45111783-1b617900-b146-11e8-8ede-4a1558ad9c50.png;>
   
   I don't really understand idea internals. The plugin adds deps to PROVIDED 
scope.
   
   
https://github.com/spring-gradle-plugins/propdeps-plugin/blob/master/src/main/groovy/io/spring/gradle/propdeps/PropDepsIdeaPlugin.groovy
   
   Also, the plugin authors list few relevant issues in the javadoc:
   http://youtrack.jetbrains.com/issue/IDEA-107046
   http://youtrack.jetbrains.com/issue/IDEA-117668
   
   Applying the plugin solves the problem.
   
   I hope this will helps.
   
   D.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141464)
Time Spent: 50m  (was: 40m)

> Support for provided configuration in Intellij Idea
> ---
>
> Key: BEAM-5297
> URL: https://issues.apache.org/jira/browse/BEAM-5297
> Project: Beam
>  Issue Type: Bug
>  Components: build-system
>Reporter: David Moravek
>Assignee: David Moravek
>Priority: Minor
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> Intellij Idea (2018.2.1) does not pick up provided dependencies.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5297) Support for provided configuration in Intellij Idea

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5297?focusedWorklogId=141465=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141465
 ]

ASF GitHub Bot logged work on BEAM-5297:


Author: ASF GitHub Bot
Created on: 05/Sep/18 18:03
Start Date: 05/Sep/18 18:03
Worklog Time Spent: 10m 
  Work Description: dmvk edited a comment on issue #6329: [BEAM-5297] Add 
propdeps-idea plugin.
URL: https://github.com/apache/beam/pull/6329#issuecomment-418824832
 
 
   Hello Ryan,
   
   I also see deps listed as `compile` scoped, but they don't get picked up as 
in your setup.
   
   https://user-images.githubusercontent.com/299781/45111783-1b617900-b146-11e8-8ede-4a1558ad9c50.png;>
   
   I don't really understand idea internals. The plugin adds deps to PROVIDED 
scope.
   
   
https://github.com/spring-gradle-plugins/propdeps-plugin/blob/master/src/main/groovy/io/spring/gradle/propdeps/PropDepsIdeaPlugin.groovy
   
   Also, the plugin authors list few relevant issues in the javadoc:
   http://youtrack.jetbrains.com/issue/IDEA-107046
   http://youtrack.jetbrains.com/issue/IDEA-117668
   
   Applying the plugin solves the problem.
   
   I hope this helps.
   
   D.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141465)
Time Spent: 1h  (was: 50m)

> Support for provided configuration in Intellij Idea
> ---
>
> Key: BEAM-5297
> URL: https://issues.apache.org/jira/browse/BEAM-5297
> Project: Beam
>  Issue Type: Bug
>  Components: build-system
>Reporter: David Moravek
>Assignee: David Moravek
>Priority: Minor
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> Intellij Idea (2018.2.1) does not pick up provided dependencies.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5297) Support for provided configuration in Intellij Idea

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5297?focusedWorklogId=141463=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141463
 ]

ASF GitHub Bot logged work on BEAM-5297:


Author: ASF GitHub Bot
Created on: 05/Sep/18 18:03
Start Date: 05/Sep/18 18:03
Worklog Time Spent: 10m 
  Work Description: dmvk edited a comment on issue #6329: [BEAM-5297] Add 
propdeps-idea plugin.
URL: https://github.com/apache/beam/pull/6329#issuecomment-418824832
 
 
   Hello Ryan,
   
   I also see deps listed as `compile` scoped, but they don't get picked up as 
in your setup.
   
   https://user-images.githubusercontent.com/299781/45111783-1b617900-b146-11e8-8ede-4a1558ad9c50.png;>
   
   I don't really understand idea internals. The plugin adds deps to PROVIDED 
scope.
   
   
https://github.com/spring-gradle-plugins/propdeps-plugin/blob/master/src/main/groovy/io/spring/gradle/propdeps/PropDepsIdeaPlugin.groovy
   
   Also they list some relevant issues in the comment:
   http://youtrack.jetbrains.com/issue/IDEA-107046
   http://youtrack.jetbrains.com/issue/IDEA-117668
   
   Applying the plugin solves the problem.
   
   I hope this will helps.
   
   D.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141463)
Time Spent: 40m  (was: 0.5h)

> Support for provided configuration in Intellij Idea
> ---
>
> Key: BEAM-5297
> URL: https://issues.apache.org/jira/browse/BEAM-5297
> Project: Beam
>  Issue Type: Bug
>  Components: build-system
>Reporter: David Moravek
>Assignee: David Moravek
>Priority: Minor
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> Intellij Idea (2018.2.1) does not pick up provided dependencies.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-5297) Support for provided configuration in Intellij Idea

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5297?focusedWorklogId=141460=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141460
 ]

ASF GitHub Bot logged work on BEAM-5297:


Author: ASF GitHub Bot
Created on: 05/Sep/18 18:02
Start Date: 05/Sep/18 18:02
Worklog Time Spent: 10m 
  Work Description: dmvk commented on issue #6329: [BEAM-5297] Add 
propdeps-idea plugin.
URL: https://github.com/apache/beam/pull/6329#issuecomment-418824832
 
 
   Hello Ryan,
   
   I also see files listed as `compile` scoped, but they don't get picked up as 
in your setup.
   
   https://user-images.githubusercontent.com/299781/45111783-1b617900-b146-11e8-8ede-4a1558ad9c50.png;>
   
   I don't really understand idea internals, but the plugin seems to be adding 
them PROVIDED scope.
   
   
https://github.com/spring-gradle-plugins/propdeps-plugin/blob/master/src/main/groovy/io/spring/gradle/propdeps/PropDepsIdeaPlugin.groovy
   
   Also they list some relevant issues in the comment:
   http://youtrack.jetbrains.com/issue/IDEA-107046
   http://youtrack.jetbrains.com/issue/IDEA-117668
   
   Applying the plugin solves the problem.
   
   I hope this will helps.
   
   D.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141460)
Time Spent: 0.5h  (was: 20m)

> Support for provided configuration in Intellij Idea
> ---
>
> Key: BEAM-5297
> URL: https://issues.apache.org/jira/browse/BEAM-5297
> Project: Beam
>  Issue Type: Bug
>  Components: build-system
>Reporter: David Moravek
>Assignee: David Moravek
>Priority: Minor
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Intellij Idea (2018.2.1) does not pick up provided dependencies.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-4863) Implement consistentWithEquals/structuralValue on FullWindowedValueCoder

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4863?focusedWorklogId=141450=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141450
 ]

ASF GitHub Bot logged work on BEAM-4863:


Author: ASF GitHub Bot
Created on: 05/Sep/18 17:39
Start Date: 05/Sep/18 17:39
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on a change in pull request #6057: 
[BEAM-4863] Implement consistentWithEquals/structuralValue on 
FullWindowedValueCoder
URL: https://github.com/apache/beam/pull/6057#discussion_r215363155
 
 

 ##
 File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/util/WindowedValue.java
 ##
 @@ -497,7 +497,7 @@ private void ensureWindowsAreASet() {
   // right, and cast the window type away here.
   @SuppressWarnings({"unchecked", "rawtypes"})
   Coder> collectionCoder =
-  (Coder) CollectionCoder.of(this.windowCoder);
+  (Coder) ListCoder.of(this.windowCoder);
 
 Review comment:
   Collections aren't required to be ordered (e.g. HashSet). It would be valid 
to return any collection which maintains duplicates such as a multiset that 
doesn't maintain order. Swapping to a ListCoder ensures that we maintain the 
ordering of windows.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141450)
Time Spent: 1h 10m  (was: 1h)

> Implement consistentWithEquals/structuralValue on FullWindowedValueCoder
> 
>
> Key: BEAM-4863
> URL: https://issues.apache.org/jira/browse/BEAM-4863
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Major
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> Implementing *consistentWithEquals*/*structuralValue* boosts significantly 
> the performance of using these values in comparison operations since it 
> doesn't require encoding the values.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-4863) Implement consistentWithEquals/structuralValue on FullWindowedValueCoder

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4863?focusedWorklogId=141449=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141449
 ]

ASF GitHub Bot logged work on BEAM-4863:


Author: ASF GitHub Bot
Created on: 05/Sep/18 17:33
Start Date: 05/Sep/18 17:33
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on a change in pull request #6057: 
[BEAM-4863] Implement consistentWithEquals/structuralValue on 
FullWindowedValueCoder
URL: https://github.com/apache/beam/pull/6057#discussion_r215361516
 
 

 ##
 File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/util/WindowedValue.java
 ##
 @@ -497,7 +497,7 @@ private void ensureWindowsAreASet() {
   // right, and cast the window type away here.
   @SuppressWarnings({"unchecked", "rawtypes"})
   Coder> collectionCoder =
-  (Coder) CollectionCoder.of(this.windowCoder);
 
 Review comment:
   The encoded/decoded bytes will still be the same since both ListCoder and 
CollectionCoder rely on the IterableLikeCoder for encoding/decoding records.
   
   Does Dataflows update care about the names of the coders used or only about 
the encoding?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141449)
Time Spent: 1h  (was: 50m)

> Implement consistentWithEquals/structuralValue on FullWindowedValueCoder
> 
>
> Key: BEAM-4863
> URL: https://issues.apache.org/jira/browse/BEAM-4863
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Major
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> Implementing *consistentWithEquals*/*structuralValue* boosts significantly 
> the performance of using these values in comparison operations since it 
> doesn't require encoding the values.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-4461) Create a library of useful transforms that use schemas

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4461?focusedWorklogId=141441=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141441
 ]

ASF GitHub Bot logged work on BEAM-4461:


Author: ASF GitHub Bot
Created on: 05/Sep/18 17:25
Start Date: 05/Sep/18 17:25
Worklog Time Spent: 10m 
  Work Description: akedin commented on a change in pull request #6316: 
[BEAM-4461] Add Unnest transform.
URL: https://github.com/apache/beam/pull/6316#discussion_r215347139
 
 

 ##
 File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/transforms/Unnest.java
 ##
 @@ -0,0 +1,182 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.beam.sdk.schemas.transforms;
+
+import com.google.auto.value.AutoValue;
+import com.google.common.collect.Lists;
+import java.util.List;
+import org.apache.beam.sdk.annotations.Experimental;
+import org.apache.beam.sdk.annotations.Experimental.Kind;
+import org.apache.beam.sdk.schemas.Schema;
+import org.apache.beam.sdk.schemas.Schema.Field;
+import org.apache.beam.sdk.transforms.DoFn;
+import org.apache.beam.sdk.transforms.PTransform;
+import org.apache.beam.sdk.transforms.ParDo;
+import org.apache.beam.sdk.transforms.SerializableFunction;
+import org.apache.beam.sdk.values.PCollection;
+import org.apache.beam.sdk.values.Row;
+/**
+ * A {@link PTransform} to unnest nested rows.
+ *
+ * For example, consider a Row with the following nestedschema:
+ *
+ * UserEvent Schema: userid: INT64 timestamp: DATETIME location: LatLong
+ *
+ * LatLong Schema: latitude: DOUBLE longitude: DOUBLE
+ *
+ * After unnesting, all of the rows will be converted to rows satisfying 
the following schema:
+ *
+ * UserEvent Schema: userid: INT64 timestamp: DATETIME location.latitude: 
DOUBLE
+ * location.longitude: DOUBLE
+ *
+ * By default nested names are concatenated to generated the unnested name, 
however {@link
+ * Unnest.Inner#withFieldNameFunction} can be used to specify a custom naming 
policy.
+ *
+ * Note that currently array and map values are not unnested.
+ */
+@Experimental(Kind.SCHEMAS)
+public class Unnest {
+  public static  Inner create() {
+return new AutoValue_Unnest_Inner.Builder()
+.setMaxLevels(Integer.MAX_VALUE)
+.setFieldNameFunction(CONCAT_FIELD_NAMES)
+.build();
+  }
+  /**
+   * This is the default naming policy for naming fields. Every field name in 
the path to a given
+   * field is concated with . characters.
+   */
+  public static final SerializableFunction, String> 
CONCAT_FIELD_NAMES =
+  l -> {
+return String.join(".", l);
+  };
+  /**
+   * This policy keeps the raw nested field name. If two differently-nested 
fields have the same
+   * name, unnesting will fail with this policy.
+   */
+  public static final SerializableFunction, String> 
KEEP_NESTED_NAME =
+  l -> {
+return l.get(l.size() - 1);
+  };
+  /** Returns the result of unnesting the given schema. The default naming 
policy is used. */
+  static Schema getUnnestedSchema(Schema schema, int maxLevels) {
+List nameComponents = Lists.newArrayList();
+return getUnnestedSchema(schema, nameComponents, CONCAT_FIELD_NAMES, 
maxLevels, 0);
+  }
+  /** Returns the result of unnesting the given schema with the given naming 
policy. */
+  static Schema getUnnestedSchema(
+  Schema schema, int maxLevels, SerializableFunction, String> 
fn) {
+List nameComponents = Lists.newArrayList();
+return getUnnestedSchema(schema, nameComponents, fn, maxLevels, 0);
+  }
+
+  private static Schema getUnnestedSchema(
+  Schema schema,
+  List nameComponents,
+  SerializableFunction, String> fn,
+  int maxLevel,
+  int currentLevel) {
+Schema.Builder builder = Schema.builder();
+for (Field field : schema.getFields()) {
+  nameComponents.add(field.getName());
+  if (field.getType().getTypeName().isCompositeType() && currentLevel < 
maxLevel) {
 
 Review comment:
   throw for arrays/maps?


This is an automated message from the Apache Git Service.
To 

[jira] [Work logged] (BEAM-4461) Create a library of useful transforms that use schemas

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4461?focusedWorklogId=141442=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141442
 ]

ASF GitHub Bot logged work on BEAM-4461:


Author: ASF GitHub Bot
Created on: 05/Sep/18 17:25
Start Date: 05/Sep/18 17:25
Worklog Time Spent: 10m 
  Work Description: akedin commented on a change in pull request #6316: 
[BEAM-4461] Add Unnest transform.
URL: https://github.com/apache/beam/pull/6316#discussion_r215354112
 
 

 ##
 File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/transforms/Unnest.java
 ##
 @@ -0,0 +1,182 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.beam.sdk.schemas.transforms;
+
+import com.google.auto.value.AutoValue;
+import com.google.common.collect.Lists;
+import java.util.List;
+import org.apache.beam.sdk.annotations.Experimental;
+import org.apache.beam.sdk.annotations.Experimental.Kind;
+import org.apache.beam.sdk.schemas.Schema;
+import org.apache.beam.sdk.schemas.Schema.Field;
+import org.apache.beam.sdk.transforms.DoFn;
+import org.apache.beam.sdk.transforms.PTransform;
+import org.apache.beam.sdk.transforms.ParDo;
+import org.apache.beam.sdk.transforms.SerializableFunction;
+import org.apache.beam.sdk.values.PCollection;
+import org.apache.beam.sdk.values.Row;
+/**
+ * A {@link PTransform} to unnest nested rows.
+ *
+ * For example, consider a Row with the following nestedschema:
+ *
+ * UserEvent Schema: userid: INT64 timestamp: DATETIME location: LatLong
+ *
+ * LatLong Schema: latitude: DOUBLE longitude: DOUBLE
+ *
+ * After unnesting, all of the rows will be converted to rows satisfying 
the following schema:
+ *
+ * UserEvent Schema: userid: INT64 timestamp: DATETIME location.latitude: 
DOUBLE
+ * location.longitude: DOUBLE
+ *
+ * By default nested names are concatenated to generated the unnested name, 
however {@link
+ * Unnest.Inner#withFieldNameFunction} can be used to specify a custom naming 
policy.
+ *
+ * Note that currently array and map values are not unnested.
+ */
+@Experimental(Kind.SCHEMAS)
+public class Unnest {
+  public static  Inner create() {
+return new AutoValue_Unnest_Inner.Builder()
+.setMaxLevels(Integer.MAX_VALUE)
+.setFieldNameFunction(CONCAT_FIELD_NAMES)
+.build();
+  }
+  /**
+   * This is the default naming policy for naming fields. Every field name in 
the path to a given
+   * field is concated with . characters.
+   */
+  public static final SerializableFunction, String> 
CONCAT_FIELD_NAMES =
+  l -> {
+return String.join(".", l);
 
 Review comment:
   I would probably use some symbol other than `"."` (maybe use `"_"` instead) 
to avoid accidental conflicts with field access operator in weird places 
downstream (e.g. accessing fields in SQL, or parsing json with these dotted 
field names).  E.g. you cannot do `JSON.parse("{ \"asd.asd\" : \"123123\" 
}").asd.asd`, which probably a some people would expect, and `eval("{ asd.asd : 
\"123123\" }")` doesn't work.
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141442)
Time Spent: 5h 20m  (was: 5h 10m)

> Create a library of useful transforms that use schemas
> --
>
> Key: BEAM-4461
> URL: https://issues.apache.org/jira/browse/BEAM-4461
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-core
>Reporter: Reuven Lax
>Assignee: Reuven Lax
>Priority: Major
>  Time Spent: 5h 20m
>  Remaining Estimate: 0h
>
> e.g. JoinBy(fields). Project, Filter, etc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work logged] (BEAM-4461) Create a library of useful transforms that use schemas

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4461?focusedWorklogId=141443=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141443
 ]

ASF GitHub Bot logged work on BEAM-4461:


Author: ASF GitHub Bot
Created on: 05/Sep/18 17:25
Start Date: 05/Sep/18 17:25
Worklog Time Spent: 10m 
  Work Description: akedin commented on a change in pull request #6316: 
[BEAM-4461] Add Unnest transform.
URL: https://github.com/apache/beam/pull/6316#discussion_r215352115
 
 

 ##
 File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/transforms/Unnest.java
 ##
 @@ -0,0 +1,182 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.beam.sdk.schemas.transforms;
+
+import com.google.auto.value.AutoValue;
+import com.google.common.collect.Lists;
+import java.util.List;
+import org.apache.beam.sdk.annotations.Experimental;
+import org.apache.beam.sdk.annotations.Experimental.Kind;
+import org.apache.beam.sdk.schemas.Schema;
+import org.apache.beam.sdk.schemas.Schema.Field;
+import org.apache.beam.sdk.transforms.DoFn;
+import org.apache.beam.sdk.transforms.PTransform;
+import org.apache.beam.sdk.transforms.ParDo;
+import org.apache.beam.sdk.transforms.SerializableFunction;
+import org.apache.beam.sdk.values.PCollection;
+import org.apache.beam.sdk.values.Row;
+/**
+ * A {@link PTransform} to unnest nested rows.
+ *
+ * For example, consider a Row with the following nestedschema:
+ *
+ * UserEvent Schema: userid: INT64 timestamp: DATETIME location: LatLong
+ *
+ * LatLong Schema: latitude: DOUBLE longitude: DOUBLE
+ *
+ * After unnesting, all of the rows will be converted to rows satisfying 
the following schema:
+ *
+ * UserEvent Schema: userid: INT64 timestamp: DATETIME location.latitude: 
DOUBLE
+ * location.longitude: DOUBLE
+ *
+ * By default nested names are concatenated to generated the unnested name, 
however {@link
+ * Unnest.Inner#withFieldNameFunction} can be used to specify a custom naming 
policy.
+ *
+ * Note that currently array and map values are not unnested.
+ */
+@Experimental(Kind.SCHEMAS)
+public class Unnest {
+  public static  Inner create() {
+return new AutoValue_Unnest_Inner.Builder()
+.setMaxLevels(Integer.MAX_VALUE)
+.setFieldNameFunction(CONCAT_FIELD_NAMES)
+.build();
+  }
+  /**
+   * This is the default naming policy for naming fields. Every field name in 
the path to a given
+   * field is concated with . characters.
+   */
+  public static final SerializableFunction, String> 
CONCAT_FIELD_NAMES =
+  l -> {
+return String.join(".", l);
+  };
+  /**
+   * This policy keeps the raw nested field name. If two differently-nested 
fields have the same
+   * name, unnesting will fail with this policy.
+   */
+  public static final SerializableFunction, String> 
KEEP_NESTED_NAME =
+  l -> {
+return l.get(l.size() - 1);
+  };
+  /** Returns the result of unnesting the given schema. The default naming 
policy is used. */
+  static Schema getUnnestedSchema(Schema schema, int maxLevels) {
+List nameComponents = Lists.newArrayList();
+return getUnnestedSchema(schema, nameComponents, CONCAT_FIELD_NAMES, 
maxLevels, 0);
+  }
+  /** Returns the result of unnesting the given schema with the given naming 
policy. */
+  static Schema getUnnestedSchema(
+  Schema schema, int maxLevels, SerializableFunction, String> 
fn) {
+List nameComponents = Lists.newArrayList();
+return getUnnestedSchema(schema, nameComponents, fn, maxLevels, 0);
+  }
+
+  private static Schema getUnnestedSchema(
+  Schema schema,
+  List nameComponents,
+  SerializableFunction, String> fn,
+  int maxLevel,
+  int currentLevel) {
+Schema.Builder builder = Schema.builder();
+for (Field field : schema.getFields()) {
+  nameComponents.add(field.getName());
+  if (field.getType().getTypeName().isCompositeType() && currentLevel < 
maxLevel) {
+Schema nestedSchema =
+getUnnestedSchema(
+field.getType().getRowSchema(), nameComponents, fn, maxLevel, 
currentLevel + 1);
+for 

[jira] [Work logged] (BEAM-4461) Create a library of useful transforms that use schemas

2018-09-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-4461?focusedWorklogId=141439=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-141439
 ]

ASF GitHub Bot logged work on BEAM-4461:


Author: ASF GitHub Bot
Created on: 05/Sep/18 17:25
Start Date: 05/Sep/18 17:25
Worklog Time Spent: 10m 
  Work Description: akedin commented on a change in pull request #6316: 
[BEAM-4461] Add Unnest transform.
URL: https://github.com/apache/beam/pull/6316#discussion_r215348371
 
 

 ##
 File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/transforms/Unnest.java
 ##
 @@ -0,0 +1,182 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.beam.sdk.schemas.transforms;
+
+import com.google.auto.value.AutoValue;
+import com.google.common.collect.Lists;
+import java.util.List;
+import org.apache.beam.sdk.annotations.Experimental;
+import org.apache.beam.sdk.annotations.Experimental.Kind;
+import org.apache.beam.sdk.schemas.Schema;
+import org.apache.beam.sdk.schemas.Schema.Field;
+import org.apache.beam.sdk.transforms.DoFn;
+import org.apache.beam.sdk.transforms.PTransform;
+import org.apache.beam.sdk.transforms.ParDo;
+import org.apache.beam.sdk.transforms.SerializableFunction;
+import org.apache.beam.sdk.values.PCollection;
+import org.apache.beam.sdk.values.Row;
+/**
+ * A {@link PTransform} to unnest nested rows.
+ *
+ * For example, consider a Row with the following nestedschema:
+ *
+ * UserEvent Schema: userid: INT64 timestamp: DATETIME location: LatLong
+ *
+ * LatLong Schema: latitude: DOUBLE longitude: DOUBLE
+ *
+ * After unnesting, all of the rows will be converted to rows satisfying 
the following schema:
+ *
+ * UserEvent Schema: userid: INT64 timestamp: DATETIME location.latitude: 
DOUBLE
+ * location.longitude: DOUBLE
+ *
+ * By default nested names are concatenated to generated the unnested name, 
however {@link
+ * Unnest.Inner#withFieldNameFunction} can be used to specify a custom naming 
policy.
+ *
+ * Note that currently array and map values are not unnested.
+ */
+@Experimental(Kind.SCHEMAS)
+public class Unnest {
+  public static  Inner create() {
+return new AutoValue_Unnest_Inner.Builder()
+.setMaxLevels(Integer.MAX_VALUE)
+.setFieldNameFunction(CONCAT_FIELD_NAMES)
+.build();
+  }
+  /**
+   * This is the default naming policy for naming fields. Every field name in 
the path to a given
+   * field is concated with . characters.
+   */
+  public static final SerializableFunction, String> 
CONCAT_FIELD_NAMES =
+  l -> {
+return String.join(".", l);
+  };
+  /**
+   * This policy keeps the raw nested field name. If two differently-nested 
fields have the same
+   * name, unnesting will fail with this policy.
+   */
+  public static final SerializableFunction, String> 
KEEP_NESTED_NAME =
 
 Review comment:
   Do we really need to customize naming in the first iteration? I suggest 
removing `CONCAT_FIELD_NAMES` and `KEEP_NESTED_NAME`, they have a 
[YAGNI](https://martinfowler.com/bliki/Yagni.html) smell. My opinion is that 
documentation of how naming behaves should be enough to start with, and then we 
could add policies later if needed. Or we could make the two behaviors a flag 
for now, e.g. `concatFieldNames()` vs `useLeafFieldName()`, I think it would be 
clearer to the users


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 141439)
Time Spent: 5h  (was: 4h 50m)

> Create a library of useful transforms that use schemas
> --
>
> Key: BEAM-4461
> URL: https://issues.apache.org/jira/browse/BEAM-4461
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-core
>Reporter: Reuven Lax
> 

  1   2   3   >