[jira] [Updated] (BEAM-6928) Make Python SDK custom Sink the default Sink for BigQuery

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-6928:
--
Fix Version/s: (was: 2.19.0)

> Make Python SDK custom Sink the default Sink for BigQuery
> -
>
> Key: BEAM-6928
> URL: https://issues.apache.org/jira/browse/BEAM-6928
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-core
>Reporter: Pablo Estrada
>Assignee: Pablo Estrada
>Priority: Major
>
> This is for 2.14.0 - please bump version to 2.14.0 when doing 2.13.0 release.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-6892) Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS if not specified by user.

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-6892:
--
Fix Version/s: (was: 2.19.0)

> Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS 
> if not specified by user.
> ---
>
> Key: BEAM-6892
> URL: https://issues.apache.org/jira/browse/BEAM-6892
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Valentyn Tymofieiev
>Assignee: Pablo Estrada
>Priority: Major
>  Time Spent: 8h 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8992) ./gradlew :sdks:go:examples:goVet fails

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-8992:
--
Fix Version/s: (was: 2.19.0)

> ./gradlew :sdks:go:examples:goVet fails
> ---
>
> Key: BEAM-8992
> URL: https://issues.apache.org/jira/browse/BEAM-8992
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-go
>Affects Versions: 2.19.0
>Reporter: Udi Meiri
>Assignee: Robert Burke
>Priority: Major
>
> {code}
> > Task :sdks:go:examples:resolveBuildDependencies
> Resolving 
> ./github.com/apache/beam/sdks/go@/usr/local/google/home/ehudm/src/beam-release/sdks/go
> .gogradle/project_gopath/src/github.com/apache/beam/sdks/go/examples/vendor/github.com/apache/beam/sdks/go/pkg/beam/io/avroio/avroio.go:28:2:
>  cannot find package "github.com/linkedin/goavro" in any of:
> 
> /usr/local/google/home/ehudm/src/beam-release/sdks/go/examples/.gogradle/project_gopath/src/github.com/apache/beam/sdks/go/examples/vendor/github.com/linkedin/goavro
>  (vendor tree)
> 
> /usr/local/google/home/ehudm/.gradle/go/binary/1.12/go/src/github.com/linkedin/goavro
>  (from $GOROOT)
> 
> /usr/local/google/home/ehudm/src/beam-release/sdks/go/examples/.gogradle/project_gopath/src/github.com/linkedin/goavro
>  (from $GOPATH)
> > Task :sdks:go:examples:goVet FAILED
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9012) Include `-> None` on Pipeline and PipelineOptions `__init__` methods for pytype compatibility

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-9012:
--
Fix Version/s: (was: 2.19.0)

> Include `-> None` on Pipeline and PipelineOptions `__init__` methods for 
> pytype compatibility
> -
>
> Key: BEAM-9012
> URL: https://issues.apache.org/jira/browse/BEAM-9012
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-core
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: Major
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> mypy [made a decision|https://github.com/python/mypy/issues/604] to allow 
> init methods to omit {{\-> None}} return type annotations, but pytype has no 
> such feature. I think we should include {{\-> None}} annotations for pytype 
> compatibility.
> cc: [~chadrik]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9012) Include `-> None` on Pipeline and PipelineOptions `__init__` methods for pytype compatibility

2020-01-08 Thread Kenneth Knowles (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011517#comment-17011517
 ] 

Kenneth Knowles commented on BEAM-9012:
---

Peanut gallery: Seems like satisfying both static checkers is worth it in this 
case. I'm employed by Google, but my heart belongs to types/verification, so 
I'm not just saying this because of my paycheck.

> Include `-> None` on Pipeline and PipelineOptions `__init__` methods for 
> pytype compatibility
> -
>
> Key: BEAM-9012
> URL: https://issues.apache.org/jira/browse/BEAM-9012
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-core
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> mypy [made a decision|https://github.com/python/mypy/issues/604] to allow 
> init methods to omit {{\-> None}} return type annotations, but pytype has no 
> such feature. I think we should include {{\-> None}} annotations for pytype 
> compatibility.
> cc: [~chadrik]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9013) Multi-output TestStream breaks the DataflowRunner

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-9013:
--
Fix Version/s: (was: 2.19.0)

> Multi-output TestStream breaks the DataflowRunner
> -
>
> Key: BEAM-9013
> URL: https://issues.apache.org/jira/browse/BEAM-9013
> Project: Beam
>  Issue Type: Bug
>  Components: runner-dataflow
>Affects Versions: 2.17.0
>Reporter: Sam Rohde
>Assignee: Sam Rohde
>Priority: Major
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9049) MemoryMonitor thrashing detection is too aggressive for batch workers

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-9049:
--
Fix Version/s: (was: 2.19.0)

> MemoryMonitor thrashing detection is too aggressive for batch workers
> -
>
> Key: BEAM-9049
> URL: https://issues.apache.org/jira/browse/BEAM-9049
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-dataflow
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: Major
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> In the streaming dataflow worker we've implemented push-back so that we will 
> reduce parallelism when there is memory pressure. Since we cannot do this on 
> the batch worker, it doesn't make sense for our thrashing detection to be so 
> aggressive. We should increase the thresholds used for thrashing detection 
> when running on Batch workloads.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9013) Multi-output TestStream breaks the DataflowRunner

2020-01-08 Thread Kenneth Knowles (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011514#comment-17011514
 ] 

Kenneth Knowles commented on BEAM-9013:
---

Can you update the "Affects Version" and "Fix Version" fields to match what 
release notes should display?

> Multi-output TestStream breaks the DataflowRunner
> -
>
> Key: BEAM-9013
> URL: https://issues.apache.org/jira/browse/BEAM-9013
> Project: Beam
>  Issue Type: Bug
>  Components: runner-dataflow
>Affects Versions: 2.17.0
>Reporter: Sam Rohde
>Assignee: Sam Rohde
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9068) Use local docker image if available for Java and Go

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-9068:
--
Fix Version/s: (was: 2.19.0)

> Use local docker image if available for Java and Go
> ---
>
> Key: BEAM-9068
> URL: https://issues.apache.org/jira/browse/BEAM-9068
> Project: Beam
>  Issue Type: Task
>  Components: build-system
>Affects Versions: 2.17.0
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
>
> Python already implemented in this way.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-7861:
--
Fix Version/s: (was: 2.19.0)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9030) Bump grpc to 1.26.0

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-9030:
--
Fix Version/s: (was: 2.19.0)

> Bump grpc to 1.26.0
> ---
>
> Key: BEAM-9030
> URL: https://issues.apache.org/jira/browse/BEAM-9030
> Project: Beam
>  Issue Type: Bug
>  Components: java-fn-execution, runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 6h
>  Remaining Estimate: 0h
>
> When submitting a Python word count job to a Flink session/standalone cluster 
> repeatedly, the meta space usage of the task manager of the Flink cluster 
> will continuously increase (about 40MB each time). The reason is that the 
> Beam classes are loaded with the user class loader in Flink and there are 
> problems with the implementation of `ProcessManager`(from Beam) and 
> `ThreadPoolCache`(from netty) which may cause the user class loader could not 
> be garbage collected even after the job finished which causes the meta space 
> memory leak eventually. You can refer to FLINK-15338[1] for more information.
> Regarding to `ProcessManager`, I have created a JIRA BEAM-9006[2] to track 
> it. Regarding to `ThreadPoolCache`, it is a Netty problem and has been fixed 
> in NETTY#8955[3]. Netty 4.1.35 Final has already included this fix and GRPC 
> 1.22.0 has already dependents on Netty 4.1.35 Final. So we need to bump the 
> version of GRPC to 1.22.0+ (currently 1.21.0).
>  
> What do you think?
> [1] https://issues.apache.org/jira/browse/FLINK-15338
> [2] https://issues.apache.org/jira/browse/BEAM-9006
> [3] [https://github.com/netty/netty/pull/8955]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9030) Bump grpc to 1.26.0

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-9030:
--
Issue Type: Improvement  (was: Bug)

> Bump grpc to 1.26.0
> ---
>
> Key: BEAM-9030
> URL: https://issues.apache.org/jira/browse/BEAM-9030
> Project: Beam
>  Issue Type: Improvement
>  Components: java-fn-execution, runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
>  Time Spent: 6h
>  Remaining Estimate: 0h
>
> When submitting a Python word count job to a Flink session/standalone cluster 
> repeatedly, the meta space usage of the task manager of the Flink cluster 
> will continuously increase (about 40MB each time). The reason is that the 
> Beam classes are loaded with the user class loader in Flink and there are 
> problems with the implementation of `ProcessManager`(from Beam) and 
> `ThreadPoolCache`(from netty) which may cause the user class loader could not 
> be garbage collected even after the job finished which causes the meta space 
> memory leak eventually. You can refer to FLINK-15338[1] for more information.
> Regarding to `ProcessManager`, I have created a JIRA BEAM-9006[2] to track 
> it. Regarding to `ThreadPoolCache`, it is a Netty problem and has been fixed 
> in NETTY#8955[3]. Netty 4.1.35 Final has already included this fix and GRPC 
> 1.22.0 has already dependents on Netty 4.1.35 Final. So we need to bump the 
> version of GRPC to 1.22.0+ (currently 1.21.0).
>  
> What do you think?
> [1] https://issues.apache.org/jira/browse/FLINK-15338
> [2] https://issues.apache.org/jira/browse/BEAM-9006
> [3] [https://github.com/netty/netty/pull/8955]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-8989) Backwards incompatible change in ParDo.getSideInputs (caught by failure when running Apache Nemo quickstart)

2020-01-08 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-8989:
--
Fix Version/s: (was: 2.19.0)

> Backwards incompatible change in ParDo.getSideInputs (caught by failure when 
> running Apache Nemo quickstart)
> 
>
> Key: BEAM-8989
> URL: https://issues.apache.org/jira/browse/BEAM-8989
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Affects Versions: 2.16.0, 2.17.0, 2.18.0
>Reporter: Luke Cwik
>Assignee: Reuven Lax
>Priority: Critical
>
> [PR/9275|https://github.com/apache/beam/pull/9275] changed 
> *ParDo.getSideInputs* from *List* to *Map PCollectionView>* which is backwards incompatible change and was released as 
> part of Beam 2.16.0 erroneously.
> Running the Apache Nemo Quickstart fails with:
>  
> {code:java}
> Exception in thread "main" java.lang.RuntimeException: Translator private 
> static void 
> org.apache.nemo.compiler.frontend.beam.PipelineTranslator.parDoMultiOutputTranslator(org.apache.nemo.compiler.frontend.beam.PipelineTranslationContext,org.apache.beam.sdk.runners.TransformHierarchy$Node,org.apache.beam.sdk.transforms.ParDo$MultiOutput)
>  have failed to translate 
> org.apache.beam.examples.WordCount$ExtractWordsFn@600b9d27Exception in thread 
> "main" java.lang.RuntimeException: Translator private static void 
> org.apache.nemo.compiler.frontend.beam.PipelineTranslator.parDoMultiOutputTranslator(org.apache.nemo.compiler.frontend.beam.PipelineTranslationContext,org.apache.beam.sdk.runners.TransformHierarchy$Node,org.apache.beam.sdk.transforms.ParDo$MultiOutput)
>  have failed to translate 
> org.apache.beam.examples.WordCount$ExtractWordsFn@600b9d27 at 
> org.apache.nemo.compiler.frontend.beam.PipelineTranslator.translatePrimitive(PipelineTranslator.java:113)
>  at 
> org.apache.nemo.compiler.frontend.beam.PipelineVisitor.visitPrimitiveTransform(PipelineVisitor.java:46)
>  at 
> org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:665)
>  at 
> org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:657)
>  at 
> org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:657)
>  at 
> org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:657)
>  at 
> org.apache.beam.sdk.runners.TransformHierarchy$Node.access$600(TransformHierarchy.java:317)
>  at 
> org.apache.beam.sdk.runners.TransformHierarchy.visit(TransformHierarchy.java:251)
>  at org.apache.beam.sdk.Pipeline.traverseTopologically(Pipeline.java:460) at 
> org.apache.nemo.compiler.frontend.beam.NemoRunner.run(NemoRunner.java:80) at 
> org.apache.nemo.compiler.frontend.beam.NemoRunner.run(NemoRunner.java:31) at 
> org.apache.beam.sdk.Pipeline.run(Pipeline.java:315) at 
> org.apache.beam.sdk.Pipeline.run(Pipeline.java:301) at 
> org.apache.beam.examples.WordCount.runWordCount(WordCount.java:185) at 
> org.apache.beam.examples.WordCount.main(WordCount.java:192)Caused by: 
> java.lang.reflect.InvocationTargetException at 
> sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:498) at 
> org.apache.nemo.compiler.frontend.beam.PipelineTranslator.translatePrimitive(PipelineTranslator.java:109)
>  ... 14 moreCaused by: java.lang.NoSuchMethodError: 
> org.apache.beam.sdk.transforms.ParDo$MultiOutput.getSideInputs()Ljava/util/List;
>  at 
> org.apache.nemo.compiler.frontend.beam.PipelineTranslator.parDoMultiOutputTranslator(PipelineTranslator.java:236)
>  ... 19 more{code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9075) Join re-ordering doesn't work for ZetaSQL

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9075?focusedWorklogId=368817=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368817
 ]

ASF GitHub Bot logged work on BEAM-9075:


Author: ASF GitHub Bot
Created on: 09/Jan/20 06:57
Start Date: 09/Jan/20 06:57
Worklog Time Spent: 10m 
  Work Description: amaliujia commented on issue #10542: [BEAM-9075] 
Disable JoinCommuteRule for ZetaSQL planner
URL: https://github.com/apache/beam/pull/10542#issuecomment-572419480
 
 
   LGTM
   
   It would be great if we can test by internal test suite before merging it.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368817)
Time Spent: 40m  (was: 0.5h)

> Join re-ordering doesn't work for ZetaSQL
> -
>
> Key: BEAM-9075
> URL: https://issues.apache.org/jira/browse/BEAM-9075
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql-zetasql
>Affects Versions: 2.17.0
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: Major
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> Join re-ordering can actually break queries in the ZetaSQL planner because 
> Calcite's JoinCommuteRule doesn't work when schemas have nested fields. It 
> only adjusts RexInputRef nodes and not RexFieldAccess.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9075) Join re-ordering doesn't work for ZetaSQL

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9075?focusedWorklogId=368818=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368818
 ]

ASF GitHub Bot logged work on BEAM-9075:


Author: ASF GitHub Bot
Created on: 09/Jan/20 06:57
Start Date: 09/Jan/20 06:57
Worklog Time Spent: 10m 
  Work Description: amaliujia commented on issue #10542: [BEAM-9075] 
Disable JoinCommuteRule for ZetaSQL planner
URL: https://github.com/apache/beam/pull/10542#issuecomment-572419480
 
 
   LGTM
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368818)
Time Spent: 50m  (was: 40m)

> Join re-ordering doesn't work for ZetaSQL
> -
>
> Key: BEAM-9075
> URL: https://issues.apache.org/jira/browse/BEAM-9075
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql-zetasql
>Affects Versions: 2.17.0
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: Major
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> Join re-ordering can actually break queries in the ZetaSQL planner because 
> Calcite's JoinCommuteRule doesn't work when schemas have nested fields. It 
> only adjusts RexInputRef nodes and not RexFieldAccess.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9076) [QST] How to make python sdk wheel package?

2020-01-08 Thread Keunhyun Oh (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Keunhyun Oh updated BEAM-9076:
--
Description: 
How to make python sdk wheel package?

I've modified and tested beam's python sdks.

I've tried to run command to upload local built wheel package to my local 
repository like below.
{code:java}
$> python setup.py sdist bdist_wheel upload -r local{code}
Below files are not included in the package made by me.
{code:java}
./apache_beam/coders: coder_impl.cpython-36m-x86_64-linux-gnu.so
./apache_beam/coders: stream.cpython-36m-x86_64-linux-gnu.so
./apache_beam/metrics: execution.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners: common.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: logger.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: opcounters.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: operations.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: statesampler_fast.cpython-36m-x86_64-linux-gnu.so
./apache_beam/transforms: cy_combiners.cpython-36m-x86_64-linux-gnu.so
./apache_beam/transforms: 
cy_dataflow_distribution_counter.cpython-36m-x86_64-linux-gnu.so
./apache_beam/utils: counters.cpython-36m-x86_64-linux-gnu.so
./apache_beam/utils: windowed_value.cpython-36m-x86_64-linux-gnu.so
{code}
Please, let me know what is right command to make the wheel package?

  was:
How to make python sdk wheel package?

I've modified and tested beam's python sdks.

I've tried to run command to upload local built wheel package to my local 
repository like below.
{code:java}
$> python setup.py sdist bdist_wheel upload -r local{code}
Below files are not included in the package made by me.
{code:java}
./apache_beam/coders: coder_impl.cpython-36m-x86_64-linux-gnu.so
./apache_beam/coders: stream.cpython-36m-x86_64-linux-gnu.so
./apache_beam/metrics: execution.cpython-36m-x86_64-linux-gnu.soOnly in 
./apache_beam/runners: common.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: logger.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: opcounters.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: operations.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: statesampler_fast.cpython-36m-x86_64-linux-gnu.so
./apache_beam/transforms: cy_combiners.cpython-36m-x86_64-linux-gnu.so
./apache_beam/transforms: 
cy_dataflow_distribution_counter.cpython-36m-x86_64-linux-gnu.so
./apache_beam/utils: counters.cpython-36m-x86_64-linux-gnu.soOnly in 
./apache_beam/utils: windowed_value.cpython-36m-x86_64-linux-gnu.so
{code}
Please, let me know what is right command to make the wheel package?


> [QST] How to make python sdk wheel package?
> ---
>
> Key: BEAM-9076
> URL: https://issues.apache.org/jira/browse/BEAM-9076
> Project: Beam
>  Issue Type: Task
>  Components: build-system
>Affects Versions: 2.16.0
>Reporter: Keunhyun Oh
>Priority: Major
> Fix For: Not applicable
>
>
> How to make python sdk wheel package?
> I've modified and tested beam's python sdks.
> I've tried to run command to upload local built wheel package to my local 
> repository like below.
> {code:java}
> $> python setup.py sdist bdist_wheel upload -r local{code}
> Below files are not included in the package made by me.
> {code:java}
> ./apache_beam/coders: coder_impl.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/coders: stream.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/metrics: execution.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/runners: common.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/runners/worker: logger.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/runners/worker: opcounters.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/runners/worker: operations.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/runners/worker: 
> statesampler_fast.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/transforms: cy_combiners.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/transforms: 
> cy_dataflow_distribution_counter.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/utils: counters.cpython-36m-x86_64-linux-gnu.so
> ./apache_beam/utils: windowed_value.cpython-36m-x86_64-linux-gnu.so
> {code}
> Please, let me know what is right command to make the wheel package?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9076) [QST] How to make python sdk wheel package?

2020-01-08 Thread Keunhyun Oh (Jira)
Keunhyun Oh created BEAM-9076:
-

 Summary: [QST] How to make python sdk wheel package?
 Key: BEAM-9076
 URL: https://issues.apache.org/jira/browse/BEAM-9076
 Project: Beam
  Issue Type: Task
  Components: build-system
Affects Versions: 2.16.0
Reporter: Keunhyun Oh
 Fix For: Not applicable


How to make python sdk wheel package?

I've modified and tested beam's python sdks.

I've tried to run command to upload local built wheel package to my local 
repository like below.
{code:java}
$> python setup.py sdist bdist_wheel upload -r local{code}
Below files are not included in the package made by me.
{code:java}
./apache_beam/coders: coder_impl.cpython-36m-x86_64-linux-gnu.so
./apache_beam/coders: stream.cpython-36m-x86_64-linux-gnu.so
./apache_beam/metrics: execution.cpython-36m-x86_64-linux-gnu.soOnly in 
./apache_beam/runners: common.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: logger.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: opcounters.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: operations.cpython-36m-x86_64-linux-gnu.so
./apache_beam/runners/worker: statesampler_fast.cpython-36m-x86_64-linux-gnu.so
./apache_beam/transforms: cy_combiners.cpython-36m-x86_64-linux-gnu.so
./apache_beam/transforms: 
cy_dataflow_distribution_counter.cpython-36m-x86_64-linux-gnu.so
./apache_beam/utils: counters.cpython-36m-x86_64-linux-gnu.soOnly in 
./apache_beam/utils: windowed_value.cpython-36m-x86_64-linux-gnu.so
{code}
Please, let me know what is right command to make the wheel package?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368806=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368806
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 05:07
Start Date: 09/Jan/20 05:07
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364561109
 
 

 ##
 File path: sdks/python/gen_protos.py
 ##
 @@ -47,6 +50,164 @@
 ]
 
 
+def generate_urn_files(log, out_dir):
 
 Review comment:
   Note that the parent package is excluded from pylint, so style is not an 
issue. 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368806)
Time Spent: 41h 10m  (was: 41h)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 41h 10m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368796=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368796
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 03:18
Start Date: 09/Jan/20 03:18
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364543781
 
 

 ##
 File path: sdks/python/apache_beam/transforms/sideinputs.py
 ##
 @@ -58,6 +59,16 @@ def map_via_end(source_window):
   return map_via_end
 
 
+def get_sideinput_index(tag):
+  # type: (str) -> int
+  match = re.match('side([0-9]+)(-.*)?$', tag,
+   re.DOTALL)
 
 Review comment:
   I made two assumptions with this change:
   
   - that all these instances were supposed to be doing the same thing
   - that the author who used `re.DOTALL` did so deliberately
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368796)
Time Spent: 41h  (was: 40h 50m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 41h
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9059) Migrate PTransformTranslation to use string constants

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9059?focusedWorklogId=368794=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368794
 ]

ASF GitHub Bot logged work on BEAM-9059:


Author: ASF GitHub Bot
Created on: 09/Jan/20 03:10
Start Date: 09/Jan/20 03:10
Worklog Time Spent: 10m 
  Work Description: sunjincheng121 commented on pull request #10521: 
[BEAM-9059] Use string constants in PTransformTranslation instead of getUrn()
URL: https://github.com/apache/beam/pull/10521#discussion_r364542265
 
 

 ##
 File path: 
sdks/java/harness/src/main/java/org/apache/beam/fn/harness/WindowMappingFnRunner.java
 ##
 @@ -45,7 +44,7 @@
  * to associate each input with its output. The nonce is represented as an 
opaque set of bytes.
  */
 public class WindowMappingFnRunner {
-  static final String URN = 
BeamUrns.getUrn(StandardPTransforms.Primitives.MAP_WINDOWS);
+  static final String URN = PTransformTranslation.MERGE_WINDOWS_TRANSFORM_URN;
 
 Review comment:
   should be `MAP_WINDOWS_TRANSFORM_URN`.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368794)
Time Spent: 3h 10m  (was: 3h)

> Migrate PTransformTranslation to use string constants
> -
>
> Key: BEAM-9059
> URL: https://issues.apache.org/jira/browse/BEAM-9059
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Trivial
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> This allows for the values to be used within switch case statements.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9075) Join re-ordering doesn't work for ZetaSQL

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9075?focusedWorklogId=368792=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368792
 ]

ASF GitHub Bot logged work on BEAM-9075:


Author: ASF GitHub Bot
Created on: 09/Jan/20 03:04
Start Date: 09/Jan/20 03:04
Worklog Time Spent: 10m 
  Work Description: TheNeuralBit commented on issue #10542: [BEAM-9075] 
Disable JoinCommuteRule for ZetaSQL planner
URL: https://github.com/apache/beam/pull/10542#issuecomment-572361171
 
 
   Run SQL PostCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368792)
Time Spent: 0.5h  (was: 20m)

> Join re-ordering doesn't work for ZetaSQL
> -
>
> Key: BEAM-9075
> URL: https://issues.apache.org/jira/browse/BEAM-9075
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql-zetasql
>Affects Versions: 2.17.0
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: Major
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Join re-ordering can actually break queries in the ZetaSQL planner because 
> Calcite's JoinCommuteRule doesn't work when schemas have nested fields. It 
> only adjusts RexInputRef nodes and not RexFieldAccess.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9075) Join re-ordering doesn't work for ZetaSQL

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9075?focusedWorklogId=368791=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368791
 ]

ASF GitHub Bot logged work on BEAM-9075:


Author: ASF GitHub Bot
Created on: 09/Jan/20 03:04
Start Date: 09/Jan/20 03:04
Worklog Time Spent: 10m 
  Work Description: TheNeuralBit commented on issue #10542: [BEAM-9075] 
Disable JoinCommuteRule for ZetaSQL planner
URL: https://github.com/apache/beam/pull/10542#issuecomment-572361149
 
 
   R: @amaliujia 
   CC: @robinyqiu 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368791)
Time Spent: 20m  (was: 10m)

> Join re-ordering doesn't work for ZetaSQL
> -
>
> Key: BEAM-9075
> URL: https://issues.apache.org/jira/browse/BEAM-9075
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql-zetasql
>Affects Versions: 2.17.0
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: Major
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Join re-ordering can actually break queries in the ZetaSQL planner because 
> Calcite's JoinCommuteRule doesn't work when schemas have nested fields. It 
> only adjusts RexInputRef nodes and not RexFieldAccess.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9075) Join re-ordering doesn't work for ZetaSQL

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9075?focusedWorklogId=368790=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368790
 ]

ASF GitHub Bot logged work on BEAM-9075:


Author: ASF GitHub Bot
Created on: 09/Jan/20 03:03
Start Date: 09/Jan/20 03:03
Worklog Time Spent: 10m 
  Work Description: TheNeuralBit commented on pull request #10542: 
[BEAM-9075] Disable JoinCommuteRule for ZetaSQL planner
URL: https://github.com/apache/beam/pull/10542
 
 
   Remove the `JoinCommuteRule` from rulesets used in the ZetaSQL planner. It 
currently doesn't work without struct flattening.
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Gearpump | Samza | Spark
   --- | --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python36/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python36/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python37/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python37/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCompletedBuild/)[![Build
 

[jira] [Work logged] (BEAM-8481) Python 3.7 Postcommit test -- frequent timeouts

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8481?focusedWorklogId=368781=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368781
 ]

ASF GitHub Bot logged work on BEAM-8481:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:51
Start Date: 09/Jan/20 02:51
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10378: [BEAM-8481] Fix a 
race condition in proto stubs generation.
URL: https://github.com/apache/beam/pull/10378#issuecomment-572358372
 
 
   Thanks, @ibzib for the fix for BEAM-9070!
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368781)
Time Spent: 5h 50m  (was: 5h 40m)

> Python 3.7 Postcommit test -- frequent timeouts
> ---
>
> Key: BEAM-8481
> URL: https://issues.apache.org/jira/browse/BEAM-8481
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Ahmet Altay
>Assignee: Valentyn Tymofieiev
>Priority: Critical
>  Time Spent: 5h 50m
>  Remaining Estimate: 0h
>
> [https://builds.apache.org/job/beam_PostCommit_Python37/] – this suite 
> seemingly frequently timing out. Other suites are not affected by these 
> timeouts. From the history, the issues started before Oct 10 and we cannot 
> pinpoint because history is lost.  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8481) Python 3.7 Postcommit test -- frequent timeouts

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8481?focusedWorklogId=368780=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368780
 ]

ASF GitHub Bot logged work on BEAM-8481:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:51
Start Date: 09/Jan/20 02:51
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10378: [BEAM-8481] Fix a 
race condition in proto stubs generation.
URL: https://github.com/apache/beam/pull/10378#issuecomment-572358229
 
 
   PTAL @udim or @ibzib 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368780)
Time Spent: 5h 40m  (was: 5.5h)

> Python 3.7 Postcommit test -- frequent timeouts
> ---
>
> Key: BEAM-8481
> URL: https://issues.apache.org/jira/browse/BEAM-8481
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Ahmet Altay
>Assignee: Valentyn Tymofieiev
>Priority: Critical
>  Time Spent: 5h 40m
>  Remaining Estimate: 0h
>
> [https://builds.apache.org/job/beam_PostCommit_Python37/] – this suite 
> seemingly frequently timing out. Other suites are not affected by these 
> timeouts. From the history, the issues started before Oct 10 and we cannot 
> pinpoint because history is lost.  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9075) Join re-ordering doesn't work for ZetaSQL

2020-01-08 Thread Brian Hulette (Jira)
Brian Hulette created BEAM-9075:
---

 Summary: Join re-ordering doesn't work for ZetaSQL
 Key: BEAM-9075
 URL: https://issues.apache.org/jira/browse/BEAM-9075
 Project: Beam
  Issue Type: Bug
  Components: dsl-sql-zetasql
Affects Versions: 2.17.0
Reporter: Brian Hulette
Assignee: Brian Hulette


Join re-ordering can actually break queries in the ZetaSQL planner because 
Calcite's JoinCommuteRule doesn't work when schemas have nested fields. It only 
adjusts RexInputRef nodes and not RexFieldAccess.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9074) Make it visible on Beam website that Beam signed a pledge to sunset Python 2 offering in 2020 and future versions of Beam will not support Python 2.

2020-01-08 Thread Valentyn Tymofieiev (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Valentyn Tymofieiev updated BEAM-9074:
--
Status: Open  (was: Triage Needed)

> Make it visible on Beam website that Beam signed a pledge to sunset Python 2 
> offering in 2020 and future versions of Beam will not support Python 2. 
> -
>
> Key: BEAM-9074
> URL: https://issues.apache.org/jira/browse/BEAM-9074
> Project: Beam
>  Issue Type: Improvement
>  Components: website
>Reporter: Valentyn Tymofieiev
>Assignee: Rose Nguyen
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9059) Migrate PTransformTranslation to use string constants

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9059?focusedWorklogId=368778=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368778
 ]

ASF GitHub Bot logged work on BEAM-9059:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:31
Start Date: 09/Jan/20 02:31
Worklog Time Spent: 10m 
  Work Description: angoenka commented on pull request #10521: [BEAM-9059] 
Use string constants in PTransformTranslation instead of getUrn()
URL: https://github.com/apache/beam/pull/10521#discussion_r364535090
 
 

 ##
 File path: 
sdks/java/harness/src/main/java/org/apache/beam/fn/harness/WindowMappingFnRunner.java
 ##
 @@ -45,7 +44,7 @@
  * to associate each input with its output. The nonce is represented as an 
opaque set of bytes.
  */
 public class WindowMappingFnRunner {
-  static final String URN = 
BeamUrns.getUrn(StandardPTransforms.Primitives.MAP_WINDOWS);
+  static final String URN = PTransformTranslation.MERGE_WINDOWS_TRANSFORM_URN;
 
 Review comment:
   Is this change intended?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368778)
Time Spent: 3h  (was: 2h 50m)

> Migrate PTransformTranslation to use string constants
> -
>
> Key: BEAM-9059
> URL: https://issues.apache.org/jira/browse/BEAM-9059
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Trivial
>  Time Spent: 3h
>  Remaining Estimate: 0h
>
> This allows for the values to be used within switch case statements.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368775=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368775
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:26
Start Date: 09/Jan/20 02:26
Worklog Time Spent: 10m 
  Work Description: chadrik commented on issue #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#issuecomment-572352389
 
 
   btw, I made some edits to my answers to clarify them, so you should continue 
the review via github rather than email.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368775)
Time Spent: 40h 50m  (was: 40h 40m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 40h 50m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368773=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368773
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:25
Start Date: 09/Jan/20 02:25
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364532942
 
 

 ##
 File path: sdks/python/apache_beam/coders/coder_impl.py
 ##
 @@ -907,7 +907,8 @@ def encode_to_stream(self, value, out, nested):
   buffer = create_OutputStream()
   if (self._write_state is not None
   and out.size() - start_size > self._write_state_threshold):
-tail = (value_iter[index + 1:] if isinstance(value, (list, tuple))
+tail = (value_iter[index + 1:]
+if isinstance(value_iter, (list, tuple))
 
 Review comment:
   It protects against this error:
   
   ```
   apache_beam/coders/coder_impl.py:910: error: Value of type 
"Union[Iterator[Any], List[Any], Tuple[Any, ...]]" is not indexable  [index]
   ```
   
   The original code was using the type of `value` to implicitly determine the 
type of `value_iter`.  This confuses humans and linters :)   
   
   By using `isinstance(value_iter, ...)` instead, mypy knows to reduce its 
type from `Union[Iterator[Any], List[Any], Tuple[Any, ...]]` down to 
`Union[List[Any], Tuple[Any, ...]]` before the `__getitem__` code is checked.
   
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368773)
Time Spent: 40.5h  (was: 40h 20m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 40.5h
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368774=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368774
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:25
Start Date: 09/Jan/20 02:25
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364532942
 
 

 ##
 File path: sdks/python/apache_beam/coders/coder_impl.py
 ##
 @@ -907,7 +907,8 @@ def encode_to_stream(self, value, out, nested):
   buffer = create_OutputStream()
   if (self._write_state is not None
   and out.size() - start_size > self._write_state_threshold):
-tail = (value_iter[index + 1:] if isinstance(value, (list, tuple))
+tail = (value_iter[index + 1:]
+if isinstance(value_iter, (list, tuple))
 
 Review comment:
   It protects against this error:
   
   ```
   apache_beam/coders/coder_impl.py:910: error: Value of type 
"Union[Iterator[Any], List[Any], Tuple[Any, ...]]" is not indexable  [index]
   ```
   
   The original code was using the type of `value` to implicitly determine the 
type of `value_iter`.  This confuses humans and linters :)   
   
   By using `isinstance` against `value_iter` instead, mypy knows to reduce its 
type from `Union[Iterator[Any], List[Any], Tuple[Any, ...]]` down to 
`Union[List[Any], Tuple[Any, ...]]` before the `__getitem__` code is checked.
   
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368774)
Time Spent: 40h 40m  (was: 40.5h)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 40h 40m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368772=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368772
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:25
Start Date: 09/Jan/20 02:25
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364532942
 
 

 ##
 File path: sdks/python/apache_beam/coders/coder_impl.py
 ##
 @@ -907,7 +907,8 @@ def encode_to_stream(self, value, out, nested):
   buffer = create_OutputStream()
   if (self._write_state is not None
   and out.size() - start_size > self._write_state_threshold):
-tail = (value_iter[index + 1:] if isinstance(value, (list, tuple))
+tail = (value_iter[index + 1:]
+if isinstance(value_iter, (list, tuple))
 
 Review comment:
   It protects against this error:
   
   ```
   apache_beam/coders/coder_impl.py:910: error: Value of type 
"Union[Iterator[Any], List[Any], Tuple[Any, ...]]" is not indexable  [index]
   ```
   
   The original code was using the type of `value` to implicitly determine the 
type of `value_iter`.  This confuses humans and linters :)   By using 
`isinstance(value_iter, ...)` mypy knows to reduce its type from 
`Union[Iterator[Any], List[Any], Tuple[Any, ...]]` down to `Union[List[Any], 
Tuple[Any, ...]]` before the `__getitem__` code is checked.
   
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368772)
Time Spent: 40h 20m  (was: 40h 10m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 40h 20m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368771=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368771
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:24
Start Date: 09/Jan/20 02:24
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364532942
 
 

 ##
 File path: sdks/python/apache_beam/coders/coder_impl.py
 ##
 @@ -907,7 +907,8 @@ def encode_to_stream(self, value, out, nested):
   buffer = create_OutputStream()
   if (self._write_state is not None
   and out.size() - start_size > self._write_state_threshold):
-tail = (value_iter[index + 1:] if isinstance(value, (list, tuple))
+tail = (value_iter[index + 1:]
+if isinstance(value_iter, (list, tuple))
 
 Review comment:
   It protects against this error:
   
   ```
   apache_beam/coders/coder_impl.py:910: error: Value of type 
"Union[Iterator[Any], List[Any], Tuple[Any, ...]]" is not indexable  [index]
   ```
   
   The original code was using the type of `value` to implicitly determine the 
type of `value_iter`.  This confuses humans and linters :)   By using 
`value_iter` mypy knows to reduce `Union[Iterator[Any], List[Any], Tuple[Any, 
...]]` down to `Union[List[Any], Tuple[Any, ...]]` before the `__getitem__` 
code is checked.
   
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368771)
Time Spent: 40h 10m  (was: 40h)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 40h 10m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368770=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368770
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:23
Start Date: 09/Jan/20 02:23
Worklog Time Spent: 10m 
  Work Description: chadrik commented on issue #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#issuecomment-572351739
 
 
   @udim  thanks for the review!  very good questions. 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368770)
Time Spent: 40h  (was: 39h 50m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 40h
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368769=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368769
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:20
Start Date: 09/Jan/20 02:20
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364533360
 
 

 ##
 File path: sdks/python/apache_beam/runners/common.py
 ##
 @@ -879,6 +892,7 @@ def process(self, windowed_value):
   return self.do_fn_invoker.invoke_process(windowed_value)
 except BaseException as exn:
   self._reraise_augmented(exn)
+  return None
 
 Review comment:
   same :)
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368769)
Time Spent: 39h 50m  (was: 39h 40m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 39h 50m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368768=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368768
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:20
Start Date: 09/Jan/20 02:20
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364533315
 
 

 ##
 File path: sdks/python/apache_beam/runners/common.py
 ##
 @@ -793,6 +804,8 @@ def current_element_progress(self):
 restriction_tracker = self.threadsafe_restriction_tracker
 if restriction_tracker:
   return restriction_tracker.current_progress()
+else:
+  return None
 
 Review comment:
   yup, required
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368768)
Time Spent: 39h 40m  (was: 39.5h)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 39h 40m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9074) Make it visible on Beam website that Beam signed a pledge to sunset Python 2 offering in 2020 and future versions of Beam will not support Python 2.

2020-01-08 Thread Valentyn Tymofieiev (Jira)
Valentyn Tymofieiev created BEAM-9074:
-

 Summary: Make it visible on Beam website that Beam signed a pledge 
to sunset Python 2 offering in 2020 and future versions of Beam will not 
support Python 2. 
 Key: BEAM-9074
 URL: https://issues.apache.org/jira/browse/BEAM-9074
 Project: Beam
  Issue Type: Improvement
  Components: website
Reporter: Valentyn Tymofieiev
Assignee: Rose Nguyen






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368767=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368767
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:19
Start Date: 09/Jan/20 02:19
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364532942
 
 

 ##
 File path: sdks/python/apache_beam/coders/coder_impl.py
 ##
 @@ -907,7 +907,8 @@ def encode_to_stream(self, value, out, nested):
   buffer = create_OutputStream()
   if (self._write_state is not None
   and out.size() - start_size > self._write_state_threshold):
-tail = (value_iter[index + 1:] if isinstance(value, (list, tuple))
+tail = (value_iter[index + 1:]
+if isinstance(value_iter, (list, tuple))
 
 Review comment:
   
   It protects against this error:
   
   ```
   apache_beam/coders/coder_impl.py:910: error: Value of type 
"Union[Iterator[Any], List[Any], Tuple[Any, ...]]" is not indexable  [index]
   ```
   
   The code is using the type of `value` to implicitly determine the type of 
`value_iter`.  This confuses humans and linters :)   By using `value_iter` mypy 
knows to reduce `Union[Iterator[Any], List[Any], Tuple[Any, ...]]` down to 
`Union[List[Any], Tuple[Any, ...]]` before the `__getitem__` code is checked.
   
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368767)
Time Spent: 39.5h  (was: 39h 20m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 39.5h
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9073) PipelineVisitor is topologically-order dependent

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9073?focusedWorklogId=368765=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368765
 ]

ASF GitHub Bot logged work on BEAM-9073:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:15
Start Date: 09/Jan/20 02:15
Worklog Time Spent: 10m 
  Work Description: rohdesamuel commented on pull request #10541: 
[BEAM-9073] Fixes order-dependence in PipelineVisitor
URL: https://github.com/apache/beam/pull/10541
 
 
   The Python PipelineVisitor is topologically-order dependent and can visit 
the same transform multiple times. The fix is to journal each transform we 
visit.
   
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [ ] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Gearpump | Samza | Spark
   --- | --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 

[jira] [Work logged] (BEAM-9073) PipelineVisitor is topologically-order dependent

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9073?focusedWorklogId=368766=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368766
 ]

ASF GitHub Bot logged work on BEAM-9073:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:15
Start Date: 09/Jan/20 02:15
Worklog Time Spent: 10m 
  Work Description: rohdesamuel commented on issue #10541: [BEAM-9073] 
Fixes order-dependence in PipelineVisitor
URL: https://github.com/apache/beam/pull/10541#issuecomment-572349674
 
 
   R: @lukecwik can you review this please?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368766)
Time Spent: 20m  (was: 10m)

> PipelineVisitor is topologically-order dependent
> 
>
> Key: BEAM-9073
> URL: https://issues.apache.org/jira/browse/BEAM-9073
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Sam Rohde
>Assignee: Sam Rohde
>Priority: Major
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> The Python PipelineVisitor is topologically-order dependent and can visit the 
> same transform multiple times.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368764=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368764
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:13
Start Date: 09/Jan/20 02:13
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364531963
 
 

 ##
 File path: sdks/python/apache_beam/pvalue.py
 ##
 @@ -82,7 +82,7 @@ class PValue(object):
   """
 
   def __init__(self,
-   pipeline,  # type: Pipeline
+   pipeline,  # type: Optional[Pipeline]
 
 Review comment:
   
   Yes, here:
   
   ```python
   class PCollection(PValue, Generic[T]):
 [...]
 @staticmethod
 def from_runner_api(proto, context):
   # type: (beam_runner_api_pb2.PCollection, PipelineContext) -> PCollection
   # Producer and tag will be filled in later, the key point is that the
   # same object is returned for the same pcollection id.
   return PCollection(
   None,
   element_type=context.element_type_from_coder_id(proto.coder_id),
   windowing=context.windowing_strategies.get_by_id(
   proto.windowing_strategy_id),
   is_bounded=proto.is_bounded == beam_runner_api_pb2.IsBounded.BOUNDED)
   ```
   
   It would be nice if we could get rid of this case somehow, because by making 
this optional we have to deal with the possibility of `pvalue.pipeline` being 
`None` throughout the code base.  I went back and forth on whether to make the 
arg optional or simply ignore the error in the method above, but I think I 
decided that the method above was a common case and thus we needed the 
protection against None-values throughout the code.
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368764)
Time Spent: 39h 20m  (was: 39h 10m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 39h 20m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9073) PipelineVisitor is topologically-order dependent

2020-01-08 Thread Sam Rohde (Jira)
Sam Rohde created BEAM-9073:
---

 Summary: PipelineVisitor is topologically-order dependent
 Key: BEAM-9073
 URL: https://issues.apache.org/jira/browse/BEAM-9073
 Project: Beam
  Issue Type: Bug
  Components: sdk-py-core
Reporter: Sam Rohde
Assignee: Sam Rohde


The Python PipelineVisitor is topologically-order dependent and can visit the 
same transform multiple times.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8575) Add more Python validates runner tests

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8575?focusedWorklogId=368762=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368762
 ]

ASF GitHub Bot logged work on BEAM-8575:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:07
Start Date: 09/Jan/20 02:07
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #10190: [BEAM-8575] Added 
two unit tests to CombineTest class to test that Co…
URL: https://github.com/apache/beam/pull/10190#issuecomment-572347542
 
 
   Run Python PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368762)
Time Spent: 43h 20m  (was: 43h 10m)

> Add more Python validates runner tests
> --
>
> Key: BEAM-8575
> URL: https://issues.apache.org/jira/browse/BEAM-8575
> Project: Beam
>  Issue Type: Test
>  Components: sdk-py-core, testing
>Reporter: wendy liu
>Assignee: wendy liu
>Priority: Major
>  Time Spent: 43h 20m
>  Remaining Estimate: 0h
>
> This is the umbrella issue to track the work of adding more Python tests to 
> improve test coverage.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8575) Add more Python validates runner tests

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8575?focusedWorklogId=368761=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368761
 ]

ASF GitHub Bot logged work on BEAM-8575:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:06
Start Date: 09/Jan/20 02:06
Worklog Time Spent: 10m 
  Work Description: angoenka commented on issue #10190: [BEAM-8575] Added 
two unit tests to CombineTest class to test that Co…
URL: https://github.com/apache/beam/pull/10190#issuecomment-572347325
 
 
   Retest this please
   
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368761)
Time Spent: 43h 10m  (was: 43h)

> Add more Python validates runner tests
> --
>
> Key: BEAM-8575
> URL: https://issues.apache.org/jira/browse/BEAM-8575
> Project: Beam
>  Issue Type: Test
>  Components: sdk-py-core, testing
>Reporter: wendy liu
>Assignee: wendy liu
>Priority: Major
>  Time Spent: 43h 10m
>  Remaining Estimate: 0h
>
> This is the umbrella issue to track the work of adding more Python tests to 
> improve test coverage.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9059) Migrate PTransformTranslation to use string constants

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9059?focusedWorklogId=368757=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368757
 ]

ASF GitHub Bot logged work on BEAM-9059:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:05
Start Date: 09/Jan/20 02:05
Worklog Time Spent: 10m 
  Work Description: sunjincheng121 commented on issue #10521: [BEAM-9059] 
Use string constants in PTransformTranslation instead of getUrn()
URL: https://github.com/apache/beam/pull/10521#issuecomment-572347043
 
 
   Run JavaPortabilityApi PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368757)
Time Spent: 2.5h  (was: 2h 20m)

> Migrate PTransformTranslation to use string constants
> -
>
> Key: BEAM-9059
> URL: https://issues.apache.org/jira/browse/BEAM-9059
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Trivial
>  Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> This allows for the values to be used within switch case statements.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9059) Migrate PTransformTranslation to use string constants

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9059?focusedWorklogId=368758=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368758
 ]

ASF GitHub Bot logged work on BEAM-9059:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:05
Start Date: 09/Jan/20 02:05
Worklog Time Spent: 10m 
  Work Description: sunjincheng121 commented on issue #10521: [BEAM-9059] 
Use string constants in PTransformTranslation instead of getUrn()
URL: https://github.com/apache/beam/pull/10521#issuecomment-572347074
 
 
   Run Python2_PVR_Flink PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368758)
Time Spent: 2h 40m  (was: 2.5h)

> Migrate PTransformTranslation to use string constants
> -
>
> Key: BEAM-9059
> URL: https://issues.apache.org/jira/browse/BEAM-9059
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Trivial
>  Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> This allows for the values to be used within switch case statements.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9059) Migrate PTransformTranslation to use string constants

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9059?focusedWorklogId=368760=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368760
 ]

ASF GitHub Bot logged work on BEAM-9059:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:05
Start Date: 09/Jan/20 02:05
Worklog Time Spent: 10m 
  Work Description: sunjincheng121 commented on issue #10521: [BEAM-9059] 
Use string constants in PTransformTranslation instead of getUrn()
URL: https://github.com/apache/beam/pull/10521#issuecomment-572346917
 
 
   LGTM. +1 to merge once the jenkins turns green.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368760)
Time Spent: 2h 50m  (was: 2h 40m)

> Migrate PTransformTranslation to use string constants
> -
>
> Key: BEAM-9059
> URL: https://issues.apache.org/jira/browse/BEAM-9059
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Trivial
>  Time Spent: 2h 50m
>  Remaining Estimate: 0h
>
> This allows for the values to be used within switch case statements.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9059) Migrate PTransformTranslation to use string constants

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9059?focusedWorklogId=368756=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368756
 ]

ASF GitHub Bot logged work on BEAM-9059:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:05
Start Date: 09/Jan/20 02:05
Worklog Time Spent: 10m 
  Work Description: sunjincheng121 commented on issue #10521: [BEAM-9059] 
Use string constants in PTransformTranslation instead of getUrn()
URL: https://github.com/apache/beam/pull/10521#issuecomment-572346993
 
 
   Run Java PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368756)
Time Spent: 2h 20m  (was: 2h 10m)

> Migrate PTransformTranslation to use string constants
> -
>
> Key: BEAM-9059
> URL: https://issues.apache.org/jira/browse/BEAM-9059
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Trivial
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
> This allows for the values to be used within switch case statements.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9059) Migrate PTransformTranslation to use string constants

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9059?focusedWorklogId=368755=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368755
 ]

ASF GitHub Bot logged work on BEAM-9059:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:04
Start Date: 09/Jan/20 02:04
Worklog Time Spent: 10m 
  Work Description: sunjincheng121 commented on issue #10521: [BEAM-9059] 
Use string constants in PTransformTranslation instead of getUrn()
URL: https://github.com/apache/beam/pull/10521#issuecomment-572346917
 
 
   LGTM. +1 to merge once the jenkins turns green.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368755)
Time Spent: 2h 10m  (was: 2h)

> Migrate PTransformTranslation to use string constants
> -
>
> Key: BEAM-9059
> URL: https://issues.apache.org/jira/browse/BEAM-9059
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Luke Cwik
>Assignee: Luke Cwik
>Priority: Trivial
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> This allows for the values to be used within switch case statements.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368754=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368754
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:04
Start Date: 09/Jan/20 02:04
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364518511
 
 

 ##
 File path: sdks/python/apache_beam/utils/profiler.py
 ##
 @@ -105,6 +105,7 @@ def create_profiler(profile_id, **kwargs):
 if random.random() < options.profile_sample_rate:
   return Profile(profile_id, options.profile_location, **kwargs)
   return create_profiler
+return None
 
 Review comment:
   if a function returns an optional result (i.e. `Union[None, X]`) then mypy 
requires that you explicitly return `None` for the optional case(s).
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368754)
Time Spent: 39h 10m  (was: 39h)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 39h 10m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368753=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368753
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:02
Start Date: 09/Jan/20 02:02
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364529725
 
 

 ##
 File path: sdks/python/apache_beam/transforms/sideinputs.py
 ##
 @@ -58,6 +59,16 @@ def map_via_end(source_window):
   return map_via_end
 
 
+def get_sideinput_index(tag):
+  # type: (str) -> int
+  match = re.match('side([0-9]+)(-.*)?$', tag,
+   re.DOTALL)
 
 Review comment:
   There were three usages of this regex.  One used `re.DOTALL` so I went with 
the most permissive. 
   
   - transforms.core:
   
   ```python
   int(re.match('side([0-9]+)(-.*)?$', tag).group(1))
   ```
   
   - runners.worker.bundle_processor: 
   
   ```python
   int(re.match('side([0-9]+)(-.*)?$', tag, re.DOTALL).group(1))
   ```
   
   - pipeline:
   
   ```python
   int(re.match('side([0-9]+)(-.*)?$', tag).group(1))
   ```
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368753)
Time Spent: 39h  (was: 38h 50m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 39h
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368752=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368752
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 02:01
Start Date: 09/Jan/20 02:01
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364529725
 
 

 ##
 File path: sdks/python/apache_beam/transforms/sideinputs.py
 ##
 @@ -58,6 +59,16 @@ def map_via_end(source_window):
   return map_via_end
 
 
+def get_sideinput_index(tag):
+  # type: (str) -> int
+  match = re.match('side([0-9]+)(-.*)?$', tag,
+   re.DOTALL)
 
 Review comment:
   There were three usages of this regex.  One used `re.DOTALL` so I went with 
the most permissive. 
   
   - transforms.core:
   
   ```python
int(re.match('side([0-9]+)(-.*)?$', tag).group(1))
   ```
   
   - runners.worker.bundle_processor: 
   
   ```python
   int(re.match('side([0-9]+)(-.*)?$', tag, re.DOTALL).group(1))
   ```
   
   - pipeline:
   
   ```python
   int(re.match('side([0-9]+)(-.*)?$', tag).group(1))
   ```
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368752)
Time Spent: 38h 50m  (was: 38h 40m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 38h 50m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8575) Add more Python validates runner tests

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8575?focusedWorklogId=368747=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368747
 ]

ASF GitHub Bot logged work on BEAM-8575:


Author: ASF GitHub Bot
Created on: 09/Jan/20 01:52
Start Date: 09/Jan/20 01:52
Worklog Time Spent: 10m 
  Work Description: bumblebee-coming commented on issue #10190: [BEAM-8575] 
Added two unit tests to CombineTest class to test that Co…
URL: https://github.com/apache/beam/pull/10190#issuecomment-572344103
 
 
   Retest this please
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368747)
Time Spent: 43h  (was: 42h 50m)

> Add more Python validates runner tests
> --
>
> Key: BEAM-8575
> URL: https://issues.apache.org/jira/browse/BEAM-8575
> Project: Beam
>  Issue Type: Test
>  Components: sdk-py-core, testing
>Reporter: wendy liu
>Assignee: wendy liu
>Priority: Major
>  Time Spent: 43h
>  Remaining Estimate: 0h
>
> This is the umbrella issue to track the work of adding more Python tests to 
> improve test coverage.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368746=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368746
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 01:50
Start Date: 09/Jan/20 01:50
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364527420
 
 

 ##
 File path: sdks/python/apache_beam/portability/__init__.py
 ##
 @@ -17,3 +17,18 @@
 
 """For internal use only; no backwards-compatibility guarantees."""
 from __future__ import absolute_import
+
+from typing import TYPE_CHECKING
+from typing import NamedTuple
+
+if TYPE_CHECKING:
+  from apache_beam.portability.api import metrics_pb2
+
+
+PropertiesFromEnumValue = NamedTuple(
 
 Review comment:
   yeah, I typically follow that convention as well.  In this case, putting 
`PropertiesFromEnumValue ` in common_urns.py would cause an import cycle, 
because it imports the _urn.py files which need access to 
`PropertiesFromEnumValue`.  
   
   I considered putting `PropertiesFromEnumValue` in a new module, but it felt 
like overkill.  Happy to do so, if it's preferable. 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368746)
Time Spent: 38h 40m  (was: 38.5h)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 38h 40m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368745=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368745
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 01:50
Start Date: 09/Jan/20 01:50
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364527420
 
 

 ##
 File path: sdks/python/apache_beam/portability/__init__.py
 ##
 @@ -17,3 +17,18 @@
 
 """For internal use only; no backwards-compatibility guarantees."""
 from __future__ import absolute_import
+
+from typing import TYPE_CHECKING
+from typing import NamedTuple
+
+if TYPE_CHECKING:
+  from apache_beam.portability.api import metrics_pb2
+
+
+PropertiesFromEnumValue = NamedTuple(
 
 Review comment:
   yeah, I typically follow that convention as well.  In this case, 
common_urns.py would cause an import cycle, because it imports the _urn.py 
files which need access to `PropertiesFromEnumValue`.  
   
   I considered putting `PropertiesFromEnumValue` in a new module, but it felt 
like overkill.  Happy to do so, if it's preferable. 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368745)
Time Spent: 38.5h  (was: 38h 20m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 38.5h
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9030) Bump grpc to 1.26.0

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9030?focusedWorklogId=368744=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368744
 ]

ASF GitHub Bot logged work on BEAM-9030:


Author: ASF GitHub Bot
Created on: 09/Jan/20 01:43
Start Date: 09/Jan/20 01:43
Worklog Time Spent: 10m 
  Work Description: sunjincheng121 commented on issue #10463: [BEAM-9030] 
Bump grpc to 1.26.0
URL: https://github.com/apache/beam/pull/10463#issuecomment-572341918
 
 
   Thank you for all of your review and merging!
   
   I have bring up the discussion about gRPC Vendor Release, the detail can be 
found in [1]. We can open the second half of the upgrade  PR once the release 
is out.
   
   [1] 
https://lists.apache.org/thread.html/7e4de745a224efc24fd5c82737160ef050cbd18a99b45c4e3434fd56%40%3Cdev.beam.apache.org%3E
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368744)
Time Spent: 6h  (was: 5h 50m)

> Bump grpc to 1.26.0
> ---
>
> Key: BEAM-9030
> URL: https://issues.apache.org/jira/browse/BEAM-9030
> Project: Beam
>  Issue Type: Bug
>  Components: java-fn-execution, runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 6h
>  Remaining Estimate: 0h
>
> When submitting a Python word count job to a Flink session/standalone cluster 
> repeatedly, the meta space usage of the task manager of the Flink cluster 
> will continuously increase (about 40MB each time). The reason is that the 
> Beam classes are loaded with the user class loader in Flink and there are 
> problems with the implementation of `ProcessManager`(from Beam) and 
> `ThreadPoolCache`(from netty) which may cause the user class loader could not 
> be garbage collected even after the job finished which causes the meta space 
> memory leak eventually. You can refer to FLINK-15338[1] for more information.
> Regarding to `ProcessManager`, I have created a JIRA BEAM-9006[2] to track 
> it. Regarding to `ThreadPoolCache`, it is a Netty problem and has been fixed 
> in NETTY#8955[3]. Netty 4.1.35 Final has already included this fix and GRPC 
> 1.22.0 has already dependents on Netty 4.1.35 Final. So we need to bump the 
> version of GRPC to 1.22.0+ (currently 1.21.0).
>  
> What do you think?
> [1] https://issues.apache.org/jira/browse/FLINK-15338
> [2] https://issues.apache.org/jira/browse/BEAM-9006
> [3] [https://github.com/netty/netty/pull/8955]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8575) Add more Python validates runner tests

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8575?focusedWorklogId=368743=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368743
 ]

ASF GitHub Bot logged work on BEAM-8575:


Author: ASF GitHub Bot
Created on: 09/Jan/20 01:41
Start Date: 09/Jan/20 01:41
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10277: [BEAM-8575] 
Reenable passing VR tests.
URL: https://github.com/apache/beam/pull/10277#issuecomment-572341387
 
 
   Run Python Dataflow ValidatesRunner
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368743)
Time Spent: 42h 50m  (was: 42h 40m)

> Add more Python validates runner tests
> --
>
> Key: BEAM-8575
> URL: https://issues.apache.org/jira/browse/BEAM-8575
> Project: Beam
>  Issue Type: Test
>  Components: sdk-py-core, testing
>Reporter: wendy liu
>Assignee: wendy liu
>Priority: Major
>  Time Spent: 42h 50m
>  Remaining Estimate: 0h
>
> This is the umbrella issue to track the work of adding more Python tests to 
> improve test coverage.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9051) BigQueryUtils toBeamRow to support GenericRecords with millisecond precision timestamps

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9051?focusedWorklogId=368742=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368742
 ]

ASF GitHub Bot logged work on BEAM-9051:


Author: ASF GitHub Bot
Created on: 09/Jan/20 01:35
Start Date: 09/Jan/20 01:35
Worklog Time Spent: 10m 
  Work Description: RyanBerti commented on issue #10540: [BEAM-9051] 
BigQueryUtils toBeamRow support the Avro timestamp-millis logical type
URL: https://github.com/apache/beam/pull/10540#issuecomment-572339923
 
 
   R: @lukecwik
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368742)
Time Spent: 20m  (was: 10m)

> BigQueryUtils toBeamRow to support GenericRecords with millisecond precision 
> timestamps
> ---
>
> Key: BEAM-9051
> URL: https://issues.apache.org/jira/browse/BEAM-9051
> Project: Beam
>  Issue Type: Improvement
>  Components: io-java-gcp
>Affects Versions: 2.16.0
>Reporter: Ryan Berti
>Assignee: Ryan Berti
>Priority: Minor
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Currently, BigQueryUtils assumes all Avro GenericRecords which utilize 
> timestamp fields include timestamp data in microseconds when converting from 
> GenericRecords to Beam Rows 
> ([https://github.com/apache/beam/blob/master/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/BigQueryUtils.java#L568]).
>  The implementation should also support timestamp data in milliseconds; this 
> should be feasible by referencing the LogicalType associated with the avro 
> field to determine if the underlying value is represented in millis or micros.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9051) BigQueryUtils toBeamRow to support GenericRecords with millisecond precision timestamps

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9051?focusedWorklogId=368741=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368741
 ]

ASF GitHub Bot logged work on BEAM-9051:


Author: ASF GitHub Bot
Created on: 09/Jan/20 01:34
Start Date: 09/Jan/20 01:34
Worklog Time Spent: 10m 
  Work Description: RyanBerti commented on pull request #10540: [BEAM-9051] 
BigQueryUtils toBeamRow support the Avro timestamp-millis logical type
URL: https://github.com/apache/beam/pull/10540
 
 
   This change allows users with Avro GenericRecords that utilize 
non-microsecond timestamp values to utilize BigQueryUtils.toBeamRow to convert 
those GenericRecords into Beam Rows. This functionality was required for our 
implementation, as were generating GenericRecords from Scala case classes and 
didn't have direct access to the implementation which chose the precision of 
the timestamp value to be generated based on Instant fields (these Instant 
fields were converted to milliseconds in our case). Testing shows that the 
functionality works for nested rows and arrays. The implementation should not 
affect any existing use cases as it preserves the original API. 
   
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [ ] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Gearpump | Samza | Spark
   --- | --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 

[jira] [Commented] (BEAM-8496) remove SDF translators in flink streaming transform translator

2020-01-08 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011288#comment-17011288
 ] 

Kai Jiang commented on BEAM-8496:
-

For failed nexmark on flink job, it was fixed by 
[https://github.com/apache/beam/commit/4af507f567605723af940c2af3b2912d0948adf5]
 actually.

> remove SDF translators in flink streaming transform translator
> --
>
> Key: BEAM-8496
> URL: https://issues.apache.org/jira/browse/BEAM-8496
> Project: Beam
>  Issue Type: Bug
>  Components: runner-flink
>Reporter: Kai Jiang
>Assignee: Kai Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 7.5h
>  Remaining Estimate: 0h
>
> Since URN of SDF has been moved to runners-core-construction-java, we need to 
> remove it.
> Otherwise, in failed nexmark Jenkins 
> [job|https://builds.apache.org/job/beam_PostCommit_Java_Nexmark_Flink/4128/console],
>  it causes duplicated transformer registered in 
> [PTransformTranslation.KnownTransformPayloadTranslator()|https://github.com/apache/beam/blob/c2f0d282337f3ae0196a7717712396a5a41fdde1/runners/core-construction-java/src/main/java/org/apache/beam/runners/core/construction/PTransformTranslation.java#L290]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=368734=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368734
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 09/Jan/20 01:16
Start Date: 09/Jan/20 01:16
Worklog Time Spent: 10m 
  Work Description: aaltay commented on pull request #10536: [BEAM-7861] 
Add direct_running_mode option for direct runners to switch between 
multi_threading and multi_processing easily
URL: https://github.com/apache/beam/pull/10536#discussion_r364520622
 
 

 ##
 File path: sdks/python/apache_beam/options/pipeline_options.py
 ##
 @@ -438,6 +438,12 @@ def _add_argparse_args(cls, parser):
 type=int,
 default=1,
 help='number of parallel running workers.')
+parser.add_argument(
+'--direct_running_mode',
+default=None,
 
 Review comment:
   Would it make sense to add another choice for `EmbeddedWorkerHandler` and 
change the default to that, instead of defaulting to `None`?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368734)
Time Spent: 40m  (was: 0.5h)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368636=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368636
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 01:07
Start Date: 09/Jan/20 01:07
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364518632
 
 

 ##
 File path: sdks/python/gen_protos.py
 ##
 @@ -47,6 +50,164 @@
 ]
 
 
+def generate_urn_files(log, out_dir):
+  """
+  Create a pyi stub for common_urns
 
 Review comment:
   ah yes.  sorry.  will also add an explanation of how this works. 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368636)
Time Spent: 38h 20m  (was: 38h 10m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 38h 20m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368634=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368634
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 01:06
Start Date: 09/Jan/20 01:06
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364518511
 
 

 ##
 File path: sdks/python/apache_beam/utils/profiler.py
 ##
 @@ -105,6 +105,7 @@ def create_profiler(profile_id, **kwargs):
 if random.random() < options.profile_sample_rate:
   return Profile(profile_id, options.profile_location, **kwargs)
   return create_profiler
+return None
 
 Review comment:
   if a function returns an optional result (i.e. `Union[None, X]`) then mypy 
requires that you explicitly return `None` for the optional case.
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368634)
Time Spent: 38h 10m  (was: 38h)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 38h 10m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8882) Allow Dataflow to automatically choose portability or not.

2020-01-08 Thread Robert Bradshaw (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011180#comment-17011180
 ] 

Robert Bradshaw commented on BEAM-8882:
---

I would word this as "beam.Create no longer preserves order unless
reshuffle=False is passed in as an argument." It's still odd to say,
as PCollections are, generically, unordered, but in this case people
may have assumed order + fusion.



> Allow Dataflow to automatically choose portability or not.
> --
>
> Key: BEAM-8882
> URL: https://issues.apache.org/jira/browse/BEAM-8882
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Robert Bradshaw
>Assignee: Robert Bradshaw
>Priority: Critical
> Fix For: 2.18.0
>
>  Time Spent: 5h 50m
>  Remaining Estimate: 0h
>
> We would like the Dataflow service to be able to automatically choose whether 
> to run pipelines in a portable way. In order to do this, we need to provide 
> more information even if portability is not explicitly requested. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368537=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368537
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:56
Start Date: 09/Jan/20 00:56
Worklog Time Spent: 10m 
  Work Description: chadrik commented on pull request #10367: [BEAM-7746] 
Add python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364516322
 
 

 ##
 File path: sdks/python/gen_protos.py
 ##
 @@ -47,6 +50,164 @@
 ]
 
 
+def generate_urn_files(log, out_dir):
 
 Review comment:
   Here is `beam_job_api_pb2_urn.py`:
   
   ```python
   from builtins import object
   from .. import PropertiesFromEnumValue
   from . import metrics_pb2
   EMPTY_MONITORING_INFO_LABEL_PROPS = metrics_pb2.MonitoringInfoLabelProps()
   EMPTY_MONITORING_INFO_SPEC = metrics_pb2.MonitoringInfoSpec()
   
   class JobMessage(object):
   
 class MessageImportance(object):
   MESSAGE_IMPORTANCE_UNSPECIFIED = PropertiesFromEnumValue(u'', u'', 
EMPTY_MONITORING_INFO_SPEC, EMPTY_MONITORING_INFO_LABEL_PROPS)
   JOB_MESSAGE_DEBUG = PropertiesFromEnumValue(u'', u'', 
EMPTY_MONITORING_INFO_SPEC, EMPTY_MONITORING_INFO_LABEL_PROPS)
   JOB_MESSAGE_DETAILED = PropertiesFromEnumValue(u'', u'', 
EMPTY_MONITORING_INFO_SPEC, EMPTY_MONITORING_INFO_LABEL_PROPS)
   JOB_MESSAGE_BASIC = PropertiesFromEnumValue(u'', u'', 
EMPTY_MONITORING_INFO_SPEC, EMPTY_MONITORING_INFO_LABEL_PROPS)
   JOB_MESSAGE_WARNING = PropertiesFromEnumValue(u'', u'', 
EMPTY_MONITORING_INFO_SPEC, EMPTY_MONITORING_INFO_LABEL_PROPS)
   JOB_MESSAGE_ERROR = PropertiesFromEnumValue(u'', u'', 
EMPTY_MONITORING_INFO_SPEC, EMPTY_MONITORING_INFO_LABEL_PROPS)
   
   
   class JobState(object):
   
 class Enum(object):
   UNSPECIFIED = PropertiesFromEnumValue(u'', u'', 
EMPTY_MONITORING_INFO_SPEC, EMPTY_MONITORING_INFO_LABEL_PROPS)
   STOPPED = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   RUNNING = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   DONE = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   FAILED = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   CANCELLED = PropertiesFromEnumValue(u'', u'', 
EMPTY_MONITORING_INFO_SPEC, EMPTY_MONITORING_INFO_LABEL_PROPS)
   UPDATED = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   DRAINING = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   DRAINED = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   STARTING = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   CANCELLING = PropertiesFromEnumValue(u'', u'', 
EMPTY_MONITORING_INFO_SPEC, EMPTY_MONITORING_INFO_LABEL_PROPS)
   UPDATING = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   
   
   class PipelineOptionType(object):
   
 class Enum(object):
   STRING = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   BOOLEAN = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   INTEGER = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   NUMBER = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   ARRAY = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   OBJECT = PropertiesFromEnumValue(u'', u'', EMPTY_MONITORING_INFO_SPEC, 
EMPTY_MONITORING_INFO_LABEL_PROPS)
   ```
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368537)
Time Spent: 38h  (was: 37h 50m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 38h
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I 

[jira] [Work logged] (BEAM-9030) Bump grpc to 1.26.0

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9030?focusedWorklogId=368534=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368534
 ]

ASF GitHub Bot logged work on BEAM-9030:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:50
Start Date: 09/Jan/20 00:50
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on pull request #10463: [BEAM-9030] 
Bump grpc to 1.26.0
URL: https://github.com/apache/beam/pull/10463
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368534)
Time Spent: 5h 40m  (was: 5.5h)

> Bump grpc to 1.26.0
> ---
>
> Key: BEAM-9030
> URL: https://issues.apache.org/jira/browse/BEAM-9030
> Project: Beam
>  Issue Type: Bug
>  Components: java-fn-execution, runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 5h 40m
>  Remaining Estimate: 0h
>
> When submitting a Python word count job to a Flink session/standalone cluster 
> repeatedly, the meta space usage of the task manager of the Flink cluster 
> will continuously increase (about 40MB each time). The reason is that the 
> Beam classes are loaded with the user class loader in Flink and there are 
> problems with the implementation of `ProcessManager`(from Beam) and 
> `ThreadPoolCache`(from netty) which may cause the user class loader could not 
> be garbage collected even after the job finished which causes the meta space 
> memory leak eventually. You can refer to FLINK-15338[1] for more information.
> Regarding to `ProcessManager`, I have created a JIRA BEAM-9006[2] to track 
> it. Regarding to `ThreadPoolCache`, it is a Netty problem and has been fixed 
> in NETTY#8955[3]. Netty 4.1.35 Final has already included this fix and GRPC 
> 1.22.0 has already dependents on Netty 4.1.35 Final. So we need to bump the 
> version of GRPC to 1.22.0+ (currently 1.21.0).
>  
> What do you think?
> [1] https://issues.apache.org/jira/browse/FLINK-15338
> [2] https://issues.apache.org/jira/browse/BEAM-9006
> [3] [https://github.com/netty/netty/pull/8955]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9030) Bump grpc to 1.26.0

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9030?focusedWorklogId=368535=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368535
 ]

ASF GitHub Bot logged work on BEAM-9030:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:50
Start Date: 09/Jan/20 00:50
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on issue #10463: [BEAM-9030] Bump 
grpc to 1.26.0
URL: https://github.com/apache/beam/pull/10463#issuecomment-572328778
 
 
   Linkage checker reported no errors.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368535)
Time Spent: 5h 50m  (was: 5h 40m)

> Bump grpc to 1.26.0
> ---
>
> Key: BEAM-9030
> URL: https://issues.apache.org/jira/browse/BEAM-9030
> Project: Beam
>  Issue Type: Bug
>  Components: java-fn-execution, runner-flink
>Reporter: sunjincheng
>Assignee: sunjincheng
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 5h 50m
>  Remaining Estimate: 0h
>
> When submitting a Python word count job to a Flink session/standalone cluster 
> repeatedly, the meta space usage of the task manager of the Flink cluster 
> will continuously increase (about 40MB each time). The reason is that the 
> Beam classes are loaded with the user class loader in Flink and there are 
> problems with the implementation of `ProcessManager`(from Beam) and 
> `ThreadPoolCache`(from netty) which may cause the user class loader could not 
> be garbage collected even after the job finished which causes the meta space 
> memory leak eventually. You can refer to FLINK-15338[1] for more information.
> Regarding to `ProcessManager`, I have created a JIRA BEAM-9006[2] to track 
> it. Regarding to `ThreadPoolCache`, it is a Netty problem and has been fixed 
> in NETTY#8955[3]. Netty 4.1.35 Final has already included this fix and GRPC 
> 1.22.0 has already dependents on Netty 4.1.35 Final. So we need to bump the 
> version of GRPC to 1.22.0+ (currently 1.21.0).
>  
> What do you think?
> [1] https://issues.apache.org/jira/browse/FLINK-15338
> [2] https://issues.apache.org/jira/browse/BEAM-9006
> [3] [https://github.com/netty/netty/pull/8955]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9065) Spark runner accumulates metrics (incorrectly) between runs

2020-01-08 Thread Udi Meiri (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011100#comment-17011100
 ] 

Udi Meiri commented on BEAM-9065:
-

2.18.0 has been cut 5 weeks ago. Let's push this fix to 2.19.0 and this as a 
Known Issue to the release notes.

> Spark runner accumulates metrics (incorrectly) between runs
> ---
>
> Key: BEAM-9065
> URL: https://issues.apache.org/jira/browse/BEAM-9065
> Project: Beam
>  Issue Type: Bug
>  Components: runner-spark
>Reporter: Etienne Chauchot
>Assignee: Etienne Chauchot
>Priority: Major
> Fix For: 2.18.0
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> When pipeline.run() is called, MetricsAccumulator (wrapper of 
> MetricsContainerStepMap spark accumulator) is initialized. Spark needs this 
> class to be a singleton for failover. The problem is that when several 
> pipelines are run inside the same JVM, the initialization of 
> MetricsAccumulator singleton does not reset the underlying spark accumulator  
> causing metrics to be accumulated between runs.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8882) Allow Dataflow to automatically choose portability or not.

2020-01-08 Thread Udi Meiri (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011099#comment-17011099
 ] 

Udi Meiri commented on BEAM-8882:
-

I think the best thing here is to add a release note telling:
Users running pipeline with beam.Create using DirectRunner (such as in tests) 
may need to add the keyword reshuffle=False to Create() if they're seeing weird 
behavior.
Reshuffle use in Create has the effect of randomizing the order of the 
resulting PCollection.


> Allow Dataflow to automatically choose portability or not.
> --
>
> Key: BEAM-8882
> URL: https://issues.apache.org/jira/browse/BEAM-8882
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Robert Bradshaw
>Assignee: Robert Bradshaw
>Priority: Critical
> Fix For: 2.18.0
>
>  Time Spent: 5h 50m
>  Remaining Estimate: 0h
>
> We would like the Dataflow service to be able to automatically choose whether 
> to run pipelines in a portable way. In order to do this, we need to provide 
> more information even if portability is not explicitly requested. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8951) Stop using nose in load tests

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8951?focusedWorklogId=368527=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368527
 ]

ASF GitHub Bot logged work on BEAM-8951:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:35
Start Date: 09/Jan/20 00:35
Worklog Time Spent: 10m 
  Work Description: Ardagan commented on issue #10435: [BEAM-8951] Stop 
using nose in load tests
URL: https://github.com/apache/beam/pull/10435#issuecomment-572325201
 
 
   Run Load Tests Java Combine Portable Flink Batch
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368527)
Time Spent: 6h  (was: 5h 50m)

> Stop using nose in load tests
> -
>
> Key: BEAM-8951
> URL: https://issues.apache.org/jira/browse/BEAM-8951
> Project: Beam
>  Issue Type: Sub-task
>  Components: testing
>Reporter: Kamil Wasilewski
>Assignee: Kamil Wasilewski
>Priority: Major
> Fix For: Not applicable
>
>  Time Spent: 6h
>  Remaining Estimate: 0h
>
> The community is considering moving away from nose to pytest: 
> https://issues.apache.org/jira/browse/BEAM-3713. We should change the way of 
> running Python load tests: instead of being subclasses of 
> `unittest.TestCase`, they could be plain Python scripts, just like wordcount 
> examples. This will bring one additional benefit: _LOAD_TEST_ENABLED_ guard 
> will be no longer needed and could be safely removed



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8575) Add more Python validates runner tests

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8575?focusedWorklogId=368524=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368524
 ]

ASF GitHub Bot logged work on BEAM-8575:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:34
Start Date: 09/Jan/20 00:34
Worklog Time Spent: 10m 
  Work Description: chamikaramj commented on issue #10159: [BEAM-8575] 
Added a unit test to CombineTest class to test that Combi…
URL: https://github.com/apache/beam/pull/10159#issuecomment-572324905
 
 
   Retest this please
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368524)
Time Spent: 42.5h  (was: 42h 20m)

> Add more Python validates runner tests
> --
>
> Key: BEAM-8575
> URL: https://issues.apache.org/jira/browse/BEAM-8575
> Project: Beam
>  Issue Type: Test
>  Components: sdk-py-core, testing
>Reporter: wendy liu
>Assignee: wendy liu
>Priority: Major
>  Time Spent: 42.5h
>  Remaining Estimate: 0h
>
> This is the umbrella issue to track the work of adding more Python tests to 
> improve test coverage.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8575) Add more Python validates runner tests

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8575?focusedWorklogId=368523=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368523
 ]

ASF GitHub Bot logged work on BEAM-8575:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:34
Start Date: 09/Jan/20 00:34
Worklog Time Spent: 10m 
  Work Description: chamikaramj commented on issue #10190: [BEAM-8575] 
Added two unit tests to CombineTest class to test that Co…
URL: https://github.com/apache/beam/pull/10190#issuecomment-572324861
 
 
   Retest this please
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368523)
Time Spent: 42h 20m  (was: 42h 10m)

> Add more Python validates runner tests
> --
>
> Key: BEAM-8575
> URL: https://issues.apache.org/jira/browse/BEAM-8575
> Project: Beam
>  Issue Type: Test
>  Components: sdk-py-core, testing
>Reporter: wendy liu
>Assignee: wendy liu
>Priority: Major
>  Time Spent: 42h 20m
>  Remaining Estimate: 0h
>
> This is the umbrella issue to track the work of adding more Python tests to 
> improve test coverage.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8575) Add more Python validates runner tests

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8575?focusedWorklogId=368526=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368526
 ]

ASF GitHub Bot logged work on BEAM-8575:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:34
Start Date: 09/Jan/20 00:34
Worklog Time Spent: 10m 
  Work Description: chamikaramj commented on issue #10159: [BEAM-8575] 
Added a unit test to CombineTest class to test that Combi…
URL: https://github.com/apache/beam/pull/10159#issuecomment-572324976
 
 
   Retest this please
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368526)
Time Spent: 42h 40m  (was: 42.5h)

> Add more Python validates runner tests
> --
>
> Key: BEAM-8575
> URL: https://issues.apache.org/jira/browse/BEAM-8575
> Project: Beam
>  Issue Type: Test
>  Components: sdk-py-core, testing
>Reporter: wendy liu
>Assignee: wendy liu
>Priority: Major
>  Time Spent: 42h 40m
>  Remaining Estimate: 0h
>
> This is the umbrella issue to track the work of adding more Python tests to 
> improve test coverage.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368517=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368517
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:31
Start Date: 09/Jan/20 00:31
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364492849
 
 

 ##
 File path: sdks/python/apache_beam/utils/profiler.py
 ##
 @@ -105,6 +105,7 @@ def create_profiler(profile_id, **kwargs):
 if random.random() < options.profile_sample_rate:
   return Profile(profile_id, options.profile_location, **kwargs)
   return create_profiler
+return None
 
 Review comment:
   Isn't this redundant?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368517)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 37h 40m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368513=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368513
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:31
Start Date: 09/Jan/20 00:31
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364402749
 
 

 ##
 File path: sdks/python/gen_protos.py
 ##
 @@ -47,6 +50,164 @@
 ]
 
 
+def generate_urn_files(log, out_dir):
+  """
+  Create a pyi stub for common_urns
 
 Review comment:
   Same comment from parent PR:
   Please update the docstring.
   IIUC, this does not generate a pyi stub, but several _urns.py files that are 
imported by common_urns.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368513)
Time Spent: 37h 10m  (was: 37h)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 37h 10m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368514=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368514
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:31
Start Date: 09/Jan/20 00:31
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364359100
 
 

 ##
 File path: sdks/python/apache_beam/transforms/sideinputs.py
 ##
 @@ -58,6 +59,16 @@ def map_via_end(source_window):
   return map_via_end
 
 
+def get_sideinput_index(tag):
+  # type: (str) -> int
+  match = re.match('side([0-9]+)(-.*)?$', tag,
+   re.DOTALL)
 
 Review comment:
   Why `re.DOTALL`? It's not present in the code this function is replacing.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368514)
Time Spent: 37h 20m  (was: 37h 10m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 37h 20m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368516=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368516
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:31
Start Date: 09/Jan/20 00:31
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364397975
 
 

 ##
 File path: sdks/python/apache_beam/portability/__init__.py
 ##
 @@ -17,3 +17,18 @@
 
 """For internal use only; no backwards-compatibility guarantees."""
 from __future__ import absolute_import
+
+from typing import TYPE_CHECKING
+from typing import NamedTuple
+
+if TYPE_CHECKING:
+  from apache_beam.portability.api import metrics_pb2
+
+
+PropertiesFromEnumValue = NamedTuple(
 
 Review comment:
   We typically don't put anything beyond import statements in `__init__.py` 
files.
   Could this be put in common_urns.py?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368516)
Time Spent: 37h 40m  (was: 37.5h)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 37h 40m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368520=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368520
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:31
Start Date: 09/Jan/20 00:31
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364483283
 
 

 ##
 File path: sdks/python/apache_beam/pvalue.py
 ##
 @@ -82,7 +82,7 @@ class PValue(object):
   """
 
   def __init__(self,
-   pipeline,  # type: Pipeline
+   pipeline,  # type: Optional[Pipeline]
 
 Review comment:
   Is (1) above incorrect? Are some PValues initialized without a pipeline?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368520)
Time Spent: 37h 50m  (was: 37h 40m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 37h 50m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368519=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368519
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:31
Start Date: 09/Jan/20 00:31
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364501177
 
 

 ##
 File path: sdks/python/apache_beam/runners/common.py
 ##
 @@ -879,6 +892,7 @@ def process(self, windowed_value):
   return self.do_fn_invoker.invoke_process(windowed_value)
 except BaseException as exn:
   self._reraise_augmented(exn)
+  return None
 
 Review comment:
   Same
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368519)
Time Spent: 37h 50m  (was: 37h 40m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 37h 50m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368515=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368515
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:31
Start Date: 09/Jan/20 00:31
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364472090
 
 

 ##
 File path: sdks/python/gen_protos.py
 ##
 @@ -47,6 +50,164 @@
 ]
 
 
+def generate_urn_files(log, out_dir):
 
 Review comment:
   Could you post somewhere the output files?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368515)
Time Spent: 37.5h  (was: 37h 20m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 37.5h
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368518=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368518
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:31
Start Date: 09/Jan/20 00:31
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364501015
 
 

 ##
 File path: sdks/python/apache_beam/runners/common.py
 ##
 @@ -793,6 +804,8 @@ def current_element_progress(self):
 restriction_tracker = self.threadsafe_restriction_tracker
 if restriction_tracker:
   return restriction_tracker.current_progress()
+else:
+  return None
 
 Review comment:
   Again, isn't this redundant? Does mypy require it?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368518)
Time Spent: 37h 50m  (was: 37h 40m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 37h 50m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=368512=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368512
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:31
Start Date: 09/Jan/20 00:31
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10367: [BEAM-7746] Add 
python type hints (part 2)
URL: https://github.com/apache/beam/pull/10367#discussion_r364352387
 
 

 ##
 File path: sdks/python/apache_beam/coders/coder_impl.py
 ##
 @@ -907,7 +907,8 @@ def encode_to_stream(self, value, out, nested):
   buffer = create_OutputStream()
   if (self._write_state is not None
   and out.size() - start_size > self._write_state_threshold):
-tail = (value_iter[index + 1:] if isinstance(value, (list, tuple))
+tail = (value_iter[index + 1:]
+if isinstance(value_iter, (list, tuple))
 
 Review comment:
   The replacement of value with value_iter seems to be logically equivalent, 
so what was mypy complaining about here?
   Was the change made to let mypy reason about the type of `tail`?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368512)
Time Spent: 37h  (was: 36h 50m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 37h
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8575) Add more Python validates runner tests

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8575?focusedWorklogId=368507=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368507
 ]

ASF GitHub Bot logged work on BEAM-8575:


Author: ASF GitHub Bot
Created on: 09/Jan/20 00:09
Start Date: 09/Jan/20 00:09
Worklog Time Spent: 10m 
  Work Description: bumblebee-coming commented on issue #10159: [BEAM-8575] 
Added a unit test to CombineTest class to test that Combi…
URL: https://github.com/apache/beam/pull/10159#issuecomment-572318605
 
 
   Run Python PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368507)
Time Spent: 42h 10m  (was: 42h)

> Add more Python validates runner tests
> --
>
> Key: BEAM-8575
> URL: https://issues.apache.org/jira/browse/BEAM-8575
> Project: Beam
>  Issue Type: Test
>  Components: sdk-py-core, testing
>Reporter: wendy liu
>Assignee: wendy liu
>Priority: Major
>  Time Spent: 42h 10m
>  Remaining Estimate: 0h
>
> This is the umbrella issue to track the work of adding more Python tests to 
> improve test coverage.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9071) Ignore pytype untill the typing feature is fully developed

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9071?focusedWorklogId=368505=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368505
 ]

ASF GitHub Bot logged work on BEAM-9071:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:56
Start Date: 08/Jan/20 23:56
Worklog Time Spent: 10m 
  Work Description: boyuanzz commented on pull request #10533: [BEAM-9071] 
Ignore pytype before typing is fully developed.
URL: https://github.com/apache/beam/pull/10533
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368505)
Time Spent: 1.5h  (was: 1h 20m)

> Ignore pytype untill the typing feature is fully developed
> --
>
> Key: BEAM-9071
> URL: https://issues.apache.org/jira/browse/BEAM-9071
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Boyuan Zhang
>Assignee: Udi Meiri
>Priority: Major
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> Please remove all # pytype: skip-file after feature fully supported.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9027) [SQL] ZetaSQL unparsing should produce valid result

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9027?focusedWorklogId=368497=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368497
 ]

ASF GitHub Bot logged work on BEAM-9027:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:38
Start Date: 08/Jan/20 23:38
Worklog Time Spent: 10m 
  Work Description: apilloud commented on issue #10539: [BEAM-9027] Unparse 
DOY/DOW/WEEK Enums properly for ZetaSQL
URL: https://github.com/apache/beam/pull/10539#issuecomment-572310694
 
 
   R: @robinyqiu 
   cc: @amaliujia 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368497)
Time Spent: 3h 20m  (was: 3h 10m)

> [SQL] ZetaSQL unparsing should produce valid result
> ---
>
> Key: BEAM-9027
> URL: https://issues.apache.org/jira/browse/BEAM-9027
> Project: Beam
>  Issue Type: Improvement
>  Components: dsl-sql-zetasql
>Reporter: Kirill Kozlov
>Assignee: Kirill Kozlov
>Priority: Major
>  Time Spent: 3h 20m
>  Remaining Estimate: 0h
>
> * ZetaSQL does not recognize keyword INTERVAL
>  * Calcite cannot unparse RexNode back to bytes literal
>  * Calcite cannot unparse some floating point literals correctly
>  * Calcite cannot unparse some string literals correctly
>  * Calcite cannot unparse types correctly for CAST function



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9027) [SQL] ZetaSQL unparsing should produce valid result

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9027?focusedWorklogId=368496=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368496
 ]

ASF GitHub Bot logged work on BEAM-9027:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:38
Start Date: 08/Jan/20 23:38
Worklog Time Spent: 10m 
  Work Description: apilloud commented on issue #10538: [BEAM-9027] 
Backport BigQuerySqlDialect fixes
URL: https://github.com/apache/beam/pull/10538#issuecomment-572310661
 
 
   R: @robinyqiu 
   cc: @amaliujia 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368496)
Time Spent: 3h 10m  (was: 3h)

> [SQL] ZetaSQL unparsing should produce valid result
> ---
>
> Key: BEAM-9027
> URL: https://issues.apache.org/jira/browse/BEAM-9027
> Project: Beam
>  Issue Type: Improvement
>  Components: dsl-sql-zetasql
>Reporter: Kirill Kozlov
>Assignee: Kirill Kozlov
>Priority: Major
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> * ZetaSQL does not recognize keyword INTERVAL
>  * Calcite cannot unparse RexNode back to bytes literal
>  * Calcite cannot unparse some floating point literals correctly
>  * Calcite cannot unparse some string literals correctly
>  * Calcite cannot unparse types correctly for CAST function



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8630) Prototype of BeamSQL Calc using ZetaSQL Expression Evaluator

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8630?focusedWorklogId=368493=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368493
 ]

ASF GitHub Bot logged work on BEAM-8630:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:37
Start Date: 08/Jan/20 23:37
Worklog Time Spent: 10m 
  Work Description: apilloud commented on issue #10537: [BEAM-8630] Use 
column numbers for BeamZetaSqlCalRel
URL: https://github.com/apache/beam/pull/10537#issuecomment-572310483
 
 
   R: @robinyqiu 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368493)
Time Spent: 5h 50m  (was: 5h 40m)

> Prototype of BeamSQL Calc using ZetaSQL Expression Evaluator
> 
>
> Key: BEAM-8630
> URL: https://issues.apache.org/jira/browse/BEAM-8630
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql
>Reporter: Yueyang Qiu
>Assignee: Yueyang Qiu
>Priority: Major
>  Time Spent: 5h 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9027) [SQL] ZetaSQL unparsing should produce valid result

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9027?focusedWorklogId=368489=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368489
 ]

ASF GitHub Bot logged work on BEAM-9027:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:35
Start Date: 08/Jan/20 23:35
Worklog Time Spent: 10m 
  Work Description: apilloud commented on pull request #10539: [BEAM-9027] 
Unparse DOY/DOW/WEEK Enums properly for ZetaSQL
URL: https://github.com/apache/beam/pull/10539
 
 
   Fixes enums for unparsing, passes compliance tests with ZetaSQL Calc. Unit 
tests aren't possible because of mismatches of these types being 0 or 1 indexed.
   
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [ ] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Gearpump | Samza | Spark
   --- | --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 

[jira] [Work logged] (BEAM-9027) [SQL] ZetaSQL unparsing should produce valid result

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9027?focusedWorklogId=368485=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368485
 ]

ASF GitHub Bot logged work on BEAM-9027:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:34
Start Date: 08/Jan/20 23:34
Worklog Time Spent: 10m 
  Work Description: apilloud commented on pull request #10538: [BEAM-9027] 
Backport BigQuerySqlDialect apache/calcite@e63c252c
URL: https://github.com/apache/beam/pull/10538
 
 
   Pull in fixes from apache/calcite@e63c252c
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [ ] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Gearpump | Samza | Spark
   --- | --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/)[![Build
 

[jira] [Work logged] (BEAM-8630) Prototype of BeamSQL Calc using ZetaSQL Expression Evaluator

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8630?focusedWorklogId=368484=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368484
 ]

ASF GitHub Bot logged work on BEAM-8630:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:33
Start Date: 08/Jan/20 23:33
Worklog Time Spent: 10m 
  Work Description: apilloud commented on pull request #10537: [BEAM-8630] 
Use column numbers for BeamZetaSqlCalRel
URL: https://github.com/apache/beam/pull/10537
 
 
   Use column numbers for BeamZetaSqlCalRel, this fixes multiple test failures.
   
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [ ] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Gearpump | Samza | Spark
   --- | --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/)[![Build
 

[jira] [Commented] (BEAM-1589) Add OnWindowExpiration method to Stateful DoFn

2020-01-08 Thread Reuven Lax (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011071#comment-17011071
 ] 

Reuven Lax commented on BEAM-1589:
--

output timestamp support is now in (Flink support pending).

I think we should just use the same timer that's used for garbage collection to 
fire OnWindowExpiration callbacks. In order to make this work, we need to 
modify this timer to have an output timestamp inside the window (maybe the end 
of the window?).

> Add OnWindowExpiration method to Stateful DoFn
> --
>
> Key: BEAM-1589
> URL: https://issues.apache.org/jira/browse/BEAM-1589
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-core, sdk-java-core
>Reporter: Jingsong Lee
>Priority: Major
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> See BEAM-1517
> This allows the user to do some work before the state's garbage collection.
> It seems kind of annoying, but on the other hand forgetting to set a final 
> timer to flush state is probably data loss most of the time.
> FlinkRunner does this work very simply, but other runners, such as 
> DirectRunner, need to traverse all the states to do this, and maybe it's a 
> little hard.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9071) Ignore pytype untill the typing feature is fully developed

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9071?focusedWorklogId=368481=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368481
 ]

ASF GitHub Bot logged work on BEAM-9071:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:27
Start Date: 08/Jan/20 23:27
Worklog Time Spent: 10m 
  Work Description: boyuanzz commented on issue #10533: [BEAM-9071] Ignore 
pytype before typing is fully developed.
URL: https://github.com/apache/beam/pull/10533#issuecomment-572307858
 
 
   I did some manual check and changes. Waiting for test passing.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368481)
Time Spent: 1h 20m  (was: 1h 10m)

> Ignore pytype untill the typing feature is fully developed
> --
>
> Key: BEAM-9071
> URL: https://issues.apache.org/jira/browse/BEAM-9071
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Boyuan Zhang
>Assignee: Udi Meiri
>Priority: Major
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> Please remove all # pytype: skip-file after feature fully supported.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8933) BigQuery IO should support read/write in Arrow format

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8933?focusedWorklogId=368482=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368482
 ]

ASF GitHub Bot logged work on BEAM-8933:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:27
Start Date: 08/Jan/20 23:27
Worklog Time Spent: 10m 
  Work Description: kennknowles commented on pull request #10369: 
[BEAM-8933] BigQueryIO Arrow for read
URL: https://github.com/apache/beam/pull/10369#discussion_r364494539
 
 

 ##
 File path: 
sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/BigQueryStorageQuerySource.java
 ##
 @@ -57,6 +59,32 @@
 priority,
 location,
 kmsKey,
+format,
+parseFn,
+outputCoder,
+bqServices);
+  }
+
+  public static  BigQueryStorageQuerySource create(
 
 Review comment:
   Good catch. The whole class is still experimental, so changes should be OK 
to make. On the other hand, I wonder if the sources should be marked 
`@Internal` or even made non-public with only the transform being public.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368482)
Time Spent: 4h 50m  (was: 4h 40m)

> BigQuery IO should support read/write in Arrow format
> -
>
> Key: BEAM-8933
> URL: https://issues.apache.org/jira/browse/BEAM-8933
> Project: Beam
>  Issue Type: Improvement
>  Components: io-java-gcp
>Reporter: Kirill Kozlov
>Assignee: Kirill Kozlov
>Priority: Major
>  Time Spent: 4h 50m
>  Remaining Estimate: 0h
>
> As of right now BigQuery uses Avro format for reading and writing.
> We should add a config to BigQueryIO to specify which format to use: Arrow or 
> Avro (with Avro as default).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8717) Beam Dependency Update Request: org.apache.commons:commons-lang3

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8717?focusedWorklogId=368478=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368478
 ]

ASF GitHub Bot logged work on BEAM-8717:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:24
Start Date: 08/Jan/20 23:24
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on issue #10524: [BEAM-8717] Update 
commons-lang3 to version 3.9
URL: https://github.com/apache/beam/pull/10524#issuecomment-572305568
 
 
   Should we be vendoring commons-lang3 instead of using the shaded classes 
from beam-sdks-java-core?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368478)
Time Spent: 2h 20m  (was: 2h 10m)

> Beam Dependency Update Request: org.apache.commons:commons-lang3
> 
>
> Key: BEAM-8717
> URL: https://issues.apache.org/jira/browse/BEAM-8717
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Assignee: Ismaël Mejía
>Priority: Major
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:43:43.060362 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:11:02.203215 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:17:32.152530 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:16:47.060229 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:17:09.857528 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:12:21.614448 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:15:59.144846 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8717) Beam Dependency Update Request: org.apache.commons:commons-lang3

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8717?focusedWorklogId=368477=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368477
 ]

ASF GitHub Bot logged work on BEAM-8717:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:23
Start Date: 08/Jan/20 23:23
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on pull request #10524: [BEAM-8717] 
Update commons-lang3 to version 3.9
URL: https://github.com/apache/beam/pull/10524#discussion_r364493129
 
 

 ##
 File path: 
buildSrc/src/main/groovy/org/apache/beam/gradle/BeamModulePlugin.groovy
 ##
 @@ -432,7 +432,7 @@ class BeamModulePlugin implements Plugin {
 commons_compress: 
"org.apache.commons:commons-compress:1.19",
 commons_csv : 
"org.apache.commons:commons-csv:1.4",
 commons_io  : 
"commons-io:commons-io:2.6",
-commons_lang3   : 
"org.apache.commons:commons-lang3:3.6",
+commons_lang3   : 
"org.apache.commons:commons-lang3:3.9",
 
 Review comment:
   You could remove this from this list and have the full maven coordinates 
appear in sdks/java/core/build.gradle since it is no longer used elsewhere.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368477)
Time Spent: 2h 10m  (was: 2h)

> Beam Dependency Update Request: org.apache.commons:commons-lang3
> 
>
> Key: BEAM-8717
> URL: https://issues.apache.org/jira/browse/BEAM-8717
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Assignee: Ismaël Mejía
>Priority: Major
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:43:43.060362 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:11:02.203215 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:17:32.152530 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:16:47.060229 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:17:09.857528 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:12:21.614448 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:15:59.144846 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current 

[jira] [Work logged] (BEAM-8717) Beam Dependency Update Request: org.apache.commons:commons-lang3

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8717?focusedWorklogId=368476=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368476
 ]

ASF GitHub Bot logged work on BEAM-8717:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:19
Start Date: 08/Jan/20 23:19
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on issue #10524: [BEAM-8717] Update 
commons-lang3 to version 3.9
URL: https://github.com/apache/beam/pull/10524#issuecomment-572305568
 
 
   Should we be vendoring commons-lang3 instead of using the shaded classes 
from beam-sdks-java-core?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368476)
Time Spent: 2h  (was: 1h 50m)

> Beam Dependency Update Request: org.apache.commons:commons-lang3
> 
>
> Key: BEAM-8717
> URL: https://issues.apache.org/jira/browse/BEAM-8717
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Assignee: Ismaël Mejía
>Priority: Major
>  Time Spent: 2h
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:43:43.060362 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:11:02.203215 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:17:32.152530 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:16:47.060229 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:17:09.857528 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:12:21.614448 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:15:59.144846 
> -
> Please consider upgrading the dependency 
> org.apache.commons:commons-lang3. 
> The current version is 3.6. The latest version is 3.9 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=368469=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368469
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 08/Jan/20 23:04
Start Date: 08/Jan/20 23:04
Worklog Time Spent: 10m 
  Work Description: Hannah-Jiang commented on issue #10536: [BEAM-7861] Add 
direct_running_mode option for direct runners to switch between multi_threading 
and multi_processing easily
URL: https://github.com/apache/beam/pull/10536#issuecomment-572300418
 
 
   Run Python PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368469)
Time Spent: 0.5h  (was: 20m)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-5605) Support Portable SplittableDoFn for batch

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-5605?focusedWorklogId=368467=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368467
 ]

ASF GitHub Bot logged work on BEAM-5605:


Author: ASF GitHub Bot
Created on: 08/Jan/20 22:58
Start Date: 08/Jan/20 22:58
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on issue #10535: [BEAM-5605] Add 
support for executing pair with restriction, split restriction, split and size 
restriction, process element and restriction and process sized element and 
restriction within the Java SDK harness.
URL: https://github.com/apache/beam/pull/10535#issuecomment-572298643
 
 
   R: @youngoli 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368467)
Time Spent: 4.5h  (was: 4h 20m)

> Support Portable SplittableDoFn for batch
> -
>
> Key: BEAM-5605
> URL: https://issues.apache.org/jira/browse/BEAM-5605
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>Reporter: Scott Wegner
>Assignee: Luke Cwik
>Priority: Major
>  Labels: portability
>  Time Spent: 4.5h
>  Remaining Estimate: 0h
>
> Roll-up item tracking work towards supporting portable SplittableDoFn for 
> batch



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-08 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=368465=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-368465
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 08/Jan/20 22:55
Start Date: 08/Jan/20 22:55
Worklog Time Spent: 10m 
  Work Description: Hannah-Jiang commented on issue #10536: [BEAM-7861] Add 
direct_running_mode option for direct runners to switch between multi_threading 
and multi_processing easily
URL: https://github.com/apache/beam/pull/10536#issuecomment-572297690
 
 
   R: @robertwb 
   Cc: @aaltay 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 368465)
Time Spent: 20m  (was: 10m)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-08 Thread Hannah Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hannah Jiang updated BEAM-7861:
---
Fix Version/s: 2.19.0

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


  1   2   3   >