[jira] [Assigned] (BEAM-1198) ViewFn: explicitly decouple runner materialization of side inputs from SDK-specific mapping

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles reassigned BEAM-1198:
-

Assignee: Kenneth Knowles

> ViewFn: explicitly decouple runner materialization of side inputs from 
> SDK-specific mapping
> ---
>
> Key: BEAM-1198
> URL: https://issues.apache.org/jira/browse/BEAM-1198
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model-fn-api, beam-model-runner-api
>    Reporter: Kenneth Knowles
>Assignee: Kenneth Knowles
>
> For side inputs, the field {{PCollectionView.fromIterableInternal}} implies 
> an "iterable" materialization of the contents of a PCollection, which is 
> adapted to the desired user-facing type within a UDF (today the 
> PCollectionView "is" the UDF)
> In practice, runners get adequate performance by special casing just a couple 
> of types of PCollectionView in a rather cumbersome manner.
> The design to improve this is https://s.apache.org/beam-side-inputs-1-pager 
> and this makes the de facto standard approach the actual model.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (BEAM-846) Decouple side input window mapping from WindowFn

2016-12-21 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613384#comment-15613384
 ] 

Kenneth Knowles edited comment on BEAM-846 at 12/21/16 9:19 PM:


Design 1-pager is https://s.apache.org/beam-windowmappingfn-1-pager


was (Author: kenn):
Design 1-pager is https://s.apache.org/beam-side-inputs-1-pager and a couple 
PRs have been authored 
([#520|https://github.com/apache/incubator-beam/pull/520] and 
[#1076|https://github.com/apache/incubator-beam/pull/1076]) attributed to 
BEAM-115 (the "all of the Runner API" ticket)


> Decouple side input window mapping from WindowFn
> 
>
> Key: BEAM-846
> URL: https://issues.apache.org/jira/browse/BEAM-846
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model-runner-api, sdk-java-core
>Reporter: Robert Bradshaw
>Assignee: Kenneth Knowles
>  Labels: backward-incompatible
>
> Currently the main WindowFn provides as getSideInputWindow method. Instead, 
> this mapping should be specified per-side-input (thought the default mapping 
> would remain the same). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-210) Allow control of empty ON_TIME panes analogous to final panes

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-210:
-
Component/s: (was: beam-model)

> Allow control of empty ON_TIME panes analogous to final panes
> -
>
> Key: BEAM-210
> URL: https://issues.apache.org/jira/browse/BEAM-210
> Project: Beam
>  Issue Type: Bug
>  Components: beam-model-runner-api, sdk-java-core
>Reporter: Mark Shields
>Assignee: Thomas Groh
>
> Today, ON_TIME panes are emitted whether or not they are empty. We had 
> decided that for final panes the user would want to control this behavior, to 
> control data volume. But for ON_TIME panes no such control exists. The 
> rationale is perhaps that the ON_TIME pane is a fundamental result that 
> should not be elided. To be considered: whether this is what we want.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-260) WindowMappingFn: Know the getSideInputWindow upper bound to release side input resources

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-260:
-
Component/s: (was: beam-model)
 beam-model-fn-api

> WindowMappingFn: Know the getSideInputWindow upper bound to release side 
> input resources
> 
>
> Key: BEAM-260
> URL: https://issues.apache.org/jira/browse/BEAM-260
> Project: Beam
>  Issue Type: Bug
>  Components: beam-model-fn-api, beam-model-runner-api
>Reporter: Mark Shields
>Assignee: Kenneth Knowles
>
> We currently have no static knowledge about the getSideInputWindow function, 
> and runners are thus forced to hold on to all side input state / elements in 
> case a future element reaches back into an earlier side input element.
> Maybe we need an upper bound on lag from current to result of 
> getSideInputWindow so we can have a progressing gc horizon as we do for  GKB 
> window state. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-846) Decouple side input window mapping from WindowFn

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-846:
-
Component/s: (was: beam-model)

> Decouple side input window mapping from WindowFn
> 
>
> Key: BEAM-846
> URL: https://issues.apache.org/jira/browse/BEAM-846
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model-runner-api, sdk-java-core
>Reporter: Robert Bradshaw
>    Assignee: Kenneth Knowles
>  Labels: backward-incompatible
>
> Currently the main WindowFn provides as getSideInputWindow method. Instead, 
> this mapping should be specified per-side-input (thought the default mapping 
> would remain the same). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1193) Give Coders URNs and document their binary formats outside the Java code base

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1193:
--
Component/s: (was: beam-model)

> Give Coders URNs and document their binary formats outside the Java code base
> -
>
> Key: BEAM-1193
> URL: https://issues.apache.org/jira/browse/BEAM-1193
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model-runner-api
>        Reporter: Kenneth Knowles
>Assignee: Frances Perry
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-653) Refine specification for WindowFn.isCompatible()

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-653:
-
Component/s: beam-model-runner-api

> Refine specification for WindowFn.isCompatible() 
> -
>
> Key: BEAM-653
> URL: https://issues.apache.org/jira/browse/BEAM-653
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model, beam-model-runner-api
>        Reporter: Kenneth Knowles
>
> {{WindowFn#isCompatible}} doesn't really have a spec. In practice, it is used 
> primarily when flattening together multiple PCollections. All of the 
> WindowFns must be compatible, and then just a single WindowFn is selected 
> arbitrarily for the output PCollection.
> In consequence, downstream of the Flatten, the merging behavior will be taken 
> from this WindowFn.
> Currently, there are some mismatches:
>  - Sessions with different gap durations _are_ compatible today, but probably 
> shouldn't be since merging makes little sense. (The use of tiny proto-windows 
> is an implementation detail anyhow)
>  - SlidingWindows and FixedWindows _could_ reasonably be compatible if they 
> had the same duration, though it might be odd.
> Either way, we should just nail down what we actually mean so we can arrive 
> at a verdict in these cases.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1149) Side input access fails in direct runner (possibly others too) when input element in multiple windows

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1149:
--
Component/s: runner-core

> Side input access fails in direct runner (possibly others too) when input 
> element in multiple windows
> -
>
> Key: BEAM-1149
> URL: https://issues.apache.org/jira/browse/BEAM-1149
> Project: Beam
>  Issue Type: Bug
>  Components: runner-core
>Reporter: Eugene Kirpichov
>Assignee: Kenneth Knowles
>Priority: Blocker
> Fix For: 0.4.0-incubating
>
>
> {code:java}
>   private static class FnWithSideInputs extends DoFn<String, String> {
> private final PCollectionView view;
> private FnWithSideInputs(PCollectionView view) {
>   this.view = view;
> }
> @ProcessElement
> public void processElement(ProcessContext c) {
>   c.output(c.element() + ":" + c.sideInput(view));
> }
>   }
>   @Test
>   public void testSideInputsWithMultipleWindows() {
> Pipeline p = TestPipeline.create();
> MutableDateTime mutableNow = Instant.now().toMutableDateTime();
> mutableNow.setMillisOfSecond(0);
> Instant now = mutableNow.toInstant();
> SlidingWindows windowFn =
> 
> SlidingWindows.of(Duration.standardSeconds(5)).every(Duration.standardSeconds(1));
> PCollectionView view = 
> p.apply(Create.of(1)).apply(View.asSingleton());
> PCollection res =
> p.apply(Create.timestamped(TimestampedValue.of("a", now)))
> .apply(Window.into(windowFn))
> .apply(ParDo.of(new FnWithSideInputs(view)).withSideInputs(view));
> PAssert.that(res).containsInAnyOrder("a:1");
> p.run();
>   }
> {code}
> This fails with the following exception:
> {code}
> org.apache.beam.sdk.Pipeline$PipelineExecutionException: 
> java.lang.IllegalStateException: sideInput called when main input element is 
> in multiple windows
>   at 
> org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:343)
>   at org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:1)
>   at org.apache.beam.sdk.Pipeline.run(Pipeline.java:176)
>   at org.apache.beam.sdk.testing.TestPipeline.run(TestPipeline.java:112)
>   at 
> Caused by: java.lang.IllegalStateException: sideInput called when main input 
> element is in multiple windows
>   at 
> org.apache.beam.runners.core.SimpleDoFnRunner$DoFnProcessContext.sideInput(SimpleDoFnRunner.java:514)
>   at 
> org.apache.beam.sdk.transforms.ParDoTest$FnWithSideInputs.processElement(ParDoTest.java:738)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1198) ViewFn: explicitly decouple runner materialization of side inputs from SDK-specific mapping

2016-12-21 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1198:
-

 Summary: ViewFn: explicitly decouple runner materialization of 
side inputs from SDK-specific mapping
 Key: BEAM-1198
 URL: https://issues.apache.org/jira/browse/BEAM-1198
 Project: Beam
  Issue Type: New Feature
  Components: beam-model-fn-api, beam-model-runner-api
Reporter: Kenneth Knowles


For side inputs, the field {{PCollectionView.fromIterableInternal}} implies an 
"iterable" materialization of the contents of a PCollection, which is adapted 
to the desired user-facing type within a UDF (today the PCollectionView "is" 
the UDF)

In practice, runners get adequate performance by special casing just a couple 
of types of PCollectionView in a rather cumbersome manner.

The design to improve this is https://s.apache.org/beam-side-inputs-1-pager and 
this makes the de facto standard approach the actual model.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Closed] (BEAM-1003) Enable caching of side-input dependent computations

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles closed BEAM-1003.
-
   Resolution: Duplicate
Fix Version/s: Not applicable

Looks like an exact duplicate, a la form resubmission.

> Enable caching of side-input dependent computations
> ---
>
> Key: BEAM-1003
> URL: https://issues.apache.org/jira/browse/BEAM-1003
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model
>Reporter: Robert Bradshaw
>Assignee: Frances Perry
> Fix For: Not applicable
>
>
> Sometimes the kind of computations one wants to perform in startBundle depend 
> on side inputs (and, implicitly, the window). For example, one might want to 
> initialize a (non-serializable) stateful object. In particular, this leads to 
> users incorrectly (in the case of triggered or non-globally-windowed side 
> inputs) memoizing this computation in the first processElement call. 
> One option would be to fold this into a customizable ViewFn. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-260) WindowMappingFn: Know the getSideInputWindow upper bound to release side input resources

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-260:
-
Summary: WindowMappingFn: Know the getSideInputWindow upper bound to 
release side input resources  (was: Know the getSideInputWindow upper bound so 
can gc side input state)

> WindowMappingFn: Know the getSideInputWindow upper bound to release side 
> input resources
> 
>
> Key: BEAM-260
> URL: https://issues.apache.org/jira/browse/BEAM-260
> Project: Beam
>  Issue Type: Bug
>  Components: beam-model, beam-model-runner-api
>Reporter: Mark Shields
>Assignee: Kenneth Knowles
>
> We currently have no static knowledge about the getSideInputWindow function, 
> and runners are thus forced to hold on to all side input state / elements in 
> case a future element reaches back into an earlier side input element.
> Maybe we need an upper bound on lag from current to result of 
> getSideInputWindow so we can have a progressing gc horizon as we do for  GKB 
> window state. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1196) Serialize/deserialize Pipeline/TransformHierarchy to JSON

2016-12-21 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1196:
-

 Summary: Serialize/deserialize Pipeline/TransformHierarchy to JSON
 Key: BEAM-1196
 URL: https://issues.apache.org/jira/browse/BEAM-1196
 Project: Beam
  Issue Type: New Feature
  Components: beam-model-runner-api, sdk-java-core
Reporter: Kenneth Knowles


There are two sketches of a concrete format for a Pipeline:

1. The Avro schema in the design doc at https://s.apache.org/beam-runner-api 2. 
The JSON schema at https://github.com/apache/incubator-beam/pull/662

These should be pushed all the way through and added to the Java SDK.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1195) Give triggers URNs / JSON formats

2016-12-21 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1195:
-

 Summary: Give triggers URNs / JSON formats
 Key: BEAM-1195
 URL: https://issues.apache.org/jira/browse/BEAM-1195
 Project: Beam
  Issue Type: New Feature
  Components: beam-model-runner-api
Reporter: Kenneth Knowles
Assignee: Kenneth Knowles


We have recently gotten to the point where triggers are just syntax, but it is 
still shipped via Java serialization. To make it language-independent, we need 
a concrete syntax.

Something like the following is fairly concise, tag adjacent to payload. I 
haven't bothered making up fully verbose/namespaced URNs here.

{code}
{
"$urn": "OrFinally",
"main": {
  "$urn": "EndOfWindow",
  "early": 
},
"finally": {
  "$urn": "AfterCount",
  "count": 45
}
}
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1192) Give transforms URNs, use them instead of instanceof checks

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1192:
--
Summary: Give transforms URNs, use them instead of instanceof checks  (was: 
Add URNs to known transforms instead of using instanceof checks)

> Give transforms URNs, use them instead of instanceof checks
> ---
>
> Key: BEAM-1192
> URL: https://issues.apache.org/jira/browse/BEAM-1192
> Project: Beam
>  Issue Type: Improvement
>  Components: beam-model-runner-api
>        Reporter: Kenneth Knowles
>
> In the [Beam Runner AP|https://s.apache.org/beam-runner-api], transforms of 
> interest to runners are to be identified by URN.
> Currently, Java-based runners use `instanceof` checks to both override 
> transforms and to implement primitive transforms. This language and 
> SDK-specific behavior should be replaced by adding these URNs, and checking 
> them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1192) Give transforms URNs, use them instead of instanceof checks

2016-12-21 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1192:
--
Issue Type: New Feature  (was: Improvement)

> Give transforms URNs, use them instead of instanceof checks
> ---
>
> Key: BEAM-1192
> URL: https://issues.apache.org/jira/browse/BEAM-1192
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model-runner-api
>        Reporter: Kenneth Knowles
>
> In the [Beam Runner AP|https://s.apache.org/beam-runner-api], transforms of 
> interest to runners are to be identified by URN.
> Currently, Java-based runners use `instanceof` checks to both override 
> transforms and to implement primitive transforms. This language and 
> SDK-specific behavior should be replaced by adding these URNs, and checking 
> them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1193) Give Coders URNs and document their binary formats outside the Java code base

2016-12-21 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1193:
-

 Summary: Give Coders URNs and document their binary formats 
outside the Java code base
 Key: BEAM-1193
 URL: https://issues.apache.org/jira/browse/BEAM-1193
 Project: Beam
  Issue Type: New Feature
  Components: beam-model, beam-model-runner-api
Reporter: Kenneth Knowles
Assignee: Frances Perry






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1192) Add URNs to known transforms instead of using instanceof checks

2016-12-21 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1192:
-

 Summary: Add URNs to known transforms instead of using instanceof 
checks
 Key: BEAM-1192
 URL: https://issues.apache.org/jira/browse/BEAM-1192
 Project: Beam
  Issue Type: Improvement
  Components: beam-model-runner-api
Reporter: Kenneth Knowles


In the [Beam Runner AP|https://s.apache.org/beam-runner-api], transforms of 
interest to runners are to be identified by URN.

Currently, Java-based runners use `instanceof` checks to both override 
transforms and to implement primitive transforms. This language and 
SDK-specific behavior should be replaced by adding these URNs, and checking 
them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1191) Eliminate OldDoFn from the SDK

2016-12-20 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1191:
-

 Summary: Eliminate OldDoFn from the SDK
 Key: BEAM-1191
 URL: https://issues.apache.org/jira/browse/BEAM-1191
 Project: Beam
  Issue Type: Improvement
  Components: sdk-java-core
Reporter: Kenneth Knowles
Priority: Minor


We are far enough along that now {{OldDoFn}} is not usable by users and 
BEAM-498 is closed out. The remaining occurrences are limited to runners and 
things that should end up in runners-core.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (BEAM-498) Make DoFnWithContext the new DoFn

2016-12-20 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles resolved BEAM-498.
--
   Resolution: Fixed
Fix Version/s: 0.3.0-incubating

> Make DoFnWithContext the new DoFn
> -
>
> Key: BEAM-498
> URL: https://issues.apache.org/jira/browse/BEAM-498
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>        Reporter: Kenneth Knowles
>    Assignee: Kenneth Knowles
>  Labels: backward-incompatible
> Fix For: 0.3.0-incubating
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1067) apex.examples.WordCountTest.testWordCountExample is flaky

2016-12-20 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1067:
--
Summary: apex.examples.WordCountTest.testWordCountExample is flaky  (was: 
apex.examples.WordCountTest.testWordCountExample is be flaky)

> apex.examples.WordCountTest.testWordCountExample is flaky
> -
>
> Key: BEAM-1067
> URL: https://issues.apache.org/jira/browse/BEAM-1067
> Project: Beam
>  Issue Type: Bug
>  Components: runner-apex
>Reporter: Stas Levin
>Assignee: Thomas Weise
>
> Seems that 
> {{org.apache.beam.runners.apex.examples.WordCountTest.testWordCountExample}} 
> is flaky.
> For example, 
> [this|https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/5408/org.apache.beam$beam-runners-apex/testReport/org.apache.beam.runners.apex.examples/WordCountTest/testWordCountExample/
>  ] run failed although no changes were made in {{runner-apex}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1189) Add guide for release verifiers in the release guide

2016-12-20 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1189:
-

 Summary: Add guide for release verifiers in the release guide
 Key: BEAM-1189
 URL: https://issues.apache.org/jira/browse/BEAM-1189
 Project: Beam
  Issue Type: Improvement
  Components: website
Reporter: Kenneth Knowles
Assignee: James Malone


This came up during the 0.4.0-incubating release discussion.

There is this checklist: 
http://incubator.apache.org/guides/releasemanagement.html#check-list

And we could point to that but make more detailed Beam-specific instructions on 
http://beam.apache.org/contribute/release-guide/#vote-on-the-release-candidate

And the template for the vote email should include a link to suggested 
verification steps.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Reopened] (BEAM-1186) Migrate the remaining tests to use TestPipeline as a JUnit rule.

2016-12-20 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles reopened BEAM-1186:
---

> Migrate the remaining tests to use TestPipeline as a JUnit rule.
> 
>
> Key: BEAM-1186
> URL: https://issues.apache.org/jira/browse/BEAM-1186
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Stas Levin
>Assignee: Stas Levin
>Priority: Minor
> Fix For: 0.5.0-incubating
>
>
> Following up on [BEAM-1176|https://issues.apache.org/jira/browse/BEAM-1176], 
> the following tests still have direct calls to {{TestPipeline.create()}}:
> * {{AvroIOGeneratedClassTest#runTestRead}}
> * {{ApproximateUniqueTest#runApproximateUniqueWithDuplicates}}
> * {{ApproximateUniqueTest#runApproximateUniqueWithSkewedDistributions}}
> * {{SampleTest#runPickAnyTest}}
> * {{BigtableIOTest#runReadTest}}
> Consider using [parametrised 
> tests|https://github.com/Pragmatists/junitparams] as suggested by [~lcwik].



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (BEAM-1176) Make our test suites use @Rule TestPipeline

2016-12-20 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles resolved BEAM-1176.
---
   Resolution: Fixed
Fix Version/s: 0.5.0-incubating

> Make our test suites use @Rule TestPipeline
> ---
>
> Key: BEAM-1176
> URL: https://issues.apache.org/jira/browse/BEAM-1176
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>        Reporter: Kenneth Knowles
>Assignee: Stas Levin
>Priority: Minor
> Fix For: 0.5.0-incubating
>
>
> Now that [~staslev] has made {{TestPipeline}} a JUnit rule that performs 
> useful sanity checks, we should port all of our tests to it so that they set 
> a good example for users. Maybe we'll even catch some straggling tests with 
> errors :-)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (BEAM-1186) Migrate the remaining tests to use TestPipeline as a JUnit rule.

2016-12-20 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles resolved BEAM-1186.
---
   Resolution: Fixed
Fix Version/s: 0.5.0-incubating

> Migrate the remaining tests to use TestPipeline as a JUnit rule.
> 
>
> Key: BEAM-1186
> URL: https://issues.apache.org/jira/browse/BEAM-1186
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Stas Levin
>Assignee: Stas Levin
>Priority: Minor
> Fix For: 0.5.0-incubating
>
>
> Following up on [BEAM-1176|https://issues.apache.org/jira/browse/BEAM-1176], 
> the following tests still have direct calls to {{TestPipeline.create()}}:
> * {{AvroIOGeneratedClassTest#runTestRead}}
> * {{ApproximateUniqueTest#runApproximateUniqueWithDuplicates}}
> * {{ApproximateUniqueTest#runApproximateUniqueWithSkewedDistributions}}
> * {{SampleTest#runPickAnyTest}}
> * {{BigtableIOTest#runReadTest}}
> Consider using [parametrised 
> tests|https://github.com/Pragmatists/junitparams] as suggested by [~lcwik].



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1067) apex.examples.WordCountTest.testWordCountExample is be flaky

2016-12-19 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1067:
--
Summary: apex.examples.WordCountTest.testWordCountExample is be flaky  
(was: apex.examples.WordCountTest.testWordCountExample may be flaky)

> apex.examples.WordCountTest.testWordCountExample is be flaky
> 
>
> Key: BEAM-1067
> URL: https://issues.apache.org/jira/browse/BEAM-1067
> Project: Beam
>  Issue Type: Bug
>  Components: runner-apex
>Reporter: Stas Levin
>Assignee: Thomas Weise
>
> Seems that 
> {{org.apache.beam.runners.apex.examples.WordCountTest.testWordCountExample}} 
> is flaky.
> For example, 
> [this|https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/5408/org.apache.beam$beam-runners-apex/testReport/org.apache.beam.runners.apex.examples/WordCountTest/testWordCountExample/
>  ] run failed although no changes were made in {{runner-apex}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-440) Create.values() returns a type-unsafe Coder

2016-12-18 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-440:
-
Assignee: Jason White

> Create.values() returns a type-unsafe Coder
> ---
>
> Key: BEAM-440
> URL: https://issues.apache.org/jira/browse/BEAM-440
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Daniel Halperin
>Assignee: Jason White
>  Labels: newbie, starter
>
> {{Create.values()}} with no arguments will default to a {{VoidCoder}}, unless 
> one is set later with {{setCoder(Coder)}}.
> Although it will encode its input correctly, this seems like a bad choice in 
> many cases. E.g., with {{Flatten}}:
> {code}
> PCollection<KV<SomeClass, Integer>> initial = p.apply("First", 
> Create.<KV<SomeClass, Integer>>of());
> PCollection<KV<SomeClass, Integer>> second =
> p.apply("Second", Create.of("a", "b")).apply(ParDo.of(new 
> MyAvroDoFn()));
> PCollectionList
> .of(initial).and(second)
> .apply(Flatten.<KV<SomeClass, Integer>>pCollections());
> {code}
> This crashes trying to cast a KV from "Second" to a Void.class.
> 1. Suggest throwing a warning in {{getDefaultOutputCoder}} when defaulting to 
> {{VoidCoder}} for an empty elements list. Should this be an error?
> 2. Suggest adding something like {{Create.empty(TypeDescriptor)}} to handle 
> this case properly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-440) Create.values() returns a type-unsafe Coder

2016-12-18 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760233#comment-15760233
 ] 

Kenneth Knowles commented on BEAM-440:
--

The root issue here is actually worse. There are two facets.

1. {{Create}} based on the value is unsafe in general. (full disclosure: I 
implemented this unsafe "feature"). It adds a coder that is safe for _those 
values only_ but not for their static type.

{code}
PCollection allStrings = pipeline.apply("Create strings", 
Create.of("hello"));
{code}

The coder inference infrastructure reasonably assumes that 
{{allStrings.getCoder()}} returns a coder that can handle all values of type 
{{Object}}, which is as false in this example as it is for an empty {{Create}}. 
Transforms like {{MapElements<Object, Object>}} will propagate the coder for 
{{Object}} which will break if any non-string is in the output (which obviously 
it could be).

2. {{Flatten}} quite explicitly assumes the same thing and grabs the first 
coder it sees, so when you flatten a bunch of {{PCollection}} together 
that have been made with {{Create}} you are in trouble.

So for this issue, we should probably just remove 
https://github.com/apache/incubator-beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/Create.java#L496.

At the time we added it, it seemed handy to infer a coder based on the values, 
since we didn't have any other information to go on due to the limits of Java 
reflection, but the idea doesn't work without further validation which requires 
as much user input as just setting the coder or type descriptor anyhow.

> Create.values() returns a type-unsafe Coder
> ---
>
> Key: BEAM-440
> URL: https://issues.apache.org/jira/browse/BEAM-440
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Daniel Halperin
>  Labels: newbie, starter
>
> {{Create.values()}} with no arguments will default to a {{VoidCoder}}, unless 
> one is set later with {{setCoder(Coder)}}.
> Although it will encode its input correctly, this seems like a bad choice in 
> many cases. E.g., with {{Flatten}}:
> {code}
> PCollection<KV<SomeClass, Integer>> initial = p.apply("First", 
> Create.<KV<SomeClass, Integer>>of());
> PCollection<KV<SomeClass, Integer>> second =
> p.apply("Second", Create.of("a", "b")).apply(ParDo.of(new 
> MyAvroDoFn()));
> PCollectionList
> .of(initial).and(second)
> .apply(Flatten.<KV<SomeClass, Integer>>pCollections());
> {code}
> This crashes trying to cast a KV from "Second" to a Void.class.
> 1. Suggest throwing a warning in {{getDefaultOutputCoder}} when defaulting to 
> {{VoidCoder}} for an empty elements list. Should this be an error?
> 2. Suggest adding something like {{Create.empty(TypeDescriptor)}} to handle 
> this case properly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-440) Create.values() returns a type-unsafe Coder

2016-12-18 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-440:
-
Description: 
{{Create.values()}} with no arguments will default to a {{VoidCoder}}, unless 
one is set later with {{setCoder(Coder)}}.

Although it will encode its input correctly, this seems like a bad choice in 
many cases. E.g., with {{Flatten}}:

{code}
PCollection<KV<SomeClass, Integer>> initial = p.apply("First", 
Create.<KV<SomeClass, Integer>>of());
PCollection<KV<SomeClass, Integer>> second =
p.apply("Second", Create.of("a", "b")).apply(ParDo.of(new 
MyAvroDoFn()));
PCollectionList
.of(initial).and(second)
.apply(Flatten.<KV<SomeClass, Integer>>pCollections());
{code}

This crashes trying to cast a KV from "Second" to a Void.class.

1. Suggest throwing a warning in {{getDefaultOutputCoder}} when defaulting to 
{{VoidCoder}} for an empty elements list. Should this be an error?

2. Suggest adding something like {{Create.empty(TypeDescriptor)}} to handle 
this case properly.

  was:
Create.values() with no arguments will default to a VoidCoder, unless one is 
set later with #setCoder(Coder).

Although it will encode its input correctly, this seems like a bad choice in 
many cases. E.g., with Flatten:

PCollection<KV<SomeClass, Integer>> initial = p.apply("First", 
Create.<KV<SomeClass, Integer>>of());
PCollection<KV<SomeClass, Integer>> second =
p.apply("Second", Create.of("a", "b")).apply(ParDo.of(new 
MyAvroDoFn()));
PCollectionList
.of(initial).and(second)
.apply(Flatten.<KV<SomeClass, Integer>>pCollections());

This crashes trying to cast a KV from "Second" to a Void.class.

1. Suggest throwing a warning in #getDefaultOutputCoder when defaulting to 
VoidCoder for an empty elements list. Should this be an error?

2. Suggest adding something like Create.empty(TypeDescriptor) to handle this 
case properly.


> Create.values() returns a type-unsafe Coder
> ---
>
> Key: BEAM-440
> URL: https://issues.apache.org/jira/browse/BEAM-440
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Daniel Halperin
>  Labels: newbie, starter
>
> {{Create.values()}} with no arguments will default to a {{VoidCoder}}, unless 
> one is set later with {{setCoder(Coder)}}.
> Although it will encode its input correctly, this seems like a bad choice in 
> many cases. E.g., with {{Flatten}}:
> {code}
> PCollection<KV<SomeClass, Integer>> initial = p.apply("First", 
> Create.<KV<SomeClass, Integer>>of());
> PCollection<KV<SomeClass, Integer>> second =
> p.apply("Second", Create.of("a", "b")).apply(ParDo.of(new 
> MyAvroDoFn()));
> PCollectionList
> .of(initial).and(second)
> .apply(Flatten.<KV<SomeClass, Integer>>pCollections());
> {code}
> This crashes trying to cast a KV from "Second" to a Void.class.
> 1. Suggest throwing a warning in {{getDefaultOutputCoder}} when defaulting to 
> {{VoidCoder}} for an empty elements list. Should this be an error?
> 2. Suggest adding something like {{Create.empty(TypeDescriptor)}} to handle 
> this case properly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-210) Allow control of empty ON_TIME panes analogous to final panes

2016-12-18 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-210:
-
Component/s: (was: runner-core)
 sdk-java-core
 beam-model-runner-api
 beam-model

> Allow control of empty ON_TIME panes analogous to final panes
> -
>
> Key: BEAM-210
> URL: https://issues.apache.org/jira/browse/BEAM-210
> Project: Beam
>  Issue Type: Bug
>  Components: beam-model, beam-model-runner-api, sdk-java-core
>Reporter: Mark Shields
>Assignee: Thomas Groh
>
> Today, ON_TIME panes are emitted whether or not they are empty. We had 
> decided that for final panes the user would want to control this behavior, to 
> control data volume. But for ON_TIME panes no such control exists. The 
> rationale is perhaps that the ON_TIME pane is a fundamental result that 
> should not be elided. To be considered: whether this is what we want.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-885) Move PipelineOptions from Pipeline.create() to Pipeline.run()

2016-12-18 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-885:
-
Component/s: beam-model-runner-api

> Move PipelineOptions from Pipeline.create() to Pipeline.run()
> -
>
> Key: BEAM-885
> URL: https://issues.apache.org/jira/browse/BEAM-885
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model-runner-api, sdk-java-core
>Reporter: Thomas Groh
>Assignee: Thomas Groh
>  Labels: backward-incompatible
>
> The specification of a Pipeline should be independent of its PipelineOptions. 
> This delays specification of the options, including choices like Pipeline 
> Runner.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-646) Get runners out of the apply()

2016-12-18 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-646:
-
Component/s: beam-model-runner-api

> Get runners out of the apply()
> --
>
> Key: BEAM-646
> URL: https://issues.apache.org/jira/browse/BEAM-646
> Project: Beam
>  Issue Type: Improvement
>  Components: beam-model-runner-api, sdk-java-core
>        Reporter: Kenneth Knowles
>Assignee: Thomas Groh
>
> Right now, the runner intercepts calls to apply() and replaces transforms as 
> we go. This means that there is no "original" user graph. For portability and 
> misc architectural benefits, we would like to build the original graph first, 
> and have the runner override later.
> Some runners already work in this manner, but we could integrate it more 
> smoothly, with more validation, via some handy APIs on e.g. the Pipeline 
> object.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-260) Know the getSideInputWindow upper bound so can gc side input state

2016-12-18 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-260:
-
Component/s: beam-model-runner-api

> Know the getSideInputWindow upper bound so can gc side input state
> --
>
> Key: BEAM-260
> URL: https://issues.apache.org/jira/browse/BEAM-260
> Project: Beam
>  Issue Type: Bug
>  Components: beam-model, beam-model-runner-api
>Reporter: Mark Shields
>    Assignee: Kenneth Knowles
>
> We currently have no static knowledge about the getSideInputWindow function, 
> and runners are thus forced to hold on to all side input state / elements in 
> case a future element reaches back into an earlier side input element.
> Maybe we need an upper bound on lag from current to result of 
> getSideInputWindow so we can have a progressing gc horizon as we do for  GKB 
> window state. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-742) Move trigger state machines to runners-core, convert triggers to AST

2016-12-18 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-742:
-
Component/s: beam-model-runner-api

> Move trigger state machines to runners-core, convert triggers to AST
> 
>
> Key: BEAM-742
> URL: https://issues.apache.org/jira/browse/BEAM-742
> Project: Beam
>  Issue Type: Sub-task
>  Components: beam-model-runner-api, runner-core, sdk-java-core
>        Reporter: Kenneth Knowles
>    Assignee: Kenneth Knowles
> Fix For: 0.4.0-incubating
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-846) Decouple side input window mapping from WindowFn

2016-12-18 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-846:
-
Component/s: beam-model-runner-api

> Decouple side input window mapping from WindowFn
> 
>
> Key: BEAM-846
> URL: https://issues.apache.org/jira/browse/BEAM-846
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model, beam-model-runner-api, sdk-java-core
>Reporter: Robert Bradshaw
>    Assignee: Kenneth Knowles
>  Labels: backward-incompatible
>
> Currently the main WindowFn provides as getSideInputWindow method. Instead, 
> this mapping should be specified per-side-input (thought the default mapping 
> would remain the same). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-1159) DoFnSignature should have info on the fn's side inputs and outputs

2016-12-18 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760080#comment-15760080
 ] 

Kenneth Knowles commented on BEAM-1159:
---

The expected resolution of this ticket was actually designed long ago, and is 
included as an appendix to the new DoFn design, here: 
https://s.apache.org/a-new-dofn#heading=h.1budnm7l01ko

It hasn't been a priority yet, but it isn't difficult. It can probably be done 
in a backwards-compatible manner, though it would be cleaner if we have time to 
add the new form of support and remove the old way.

> DoFnSignature should have info on the fn's side inputs and outputs
> --
>
> Key: BEAM-1159
> URL: https://issues.apache.org/jira/browse/BEAM-1159
> Project: Beam
>  Issue Type: Improvement
>Reporter: Eugene Kirpichov
>
> This is logically part of the fn itself, rather than its enclosing transform.
> Example where this would have been important: 
> signature.processElement().observesWindow() should return true for a DoFn 
> that has any side inputs, since side inputs are windowed. See BEAM-1149.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (BEAM-1149) Side input access fails in direct runner (possibly others too) when input element in multiple windows

2016-12-18 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles resolved BEAM-1149.
---
Resolution: Fixed

> Side input access fails in direct runner (possibly others too) when input 
> element in multiple windows
> -
>
> Key: BEAM-1149
> URL: https://issues.apache.org/jira/browse/BEAM-1149
> Project: Beam
>  Issue Type: Bug
>Reporter: Eugene Kirpichov
>    Assignee: Kenneth Knowles
>Priority: Blocker
> Fix For: 0.4.0-incubating
>
>
> {code:java}
>   private static class FnWithSideInputs extends DoFn<String, String> {
> private final PCollectionView view;
> private FnWithSideInputs(PCollectionView view) {
>   this.view = view;
> }
> @ProcessElement
> public void processElement(ProcessContext c) {
>   c.output(c.element() + ":" + c.sideInput(view));
> }
>   }
>   @Test
>   public void testSideInputsWithMultipleWindows() {
> Pipeline p = TestPipeline.create();
> MutableDateTime mutableNow = Instant.now().toMutableDateTime();
> mutableNow.setMillisOfSecond(0);
> Instant now = mutableNow.toInstant();
> SlidingWindows windowFn =
> 
> SlidingWindows.of(Duration.standardSeconds(5)).every(Duration.standardSeconds(1));
> PCollectionView view = 
> p.apply(Create.of(1)).apply(View.asSingleton());
> PCollection res =
> p.apply(Create.timestamped(TimestampedValue.of("a", now)))
> .apply(Window.into(windowFn))
> .apply(ParDo.of(new FnWithSideInputs(view)).withSideInputs(view));
> PAssert.that(res).containsInAnyOrder("a:1");
> p.run();
>   }
> {code}
> This fails with the following exception:
> {code}
> org.apache.beam.sdk.Pipeline$PipelineExecutionException: 
> java.lang.IllegalStateException: sideInput called when main input element is 
> in multiple windows
>   at 
> org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:343)
>   at org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:1)
>   at org.apache.beam.sdk.Pipeline.run(Pipeline.java:176)
>   at org.apache.beam.sdk.testing.TestPipeline.run(TestPipeline.java:112)
>   at 
> Caused by: java.lang.IllegalStateException: sideInput called when main input 
> element is in multiple windows
>   at 
> org.apache.beam.runners.core.SimpleDoFnRunner$DoFnProcessContext.sideInput(SimpleDoFnRunner.java:514)
>   at 
> org.apache.beam.sdk.transforms.ParDoTest$FnWithSideInputs.processElement(ParDoTest.java:738)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-1164) Allow a DoFn to opt in to mutating it's input

2016-12-18 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15760074#comment-15760074
 ] 

Kenneth Knowles commented on BEAM-1164:
---

I think with the new {{DoFn}} there is a fairly elegant solution here.

Today we write:

{code}
new DoFn<Foo, Baz>() {
  @ProcessElement
  public void processElem(ProcessContext ctx) {
... ctx.element() ...
  }
}
{code}

We'd like to allow the user to request only the element, both for clarity and 
for potential optimization, as in

{code}
new DoFn<Foo, Baz>() {
  @ProcessElement
  public void processElem(Element elem) {
... elem.get() ...
  }
}
{code}

where {{Element}} is a distinguished inner class, to avoid repeating verbose 
input types.

>From here, it is a short step to saying that you want a mutable element:

{code}
new DoFn<Foo, Baz>() {
  @ProcessElement
  public void processElem(MutableElement elem) {
... elem.get().setBizzle(...) ...
  }
}
{code}

At the level of the "JSON" runner API, we will need to tag the user-defined 
function with the fact that it intends to mutate its input. The Java SDK will 
analyze the method signature, as usual, to discern this automatically.

A runner will then be free to decide between disabling optimizations or cloning 
elements when necessary.

> Allow a DoFn to opt in to mutating it's input
> -
>
> Key: BEAM-1164
> URL: https://issues.apache.org/jira/browse/BEAM-1164
> Project: Beam
>  Issue Type: Bug
>  Components: beam-model
>Reporter: Frances Perry
>Priority: Minor
>
> Runners generally can't tell if a DoFn is mutating inputs, but assuming so by 
> default leads to significant performance implications from unnecessary 
> copying (around sibling fusion, etc). So instead the model prevents mutating 
> inputs, and the Direct Runner validates this behavior. (See: 
> http://beam.incubator.apache.org/contribute/design-principles/#make-efficient-things-easy-rather-than-make-easy-things-efficient)
>  
> However, if users are processing a small number of large records by making 
> incremental changes (for example, genomics use cases), the cost of 
> immutability requirement can be very large. As a workaround, users sometimes 
> do suboptimal things (fusing ParDos by hand) or undefined things when they 
> expect the immutability requirement is unnecessarily strict (adding no-op 
> coders in places they hope the runner won't be materializing things, mutating 
> things anyway when they don't expect sibling fusion to happen, etc).
> We should consider adding a signal (MutatingDoFn?) that users explicitly opt 
> in to to say their code may mutate inputs. The runner can then use this 
> assumption to either prevent optimizations that would break in the face of 
> this or insert additional copies as needed to allow optimizations to preserve 
> semantics.
> See this related user@ discussion:
> https://lists.apache.org/thread.html/f39689f54147117f3fc54c498eff1a20fa73f1be5b5cad5b6f816fd3@%3Cuser.beam.apache.org%3E



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-651) Making TypedPValue.setTypeDescriptorInternal no longer Internal

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-651:
-
Summary: Making TypedPValue.setTypeDescriptorInternal no longer Internal  
(was: Consider making TypedPValue.setTypeDescriptorInternal no longer Internal)

> Making TypedPValue.setTypeDescriptorInternal no longer Internal
> ---
>
> Key: BEAM-651
> URL: https://issues.apache.org/jira/browse/BEAM-651
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>        Reporter: Kenneth Knowles
>Assignee: Neelesh Srinivas Salian
>Priority: Minor
>  Labels: easy, easyfix, starter
> Fix For: 0.4.0-incubating
>
>
> This would give fairly pithy answers to StackOverflow questions sometimes.
> When choosing between .getOutputCoder() and .getOutputTypeDescriptor() for a 
> transform/DoFn we often choose the type, so the coder registry can do its 
> thing.
> This would also give a similar choice between .setCoder(...) and 
> .setTypeDescriptor(...).
> And anyhow we have the intention of removing our practice of the "*Internal" 
> suffix, so this one might be most easily solved by making it public.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-651) Add public TypedPValue.setTypeDescriptor

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-651:
-
Summary: Add public TypedPValue.setTypeDescriptor  (was: Making 
TypedPValue.setTypeDescriptorInternal no longer Internal)

> Add public TypedPValue.setTypeDescriptor
> 
>
> Key: BEAM-651
> URL: https://issues.apache.org/jira/browse/BEAM-651
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>        Reporter: Kenneth Knowles
>Assignee: Neelesh Srinivas Salian
>Priority: Minor
>  Labels: easy, easyfix, starter
> Fix For: 0.4.0-incubating
>
>
> This would give fairly pithy answers to StackOverflow questions sometimes.
> When choosing between .getOutputCoder() and .getOutputTypeDescriptor() for a 
> transform/DoFn we often choose the type, so the coder registry can do its 
> thing.
> This would also give a similar choice between .setCoder(...) and 
> .setTypeDescriptor(...).
> And anyhow we have the intention of removing our practice of the "*Internal" 
> suffix, so this one might be most easily solved by making it public.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-35) Remove timer multiplexing and give timers identifiers

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-35?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-35:

Issue Type: Improvement  (was: New Feature)

> Remove timer multiplexing and give timers identifiers
> -
>
> Key: BEAM-35
> URL: https://issues.apache.org/jira/browse/BEAM-35
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-core
>        Reporter: Kenneth Knowles
>    Assignee: Kenneth Knowles
> Fix For: 0.4.0-incubating
>
>
> Today, timers set by the runner core of the SDK are identified by timestamp, 
> so multiple timers for the same time result in only one backend timer. This 
> is runner-specific and obsolete, and makes it unsafe to support timer 
> deletion because backend timers have no real owner. User-facing timers will 
> require explicit identifiers anyhow.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-864) RAT check fails on DEPENDENCIES

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-864:
-
Summary: RAT check fails on DEPENDENCIES  (was: Update to latest Apache 
Maven-Parent)

> RAT check fails on DEPENDENCIES
> ---
>
> Key: BEAM-864
> URL: https://issues.apache.org/jira/browse/BEAM-864
> Project: Beam
>  Issue Type: Bug
>  Components: build-system
>Reporter: Aljoscha Krettek
>Assignee: Jean-Baptiste Onofré
> Fix For: 0.4.0-incubating
>
>
> The release plugin creates a DEPENDENCIES file that is not properly excluded 
> from the rat check with the current version of the Apache Maven-Parent that 
> we are using.
> This is the relevant RAT Jira issue: 
> https://issues.apache.org/jira/browse/RAT-184



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-808) Increase "spark.port.maxRetries" to avoid BindException in ROS.

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-808:
-
Issue Type: Bug  (was: Improvement)

> Increase "spark.port.maxRetries" to avoid BindException in ROS.
> ---
>
> Key: BEAM-808
> URL: https://issues.apache.org/jira/browse/BEAM-808
> Project: Beam
>  Issue Type: Bug
>  Components: runner-spark
>Reporter: Amit Sela
>Assignee: Amit Sela
> Fix For: 0.4.0-incubating
>
>
> The default is 16, and it's hard to know which port failed to bind since 
> there's no logging.
> Let's start with 64.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-830) Launcher for ApexRunner execution on YARN cluster

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-830:
-
Issue Type: New Feature  (was: Improvement)

> Launcher for ApexRunner execution on YARN cluster 
> --
>
> Key: BEAM-830
> URL: https://issues.apache.org/jira/browse/BEAM-830
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-apex
>Reporter: Thomas Weise
>Assignee: Thomas Weise
> Fix For: 0.4.0-incubating
>
>
> Currently the ApexRunner only support execution in embedded mode. Add the 
> support to package the dependencies and run the Apex app on a YARN cluster.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1060) Make DoFnTester use new DoFn

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1060:
--
Issue Type: New Feature  (was: Improvement)

> Make DoFnTester use new DoFn
> 
>
> Key: BEAM-1060
> URL: https://issues.apache.org/jira/browse/BEAM-1060
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>Reporter: Eugene Kirpichov
>Assignee: Eugene Kirpichov
> Fix For: 0.4.0-incubating
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-252) Make Regex Transform

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-252:
-
Issue Type: New Feature  (was: Improvement)

> Make Regex Transform
> 
>
> Key: BEAM-252
> URL: https://issues.apache.org/jira/browse/BEAM-252
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>Reporter: Jesse Anderson
>Assignee: Jesse Anderson
> Fix For: 0.4.0-incubating
>
>
> There needs to be an easier way to run Regular Expressions as part of a 
> transform. This will make string-based ETL much easier.
> The transform should support using the matches and find methods. The 
> transform should allow you to choose a group in the regex to output. The 
> transform should allow single strings to be output or KV's of strings.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1060) Make DoFnTester use new DoFn

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1060:
--
Issue Type: Improvement  (was: New Feature)

> Make DoFnTester use new DoFn
> 
>
> Key: BEAM-1060
> URL: https://issues.apache.org/jira/browse/BEAM-1060
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Eugene Kirpichov
>Assignee: Eugene Kirpichov
> Fix For: 0.4.0-incubating
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-939) New credentials code broke Dataflow runner

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-939:
-
Issue Type: Bug  (was: New Feature)

> New credentials code broke Dataflow runner
> --
>
> Key: BEAM-939
> URL: https://issues.apache.org/jira/browse/BEAM-939
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-gcp
>Affects Versions: Not applicable
>Reporter: Daniel Halperin
>Assignee: Luke Cwik
>Priority: Minor
> Fix For: 0.4.0-incubating
>
>
> https://builds.apache.org/view/Beam/job/beam_PostCommit_MavenVerify/1753/
> {code}
> java.lang.NoSuchMethodError: 
> com.google.auth.oauth2.GoogleCredentials.getApplicationDefault(Lcom/google/api/client/http/HttpTransport;)Lcom/google/auth/oauth2/GoogleCredentials;
>   at 
> com.google.cloud.bigtable.config.CredentialFactory.getApplicationDefaultCredential(CredentialFactory.java:207)
>   at 
> com.google.cloud.bigtable.config.CredentialFactory.getCredentials(CredentialFactory.java:112)
>   at 
> com.google.cloud.bigtable.grpc.io.CredentialInterceptorCache.getCredentialsInterceptor(CredentialInterceptorCache.java:94)
>   at 
> com.google.cloud.bigtable.grpc.BigtableSession.(BigtableSession.java:272)
>   at 
> org.apache.beam.sdk.io.gcp.bigtable.BigtableServiceImpl.tableExists(BigtableServiceImpl.java:81)
>   at 
> org.apache.beam.sdk.io.gcp.bigtable.BigtableIO$Read.validate(BigtableIO.java:296)
>   at 
> org.apache.beam.sdk.io.gcp.bigtable.BigtableIO$Read.validate(BigtableIO.java:185)
>   at org.apache.beam.sdk.Pipeline.applyInternal(Pipeline.java:399)
>   at org.apache.beam.sdk.Pipeline.applyTransform(Pipeline.java:307)
>   at org.apache.beam.sdk.values.PBegin.apply(PBegin.java:47)
>   at org.apache.beam.sdk.Pipeline.apply(Pipeline.java:158)
>   at 
> org.apache.beam.sdk.io.gcp.bigtable.BigtableReadIT.testE2EBigtableRead(BigtableReadIT.java:53)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
>   at 
> org.apache.maven.surefire.junitcore.pc.Scheduler$1.run(Scheduler.java:393)
>   at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>   at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>   at java.lang.Thread.run(Thread.java:745)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-651) Consider making TypedPValue.setTypeDescriptorInternal no longer Internal

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-651:
-
Issue Type: New Feature  (was: Wish)

> Consider making TypedPValue.setTypeDescriptorInternal no longer Internal
> 
>
> Key: BEAM-651
> URL: https://issues.apache.org/jira/browse/BEAM-651
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>        Reporter: Kenneth Knowles
>Assignee: Neelesh Srinivas Salian
>Priority: Minor
>  Labels: easy, easyfix, starter
> Fix For: 0.4.0-incubating
>
>
> This would give fairly pithy answers to StackOverflow questions sometimes.
> When choosing between .getOutputCoder() and .getOutputTypeDescriptor() for a 
> transform/DoFn we often choose the type, so the coder registry can do its 
> thing.
> This would also give a similar choice between .setCoder(...) and 
> .setTypeDescriptor(...).
> And anyhow we have the intention of removing our practice of the "*Internal" 
> suffix, so this one might be most easily solved by making it public.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-947) WindowedWordCountIT fails due to not setting "--output" pipeline options

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-947:
-
Issue Type: Bug  (was: New Feature)

> WindowedWordCountIT fails due to not setting "--output" pipeline options
> 
>
> Key: BEAM-947
> URL: https://issues.apache.org/jira/browse/BEAM-947
> Project: Beam
>  Issue Type: Bug
>  Components: examples-java
>Reporter: Kenneth Knowles
>Assignee: Kenneth Knowles
>Priority: Blocker
> Fix For: 0.4.0-incubating
>
>
> Example failure: 
> https://builds.apache.org/job/beam_PreCommit_MavenVerify/4793/org.apache.beam$beam-examples-java/testReport/junit/org.apache.beam.examples/WindowedWordCountIT/testWindowedWordCountInStreaming/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-261) Apache Apex Runner

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-261:
-
Issue Type: New Feature  (was: Wish)

> Apache Apex Runner
> --
>
> Key: BEAM-261
> URL: https://issues.apache.org/jira/browse/BEAM-261
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-apex
>Reporter: Suminda Dharmasena
>Assignee: Thomas Weise
> Fix For: 0.4.0-incubating
>
>
> Like Spark, Flink and GearPump, Apache Apex also does have advantages. Is it 
> possible to have a runner for Apache Apex?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1176) Make our test suites use @Rule TestPipeline

2016-12-17 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1176:
-

 Summary: Make our test suites use @Rule TestPipeline
 Key: BEAM-1176
 URL: https://issues.apache.org/jira/browse/BEAM-1176
 Project: Beam
  Issue Type: Improvement
  Components: sdk-java-core
Reporter: Kenneth Knowles
Assignee: Stas Levin
Priority: Minor


Now that [~staslev] has made {{TestPipeline}} a JUnit rule that performs useful 
sanity checks, we should port all of our tests to it so that they set a good 
example for users. Maybe we'll even catch some straggling tests with errors :-)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (BEAM-85) PAssert needs sanity check that it's used correctly

2016-12-17 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-85?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles resolved BEAM-85.
-
   Resolution: Fixed
Fix Version/s: 0.5.0-incubating

When used as a `@Rule` this is now the case. Now we need to port the tests, 
which I'd say fits more into BEAM0-298 or potentially a new ticket.

> PAssert needs sanity check that it's used correctly
> ---
>
> Key: BEAM-85
> URL: https://issues.apache.org/jira/browse/BEAM-85
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>Reporter: Daniel Halperin
>Assignee: Stas Levin
> Fix For: 0.5.0-incubating
>
>
> We should validate two things:
> # DataflowAssert is not added to a pipeline that has already been run.
> # The pipeline is run after the DataflowAssert is added.
> If either of these are not validated, then it is possible that the test 
> doesn't actually verify anything.
> This code should throw an assertion error or fail in some other way.
> {code}
> Pipeline p = TestPipeline.create();
> PCollection value = p.apply(Create.of(Boolean.FALSE));
> p.run();
> DataflowAssert.thatSingleton(value).isEqualTo(true);
> {code}
> but it would pass silently.
> similarly, this code wills pass silently:
> {code}
> Pipeline p = TestPipeline.create();
> PCollection value = p.apply(Create.of(Boolean.FALSE));
> DataflowAssert.thatSingleton(value).isEqualTo(true);
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1175) Separate runners-core Javadoc from SDK Javadoc, and perhaps other artifacts

2016-12-16 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1175:
-

 Summary: Separate runners-core Javadoc from SDK Javadoc, and 
perhaps other artifacts
 Key: BEAM-1175
 URL: https://issues.apache.org/jira/browse/BEAM-1175
 Project: Beam
  Issue Type: Bug
  Components: website
Reporter: Kenneth Knowles
Assignee: James Malone


Currently, the runners-core Javadoc is included with the SDK's Javadoc on the 
website.

Pipeline authors should only be viewing the SDK's Javadoc, and likely the IOs.

Generally, while I do think we can't really separate every IO Javadoc, it is 
extremely confusing to have Javadoc for separate POM dependencies included 
together, since the classes listed will not actually be available to someone 
who just depends on the SDK.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1172) Precommit integration tests should include some merging windows

2016-12-16 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1172:
-

 Summary: Precommit integration tests should include some merging 
windows
 Key: BEAM-1172
 URL: https://issues.apache.org/jira/browse/BEAM-1172
 Project: Beam
  Issue Type: Bug
  Components: testing
Reporter: Kenneth Knowles
Assignee: Kenneth Knowles






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1139) Investigate failures in precommit - Apex & Kryo

2016-12-16 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1139:
--
Assignee: (was: Kenneth Knowles)

> Investigate failures in precommit - Apex & Kryo
> ---
>
> Key: BEAM-1139
> URL: https://issues.apache.org/jira/browse/BEAM-1139
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-apex
>    Reporter: Kenneth Knowles
>Priority: Minor
>
> https://builds.apache.org/view/Beam/job/beam_PreCommit_Java_MavenInstall/org.apache.beam$beam-examples-java/5775/testReport/junit/org.apache.beam.examples/WordCountIT/testE2EWordCount/
> This is not necessarily a bug in the Apex runner, but it looks like this 
> class cannot be serialized via Kryo while the Apex runner needs it to be. 
> Probably the fix is to roll-forwards a simple change to make it Kryo 
> serializable.
> It is not clear to me the difference between this test run and others. 
> Clearly there is a coverage gap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (BEAM-1148) Port PAssert away from Aggregators

2016-12-14 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15749192#comment-15749192
 ] 

Kenneth Knowles edited comment on BEAM-1148 at 12/14/16 7:17 PM:
-

The trouble with side output + combine is that we don't have access to the 
result via PipelineResult (aka handle to the job). So I think there's actual 
design work here.


was (Author: kenn):
The trouble with side output + combine is that we don't have access to the 
result via PipelineResult. So I think there's actual design work here.

> Port PAssert away from Aggregators
> --
>
> Key: BEAM-1148
> URL: https://issues.apache.org/jira/browse/BEAM-1148
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>        Reporter: Kenneth Knowles
>
> One step in the removal of Aggregators (in favor of Metrics) is to remove our 
> reliance on them for PAssert checking.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-1148) Port PAssert away from Aggregators

2016-12-14 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15749192#comment-15749192
 ] 

Kenneth Knowles commented on BEAM-1148:
---

The trouble with side output + combine is that we don't have access to the 
result via PipelineResult. So I think there's actual design work here.

> Port PAssert away from Aggregators
> --
>
> Key: BEAM-1148
> URL: https://issues.apache.org/jira/browse/BEAM-1148
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>        Reporter: Kenneth Knowles
>
> One step in the removal of Aggregators (in favor of Metrics) is to remove our 
> reliance on them for PAssert checking.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Reopened] (BEAM-1149) Side input access fails in direct runner (possibly others too) when input element in multiple windows

2016-12-14 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles reopened BEAM-1149:
---
  Assignee: Kenneth Knowles  (was: Eugene Kirpichov)

Still failing Spark postcommit 
https://builds.apache.org/job/beam_PostCommit_Java_RunnableOnService_Spark/409/org.apache.beam$beam-runners-spark/testReport/org.apache.beam.sdk.transforms/ParDoTest/testSideInputsWithMultipleWindows/


> Side input access fails in direct runner (possibly others too) when input 
> element in multiple windows
> -
>
> Key: BEAM-1149
> URL: https://issues.apache.org/jira/browse/BEAM-1149
> Project: Beam
>  Issue Type: Bug
>Reporter: Eugene Kirpichov
>    Assignee: Kenneth Knowles
>Priority: Blocker
> Fix For: 0.4.0-incubating
>
>
> {code:java}
>   private static class FnWithSideInputs extends DoFn<String, String> {
> private final PCollectionView view;
> private FnWithSideInputs(PCollectionView view) {
>   this.view = view;
> }
> @ProcessElement
> public void processElement(ProcessContext c) {
>   c.output(c.element() + ":" + c.sideInput(view));
> }
>   }
>   @Test
>   public void testSideInputsWithMultipleWindows() {
> Pipeline p = TestPipeline.create();
> MutableDateTime mutableNow = Instant.now().toMutableDateTime();
> mutableNow.setMillisOfSecond(0);
> Instant now = mutableNow.toInstant();
> SlidingWindows windowFn =
> 
> SlidingWindows.of(Duration.standardSeconds(5)).every(Duration.standardSeconds(1));
> PCollectionView view = 
> p.apply(Create.of(1)).apply(View.asSingleton());
> PCollection res =
> p.apply(Create.timestamped(TimestampedValue.of("a", now)))
> .apply(Window.into(windowFn))
> .apply(ParDo.of(new FnWithSideInputs(view)).withSideInputs(view));
> PAssert.that(res).containsInAnyOrder("a:1");
> p.run();
>   }
> {code}
> This fails with the following exception:
> {code}
> org.apache.beam.sdk.Pipeline$PipelineExecutionException: 
> java.lang.IllegalStateException: sideInput called when main input element is 
> in multiple windows
>   at 
> org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:343)
>   at org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:1)
>   at org.apache.beam.sdk.Pipeline.run(Pipeline.java:176)
>   at org.apache.beam.sdk.testing.TestPipeline.run(TestPipeline.java:112)
>   at 
> Caused by: java.lang.IllegalStateException: sideInput called when main input 
> element is in multiple windows
>   at 
> org.apache.beam.runners.core.SimpleDoFnRunner$DoFnProcessContext.sideInput(SimpleDoFnRunner.java:514)
>   at 
> org.apache.beam.sdk.transforms.ParDoTest$FnWithSideInputs.processElement(ParDoTest.java:738)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1149) Side input access fails in direct runner (possibly others too) when input element in multiple windows

2016-12-13 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1149:
--
Assignee: Eugene Kirpichov

> Side input access fails in direct runner (possibly others too) when input 
> element in multiple windows
> -
>
> Key: BEAM-1149
> URL: https://issues.apache.org/jira/browse/BEAM-1149
> Project: Beam
>  Issue Type: Bug
>Reporter: Eugene Kirpichov
>Assignee: Eugene Kirpichov
>Priority: Blocker
>
> {code:java}
>   private static class FnWithSideInputs extends DoFn<String, String> {
> private final PCollectionView view;
> private FnWithSideInputs(PCollectionView view) {
>   this.view = view;
> }
> @ProcessElement
> public void processElement(ProcessContext c) {
>   c.output(c.element() + ":" + c.sideInput(view));
> }
>   }
>   @Test
>   public void testSideInputsWithMultipleWindows() {
> Pipeline p = TestPipeline.create();
> MutableDateTime mutableNow = Instant.now().toMutableDateTime();
> mutableNow.setMillisOfSecond(0);
> Instant now = mutableNow.toInstant();
> SlidingWindows windowFn =
> 
> SlidingWindows.of(Duration.standardSeconds(5)).every(Duration.standardSeconds(1));
> PCollectionView view = 
> p.apply(Create.of(1)).apply(View.asSingleton());
> PCollection res =
> p.apply(Create.timestamped(TimestampedValue.of("a", now)))
> .apply(Window.into(windowFn))
> .apply(ParDo.of(new FnWithSideInputs(view)).withSideInputs(view));
> PAssert.that(res).containsInAnyOrder("a:1");
> p.run();
>   }
> {code}
> This fails with the following exception:
> {code}
> org.apache.beam.sdk.Pipeline$PipelineExecutionException: 
> java.lang.IllegalStateException: sideInput called when main input element is 
> in multiple windows
>   at 
> org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:343)
>   at org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:1)
>   at org.apache.beam.sdk.Pipeline.run(Pipeline.java:176)
>   at org.apache.beam.sdk.testing.TestPipeline.run(TestPipeline.java:112)
>   at 
> Caused by: java.lang.IllegalStateException: sideInput called when main input 
> element is in multiple windows
>   at 
> org.apache.beam.runners.core.SimpleDoFnRunner$DoFnProcessContext.sideInput(SimpleDoFnRunner.java:514)
>   at 
> org.apache.beam.sdk.transforms.ParDoTest$FnWithSideInputs.processElement(ParDoTest.java:738)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Closed] (BEAM-1150) Side inputs broken with sliding windows (or any multi-window WindowFn)

2016-12-13 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles closed BEAM-1150.
-

> Side inputs broken with sliding windows (or any multi-window WindowFn)
> --
>
> Key: BEAM-1150
> URL: https://issues.apache.org/jira/browse/BEAM-1150
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>        Reporter: Kenneth Knowles
>Assignee: Eugene Kirpichov
>Priority: Blocker
> Fix For: Not applicable
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (BEAM-1150) Side inputs broken with sliding windows (or any multi-window WindowFn)

2016-12-13 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles resolved BEAM-1150.
---
   Resolution: Duplicate
Fix Version/s: Not applicable

> Side inputs broken with sliding windows (or any multi-window WindowFn)
> --
>
> Key: BEAM-1150
> URL: https://issues.apache.org/jira/browse/BEAM-1150
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>        Reporter: Kenneth Knowles
>Assignee: Eugene Kirpichov
>Priority: Blocker
> Fix For: Not applicable
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1150) Side inputs broken with sliding windows (or any multi-window WindowFn)

2016-12-13 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1150:
-

 Summary: Side inputs broken with sliding windows (or any 
multi-window WindowFn)
 Key: BEAM-1150
 URL: https://issues.apache.org/jira/browse/BEAM-1150
 Project: Beam
  Issue Type: Bug
  Components: sdk-java-core
Reporter: Kenneth Knowles
Assignee: Eugene Kirpichov
Priority: Blocker






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1148) Port PAssert away from Aggregators

2016-12-13 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1148:
-

 Summary: Port PAssert away from Aggregators
 Key: BEAM-1148
 URL: https://issues.apache.org/jira/browse/BEAM-1148
 Project: Beam
  Issue Type: New Feature
  Components: sdk-java-core
Reporter: Kenneth Knowles


One step in the removal of Aggregators (in favor of Metrics) is to remove our 
reliance on them for PAssert checking.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (BEAM-967) Add Jenkins postcommit for Apex runner RunnableOnService tests

2016-12-13 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles reassigned BEAM-967:


Assignee: Kenneth Knowles  (was: Jason Kuster)

> Add Jenkins postcommit for Apex runner RunnableOnService tests
> --
>
> Key: BEAM-967
> URL: https://issues.apache.org/jira/browse/BEAM-967
> Project: Beam
>  Issue Type: Bug
>  Components: runner-apex, testing
>        Reporter: Kenneth Knowles
>    Assignee: Kenneth Knowles
>
> During development of the Apex runner, all RunnableOnService tests were run 
> in precommit. Now that it has been merged, they are not. It needs the usual 
> treatment that we give runners.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-967) Add Jenkins postcommit for Apex runner RunnableOnService tests

2016-12-13 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15746035#comment-15746035
 ] 

Kenneth Knowles commented on BEAM-967:
--

Since we now have access to Jenkins config via the DSL, I'll give it a try.

> Add Jenkins postcommit for Apex runner RunnableOnService tests
> --
>
> Key: BEAM-967
> URL: https://issues.apache.org/jira/browse/BEAM-967
> Project: Beam
>  Issue Type: Bug
>  Components: runner-apex, testing
>        Reporter: Kenneth Knowles
>Assignee: Jason Kuster
>
> During development of the Apex runner, all RunnableOnService tests were run 
> in precommit. Now that it has been merged, they are not. It needs the usual 
> treatment that we give runners.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-1139) Investigate failures in precommit - Apex & Kryo

2016-12-13 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15745780#comment-15745780
 ] 

Kenneth Knowles commented on BEAM-1139:
---

This looks like it is actually a problem with our example pom and/or Jenkins 
invocation, where the Spark runner's deps override the Apex runner's deps. I'll 
take this and mess with the pom.

> Investigate failures in precommit - Apex & Kryo
> ---
>
> Key: BEAM-1139
> URL: https://issues.apache.org/jira/browse/BEAM-1139
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-apex
>    Reporter: Kenneth Knowles
>    Assignee: Kenneth Knowles
>Priority: Minor
>
> https://builds.apache.org/view/Beam/job/beam_PreCommit_Java_MavenInstall/org.apache.beam$beam-examples-java/5775/testReport/junit/org.apache.beam.examples/WordCountIT/testE2EWordCount/
> This is not necessarily a bug in the Apex runner, but it looks like this 
> class cannot be serialized via Kryo while the Apex runner needs it to be. 
> Probably the fix is to roll-forwards a simple change to make it Kryo 
> serializable.
> It is not clear to me the difference between this test run and others. 
> Clearly there is a coverage gap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (BEAM-1139) Investigate failures in precommit - Apex & Kryo

2016-12-13 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles reassigned BEAM-1139:
-

Assignee: Kenneth Knowles  (was: Thomas Weise)

> Investigate failures in precommit - Apex & Kryo
> ---
>
> Key: BEAM-1139
> URL: https://issues.apache.org/jira/browse/BEAM-1139
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-apex
>    Reporter: Kenneth Knowles
>    Assignee: Kenneth Knowles
>Priority: Minor
>
> https://builds.apache.org/view/Beam/job/beam_PreCommit_Java_MavenInstall/org.apache.beam$beam-examples-java/5775/testReport/junit/org.apache.beam.examples/WordCountIT/testE2EWordCount/
> This is not necessarily a bug in the Apex runner, but it looks like this 
> class cannot be serialized via Kryo while the Apex runner needs it to be. 
> Probably the fix is to roll-forwards a simple change to make it Kryo 
> serializable.
> It is not clear to me the difference between this test run and others. 
> Clearly there is a coverage gap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-1089) Jenkins comments on PRs are too many & too large

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15744080#comment-15744080
 ] 

Kenneth Knowles commented on BEAM-1089:
---

[~jasonkuster] I see what you mean [here in our 
configs|https://github.com/apache/incubator-beam/blob/master/.jenkins/common_job_properties.groovy#L124].
 Can the whole block be deleted or some such?

> Jenkins comments on PRs are too many & too large
> 
>
> Key: BEAM-1089
> URL: https://issues.apache.org/jira/browse/BEAM-1089
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>    Reporter: Kenneth Knowles
>Assignee: Jason Kuster
>
> Lately, I've been finding review comments somewhat drowned out by asfbot 
> copying build results onto a PR. It also generates a lot of needless email. I 
> have not yet tried to devise just the right filter, hoping we can just return 
> to the normal practice of leaving just a commit status.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-1089) Jenkins comments on PRs are too many & too large

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15744077#comment-15744077
 ] 

Kenneth Knowles commented on BEAM-1089:
---

[~neelesh77] that sounds really great as well!

> Jenkins comments on PRs are too many & too large
> 
>
> Key: BEAM-1089
> URL: https://issues.apache.org/jira/browse/BEAM-1089
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>    Reporter: Kenneth Knowles
>Assignee: Jason Kuster
>
> Lately, I've been finding review comments somewhat drowned out by asfbot 
> copying build results onto a PR. It also generates a lot of needless email. I 
> have not yet tried to devise just the right filter, hoping we can just return 
> to the normal practice of leaving just a commit status.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1143) Timestamps on Jenkins log lines

2016-12-12 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1143:
-

 Summary: Timestamps on Jenkins log lines
 Key: BEAM-1143
 URL: https://issues.apache.org/jira/browse/BEAM-1143
 Project: Beam
  Issue Type: Improvement
  Components: testing
Reporter: Kenneth Knowles
Assignee: Jason Kuster
Priority: Minor


I suspect this might be doable more universally in the groovy DSL scripts, but 
we would gain some value by a port of 
https://github.com/apache/incubator-beam/commit/7f82a573d00a5a30331b7bbb8757e55f4a2d93ae
 to the most appropriate analog for Jenkins. (in the worst case, just exactly 
porting the env var)

We are currently regularly bottlenecked on build duration/backlog and the time 
seems to exist outside of the durations accounted for by Maven's usual output.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (BEAM-1139) Investigate failures in precommit - Apex & Kryo

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15743997#comment-15743997
 ] 

Kenneth Knowles edited comment on BEAM-1139 at 12/13/16 3:18 AM:
-

Unfortunately not. Further bummer: the postcommit is not a superset of the 
precommit so it never broke. That would provide a much easier trail.


was (Author: kenn):
Unfortunately not. Further bummer: the postcommit does not mirror the precommit 
so it never broke. That would provide a much easier trail.

> Investigate failures in precommit - Apex & Kryo
> ---
>
> Key: BEAM-1139
> URL: https://issues.apache.org/jira/browse/BEAM-1139
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-apex
>    Reporter: Kenneth Knowles
>Assignee: Thomas Weise
>Priority: Minor
>
> https://builds.apache.org/view/Beam/job/beam_PreCommit_Java_MavenInstall/org.apache.beam$beam-examples-java/5775/testReport/junit/org.apache.beam.examples/WordCountIT/testE2EWordCount/
> This is not necessarily a bug in the Apex runner, but it looks like this 
> class cannot be serialized via Kryo while the Apex runner needs it to be. 
> Probably the fix is to roll-forwards a simple change to make it Kryo 
> serializable.
> It is not clear to me the difference between this test run and others. 
> Clearly there is a coverage gap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1139) Investigate failures in precommit - Apex & Kryo

2016-12-12 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1139:
--
Summary: Investigate failures in precommit - Apex & Kryo  (was: Failures in 
precommit - Apex & Kryo)

> Investigate failures in precommit - Apex & Kryo
> ---
>
> Key: BEAM-1139
> URL: https://issues.apache.org/jira/browse/BEAM-1139
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-apex
>Reporter: Kenneth Knowles
>Assignee: Thomas Weise
>Priority: Minor
>
> https://builds.apache.org/view/Beam/job/beam_PreCommit_Java_MavenInstall/org.apache.beam$beam-examples-java/5775/testReport/junit/org.apache.beam.examples/WordCountIT/testE2EWordCount/
> This is not necessarily a bug in the Apex runner, but it looks like this 
> class cannot be serialized via Kryo while the Apex runner needs it to be. 
> Probably the fix is to roll-forwards a simple change to make it Kryo 
> serializable.
> It is not clear to me the difference between this test run and others. 
> Clearly there is a coverage gap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1139) Failures in precommit - Apex & Kryo

2016-12-12 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1139:
--
Priority: Minor  (was: Blocker)

> Failures in precommit - Apex & Kryo
> ---
>
> Key: BEAM-1139
> URL: https://issues.apache.org/jira/browse/BEAM-1139
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-apex
>    Reporter: Kenneth Knowles
>Assignee: Thomas Weise
>Priority: Minor
>
> https://builds.apache.org/view/Beam/job/beam_PreCommit_Java_MavenInstall/org.apache.beam$beam-examples-java/5775/testReport/junit/org.apache.beam.examples/WordCountIT/testE2EWordCount/
> This is not necessarily a bug in the Apex runner, but it looks like this 
> class cannot be serialized via Kryo while the Apex runner needs it to be. 
> Probably the fix is to roll-forwards a simple change to make it Kryo 
> serializable.
> It is not clear to me the difference between this test run and others. 
> Clearly there is a coverage gap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-1139) Failures in precommit - Apex & Kryo

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15743997#comment-15743997
 ] 

Kenneth Knowles commented on BEAM-1139:
---

Unfortunately not. Further bummer: the postcommit does not mirror the precommit 
so it never broke. That would provide a much easier trail.

> Failures in precommit - Apex & Kryo
> ---
>
> Key: BEAM-1139
> URL: https://issues.apache.org/jira/browse/BEAM-1139
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-apex
>    Reporter: Kenneth Knowles
>Assignee: Thomas Weise
>Priority: Blocker
>
> https://builds.apache.org/view/Beam/job/beam_PreCommit_Java_MavenInstall/org.apache.beam$beam-examples-java/5775/testReport/junit/org.apache.beam.examples/WordCountIT/testE2EWordCount/
> This is not necessarily a bug in the Apex runner, but it looks like this 
> class cannot be serialized via Kryo while the Apex runner needs it to be. 
> Probably the fix is to roll-forwards a simple change to make it Kryo 
> serializable.
> It is not clear to me the difference between this test run and others. 
> Clearly there is a coverage gap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-967) Add Jenkins postcommit for Apex runner RunnableOnService tests

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15743877#comment-15743877
 ] 

Kenneth Knowles commented on BEAM-967:
--

This would likely have caught BEAM-1139 sooner. Any progress?

> Add Jenkins postcommit for Apex runner RunnableOnService tests
> --
>
> Key: BEAM-967
> URL: https://issues.apache.org/jira/browse/BEAM-967
> Project: Beam
>  Issue Type: Bug
>  Components: runner-apex, testing
>        Reporter: Kenneth Knowles
>Assignee: Jason Kuster
>
> During development of the Apex runner, all RunnableOnService tests were run 
> in precommit. Now that it has been merged, they are not. It needs the usual 
> treatment that we give runners.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-843) Use New DoFn Directly in Flink Runner

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15743596#comment-15743596
 ] 

Kenneth Knowles commented on BEAM-843:
--

Removed the version tag because I think it was just added when it was filed, 
not deliberately to block release. Aljoscha, feel free to indicate that you 
really mean to ship this now. We don't mean to remove any intentional tag.

> Use New DoFn Directly in Flink Runner
> -
>
> Key: BEAM-843
> URL: https://issues.apache.org/jira/browse/BEAM-843
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-flink
>Reporter: Aljoscha Krettek
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-843) Use New DoFn Directly in Flink Runner

2016-12-12 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-843:
-
Fix Version/s: (was: 0.4.0-incubating)

> Use New DoFn Directly in Flink Runner
> -
>
> Key: BEAM-843
> URL: https://issues.apache.org/jira/browse/BEAM-843
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-flink
>Reporter: Aljoscha Krettek
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-843) Use New DoFn Directly in Flink Runner

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15743588#comment-15743588
 ] 

Kenneth Knowles commented on BEAM-843:
--

I think the only new capabilities it would allow are not yet implemented anyhow.

> Use New DoFn Directly in Flink Runner
> -
>
> Key: BEAM-843
> URL: https://issues.apache.org/jira/browse/BEAM-843
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-flink
>Reporter: Aljoscha Krettek
> Fix For: 0.4.0-incubating
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1139) Failures in precommit - Apex & Kryo

2016-12-12 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1139:
--
Assignee: Thomas Weise  (was: Kenneth Knowles)

> Failures in precommit - Apex & Kryo
> ---
>
> Key: BEAM-1139
> URL: https://issues.apache.org/jira/browse/BEAM-1139
> Project: Beam
>  Issue Type: Improvement
>  Components: runner-apex
>    Reporter: Kenneth Knowles
>Assignee: Thomas Weise
>Priority: Blocker
>
> https://builds.apache.org/view/Beam/job/beam_PreCommit_Java_MavenInstall/org.apache.beam$beam-examples-java/5775/testReport/junit/org.apache.beam.examples/WordCountIT/testE2EWordCount/
> This is not necessarily a bug in the Apex runner, but it looks like this 
> class cannot be serialized via Kryo while the Apex runner needs it to be. 
> Probably the fix is to roll-forwards a simple change to make it Kryo 
> serializable.
> It is not clear to me the difference between this test run and others. 
> Clearly there is a coverage gap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1139) Failures in precommit - Apex & Kryo

2016-12-12 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1139:
-

 Summary: Failures in precommit - Apex & Kryo
 Key: BEAM-1139
 URL: https://issues.apache.org/jira/browse/BEAM-1139
 Project: Beam
  Issue Type: Improvement
  Components: runner-apex
Reporter: Kenneth Knowles
Assignee: Thomas Weise
Priority: Blocker


https://builds.apache.org/view/Beam/job/beam_PreCommit_Java_MavenInstall/org.apache.beam$beam-examples-java/5775/testReport/junit/org.apache.beam.examples/WordCountIT/testE2EWordCount/

This is not necessarily a bug in the Apex runner, but it looks like this class 
cannot be serialized via Kryo while the Apex runner needs it to be. Probably 
the fix is to roll-forwards a simple change to make it Kryo serializable.

It is not clear to me the difference between this test run and others. Clearly 
there is a coverage gap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1138) Consider merging KeyedPValueTrackingVisitor with DirectGraphVisitor

2016-12-12 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1138:
-

 Summary: Consider merging KeyedPValueTrackingVisitor with 
DirectGraphVisitor
 Key: BEAM-1138
 URL: https://issues.apache.org/jira/browse/BEAM-1138
 Project: Beam
  Issue Type: Improvement
  Components: runner-direct
Reporter: Kenneth Knowles
Assignee: Thomas Groh
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1033) BigQueryMatcher is flaky

2016-12-12 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1033:
--
Summary: BigQueryMatcher is flaky  (was: WindowedWordCountIT is flaky)

> BigQueryMatcher is flaky
> 
>
> Key: BEAM-1033
> URL: https://issues.apache.org/jira/browse/BEAM-1033
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>Reporter: Pei He
>Assignee: Mark Liu
>
> Jenkins link:
> https://builds.apache.org/job/beam_PreCommit_MavenVerify/5145/console
> Running org.apache.beam.examples.WindowedWordCountIT
> Tests run: 2, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 304.282 sec 
> <<< FAILURE! - in org.apache.beam.examples.WindowedWordCountIT
> testWindowedWordCountInBatch(org.apache.beam.examples.WindowedWordCountIT)  
> Time elapsed: 304.282 sec  <<< FAILURE!
> java.lang.AssertionError: 
> Expected: Expected checksum is (cd5b52939257e12428a9fa085c32a84dd209b180)
>  but: Invalid BigQuery response: 
> {"jobComplete":false,"jobReference":{"jobId":"job_0STNX_OD83tQOzo6MvmqXCrk61U","projectId":"apache-beam-testing"},"kind":"bigquery#queryResponse"}
>   at org.hamcrest.MatcherAssert.assertThat(MatcherAssert.java:20)
>   at org.hamcrest.MatcherAssert.assertThat(MatcherAssert.java:8)
>   at 
> org.apache.beam.runners.dataflow.testing.TestDataflowRunner.run(TestDataflowRunner.java:164)
>   at 
> org.apache.beam.runners.dataflow.testing.TestDataflowRunner.run(TestDataflowRunner.java:93)
>   at 
> org.apache.beam.runners.dataflow.testing.TestDataflowRunner.run(TestDataflowRunner.java:61)
>   at org.apache.beam.sdk.Pipeline.run(Pipeline.java:179)
>   at 
> org.apache.beam.examples.WindowedWordCount.main(WindowedWordCount.java:224)
>   at 
> org.apache.beam.examples.WindowedWordCountIT.testWindowedWordCountPipeline(WindowedWordCountIT.java:88)
>   at 
> org.apache.beam.examples.WindowedWordCountIT.testWindowedWordCountInBatch(WindowedWordCountIT.java:59)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
>   at 
> org.apache.maven.surefire.junitcore.pc.Scheduler$1.run(Scheduler.java:393)
>   at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>   at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>   at java.lang.Thread.run(Thread.java:745)
> Results :
> Failed tests: 
>   
> WindowedWordCountIT.testWindowedWordCountInBatch:59->testWindowedWordCountPipeline:88
>  
> Expected: Expected checksum is (cd5b52939257e12428a9fa085c32a84dd209b180)
>  but: Invalid BigQuery response: 
> {"jobComplete":false,"jobReference":{"jobId":"job_0STNX_OD83tQOzo6MvmqXCrk61U","projectId":"apache-beam-testing"},"kind":"bigquery#queryResponse"}
> Tests run: 3, Failures: 1, Errors: 0, Skipped: 0



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-1033) BigQueryMatcher is flaky

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15742850#comment-15742850
 ] 

Kenneth Knowles commented on BEAM-1033:
---

Changing the title because after BEAM-1135 WindowedWordCountIT does not use 
this matcher.

> BigQueryMatcher is flaky
> 
>
> Key: BEAM-1033
> URL: https://issues.apache.org/jira/browse/BEAM-1033
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>Reporter: Pei He
>Assignee: Mark Liu
>
> Jenkins link:
> https://builds.apache.org/job/beam_PreCommit_MavenVerify/5145/console
> Running org.apache.beam.examples.WindowedWordCountIT
> Tests run: 2, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 304.282 sec 
> <<< FAILURE! - in org.apache.beam.examples.WindowedWordCountIT
> testWindowedWordCountInBatch(org.apache.beam.examples.WindowedWordCountIT)  
> Time elapsed: 304.282 sec  <<< FAILURE!
> java.lang.AssertionError: 
> Expected: Expected checksum is (cd5b52939257e12428a9fa085c32a84dd209b180)
>  but: Invalid BigQuery response: 
> {"jobComplete":false,"jobReference":{"jobId":"job_0STNX_OD83tQOzo6MvmqXCrk61U","projectId":"apache-beam-testing"},"kind":"bigquery#queryResponse"}
>   at org.hamcrest.MatcherAssert.assertThat(MatcherAssert.java:20)
>   at org.hamcrest.MatcherAssert.assertThat(MatcherAssert.java:8)
>   at 
> org.apache.beam.runners.dataflow.testing.TestDataflowRunner.run(TestDataflowRunner.java:164)
>   at 
> org.apache.beam.runners.dataflow.testing.TestDataflowRunner.run(TestDataflowRunner.java:93)
>   at 
> org.apache.beam.runners.dataflow.testing.TestDataflowRunner.run(TestDataflowRunner.java:61)
>   at org.apache.beam.sdk.Pipeline.run(Pipeline.java:179)
>   at 
> org.apache.beam.examples.WindowedWordCount.main(WindowedWordCount.java:224)
>   at 
> org.apache.beam.examples.WindowedWordCountIT.testWindowedWordCountPipeline(WindowedWordCountIT.java:88)
>   at 
> org.apache.beam.examples.WindowedWordCountIT.testWindowedWordCountInBatch(WindowedWordCountIT.java:59)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
>   at 
> org.apache.maven.surefire.junitcore.pc.Scheduler$1.run(Scheduler.java:393)
>   at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>   at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>   at java.lang.Thread.run(Thread.java:745)
> Results :
> Failed tests: 
>   
> WindowedWordCountIT.testWindowedWordCountInBatch:59->testWindowedWordCountPipeline:88
>  
> Expected: Expected checksum is (cd5b52939257e12428a9fa085c32a84dd209b180)
>  but: Invalid BigQuery response: 
> {"jobComplete":false,"jobReference":{"jobId":"job_0STNX_OD83tQOzo6MvmqXCrk61U","projectId":"apache-beam-testing"},"kind":"bigquery#queryResponse"}
> Tests run: 3, Failures: 1, Errors: 0, Skipped: 0



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1135) Make WindowedWordCount example more independent of runner and execution style

2016-12-12 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1135:
-

 Summary: Make WindowedWordCount example more independent of runner 
and execution style
 Key: BEAM-1135
 URL: https://issues.apache.org/jira/browse/BEAM-1135
 Project: Beam
  Issue Type: Improvement
  Components: examples-java
Reporter: Kenneth Knowles
Assignee: Kenneth Knowles
 Fix For: 0.4.0-incubating


Right now, WindowedWordCount has not received the same nice revision as our 
other examples. The PR has been open for a bit I am breaking it out into its 
own ticket.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (BEAM-35) Remove timer multiplexing and give timers identifiers

2016-12-12 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-35?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles resolved BEAM-35.
-
   Resolution: Fixed
Fix Version/s: 0.4.0-incubating

> Remove timer multiplexing and give timers identifiers
> -
>
> Key: BEAM-35
> URL: https://issues.apache.org/jira/browse/BEAM-35
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-core
>        Reporter: Kenneth Knowles
>    Assignee: Kenneth Knowles
> Fix For: 0.4.0-incubating
>
>
> Today, timers set by the runner core of the SDK are identified by timestamp, 
> so multiple timers for the same time result in only one backend timer. This 
> is runner-specific and obsolete, and makes it unsafe to support timer 
> deletion because backend timers have no real owner. User-facing timers will 
> require explicit identifiers anyhow.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (BEAM-1121) Update documentation following rename of PTransform.apply

2016-12-12 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles resolved BEAM-1121.
---
   Resolution: Fixed
Fix Version/s: Not applicable

> Update documentation following rename of PTransform.apply
> -
>
> Key: BEAM-1121
> URL: https://issues.apache.org/jira/browse/BEAM-1121
> Project: Beam
>  Issue Type: Bug
>  Components: website
>Reporter: Daniel Halperin
>    Assignee: Kenneth Knowles
> Fix For: Not applicable
>
>
> Since PTransform#apply does not exist any more, significant website 
> documentation may be wrong.
> Fix version: 0.4.0-incubating really just means this needs to be done as part 
> of the 0.4.0-incubating release, since this change will make it into said 
> release.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (BEAM-1121) Update documentation following rename of PTransform.apply

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15742785#comment-15742785
 ] 

Kenneth Knowles edited comment on BEAM-1121 at 12/12/16 6:55 PM:
-

Aside from Javadoc, this was fixed three days ago, by 
https://github.com/apache/incubator-beam-site/pull/104, tagged with BEAM-438. 
If you find any omissions, let me know ASAP.


was (Author: kenn):
Aside from Javadoc, this was fixed three days ago it was filed, by 
https://github.com/apache/incubator-beam-site/pull/104, tagged with BEAM-438. 
If you find any omissions, let me know ASAP.

> Update documentation following rename of PTransform.apply
> -
>
> Key: BEAM-1121
> URL: https://issues.apache.org/jira/browse/BEAM-1121
> Project: Beam
>  Issue Type: Bug
>  Components: website
>Reporter: Daniel Halperin
>    Assignee: Kenneth Knowles
> Fix For: Not applicable
>
>
> Since PTransform#apply does not exist any more, significant website 
> documentation may be wrong.
> Fix version: 0.4.0-incubating really just means this needs to be done as part 
> of the 0.4.0-incubating release, since this change will make it into said 
> release.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (BEAM-1121) Update documentation following rename of PTransform.apply

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15742785#comment-15742785
 ] 

Kenneth Knowles edited comment on BEAM-1121 at 12/12/16 6:54 PM:
-

Aside from Javadoc, this was fixed three days ago it was filed, by 
https://github.com/apache/incubator-beam-site/pull/104, tagged with BEAM-438. 
If you find any omissions, let me know ASAP.


was (Author: kenn):
Aside from Javadoc, this was fixed three days before it was filed, by 
https://github.com/apache/incubator-beam-site/pull/104, tagged with BEAM-438. 
If you find any omissions, let me know ASAP.

> Update documentation following rename of PTransform.apply
> -
>
> Key: BEAM-1121
> URL: https://issues.apache.org/jira/browse/BEAM-1121
> Project: Beam
>  Issue Type: Bug
>  Components: website
>Reporter: Daniel Halperin
>    Assignee: Kenneth Knowles
>
> Since PTransform#apply does not exist any more, significant website 
> documentation may be wrong.
> Fix version: 0.4.0-incubating really just means this needs to be done as part 
> of the 0.4.0-incubating release, since this change will make it into said 
> release.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (BEAM-1121) Update documentation following rename of PTransform.apply

2016-12-12 Thread Kenneth Knowles (JIRA)

[ 
https://issues.apache.org/jira/browse/BEAM-1121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15742785#comment-15742785
 ] 

Kenneth Knowles commented on BEAM-1121:
---

Aside from Javadoc, this was fixed three days before it was filed, by 
https://github.com/apache/incubator-beam-site/pull/104, tagged with BEAM-438. 
If you find any omissions, let me know ASAP.

> Update documentation following rename of PTransform.apply
> -
>
> Key: BEAM-1121
> URL: https://issues.apache.org/jira/browse/BEAM-1121
> Project: Beam
>  Issue Type: Bug
>  Components: website
>Reporter: Daniel Halperin
>    Assignee: Kenneth Knowles
>
> Since PTransform#apply does not exist any more, significant website 
> documentation may be wrong.
> Fix version: 0.4.0-incubating really just means this needs to be done as part 
> of the 0.4.0-incubating release, since this change will make it into said 
> release.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1132) Jenkins JaCoCo plugin for Beam

2016-12-11 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1132:
-

 Summary: Jenkins JaCoCo plugin for Beam
 Key: BEAM-1132
 URL: https://issues.apache.org/jira/browse/BEAM-1132
 Project: Beam
  Issue Type: New Feature
  Components: testing
Reporter: Kenneth Knowles
Assignee: Davor Bonaci


Jenkins has a JaCoCo plugin and other Apache projects use it. We should try it 
too (and might as well disable coveralls, as I don't know anyone who looks at 
it).

If this takes more than 10 minutes to set up, then another option is to just 
archive JaCoCo's HTML reports so we can browse them.

Either of these should take just minutes and yield huge benefits in visibility 
of where we have really bad coverage.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1131) Consider japicmp for testing backwards-compatibility

2016-12-11 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1131:
-

 Summary: Consider japicmp for testing backwards-compatibility
 Key: BEAM-1131
 URL: https://issues.apache.org/jira/browse/BEAM-1131
 Project: Beam
  Issue Type: New Feature
  Components: testing
Reporter: Kenneth Knowles
Assignee: Jason Kuster


japicmp is what Flink uses to test backwards-compatibility, configuration here: 
https://github.com/apache/flink/blob/41d5875bfc272f2cd5c7e8c8523036684865c1ce/pom.xml#L1184

At our first stable release, we should activate this plugin too! (configuration 
to be determined by community discussion). We should have it installed and 
configured beforehand, informational-only, so that we have a sense of it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1115) Support for new Timer API in Spark runner

2016-12-08 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1115:
--
Component/s: (was: runner-apex)
 runner-spark

> Support for new Timer API in Spark runner
> -
>
> Key: BEAM-1115
> URL: https://issues.apache.org/jira/browse/BEAM-1115
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-spark
>        Reporter: Kenneth Knowles
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (BEAM-1117) Support for new Timer API in Direct runner

2016-12-08 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles reassigned BEAM-1117:
-

Assignee: Kenneth Knowles  (was: Thomas Groh)

> Support for new Timer API in Direct runner
> --
>
> Key: BEAM-1117
> URL: https://issues.apache.org/jira/browse/BEAM-1117
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-direct
>        Reporter: Kenneth Knowles
>    Assignee: Kenneth Knowles
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (BEAM-1116) Support for new Timer API in Flink runner

2016-12-08 Thread Kenneth Knowles (JIRA)

 [ 
https://issues.apache.org/jira/browse/BEAM-1116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1116:
--
Assignee: (was: Maximilian Michels)

> Support for new Timer API in Flink runner
> -
>
> Key: BEAM-1116
> URL: https://issues.apache.org/jira/browse/BEAM-1116
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-flink
>        Reporter: Kenneth Knowles
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1118) Support for new Timer API in Gearpump runner

2016-12-08 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1118:
-

 Summary: Support for new Timer API in Gearpump runner
 Key: BEAM-1118
 URL: https://issues.apache.org/jira/browse/BEAM-1118
 Project: Beam
  Issue Type: New Feature
  Components: runner-gearpump
Reporter: Kenneth Knowles






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1117) Support for new Timer API in Direct runner

2016-12-08 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1117:
-

 Summary: Support for new Timer API in Direct runner
 Key: BEAM-1117
 URL: https://issues.apache.org/jira/browse/BEAM-1117
 Project: Beam
  Issue Type: New Feature
  Components: runner-direct
Reporter: Kenneth Knowles
Assignee: Thomas Groh






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1116) Support for new Timer API in Flink runner

2016-12-08 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1116:
-

 Summary: Support for new Timer API in Flink runner
 Key: BEAM-1116
 URL: https://issues.apache.org/jira/browse/BEAM-1116
 Project: Beam
  Issue Type: New Feature
  Components: runner-flink
Reporter: Kenneth Knowles
Assignee: Maximilian Michels






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1114) Support for new Timer API in Apex runner

2016-12-08 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1114:
-

 Summary: Support for new Timer API in Apex runner
 Key: BEAM-1114
 URL: https://issues.apache.org/jira/browse/BEAM-1114
 Project: Beam
  Issue Type: New Feature
  Components: runner-apex
Reporter: Kenneth Knowles






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1115) Support for new Timer API in Spark runner

2016-12-08 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1115:
-

 Summary: Support for new Timer API in Spark runner
 Key: BEAM-1115
 URL: https://issues.apache.org/jira/browse/BEAM-1115
 Project: Beam
  Issue Type: New Feature
  Components: runner-apex
Reporter: Kenneth Knowles






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (BEAM-1113) Support for new Timer API in Dataflow runner

2016-12-08 Thread Kenneth Knowles (JIRA)
Kenneth Knowles created BEAM-1113:
-

 Summary: Support for new Timer API in Dataflow runner
 Key: BEAM-1113
 URL: https://issues.apache.org/jira/browse/BEAM-1113
 Project: Beam
  Issue Type: New Feature
  Components: runner-dataflow
Reporter: Kenneth Knowles
Assignee: Kenneth Knowles






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


  1   2   3   4   5   >