jkff commented on a change in pull request #4175: [BEAM-3247] fix Sample.any performance URL: https://github.com/apache/beam/pull/4175#discussion_r153282900
########## File path: sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/Sample.java ########## @@ -209,29 +202,67 @@ public void populateDisplayData(DisplayData.Builder builder) { } /** - * A {@link DoFn} that returns up to limit elements from the side input PCollection. + * A {@link DoFn} that outputs up to limit elements. */ - private static class SampleAnyDoFn<T> extends DoFn<Void, T> { - long limit; - final PCollectionView<Iterable<T>> iterableView; + private static class SampleAnyDoFn<T> extends DoFn<T, T> { Review comment: Is this DoFn really needed? I'm wondering if your implementation of the combiner is sufficient for performance. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services