TanuSharma2511 commented on issue #34756: URL: https://github.com/apache/beam/issues/34756#issuecomment-2834160817
If I understand the code correctly, for the first suggestion i.e. a fixed high enough value, ~O(10K) [here](https://github.com/apache/beam/blob/930d14abc998d7f67b636259657377240b75929a/sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/Reshuffle.java#L198) we can assign default value of numBuckets 10k if not given. For python, default value is already set [here](https://github.com/apache/beam/blob/930d14abc998d7f67b636259657377240b75929a/sdks/python/apache_beam/transforms/util.py#L1112). For the 2nd suggestion, can we take some idea from [this](https://github.com/apache/beam/blob/930d14abc998d7f67b636259657377240b75929a/runners/google-cloud-dataflow-java/src/main/java/org/apache/beam/runners/dataflow/DataflowRunner.java#L2593) ? @arunpandianp , can I work on this ? Also, please let us know which suggestion should we focus ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@beam.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org