robertwb commented on PR #30368: URL: https://github.com/apache/beam/pull/30368#issuecomment-1971629031
I'm all for trying to find better names for this. I thought about Partition, but Beam already has a Partition operation that's a bit different: it takes in a integer N and splits the PCollection into N numbered PCollections: https://beam.apache.org/documentation/programming-guide/#partition My other hesitation is that partition is used like shard (and roughly) parallelism of otherwise homogenous datasets in other systems like Kafka. But that's not a blocker. I really haven't settled on naming that I like here (and went back and forth when writing this). Maybe we should bikeshed on the dev list. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
