aglinxinyuan opened a new pull request, #4746: URL: https://github.com/apache/texera/pull/4746
### What changes were proposed in this PR? Add `PartitionersSpec` covering the four send-semantics partitioners that currently lack unit tests (only `RangeBasedShufflePartitioner` had existing coverage in `RangeBasedShuffleSpec`): - `OneToOnePartitioner` — `getBucketIndex` always emits `Iterator(0)`; `allReceivers` selects the channel whose `fromWorkerId` matches the actor id - `BroadcastPartitioner` — `getBucketIndex` yields every receiver index; `allReceivers` is deduplicated - `RoundRobinPartitioner` — `getBucketIndex` cycles bucket indices (asserting the current contract that the first emitted index is 1, since the implementation increments before emitting); `allReceivers` preserves channel order while deduplicating - `HashBasedShufflePartitioner` — `getBucketIndex` is in-range, deterministic for the same hash-key, depends only on the configured hash-attribute subset, and falls back to the full tuple when no hash attributes are configured; `allReceivers` deduplicates ### Any related issues, documentation, discussions? Closes #4745 ### How was this PR tested? `sbt "WorkflowExecutionService/testOnly org.apache.texera.amber.engine.architecture.sendsemantics.partitioners.PartitionersSpec"` — 11/11 tests pass. ### Was this PR authored or co-authored using generative AI tooling? Generated-by: Claude Code (Claude Opus 4.7) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
