aglinxinyuan opened a new pull request, #4746:
URL: https://github.com/apache/texera/pull/4746

   ### What changes were proposed in this PR?
   
   Add `PartitionersSpec` covering the four send-semantics partitioners that 
currently lack unit tests (only `RangeBasedShufflePartitioner` had existing 
coverage in `RangeBasedShuffleSpec`):
   
   - `OneToOnePartitioner` — `getBucketIndex` always emits `Iterator(0)`; 
`allReceivers` selects the channel whose `fromWorkerId` matches the actor id
   - `BroadcastPartitioner` — `getBucketIndex` yields every receiver index; 
`allReceivers` is deduplicated
   - `RoundRobinPartitioner` — `getBucketIndex` cycles bucket indices 
(asserting the current contract that the first emitted index is 1, since the 
implementation increments before emitting); `allReceivers` preserves channel 
order while deduplicating
   - `HashBasedShufflePartitioner` — `getBucketIndex` is in-range, 
deterministic for the same hash-key, depends only on the configured 
hash-attribute subset, and falls back to the full tuple when no hash attributes 
are configured; `allReceivers` deduplicates
   
   ### Any related issues, documentation, discussions?
   
   Closes #4745
   
   ### How was this PR tested?
   
   `sbt "WorkflowExecutionService/testOnly 
org.apache.texera.amber.engine.architecture.sendsemantics.partitioners.PartitionersSpec"`
 — 11/11 tests pass.
   
   ### Was this PR authored or co-authored using generative AI tooling?
   
   Generated-by: Claude Code (Claude Opus 4.7)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to