aglinxinyuan opened a new issue, #4745:
URL: https://github.com/apache/texera/issues/4745

   ### Task Summary
   
   Four of the five partitioners under 
`amber/src/main/scala/org/apache/texera/amber/engine/architecture/sendsemantics/partitioners/`
 currently have no unit tests; only `RangeBasedShufflePartitioner` is covered 
(by `RangeBasedShuffleSpec`). Add `PartitionersSpec` covering the remaining 
four:
   
   - **OneToOnePartitioner** — `getBucketIndex` always emits `Iterator(0)`; 
`allReceivers` selects the channel whose `fromWorkerId` matches the actor id
   - **BroadcastPartitioner** — `getBucketIndex` yields every receiver index; 
`allReceivers` is deduplicated
   - **RoundRobinPartitioner** — `getBucketIndex` cycles bucket indices 
(verifying current contract that the first emitted index is 1, since the 
implementation increments before emitting); `allReceivers` preserves channel 
order while deduplicating
   - **HashBasedShufflePartitioner** — `getBucketIndex` is in-range, 
deterministic for the same hash-key, depends only on the configured 
hash-attribute subset, and falls back to the full tuple when no hash attributes 
are configured; `allReceivers` deduplicates
   
   ### Priority
   P3 – Low
   
   ### Task Type
   - [x] Testing / QA


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to