AgenticSpark opened a new pull request, #56727:
URL: https://github.com/apache/spark/pull/56727

   ### What changes were proposed in this pull request?
   
   Convert `_LEGACY_ERROR_TEMP_2003` into an internal error. 
`mapSizeExceedArraySizeWhenZipMapError` now returns 
`SparkException.internalError(...)` and the `_LEGACY_ERROR_TEMP_2003` entry is 
removed from `error-conditions.json`.
   
   ### Why are the changes needed?
   
   Part of the effort to assign proper names to legacy error classes 
(SPARK-42839). This error is thrown only by `map_zip_with` 
(`MapZipWith.assertSizeOfArrayBuffer`) when the number of unique keys exceeds 
`ByteArrayMethods.MAX_ROUNDED_ARRAY_LENGTH` (~2^31). That condition is not 
reachable from user code, so the error is not user-facing. Per the ticket, 
errors that cannot be reproduced from user space should become internal errors 
rather than be given a user-facing name.
   
   ### Does this PR introduce _any_ user-facing change?
   
   No.
   
   ### How was this patch tested?
   
   No test triggers this error (the array-size limit cannot be reached from 
user code, so there is nothing to assert with `checkError()`). Verified that no 
references to `_LEGACY_ERROR_TEMP_2003` remain in the tree and that 
`error-conditions.json` is still valid JSON with sorted keys.
   
   ### Was this patch authored or co-authored using generative AI tooling?
   
   Generated-by: GitHub Copilot CLI (Claude Opus 4.8)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to