HyukjinKwon opened a new pull request #34902: URL: https://github.com/apache/spark/pull/34902
### What changes were proposed in this pull request? This PR proposes to switch default index to `distributed-sequence` by default. ### Why are the changes needed? `sequence` type relies on sending all data to one executor that easily causes OOM. We should better switch to `distributed-sequence` type that truly distributes the data. ### Does this PR introduce _any_ user-facing change? Ideally no. Order might be affected but that's not already guaranteed. ### How was this patch tested? Existing CI should test it out. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
