Github user viirya commented on the issue:
https://github.com/apache/spark/pull/20442
`numBuckets` is a default param and can cause persistence bug too if we add
multi-column error handling logic. I think we have two options:
1. Ignore `numBuckets` when `inputCols` and `numBucketsArray` are set.
Don't raise error if it is set.
2. Similar to `outputCol`, also skip the default value of `numBuckets` if
`inputCols` is set when saving the metadata.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]