Github user sr11231 commented on the issue:
https://github.com/apache/spark/pull/17758
Why it's ok to have duplicate columns when you read from RDD/DS and not
when you read directly from file? Maybe it's should be configurable option?
When you have duplicate columns, how can you deal with columns selection or
renaming?--- --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
