jeff303 opened a new pull request #25807: [SPARK-25153][CORE] Improve error messages for columns with dots/periods URL: https://github.com/apache/spark/pull/25807 Check schema fields to see if they contain the exact column name, add to error message in DataSet#resolve Add test for extra error message piece ### What changes were proposed in this pull request? Adds an additional check in `DataSet#resolve`, in the else clause (i.e. column not resolved), that appends a suffix to the error message for the `AnalysisException` if that column name is literally found in the schema fields, to suggest to the user that it might need to be quoted via backticks. ### Why are the changes needed? Forgetting to quote such column names is a common occurrence for new Spark users. ### Does this PR introduce any user-facing change? No (other than the extra suffix on the error message). ### How was this patch tested? `test` was run for `core` in `sbt`, and passed.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
