brettplarson commented on pull request #26945: URL: https://github.com/apache/spark/pull/26945#issuecomment-829268778
Hello and thanks for making this MR. * Is there any long term guidance on how column names should be labeled when Spark is used? Is this documented anywhere in either the parquet or spark docs? I am having a hard time finding any specific information on guidance on naming columns. * Is there a long term plan to address this by the Spark team? The problem is that people will use pandas and create a dataframe with this "invalid" name, but then this doesn't become an issue until it's written to parquet from Spark which could potentially happen after a project is pretty far along. Please let me know, Thank you! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
