ravikiranpagidi commented on issue #55884: URL: https://github.com/apache/spark/issues/55884#issuecomment-4511890926
Hi, I’m Ravikiran Pagidi. I would like to work on this issue. My proposed approach is to add an opt-in CSV option, defaulting to false for backward compatibility, that keeps digit-only values with leading zeros as StringType during inferSchema. Example: 000123 -> StringType 00123 -> StringType 0 -> IntegerType 123 -> IntegerType I can add the implementation, tests in CSVInferSchemaSuite, and update the CSV data source documentation. Does this approach sound acceptable to the maintainers? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
