ravikiranpagidi commented on issue #55884:
URL: https://github.com/apache/spark/issues/55884#issuecomment-4511890926

   Hi, I’m Ravikiran Pagidi. I would like to work on this issue.
   
   My proposed approach is to add an opt-in CSV option, defaulting to false for 
backward compatibility, that keeps digit-only values with leading zeros as 
StringType during inferSchema.
   
   Example:
   000123 -> StringType
   00123  -> StringType
   0          -> IntegerType
   123      -> IntegerType
   
   I can add the implementation, tests in CSVInferSchemaSuite, and update the 
CSV data source documentation.
   
   Does this approach sound acceptable to the maintainers?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to