jxu opened a new issue, #55884:
URL: https://github.com/apache/spark/issues/55884

   Version: Spark 3.5
   
   Usually, if a data source has a string like "000123", this is an ID of fixed 
number of digits where the leading zeros are important, so this should not be 
cast into an integer like 123. An example is zip codes. There should be an 
option to be safe and keep as a string. Otherwise I have to turn off 
inferSchema and manually specify the whole schema or cast afterwards. 
   
   Same request here 
https://stackoverflow.com/questions/50299192/pyspark-inferschema-numbers-that-start-with-0
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to