Omega359 commented on issue #13351:
URL: https://github.com/apache/datafusion/issues/13351#issuecomment-2468280000

   The difference in `to_timestamp(1)` is almost certainly tz related as DF 
udf's don't have access to the context's timezone.
   
   As for why the DF version has multiple arguments - it's because it's better. 
None of those other systems were designed with handling human entered data in 
mind. Dates and times especially are consistently different, especially if you 
go outside a particular locality (US - mm/dd/yyyy, most everywhere else 
dd/mm/yyyy as a very simple example).
   
   If you ever have to clean 100's of billions of records and try to get 
consist dates and timestamps out of them having a function that accepts many 
possible formats to try is very refreshing. The alternative of 
when/when/when/when/when/............./otherwise statements is substantially 
poorer ux. For a single field I have up to 10 different formats I must try - 
and this is just for US specific data!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to