[GitHub] spark issue #18664: [SPARK-21375][PYSPARK][SQL][WIP] Add Date and Timestamp ...

gatorsmile Thu, 05 Oct 2017 23:28:33 -0700

Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/18664
  
    I think all of the involved reviewers agree this is a pretty serious design 
issue. We are unable to change the behavior after we officially release it. 
Thus, we have to be very very careful about the design. 
    
    Can anyone first write user facing documents to explain the proposal and 
the potential impact? 
    
    Let us list a few typical scenarios 
    
    Given the users/applications containing the `Timestamp` in their `Dataset` 
and their processing algorithms have the corresponding logics:
    
    - For the **new** users/applications, they first enabled Arrow and later 
hit an Arrow bug? Can they simply turn off `spark.sql.execution.arrow.enable`? 
If not, what should they do?
    
    - For the **existing** users/applications, they want to utilize Arrow for 
better performance. Can they just turn on `spark.sql.execution.arrow.enable`? 
What should they do?




---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #18664: [SPARK-21375][PYSPARK][SQL][WIP] Add Date and Timestamp ...

Reply via email to