[GitHub] spark pull request #21427: [SPARK-24324][PYTHON] Pandas Grouped Map UDF shou...

BryanCutler Fri, 22 Jun 2018 10:06:59 -0700

Github user BryanCutler commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21427#discussion_r197509567
  
    --- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowUtils.scala 
---
    @@ -120,4 +121,19 @@ object ArrowUtils {
           StructField(field.getName, dt, field.isNullable)
         })
       }
    +
    +  /** Return Map with conf settings to be used in ArrowPythonRunner */
    +  def getPythonRunnerConfMap(conf: SQLConf): Map[String, String] = {
    +    val timeZoneConf = if (conf.pandasRespectSessionTimeZone) {
    +      Seq(SQLConf.SESSION_LOCAL_TIMEZONE.key -> conf.sessionLocalTimeZone)
    +    } else {
    +      Nil
    +    }
    +    val pandasColsByPosition = if 
(conf.pandasGroupedMapAssignColumnssByPosition) {
    --- End diff --
    
    I think it's better to just omit the config for the default case, that way 
it's easier to process in the worker.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #21427: [SPARK-24324][PYTHON] Pandas Grouped Map UDF shou...

Reply via email to