[GitHub] spark pull request #19459: [SPARK-20791][PYSPARK] Use Arrow to create Spark ...

viirya Thu, 19 Oct 2017 18:05:05 -0700

Github user viirya commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19459#discussion_r145859471
  
    --- Diff: python/pyspark/sql/session.py ---
    @@ -510,6 +578,12 @@ def createDataFrame(self, data, schema=None, 
samplingRatio=None, verifySchema=Tr
             except Exception:
                 has_pandas = False
             if has_pandas and isinstance(data, pandas.DataFrame):
    +            if self.conf.get("spark.sql.execution.arrow.enabled", 
"false").lower() == "true" \
    +                    and len(data) > 0:
    +                df = self._createFromPandasWithArrow(data, schema)
    +                # Fallback to create DataFrame without arrow if return None
    +                if df is not None:
    --- End diff --
    
    Shall we show some log message to users in this case?



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #19459: [SPARK-20791][PYSPARK] Use Arrow to create Spark ...

Reply via email to