[GitHub] spark pull request #21368: [SPARK-16451][repl] Fail shell if SparkSession fa...

felixcheung Mon, 21 May 2018 22:43:10 -0700

Github user felixcheung commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21368#discussion_r189781578
  
    --- Diff: python/pyspark/sql/session.py ---
    @@ -547,6 +547,40 @@ def _create_from_pandas_with_arrow(self, pdf, schema, 
timezone):
             df._schema = schema
             return df
     
    +    @staticmethod
    +    def _create_shell_session():
    +        """
    +        Initialize a SparkSession for a pyspark shell session. This is 
called from shell.py
    +        to make error handling simpler without needing to declare local 
variables in that
    +        script, which would expose those to users.
    +        """
    +        import py4j
    +        from pyspark.conf import SparkConf
    +        from pyspark.context import SparkContext
    +        try:
    +            # Try to access HiveConf, it will raise exception if Hive is 
not added
    +            conf = SparkConf()
    +            if conf.get('spark.sql.catalogImplementation', 'hive').lower() 
== 'hive':
    +                SparkContext._jvm.org.apache.hadoop.hive.conf.HiveConf()
    +                return SparkSession.builder\
    +                    .enableHiveSupport()\
    +                    .getOrCreate()
    +            else:
    +                return SparkSession.builder.getOrCreate()
    +        except py4j.protocol.Py4JError:
    +            if conf.get('spark.sql.catalogImplementation', '').lower() == 
'hive':
    +                warnings.warn("Fall back to non-hive support because 
failing to access HiveConf, "
    +                              "please make sure you build spark with hive")
    +
    +        try:
    +            return SparkSession.builder.getOrCreate()
    --- End diff --
    
    the call flow seems to be changed here? I think this line is meant to be 
inside the handling of Py4JError?



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21368: [SPARK-16451][repl] Fail shell if SparkSession fa...

Reply via email to