Re: [PR] [SPARK-46171][SQL][PYTHON][R][DOCS] Remove `HiveContext` from Apache Spark 4.0 [spark]

2023-11-29 Thread via GitHub


dongjoon-hyun commented on PR #44077:
URL: https://github.com/apache/spark/pull/44077#issuecomment-1832478448

   Thank you for your decision, @LuciferYang .


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] [SPARK-46171][SQL][PYTHON][R][DOCS] Remove `HiveContext` from Apache Spark 4.0 [spark]

2023-11-29 Thread via GitHub


LuciferYang closed pull request #44077: [SPARK-46171][SQL][PYTHON][R][DOCS] 
Remove `HiveContext` from Apache Spark 4.0
URL: https://github.com/apache/spark/pull/44077


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] [SPARK-46171][SQL][PYTHON][R][DOCS] Remove `HiveContext` from Apache Spark 4.0 [spark]

2023-11-29 Thread via GitHub


LuciferYang commented on PR #44077:
URL: https://github.com/apache/spark/pull/44077#issuecomment-1832467193

   After carefully reading  
https://lists.apache.org/thread/mrx0y078cf3ozs7czykvv864y6dr55xq, I have 
decided to abandon the deletion of HiveContext. As @gatorsmile said, its 
maintenance cost is not high, but it will increase the cost of users migrating 
to Spark4.0, so I also believe it is not worth deleting it in this context.
   
   I will close this pr. Thanks @dongjoon-hyun @gatorsmile 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] [SPARK-46171][SQL][PYTHON][R][DOCS] Remove `HiveContext` from Apache Spark 4.0 [spark]

2023-11-29 Thread via GitHub


LuciferYang commented on code in PR #44077:
URL: https://github.com/apache/spark/pull/44077#discussion_r1409591501


##
python/pyspark/sql/context.py:
##
@@ -311,6 +312,24 @@ def registerJavaFunction(
 )
 return self.sparkSession.udf.registerJavaFunction(name, javaClassName, 
returnType)
 
+# TODO(andrew): delete this once we refactor things to take in SparkSession
+def _inferSchema(self, rdd: RDD, samplingRatio: Optional[float] = None) -> 
StructType:

Review Comment:
   Thanks @dongjoon-hyun, the code in this file should be okay now.
   



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] [SPARK-46171][SQL][PYTHON][R][DOCS] Remove `HiveContext` from Apache Spark 4.0 [spark]

2023-11-29 Thread via GitHub


dongjoon-hyun commented on PR #44077:
URL: https://github.com/apache/spark/pull/44077#issuecomment-1832316270

   I agree with you, @LuciferYang . Thank you for the heads-up email.
   Since we need an official vote for this specific API, I replied your email 
thread.
   Let's see the community atmosphere.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] [SPARK-46171][SQL][PYTHON][R][DOCS] Remove `HiveContext` from Apache Spark 4.0 [spark]

2023-11-29 Thread via GitHub


LuciferYang commented on code in PR #44077:
URL: https://github.com/apache/spark/pull/44077#discussion_r1409563409


##
python/pyspark/sql/context.py:
##
@@ -311,6 +312,24 @@ def registerJavaFunction(
 )
 return self.sparkSession.udf.registerJavaFunction(name, javaClassName, 
returnType)
 
+# TODO(andrew): delete this once we refactor things to take in SparkSession
+def _inferSchema(self, rdd: RDD, samplingRatio: Optional[float] = None) -> 
StructType:

Review Comment:
   Sorry, I think I made a mistake during the rebase.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] [SPARK-46171][SQL][PYTHON][R][DOCS] Remove `HiveContext` from Apache Spark 4.0 [spark]

2023-11-29 Thread via GitHub


dongjoon-hyun commented on code in PR #44077:
URL: https://github.com/apache/spark/pull/44077#discussion_r1409558060


##
python/pyspark/sql/context.py:
##
@@ -311,6 +312,24 @@ def registerJavaFunction(
 )
 return self.sparkSession.udf.registerJavaFunction(name, javaClassName, 
returnType)
 
+# TODO(andrew): delete this once we refactor things to take in SparkSession
+def _inferSchema(self, rdd: RDD, samplingRatio: Optional[float] = None) -> 
StructType:

Review Comment:
   What is this addition?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



Re: [PR] [SPARK-46171][SQL][PYTHON][R][DOCS] Remove `HiveContext` from Apache Spark 4.0 [spark]

2023-11-29 Thread via GitHub


LuciferYang commented on PR #44077:
URL: https://github.com/apache/spark/pull/44077#issuecomment-1831679067

   Already send a email to dev mail list for discussion 
https://lists.apache.org/thread/qfznmh1dvjf9r3qn2qc8zkryk3x1t05w


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org