HyukjinKwon opened a new pull request #26501: 
[SPARK-29875][PYTHON][SQL][BRANCH-2.4] Avoid to use deprecated 
pyarrow.open_stream API in Spark 2.4.x
URL: https://github.com/apache/spark/pull/26501
 
 
   ### What changes were proposed in this pull request?
   
   This PR proposes to avoid to use deprecated PyArrow API `open_stream`. It 
was deprecated as of 0.12.0 (https://github.com/apache/arrow/pull/3244).
   
   Root cause is that we use separate forked process and each process emits the 
warning (printing once via "default" filter at `warnings` package); however, we 
should avoid to use deprecated APIs anyway.
   
   In the current master, it was fixed when we upgrade PyArrow from 0.10.0 to 
0.12.0 at 
https://github.com/apache/spark/commit/16990f929921b3f784a85f3afbe1a22fbe77d895
   
   ### Why are the changes needed?
   
   if we use PyArrow higher then 0.12.0, Spark 2.4.x shows a bunch of annoying 
warnings as below:
   
   ```
   from pyspark.sql.functions import pandas_udf, PandasUDFType
   
   @pandas_udf("integer", PandasUDFType.SCALAR)  # doctest: +SKIP
   def add_one(x):
       return x + 1
   
   spark.range(100).select(add_one("id")).collect()
   ```
   
   ```
   UserWarning: pyarrow.open_stream is deprecated, please use 
pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   /usr/local/lib/python3.7/site-packages/pyarrow/__init__.py:157: UserWarning: 
pyarrow.open_stream is deprecated, please use pyarrow.ipc.open_stream
     warnings.warn("pyarrow.open_stream is deprecated, please use "
   ```
   
   ### Does this PR introduce any user-facing change?
   
   Remove annoying warning messages.
   
   ### How was this patch tested?
   
   Manually tested.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to