asmello opened a new pull request #31306:
URL: https://github.com/apache/spark/pull/31306


   ### What changes were proposed in this pull request?
   This PR implements 
[SPARK-34214](https://issues.apache.org/jira/browse/SPARK-34214), by exposing 
the already existing regexp_extract_all SQL function in the PySpark API.
   
   ### Why are the changes needed?
   Please refer to 
[SPARK-24884](https://issues.apache.org/jira/browse/SPARK-24884) for why this 
function is useful. This PR merely exposes it to the PySpark API, for added 
consistency and greater availability for users.
   
   ### Does this PR introduce _any_ user-facing change?
   Yes, a new function is made available in the PySpark API. The associated 
docstring is included. Also I tweaked the description of the original 
`regexp_extract` function to highlight how its behaviour differs from that of 
`regexp_extract_all`.
   
   ### How was this patch tested?
   I tested it locally in a pyspark console session. I didn't find any tests 
for `regexp_extract`, so I didn't add any for the new function, but happy to do 
so if desired.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to