asmello opened a new pull request #31306: URL: https://github.com/apache/spark/pull/31306
### What changes were proposed in this pull request? This PR implements [SPARK-34214](https://issues.apache.org/jira/browse/SPARK-34214), by exposing the already existing regexp_extract_all SQL function in the PySpark API. ### Why are the changes needed? Please refer to [SPARK-24884](https://issues.apache.org/jira/browse/SPARK-24884) for why this function is useful. This PR merely exposes it to the PySpark API, for added consistency and greater availability for users. ### Does this PR introduce _any_ user-facing change? Yes, a new function is made available in the PySpark API. The associated docstring is included. Also I tweaked the description of the original `regexp_extract` function to highlight how its behaviour differs from that of `regexp_extract_all`. ### How was this patch tested? I tested it locally in a pyspark console session. I didn't find any tests for `regexp_extract`, so I didn't add any for the new function, but happy to do so if desired. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
