Josh Rosen created SPARK-28427:
----------------------------------
Summary: Support more Postgres JSON functions
Key: SPARK-28427
URL: https://issues.apache.org/jira/browse/SPARK-28427
Project: Spark
Issue Type: Sub-task
Components: SQL
Affects Versions: 3.0.0
Reporter: Josh Rosen
Postgres features a number of JSON functions that are missing in Spark:
https://www.postgresql.org/docs/9.3/functions-json.html
Redshift's JSON functions
(https://docs.aws.amazon.com/redshift/latest/dg/json-functions.html) have
partial overlap with the Postgres list.
Some of these functions can be expressed in terms of compositions of existing
Spark functions. For example, I think that {{json_array_length}} can be
expressed with {{cardinality}} and {{from_json}}, but there's a caveat related
to legacy Hive compatibility (see the demo notebook at
https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/5796212617691211/45530874214710/4901752417050771/latest.html
for more details).
I'm filing this ticket so that we can triage the list of Postgres JSON features
and decide which ones make sense to support in Spark. After we've done that, we
can create individual tickets for specific functions and features.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]