Jayant Kumar created SPARK-40277:
------------------------------------
Summary: Use DataFrame's column for referring to DDL schema for
from_csv() and from_json()
Key: SPARK-40277
URL: https://issues.apache.org/jira/browse/SPARK-40277
Project: Spark
Issue Type: New Feature
Components: SQL
Affects Versions: 3.0.0
Reporter: Jayant Kumar
With spark's DataFrame api one has to explicitly pass the StrucType to
functions like from_csv and from_json. This works okay in general.
In certain circumstances when schema depends on the one of the DataFrame's
field, it gets complicated and one has to switch to RDD. This requires
additional libraries to be added with additional parsing logic.
I am trying to explore a way to enable such use cases with DataFrame api and
function itself.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]