[
https://issues.apache.org/jira/browse/SPARK-45765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Allison Wang updated SPARK-45765:
---------------------------------
Description:
Currently, the error message is super confusing when a user tries to load
multiple paths incorrectly.
For example, `spark.read.format("json").load("p1", "p2")` will have this error:
An error occurred while calling o36.load.
: org.apache.spark.SparkClassNotFoundException: [DATA_SOURCE_NOT_FOUND] Failed
to find the data source: p2. Please find packages at
`[https://spark.apache.org/third-party-projects.html]`. SQLSTATE: 42K02
This can be confusing but it's valid error message, as "p2" will be considered
as the `format` field of the load() method.
was:
Currently, the error message is super confusing when a user tries to load
multiple paths incorrectly.
For example, `spark.read.format("json").load("p1", "p2")` will have this error:
An error occurred while calling o36.load.
: org.apache.spark.SparkClassNotFoundException: [DATA_SOURCE_NOT_FOUND] Failed
to find the data source: p2. Please find packages at
`https://spark.apache.org/third-party-projects.html`. SQLSTATE: 42K02
We should fix this.
> Improve error messages when loading multiple paths in PySpark
> -------------------------------------------------------------
>
> Key: SPARK-45765
> URL: https://issues.apache.org/jira/browse/SPARK-45765
> Project: Spark
> Issue Type: Improvement
> Components: PySpark
> Affects Versions: 4.0.0
> Reporter: Allison Wang
> Priority: Major
>
> Currently, the error message is super confusing when a user tries to load
> multiple paths incorrectly.
> For example, `spark.read.format("json").load("p1", "p2")` will have this
> error:
> An error occurred while calling o36.load.
> : org.apache.spark.SparkClassNotFoundException: [DATA_SOURCE_NOT_FOUND]
> Failed to find the data source: p2. Please find packages at
> `[https://spark.apache.org/third-party-projects.html]`. SQLSTATE: 42K02
> This can be confusing but it's valid error message, as "p2" will be
> considered as the `format` field of the load() method.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]