[ 
https://issues.apache.org/jira/browse/SPARK-45765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allison Wang updated SPARK-45765:
---------------------------------
    Description: 
Currently, the error message is super confusing when a user tries to load 
multiple paths incorrectly.

For example, `spark.read.format("json").load("p1", "p2")` will have this error:

An error occurred while calling o36.load.
: org.apache.spark.SparkClassNotFoundException: [DATA_SOURCE_NOT_FOUND] Failed 
to find the data source: p2. Please find packages at 
`[https://spark.apache.org/third-party-projects.html]`. SQLSTATE: 42K02

This can be confusing but it's valid error message, as "p2" will be considered 
as the `format` field of the load() method. 

  was:
Currently, the error message is super confusing when a user tries to load 
multiple paths incorrectly.

For example, `spark.read.format("json").load("p1", "p2")` will have this error:

An error occurred while calling o36.load.
: org.apache.spark.SparkClassNotFoundException: [DATA_SOURCE_NOT_FOUND] Failed 
to find the data source: p2. Please find packages at 
`https://spark.apache.org/third-party-projects.html`. SQLSTATE: 42K02

We should fix this.


> Improve error messages when loading multiple paths in PySpark
> -------------------------------------------------------------
>
>                 Key: SPARK-45765
>                 URL: https://issues.apache.org/jira/browse/SPARK-45765
>             Project: Spark
>          Issue Type: Improvement
>          Components: PySpark
>    Affects Versions: 4.0.0
>            Reporter: Allison Wang
>            Priority: Major
>
> Currently, the error message is super confusing when a user tries to load 
> multiple paths incorrectly.
> For example, `spark.read.format("json").load("p1", "p2")` will have this 
> error:
> An error occurred while calling o36.load.
> : org.apache.spark.SparkClassNotFoundException: [DATA_SOURCE_NOT_FOUND] 
> Failed to find the data source: p2. Please find packages at 
> `[https://spark.apache.org/third-party-projects.html]`. SQLSTATE: 42K02
> This can be confusing but it's valid error message, as "p2" will be 
> considered as the `format` field of the load() method. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to