Re: [PR] [SPARK-47564][SQL] Always throw FAILED_READ_FILE error when fail to read files [spark]

via GitHub Tue, 26 Mar 2024 23:34:21 -0700


cloud-fan commented on code in PR #45723:
URL: https://github.com/apache/spark/pull/45723#discussion_r1540547369



##########
common/utils/src/main/resources/error/error-classes.json:
##########
@@ -1257,6 +1249,31 @@
     "message" : [
       "Encountered error while reading file <path>."
     ],
+    "subClass" : {
+      "CANNOT_READ_FILE_FOOTER" : {
+        "message" : [
+          "Could not read footer. Please ensure that the file is in either ORC 
or Parquet format.",
+          "If not, please convert it to a valid format. If the file is in the 
valid format, please check if it is corrupt.",
+          "If it is, you can choose to either ignore it or fix the corruption."
+        ]
+      },
+      "FILE_NOT_EXIST" : {
+        "message" : [
+          "File does not exist. It is possible the underlying files have been 
updated.",
+          "You can explicitly invalidate the cache in Spark by running 
'REFRESH TABLE tableName' command in SQL or by recreating the Dataset/DataFrame 
involved."
+        ]
+      },
+      "NO_HINT" : {
+        "message" : [
+          ""
+        ]
+      },
+      "PARQUET_COLUMN_DATA_TYPE_MISMATCH" : {
+        "message" : [
+          "Data type mismatches when reading Parquet column <column>: 
Expected: <expectedType>, Found: <actualType>."

Review Comment:
   I find the logical vs physical type a bit confusing. How about `Expected 
Spark type: ..., actual Parquet type ...`.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] [SPARK-47564][SQL] Always throw FAILED_READ_FILE error when fail to read files [spark]

Reply via email to