tonydoen created SPARK-38639:
--------------------------------

             Summary: Support ignoreCorruptRecord flag parallel to 
ignoreCorruptFiles
                 Key: SPARK-38639
                 URL: https://issues.apache.org/jira/browse/SPARK-38639
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.2.1, 3.1.2
            Reporter: tonydoen
             Fix For: 3.2.1


There's an existing flag "spark.sql.files.ignoreCorruptFiles" and 
"spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads 
from files that have been corrupted, but it still allows the query to fail on 
sequence files.

 

Being able to ignore corrupt record is useful in the scenarios that users want 
to query successfully in dirty data(mixed schema in one table).

 

We would like to add a "spark.sql.hive.ignoreCorruptRecord"  to fill out the 
functionality.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to