GitHub user yhuai opened a pull request:

    https://github.com/apache/spark/pull/10288

    [SPARK-12057] [SQL] Prevent failure on corrupt JSON records

    This PR makes JSON parser and schema inference handle more cases where we 
have unparsed records. It is based on #10043. The last commit fixes the failed 
test and updates the logic of schema inference.
    
    When merge this PR, please make sure that the author is @simplyianm.
    
    JIRA: https://issues.apache.org/jira/browse/SPARK-12057
    
    Closes #10043

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/yhuai/spark handleCorruptJson

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/10288.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #10288
    
----
commit 0271f0ceecd24aa97cc813994baa4bf8b7ba5256
Author: Ian Macalinao <[email protected]>
Date:   2015-11-30T18:23:34Z

    Prevent failure on corrupt JSON records
    
    Return failed record when a record cannot be parsed. Allows parsing of 
files containing corrupt records of any form.

commit e7a032a4617823a4c7f3741516d8115b50ffa464
Author: Ian Macalinao <[email protected]>
Date:   2015-12-01T20:10:33Z

    Add regression test for corrupt record JSON parsing

commit 4fba0cacd1b301f1678eada315fbd91f58fd7618
Author: Ian Macalinao <[email protected]>
Date:   2015-12-01T20:53:33Z

    Correct schema

commit d8722bb30d347fa97c7d358284eb433edb1be6ec
Author: Yin Huai <[email protected]>
Date:   2015-12-14T04:57:20Z

    Handle more cases.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to