[
https://issues.apache.org/jira/browse/HIVE-18295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16295717#comment-16295717
]
ASF GitHub Bot commented on HIVE-18295:
---------------------------------------
GitHub user mattk42 opened a pull request:
https://github.com/apache/hive/pull/278
HIVE-18295 - Add ability to ignore invalid values in JSON SerDe
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/mattk42/hive master
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/hive/pull/278.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #278
----
commit 7d1a1aa42c4661ba9373c21acacc059bf894d1ed
Author: Matt Knox <[email protected]>
Date: 2017-12-15T19:21:25Z
Add ability to ignore values that cannot be parsed into the final type.
----
> Add ability to ignore invalid values in JSON SerDe
> --------------------------------------------------
>
> Key: HIVE-18295
> URL: https://issues.apache.org/jira/browse/HIVE-18295
> Project: Hive
> Issue Type: Improvement
> Components: HCatalog
> Reporter: Matthew Knox
> Assignee: Matthew Knox
> Labels: pull-request-available
>
> It would be nice to be able to configure the JSON SerDe to ignore invalid
> values while parsing JSON.
> In our case our raw JSON data is ingested from multiple sources, some of
> which unreliably sanitize the data. Our current practice is to cleanse the
> data after ingestion, but that can lead to other issues as well. Having the
> ability to simply default to NULL if a value can not be parsed would be
> immensely helpful to us.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)