GitHub user brkyvz opened a pull request:

    https://github.com/apache/spark/pull/20302

    [SPARK-23094] Fix invalid character handling in JsonDataSource

    ## What changes were proposed in this pull request?
    
    There were two related fixes regarding `from_json`, `get_json_object` and 
`json_tuple` ([Fix 
#1](https://github.com/apache/spark/commit/c8803c06854683c8761fdb3c0e4c55d5a9e22a95),
     [Fix 
#2](https://github.com/apache/spark/commit/86174ea89b39a300caaba6baffac70f3dc702788)),
 but they weren't comprehensive it seems. I wanted to extend those fixes to all 
the parsers, and add tests for each case.
    
    ## How was this patch tested?
    
    Regression tests

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/brkyvz/spark json-invfix

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20302.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20302
    
----
commit 231152ab0d615166bdea354916a9f6ca0aaf7e6a
Author: Burak Yavuz <brkyvz@...>
Date:   2018-01-18T01:01:00Z

    [ES-4104][WARMFIX] Fix invalid character handling in JsonDataSource
    
    Cherry-pick of https://github.com/databricks/spark/pull/1135
    
    ## What changes were proposed in this pull request?
    
    I shall also merge this upstream, but wanted to merge here first since it 
was an ES ticket related to Qubole workloads.
    
    There were two related fixes regarding `from_json`, `get_json_object` and 
`json_tuple` in OSS ([Fix 
#1](https://github.com/apache/spark/commit/c8803c06854683c8761fdb3c0e4c55d5a9e22a95),
     [Fix 
#2](https://github.com/apache/spark/commit/86174ea89b39a300caaba6baffac70f3dc702788)),
 but they weren't comprehensive it seems. I wanted to extend those fixes to all 
the parsers, and add tests for each case.
    
    ## How was this patch tested?
    
    Regression tests
    
    Author: Burak Yavuz <brkyvzgmail.com>
    
    Closes #1135 from brkyvz/json-32-dbx.
    
    ## What changes were proposed in this pull request?
    
    (Please fill in changes proposed in this fix)
    
    ## How was this patch tested?
    
    (Please explain how this patch was tested. E.g. unit tests, integration 
tests, manual tests)
    (If this patch involves UI changes, please attach a screenshot; otherwise, 
remove this)
    
    Please review http://spark.apache.org/contributing.html before opening a 
pull request.
    
    ## **IMPORTANT** Warmfix instructions
    
    If this PR needs to be warmfixed (i.e. merged into the release branch after 
the code freeze), please follow steps below.
    
    What type of warmfix is this? Please select **exactly one choice**, or 
write description in Other.
    
    - [ ] Regression (e.g. fixing the behavior of a feature that regressed in 
the current release cycle)
    - [ ] ES ticket fix (e.g. Customer or internally requested update/fix)
    - [ ] Other (please describe):
    
    Make the following updates to this PR:
    
    - [ ] Add `[WARMFIX]` in the title of this PR.
    - [ ] Label the PR using label(s) corrsponding to the WARMFIX branch(es). 
The label name should be in the format of `dbr-branch-a.b` (e.g. 
`dbr-branch-3.2`), which matches the release branch name for Runtime release 
`a.b`.
    - [ ] Ask your team lead to sign off this warmfix and add the 
`warmfix-approved` label.
    - [ ] When merging the PR using the merge script, make sure to get this PR 
merged into the following branches:
    
      - The branch against which your PR is opened, and
      - Any extra release branch(es) corresponding to the `dbr-branch-a.b` 
label(s) applied to your PR.
    
    Author: Burak Yavuz <[email protected]>
    
    Closes #1616 from brkyvz/inv-json4x.

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to