GitHub user MaxGekk opened a pull request:

    https://github.com/apache/spark/pull/20963

    [SPARK-23849][SQL] Tests for the samplingRatio option of JSON datasource

    ## What changes were proposed in this pull request?
    
    Proposed tests checks that only subset of input dataset is touched during 
schema inferring.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/MaxGekk/spark-1 json-sampling-tests

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20963.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20963
    
----
commit 76e38a8670303afe20b530fc2a837a1363a01974
Author: Maxim Gekk <maxim.gekk@...>
Date:   2018-04-02T13:28:32Z

    Adding samplingRation tests for json

commit 1acc3ece4121ffc3209fb65a20526dec574820e5
Author: Maxim Gekk <maxim.gekk@...>
Date:   2018-04-02T13:34:06Z

    Removing debug code

commit 0d5fcfb59e112cc989753404813d32432257a0f6
Author: Maxim Gekk <maxim.gekk@...>
Date:   2018-04-02T18:45:18Z

    Adding the ticket ref to test titles

commit a664465d8a3042f06cf7a866283c34a674b24939
Author: Maxim Gekk <maxim.gekk@...>
Date:   2018-04-02T18:48:56Z

    Making Scala style checker happy

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to