GitHub user vincent-grosbois opened a pull request:

    https://github.com/apache/spark/pull/21933

    [SPARK-24917] make chunk size configurable

    Add an option in Spark configuration to change the chunk size (which is by 
default 4 Mb).
    
    This would allow to bypass the issue mentionned in SPARK-24917 by allowing 
users to define larger chunks
    
    ## What changes were proposed in this pull request?
    I'm introducing a configuration parameter to define the chunk size
    
    ## How was this patch tested?
    not tested as it's a trivial test. I can write a test if needed

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/vincent-grosbois/spark chunk_size

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/21933.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #21933
    
----
commit 0251bd517e7fd3e695cb8366ffa03de8c9e2900b
Author: vincent-grosbois <vincent.grosbois@...>
Date:   2018-07-31T15:21:24Z

    [SPARK-24917] make chunk size configurable
    
    Add an option in Spark configuration to change the chunk size (which is by 
default 4 Mb).
    
    This would allow to bypass the issue mentionned in SPARK-24917 when 
fetching large partitions (a bit less than 2 Gb)

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to