GitHub user jinxing64 opened a pull request:

    https://github.com/apache/spark/pull/18211

    [WIP][SPARK-20994] Alleviate memory pressure in StreamManager

    ## What changes were proposed in this pull request?
    
    In current code, chunks are fetched from shuffle service in two steps:
    Step-1. Send `OpenBlocks`, which contains the blocks list to to fetch;
    Step-2. Fetch the consecutive chunks from shuffle-service by `streamId` and 
`chunkIndex`
    Conceptually, there is no need to send the blocks list in step-1. Client 
can send the blockId in Step-2. Receiving `ChunkFetchRequest`, server can check 
if the chunkId is in local block manager and send back response. 
    Thus memory cost can be improved.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/jinxing64/spark SPARK-20994

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/18211.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #18211
    
----
commit 883089aa824dabfb9b82a17546a953f1f0a22be4
Author: jinxing <[email protected]>
Date:   2017-06-05T09:19:18Z

    [SPARK-20994] Alleviate memory pressure in StreamManager

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to