[ 
https://issues.apache.org/jira/browse/FLINK-4399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16921355#comment-16921355
 ] 

Till Rohrmann commented on FLINK-4399:
--------------------------------------

Thanks for the detailed answer [~SleePy]. Given that we don't have an acute 
problem with too large messages I'm aware of atm and only limited review 
capacities I'd suggest that we add some utilities to offload large payloads on 
the {{BlobServer}} the next time we run into this problem.

For improving the RPC service to support large messages, I think it is not 
super critical. If you want, you can work on it but I cannot promise to help 
you with the review/merging in this release cycle. I would try to squeeze it in 
whenever there is time. So if you are ok with not getting fast feedback, then 
it could work.

> Add support for oversized messages
> ----------------------------------
>
>                 Key: FLINK-4399
>                 URL: https://issues.apache.org/jira/browse/FLINK-4399
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Coordination
>         Environment: FLIP-6 feature branch
>            Reporter: Stephan Ewen
>            Assignee: Biao Liu
>            Priority: Major
>              Labels: flip-6
>
> Currently, messages larger than the maximum Akka Framesize cause an error 
> when being transported. We should add a way to pass messages that are larger 
> than the Framesize, as may happen for:
>   - {{collect()}} calls that collect large data sets (via accumulators)
>   - Job submissions and operator deployments where the functions closures are 
> large (for example because it contains large pre-loaded data)
>   - Function restore in cases where restored state is larger than 
> checkpointed state (union state)
> I suggest to use the {{BlobManager}} to transfer large payload.
>   - On the sender side, oversized messages are stored under a transient blob 
> (which is deleted after first retrieval, or after a certain number of minutes)
>   - The sender sends a "pointer to blob message" instead.
>   - The receiver grabs the message from the blob upon receiving the pointer 
> message
> The RPC Service should be optionally initializable with a "large message 
> handler" which is internally the {{BlobManager}}.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

Reply via email to