[ 
https://issues.apache.org/jira/browse/FLINK-20860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated FLINK-20860:
-----------------------------------
    Labels: pull-request-available  (was: )

> Allow streaming operators to use managed memory
> -----------------------------------------------
>
>                 Key: FLINK-20860
>                 URL: https://issues.apache.org/jira/browse/FLINK-20860
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Runtime / Configuration, Runtime / Task
>            Reporter: Jark Wu
>            Assignee: Xintong Song
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.13.0
>
>
> We are planning to use some batch algorithms (sorting & bytes hash table) to 
> improve the performance of streaming SQL operators, especially for the the 
> mini-batch operators introduced by FLIP-145.
> Currently, we have to buffer input records and accumulators in heap (i.e. 
> Java HashMap) which is not efficient and there are potential risks of full GC 
> and OOM. With the managed memory, we can fully use the memory to buffer more 
> data without worrying about OOM and improve the performance a lot. However, 
> the managed memory is not allowed to be used in streaming operators. 
> As discussed in the mailing list [1], we have reached a consensus that we can 
> extend the configuration {{taskmanager.memory.managed.consumer-weights}} to 
> have 2 more options {{OPERATOR}} and {{STATE_BACKEND}}, the available 
> consumer options will be :
> * `OPERATOR` for both streaming and bath operators
> * `STATE_BACKEND` for state backends
> * `PYTHON` for python processes
> * `DATAPROC` as a legacy key for state backend or batch operators if
> `STATE_BACKEND` or `OPERATOR` are not specified.
> The previous default value is {{DATAPROC:70,PYTHON:30}}, the new default 
> value will be {{OPERATOR:70,STATE_BACKEND:70,PYTHON:30}}.
> The weight for OPERATOR and STATE_BACKEND will be the same value to align 
> with previous behaviors.
> [1]: 
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Allow-streaming-operators-to-use-managed-memory-td47327.html



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to