[ 
https://issues.apache.org/jira/browse/YARN-6426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15952779#comment-15952779
 ] 

Arun Suresh commented on YARN-6426:
-----------------------------------

[~roniburd], thanks for the patch.

I am sure compressing would help reduce the size. But there are a couple of 
things to take into account though:
Upgrades for example. For 3.0.0-alpha3, it might not be a big problem (since we 
don't guarantee upgrades from 2.x to 3.x). But if we targeting 2.8.1 / 2.9.0, 
we need to figure out a way for:
# The 2.8.1 / 2.9.0 RM to be able to read uncompressed AppStateDate znodes 
after upgrade.
# The 2.8.0 RM should be able to ignore compressed AppStateData it gets from ZK 
after a downgrade.


> Compress ZK YARN keys to scale up (especially AppStateData
> ----------------------------------------------------------
>
>                 Key: YARN-6426
>                 URL: https://issues.apache.org/jira/browse/YARN-6426
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: yarn
>    Affects Versions: 3.0.0-alpha2
>            Reporter: Roni Burd
>            Assignee: Roni Burd
>              Labels: patch
>         Attachments: zkcompression.patch
>
>
> ZK today stores the protobuf files uncompressed. This is not an issue except 
> that if a customer job has thousands of files, AppStateData will store the 
> user context as a string with multiple URLs and it is easy to get to 1MB or 
> more. 
> This can put unnecessary strain on ZK and make the process slow. 
> The proposal is to simply compress protobufs before sending them to ZK



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to