[ 
https://issues.apache.org/jira/browse/TEZ-3216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15331233#comment-15331233
 ] 

Siddharth Seth commented on TEZ-3216:
-------------------------------------

bq. That will give you the floor value. The code is from DATA_RANGE_IN_MB. If 
we were to use the java ceil function, then we have to cast it to double type 
and cast the result back to int.
Makes sense.

bq. 16 bits equals to 65GB. This is just in case some apps have really skew 
data. As you said, the extra buffer won't stay in AM for too long. Also 
protobuf integer encoding might help with RPC size.
If we use all 32bits, would it make sense for this value to be in KB ? 
[~rajesh.balamohan] - your thoughts on this from having run lots of jobs ? The 
original partition stats range is 0, 1, 10, 100, 1000 (assuming MB level is 
adequate ?)

Mostly minor comments.
- The comment about the ShuffleVertexManager. Can you please include a TODO 
TEZ-3303 along with the comment.
- The ReportPartitionStats enum. I'd prefer not introducing a separate class 
specifically for a single config parameter. This still creates a new class, but 
can a simple enum be defined inside TezRuntimeConfiguration, and move the 
Helper methods into a separate Utility class which is for private consumption 
(and other config parameters and validation) ?
- ReportPartitionStats - Mark "false", "true" as deprecated in favor of the new 
values.
- ReportPartitionStats - Documentation for the various values. Some information 
on the impact of lossless on RPC / AM memory would be useful. (AM memory would 
depend on how a user plugin handles this property. For the ShuffleVertexManager 
- this is aggregation)
- Naming of the configs: We're dropping byte level information - so lossless 
isn't exactly correct. Maybe "precise" or something else ? Any thoughts on a 
name other than 'lossy'. Lossy would typically be seen as a negative - but is 
sufficient in a lot of cases, and has the advantage of reduced memory pressure 
/ network traffic.
- ReportPartitionStats fromString - Instead of returning null, throw an 
IllegalArgumentException if the value specified is none of the expected values 
or null.

> Support for more precise partition stats in VertexManagerEvent
> --------------------------------------------------------------
>
>                 Key: TEZ-3216
>                 URL: https://issues.apache.org/jira/browse/TEZ-3216
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Ming Ma
>            Assignee: Ming Ma
>         Attachments: TEZ-3216-2.patch, TEZ-3216.patch
>
>
> Follow up on TEZ-3206 discussion, at least for some use cases, more accurate 
> partition stats will be useful for DataMovementEvent routing. Maybe we can 
> provide a config option to allow apps to choose the more accurate partition 
> stats over RoaringBitmap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to