zhijiangW edited a comment on issue #6417: [FLINK-9913][runtime] Improve output 
serialization only once in RecordWriter
URL: https://github.com/apache/flink/pull/6417#issuecomment-413127511
 
 
   For benchmark, I also create a new job with only source and map vertex, and 
the source broadcast Long type value to all the maps. I execute the same job 
for setting the parallelism 1:100 and 1:200 separately, and the throughput 
increases 13%, 15% separately in the      following:
   
   ```
   parallelism      | throughput (before)          | throughput (now)
   1:100            | 70.760 ± 10.557 ops/ms       | 83.480 ± 1.967 ops/ms
   1:200            | 37.756 ± 1.170 ops/ms        | 43.316 ± 2.176 ops/ms
   ```
   
   In theory, we only reduce the number of serialization times and do not 
introduce any time-cost operations.  I will further verify the 
`StreamNetworkThroughputBenchmarkExecutor` as you mentioned and show your 
results after done. :)
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to