Benchmarking Apache Flink via Query Plan

2017-04-22 Thread giacom...@libero.it
Plus, I'm currently using 1.1.2 and I cannot change version due to dependency 
problems.
Thanks in advance,

 Giacomo90

>Messaggio originale
>From: "giacom...@libero.it" <giacom...@libero.it>
>Date: 21/04/2017 17.42
>To: <user@flink.apache.org>
>Subj: R: WELCOME to user@flink.apache.org
>
>Dear Users and Apache Flink devs,
>
> For each one of my distributed computation, I'm generating and 
>reading the json files produced by the getExecutionPlan() in order to 
motivate 
>my benchmarks. Is there some guide providing an explaination of the exact 
>meaning of the fields of the generated JSON file? I'm trying to 
differentiate 
>from the timing result which part of the computation time was spent sending 
>messages and which time was spent during either I/O or CPU operations.
> By the way, I also noticed that I do not get any information 
>concerning the actual data that is been used and transmitted throughout the 
>network (the actual data size and the messages' data size). 
> Moreover, currently I'm using the following way to get the JSON file
>
>> createAndRegisterDataSinks();
>> String plan = globalEnvironment.getExecutionPlan();
>> createAndRegisterDataSinks();
>> globalEnvironment.execute(getClass().getSimpleName()); // Running the 
actual 
>class
>
>  Is there a better way to do it?
>  Thanks in advance for your support,
>
>Giacomo90
>




R: WELCOME to user@flink.apache.org

2017-04-21 Thread giacom...@libero.it
Dear Users and Apache Flink devs,

 For each one of my distributed computation, I'm generating and 
reading the json files produced by the getExecutionPlan() in order to motivate 
my benchmarks. Is there some guide providing an explaination of the exact 
meaning of the fields of the generated JSON file? I'm trying to differentiate 
from the timing result which part of the computation time was spent sending 
messages and which time was spent during either I/O or CPU operations.
 By the way, I also noticed that I do not get any information 
concerning the actual data that is been used and transmitted throughout the 
network (the actual data size and the messages' data size). 
 Moreover, currently I'm using the following way to get the JSON file

> createAndRegisterDataSinks();
> String plan = globalEnvironment.getExecutionPlan();
> createAndRegisterDataSinks();
> globalEnvironment.execute(getClass().getSimpleName()); // Running the actual 
class

  Is there a better way to do it?
  Thanks in advance for your support,

Giacomo90