[ 
https://issues.apache.org/jira/browse/CRUNCH-519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552631#comment-14552631
 ] 

Josh Wills commented on CRUNCH-519:
-----------------------------------

Hey [~ronhash], thanks for this! I had a couple of q's based on looking it 
over: first, why Kbs for the size of the PCollection vs. Mbs? Also, do we only 
want to include the numReducers in the plan file when it's not specified by the 
developer, or should we always include it (maybe w/some indication as to 
whether it was hard-coded or determined by Crunch?)

> Plan dot file can display more infromation
> ------------------------------------------
>
>                 Key: CRUNCH-519
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-519
>             Project: Crunch
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Ron Hashimshony
>            Assignee: Josh Wills
>         Attachments: CRUNCH-519.diff
>
>
> The current plan dot file display nicely the jobs, with nice names and arrows.
> However it does not explain how the planner decided on the reducers number, 
> which is based on the input data size, scale factor and desired size per 
> reducer.
> I suggest adding this information to the dot file.
> An addition to the DotfileWriter class can do this easily.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to