[ https://issues.apache.org/jira/browse/CRUNCH-438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Christian Tzolov updated CRUNCH-438: ------------------------------------ Attachment: CRUNCH-438.3.patch Gabriel i think your point makes sense. So i've attached a new patch (v3) that removes the functionality that writes dotfiles into the PIPELINE_DOTFILE_OUTPUT_DIR folder. Also i've fixed (hopefully) the spelling names. The debug diagrams will be stored i the following configuration properties: * PCOLLECTION_LINEAGE_DOTFILE * BASE_GRAPH_PLAN_DOTFILE * SPLIT_GRAPH_PLAN_DOTFILE * RTNODES_PLAN_DOTFILE Would it make sense to enable the 'debug' dotfiles generation only if the pipeliene.enbableDebug() is set? Also shall we move all DotfileWriter... clases into a dedicated package? For example: org.apache.crunch.impl.mr.plan.tracke > Visualizations of some important internal/intermediate pipeline planning > states > ------------------------------------------------------------------------------- > > Key: CRUNCH-438 > URL: https://issues.apache.org/jira/browse/CRUNCH-438 > Project: Crunch > Issue Type: Improvement > Components: Core > Affects Versions: 0.10.0, 0.8.3 > Reporter: Christian Tzolov > Assignee: Christian Tzolov > Attachments: CRUNCH-438.2.patch, CRUNCH-438.3.patch, CRUNCH-438.patch > > > To improve the understability of the pipeline planning stages it would help > to visualize some intermediate planning states like: > - PCollection lineage. (visualizing the output-pcollection-targets structure) > - MSCRPlanner's planning Graphs before and after the split up of dependent > GBK nodes > - RTNode hierarchy along with the Input and Output configurations as > persistent in the Configuration before the execution of the pipeline. > Most of the information can be intercepted in the MSCRPlanner#plan() method. -- This message was sent by Atlassian JIRA (v6.2#6252)