[ https://issues.apache.org/jira/browse/TEZ-1080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14033143#comment-14033143 ]
Siddharth Seth commented on TEZ-1080: ------------------------------------- Usage examples Complex, configuring multiple properties. {code} SortedPartitionedPairConf conf = SortedPartitionedPairConf.newBuilder("keyClass", "valClass") .setKeyComparatorClass("comparatorClass") .enableCompression(null) .configureOutput("partitionerClassName") .setSortBufferSize(1024) .setSorterNumThreads(1) .build() .configureInput() .setCombiner("Combiner", Map<String, String> combinerProperties) .setShuffleBufferFraction(0.25f) .build() .setAdditionalConf("key", "val") .setAdditionalCons(Map<String, String>) .build(); conf.getInputPayload(); conf.getOutputPayload(); {code} Simple, relying primarily on defaults {code} SortedPartitionedPairConf conf = SortedPartitionedPairConf.newBuilder("keyClass", "valClass") .configureOutput("partitionerClassName").build() .configureInput().build() .build(); conf.getInputPayload(); conf.getOutputPayload(); {code} Optionally, these configuration objects could generated the actual Edge to be used in the DAGPlan. Edge conf.createEdge(... ...) Combiners, Partitioners etc would receive separate key-value pairs, if they need additional configuration. The current method of relying on a configuration would no longer work. [~rohini], [~hagleitn] - any feedback on this API ? > Configuration for non MR based Inputs/Outputs > --------------------------------------------- > > Key: TEZ-1080 > URL: https://issues.apache.org/jira/browse/TEZ-1080 > Project: Apache Tez > Issue Type: Sub-task > Reporter: Siddharth Seth > Assignee: Siddharth Seth > Attachments: TEZ-1080.wip.1.txt, TEZ-1080.wip.2.txt > > > De-link configuration from MRHelpers (except for the YARNRunner case), and > allow for these to be configured easily - exposing necessary setters / > getters without having to rely on config keys. -- This message was sent by Atlassian JIRA (v6.2#6252)