[jira] [Commented] (TEZ-1132) Consistent naming of Input and Outputs

Siddharth Seth (JIRA) Wed, 13 Aug 2014 19:26:21 -0700

    [ 
https://issues.apache.org/jira/browse/TEZ-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14096463#comment-14096463
 ]


Siddharth Seth commented on TEZ-1132:
-------------------------------------

bq. Change KV to KeyValue in all names.
Optional, the class names are already long. Also this can get confusing w.r.t 
Readers since some are KeyValue based, others are KeyValues based.
If we evolve these to be RowBased at some point - that will just be a new set 
if Inputs/Otuputs.

bq. LocalOnFileSorterOutput should probably be removed.
bq. LocalMergedInput should probably be moved out.
+1

bq. Do we need the OnFile prefix on these? These could potentially write to 
HDFS?
Agree. I think we should remove it.

bq. Is the Shuffled prefix needed? The reader threads could potentially read 
from HDFS?
Shuffled can be interpreted in several different ways - mapreduce shuffle, just 
moving data. Probably best to just remove it to avoid confusion.

The proposed input names should also have KV/KeyValue.



> Consistent naming of Input and Outputs
> --------------------------------------
>
>                 Key: TEZ-1132
>                 URL: https://issues.apache.org/jira/browse/TEZ-1132
>             Project: Apache Tez
>          Issue Type: Sub-task
>            Reporter: Bikas Saha
>            Assignee: Bikas Saha
>            Priority: Blocker
>
> Some places we should Sorted Partitioned. In others we should Shuffled. We 
> should use a consistent naming scheme based on Sorted, Grouped, Partitioned 
> sub-terms so that the function is clear from the name.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (TEZ-1132) Consistent naming of Input and Outputs

Reply via email to