[
https://issues.apache.org/jira/browse/TEZ-1317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14083826#comment-14083826
]
Bikas Saha commented on TEZ-1317:
---------------------------------
Fixed up DataSource/SinkDescriptors to contain the necessary info and use that
info internally to infer credentials, numTasks and location hints where
relevant.
There is no addInputPaths. Its already called setInputPaths. CSV is what
fileinputformat supports. So using that since most people would be using it
today.
The exception already states that the format should be assignable from
FileInputFormat (TextInputFormat etc). It cannot be exactly FileInputFormat
since its abstract.
Changed to createConfigurer(). Sticking to create() since its creating the
descriptor.
> Simplify MRinput/MROutput configuration
> ---------------------------------------
>
> Key: TEZ-1317
> URL: https://issues.apache.org/jira/browse/TEZ-1317
> Project: Apache Tez
> Issue Type: Sub-task
> Reporter: Siddharth Seth
> Assignee: Bikas Saha
> Priority: Blocker
> Attachments: TEZ-1317.1.patch, TEZ-1317.2.patch, TEZ-1317.3.patch,
> TEZ-1317.3.patch, TEZ-1317.4.patch
>
>
> Should at least be possible to generate the correct Descriptors.
> Potentially change the addInput / addOutput APIs to accept a single entity
> which encapsulates InputDescriptor and InputInitializerDescriptor. Similarly
> for Outputs.
--
This message was sent by Atlassian JIRA
(v6.2#6252)