Arkady Borkovsky wrote:
On Apr 12, 2007, at 4:18 PM, Doug Cutting wrote:
The new classes in question are not a part of streaming, but are being added to the mapred package.
Is not Hadoop Streaming part of Hadoop MapReduce product?

Streaming is currently in contrib not in the core.

I'm just suggesting we use consistent, accurate and descriptive terminology within the core. These classes to not read nor generate lines. They do facilitate interoperability with other line-based tools like TextInputFormat and TextOutputFormat.

And are not the classes in question supposed to be referred to by "naive users" on the Hadoop Streaming command line?

I don't think we should name core classes to make the streaming command line more intuitive. If all else were equal, sure, that's a good thing, but, core classes should be named as consistently, accurately and descriptively as possible. If streaming's command line is confusing, then that should be fixed in streaming, no?

Doug

Reply via email to