April 12, 2007 4:10:59 PM Arkady Borkovsky wrote:
using "Line" in class names stresses the use of these classes for
Streaming.
On Apr 12, 2007, at 4:18 PM, Doug Cutting wrote:
The new classes in question are not a part of streaming, but are being
added to the mapred package.
Is not Hadoop Streaming part of Hadoop MapReduce product?
And are not the classes in question supposed to be referred to by
"naive users" on the Hadoop Streaming command line?
- abd
Unix commands invoked by Streaming operate on lines, not on Text.
Understanding what "Text" means requires knowing internals of Hadoop,
while "line" is used here in its common meaning.
These new classes are part of the internals of Hadoop.
Doug