I tried it myself, it doesn't work.
I've also tried stream.map.output.field.separator and
map.output.key.field.separator parameters for this purpose, they
don't work either. When hadoop sees empty string, it takes default tab
character instead.
Rasit
2009/2/4 jason hadoop
For your reduce, the parameter is stream.reduce.input.field.separator, if
you are supplying a reduce class and I believe the output format is
TextOutputFormat...
It looks like you have tried the map parameter for the separator, not the
reduce parameter.
From 0.19.0 PipeReducer:
configure:
Hello,
I'm interested in a map-reduce flow where I output only values (no keys) in
my reduce step. For example, imagine the canonical word-counting program
where I'd like my output to be an unlabeled histogram of counts instead of
(word, count) pairs.
I'm using HadoopStreaming (specifically,
If you are using the standard TextOutputFormat, and the output collector is
passed a null for the value, there will not be a trailing tab character
added to the output line.
output.collect( key, null );
Will give you the behavior you are looking for if your configuration is as I
expect.
On Tue,
Ooops, you are using streaming., and I am not familar.
As a terrible hack, you could set mapred.textoutputformat.separator to the
empty string, in your configuration.
On Tue, Feb 3, 2009 at 9:26 PM, jason hadoop jason.had...@gmail.com wrote:
If you are using the standard TextOutputFormat, and