Re: Value-Only Reduce Output

2009-02-04 Thread Rasit OZDAS
I tried it myself, it doesn't work. I've also tried stream.map.output.field.separator and map.output.key.field.separator parameters for this purpose, they don't work either. When hadoop sees empty string, it takes default tab character instead. Rasit 2009/2/4 jason hadoop

Re: Value-Only Reduce Output

2009-02-04 Thread jason hadoop
For your reduce, the parameter is stream.reduce.input.field.separator, if you are supplying a reduce class and I believe the output format is TextOutputFormat... It looks like you have tried the map parameter for the separator, not the reduce parameter. From 0.19.0 PipeReducer: configure:

Value-Only Reduce Output

2009-02-03 Thread Jack Stahl
Hello, I'm interested in a map-reduce flow where I output only values (no keys) in my reduce step. For example, imagine the canonical word-counting program where I'd like my output to be an unlabeled histogram of counts instead of (word, count) pairs. I'm using HadoopStreaming (specifically,

Re: Value-Only Reduce Output

2009-02-03 Thread jason hadoop
If you are using the standard TextOutputFormat, and the output collector is passed a null for the value, there will not be a trailing tab character added to the output line. output.collect( key, null ); Will give you the behavior you are looking for if your configuration is as I expect. On Tue,

Re: Value-Only Reduce Output

2009-02-03 Thread jason hadoop
Ooops, you are using streaming., and I am not familar. As a terrible hack, you could set mapred.textoutputformat.separator to the empty string, in your configuration. On Tue, Feb 3, 2009 at 9:26 PM, jason hadoop jason.had...@gmail.com wrote: If you are using the standard TextOutputFormat, and