subject:"Value\-Only Reduce Output"

Re: Value-Only Reduce Output

2009-02-04 Thread Jack Stahl

My (0.18.2) reduce src looks like this: write(key); clientOut_.write('\t'); write(val); clientOut_.write('\n'); which explains why avoiding the trailing tab is unavoidable. Thanks for your help, though, Jason! 2009/2/4 jason hadoop > For your reduce, th

Re: Value-Only Reduce Output

2009-02-04 Thread jason hadoop

For your reduce, the parameter is stream.reduce.input.field.separator, if you are supplying a reduce class and I believe the output format is TextOutputFormat... It looks like you have tried the map parameter for the separator, not the reduce parameter. >From 0.19.0 PipeReducer: configure:

Re: Value-Only Reduce Output

2009-02-04 Thread Rasit OZDAS

I tried it myself, it doesn't work. I've also tried stream.map.output.field.separator and map.output.key.field.separator parameters for this purpose, they don't work either. When hadoop sees empty string, it takes default tab character instead. Rasit 2009/2/4 jason hadoop > > Ooops, you are

Re: Value-Only Reduce Output

2009-02-03 Thread jason hadoop

Ooops, you are using streaming., and I am not familar. As a terrible hack, you could set mapred.textoutputformat.separator to the empty string, in your configuration. On Tue, Feb 3, 2009 at 9:26 PM, jason hadoop wrote: > If you are using the standard TextOutputFormat, and the output collector is

Re: Value-Only Reduce Output

2009-02-03 Thread jason hadoop

If you are using the standard TextOutputFormat, and the output collector is passed a null for the value, there will not be a trailing tab character added to the output line. output.collect( key, null ); Will give you the behavior you are looking for if your configuration is as I expect. On Tue, F

Value-Only Reduce Output

2009-02-03 Thread Jack Stahl

Hello, I'm interested in a map-reduce flow where I output only values (no keys) in my reduce step. For example, imagine the canonical word-counting program where I'd like my output to be an unlabeled histogram of counts instead of (word, count) pairs. I'm using HadoopStreaming (specifically, I'm

Re: Value-Only Reduce Output

Re: Value-Only Reduce Output

Re: Value-Only Reduce Output

Re: Value-Only Reduce Output

Re: Value-Only Reduce Output

Value-Only Reduce Output

6 matches

Site Navigation

Mail list logo

Footer information