when using multipleOutputs?
We are taking our input and splitting it into 3 files. So it seems to be a
natural choice for MultipleOutputs. Performance is a bit slow though.
Cheers!
David
From: David Poisson [david.pois...@ca.fujitsu.com]
Sent: Thursday
Howdy,
I want to take a look at a MR job which seems to be slower than I had
hoped. Mind you, this MR job is only running on a pseudo-distributed VM
(cloudera cdh4).
I have modified my mapred-site.xml with the following (that last one is
commented out because it crashes my MR job):
it be interfering? Other than that,
my VM's networking is set to bridged, if that makes any difference. Mind you,
I'm trying to connect from my vm to my vm.
I'm at a lost here. Could really use some guidance. Thanks!
David
From: David Poisson [david.pois
Hi,
We are still very new at all of this hbase/hadoop/mapreduce stuff. We are
looking for the best practices that will fit our requirements. We are currently
using the latest cloudera vmware's (single node) for our development tests.
The problem is as follows:
We have multiple sources in