RE: newbie question

2013-01-16 Thread Dave Shine
conf) { ... } @Override public void map() { ... } } Dave Shine Sr. Software Engineer 321.939.5093 direct | 407.314.0122 mobile CI Boost(tm) Clients Outperform Online(tm) www.ciboost.comhttp://www.ciboost.com/ From: jamal sasha [mailto:jamalsha

RE: Exit code 126?

2013-01-14 Thread Dave Shine
I was referring to https://issues.apache.org/jira/browse/MAPREDUCE-2374 Dave Shine Sr. Software Engineer 321.939.5093 direct | 407.314.0122 mobile CI Boost™ Clients Outperform Online™ www.ciboost.com -Original Message- From: Jean-Marc Spaggiari [mailto:jean-m...@spaggiari.org] Sent

RE: Exit code 126?

2013-01-11 Thread Dave Shine
We see this quite a bit. There is a JIRA (I don't remember the number) that addresses this issue. It has been applied to the apache distro, but I don't think it is incorporated in and release of CDH yet. Dave Shine Sr. Software Engineer 321.939.5093 direct | 407.314.0122 mobile CI Boost

Task Attempt Failed

2012-10-08 Thread Dave Shine
. But, the job takes a couple of hours when I don't see this message, longer when I do. And I need to get this job to complete faster if possible. I'm running CDH3U3. Any help would be greatly appreciated. Thanks, Dave Shine Sr. Software Engineer 321.939.5093 direct | 407.314.0122 mobile

RE: moving data

2012-08-01 Thread Dave Shine
From the command line you can hadoop fs -copyFromLocal source destination Dave Shine Sr. Software Engineer 321.939.5093 direct | 407.314.0122 mobile CI Boost(tm) Clients Outperform Online(tm) www.ciboost.comhttp://www.ciboost.com/ From: Soheila Dehghanzadeh [mailto:sally...@gmail.com] Sent

RE: Distributing Keys across Reducers

2012-07-25 Thread Dave Shine
of the story, verify the data before you blame the software. Dave Shine Sr. Software Engineer 321.939.5093 direct |  407.314.0122 mobile CI BoostT Clients  Outperform OnlineT  www.ciboost.com -Original Message- From: Dave Shine [mailto:dave.sh...@channelintelligence.com] Sent: Friday, July

RE: Counting records

2012-07-23 Thread Dave Shine
programmatically get the values of all counters including the one you create in the Map() method. Dave Shine Sr. Software Engineer 321.939.5093 direct | 407.314.0122 mobile CI Boost(tm) Clients Outperform Online(tm) www.ciboost.com -Original Message- From: Peter Marron [mailto:peter.mar

Distributing Keys across Reducers

2012-07-20 Thread Dave Shine
. that might give me some options on how to deal with this issue? Thanks, Dave Shine Sr. Software Engineer 321.939.5093 direct | 407.314.0122 mobile [cid:image001.png@01CD6658.D1D27BC0] CI Boost(tm) Clients Outperform Online(tm) www.ciboost.comhttp://www.ciboost.com/ facebook platform | where

RE: Distributing Keys across Reducers

2012-07-20 Thread Dave Shine
Thanks Syed. I'm not using HBase, so I don't think this is related to my problem. Dave Shine Sr. Software Engineer 321.939.5093 direct | 407.314.0122 mobile CI Boost(tm) Clients Outperform Online(tm) www.ciboost.comhttp://www.ciboost.com/ From: syed kather [mailto:in.ab...@gmail.com] Sent

RE: Distributing Keys across Reducers

2012-07-20 Thread Dave Shine
. Dave Shine Sr. Software Engineer 321.939.5093 direct |  407.314.0122 mobile CI BoostT Clients  Outperform OnlineT  www.ciboost.com -Original Message- From: Harsh J [mailto:ha...@cloudera.com] Sent: Friday, July 20, 2012 11:56 AM To: mapreduce-user@hadoop.apache.org Cc: john.armstr

RE: Distributing Keys across Reducers

2012-07-20 Thread Dave Shine
have changed it to use a fewer number of reducers just to see how that effects the distribution. Dave Shine Sr. Software Engineer 321.939.5093 direct | 407.314.0122 mobile CI Boost(tm) Clients Outperform Online(tm) www.ciboost.com -Original Message- From: Tim Broberg [mailto:tim.brob

RE: Hadoop fair scheduler doubt: allocate jobs to pool

2012-03-01 Thread Dave Shine
I've just started playing with the Fair Scheduler. To specify the pool at job submission time you set the mapred.fairscheduler.pool property on the Job Conf to the name of the pool you want the job to use. Dave -Original Message- From: Merto Mertek [mailto:masmer...@gmail.com] Sent:

RE: Combining AVRO files efficiently within HDFS

2012-01-06 Thread Dave Shine
Frank, We have a very serious small file problem. I created a M/R job that combines files as it seemed best to use all the resources of the cluster rather than opening a stream and combining files single threaded or trying to do something via command line. Dave -Original Message-

dfs.name.dir and fs.checkpoint.dir

2012-01-04 Thread Dave Shine
in minutes. My question is, is there really any reason at all for specifying more than one directory in fs.checkpoint.dir? I probably did it out of paranoia when I was first configuring the cluster. How is this property configured in other Hadoop environments? Thanks, Dave Shine The information