RE: moving data

2012-08-01 Thread Dave Shine
>From the command line you can hadoop fs -copyFromLocal Dave Shine Sr. Software Engineer 321.939.5093 direct | 407.314.0122 mobile CI Boost(tm) Clients Outperform Online(tm) www.ciboost.com<http://www.ciboost.com/> From: Soheila Dehghanzadeh [mailto:sally...@gmail.com] Sent:

RE: Combining AVRO files efficiently within HDFS

2012-01-06 Thread Dave Shine
Frank, We have a very serious small file problem. I created a M/R job that combines files as it seemed best to use all the resources of the cluster rather than opening a stream and combining files single threaded or trying to do something via command line. Dave -Original Message- Fr

dfs.name.dir and fs.checkpoint.dir

2012-01-04 Thread Dave Shine
livered to the NN in minutes. My question is, is there really any reason at all for specifying more than one directory in fs.checkpoint.dir? I probably did it out of paranoia when I was first configuring the cluster. How is this property configured in other Hadoop environments? Thanks, Dave Sh