fpcalc on Hadoop streaming can't find file

2014-08-25 Thread Edmund Day
I have an hdfs directory that contains audio files. I wish to run fpcalc on each file using Hadoop streaming. I can do this locally no problem, but in hadoop fpcalc cannot see the files. My code is:     import shlex     cli = './fpcalc -raw -length ' + str(sample_length) + ' ' + file_a     from

connecting hiveserver2 through ssh tunnel - time out

2014-08-25 Thread murat migdisoglu
Hello, Due to some firewall restrictions, I need to connect from tableau to the hiveserver2 through ssh tunnel.. I tried tunneling from port range 1 -10004 but tableau still times out.. my hiveserver2 is running on 10.0.0.100 and I'm on 192. network. I tried ssh murat@10.0.0.100 -L

Local file system to access hdfs blocks

2014-08-25 Thread Demai Ni
Hi, folks, New in this area. Hopefully to get a couple pointers. I am using Centos and have Hadoop set up using cdh5.1(Hadoop 2.3) I am wondering whether there is a interface to get each hdfs block information in the term of local file system. For example, I can use Hadoop fsck

Re: Missing Snapshots for 2.5.0

2014-08-25 Thread Tsuyoshi OZAWA
Hi Mark, Thanks for your reporting. I also confirmed that we cannot access jars of Hadoop 2.5.0. Karthik, could you check this problem? Thanks, - Tsuyoshi On Thu, Aug 21, 2014 at 2:08 AM, Campbell, Mark mark.campb...@xerox.com wrote: It seems that all the needed archives (yard, mapreduce,

what do you call it when you use Tez?

2014-08-25 Thread Adaryl Bob Wakefield, MBA
You've got MapReduce jobs right? What is it called if, instead, you're using Tez? A Tez job? Adaryl Bob Wakefield, MBA Principal Mass Street Analytics 913.938.6685 www.linkedin.com/in/bobwakefieldmba Twitter: @BobLovesData

namenod shutdown. epoch number mismatch

2014-08-25 Thread cho ju il
hadoop version 2.4.1 Namenode shutdown to become epoch number mismatch. Why suddenly epoch numbers mismatch ? Why suddenly namenode shutdown ? *** namenode log 2014-08-26 12:17:48,625 INFO org.apache.hadoop.hdfs.server.blockmanagement.CacheReplicationMonitor: Rescanning after 3