Did you forget to attach the output? On Fri, Jan 27, 2012 at 3:31 PM, Matthew Rathbone <[email protected]>wrote:
> Here's my output from that (attached, it's long). > > The regular web-uri :8080 works fine until I submit a job, it can see the > hadoop jobtracker and everything, but when I submit a job it goes haywire. > I can't see anything obvious in the logs either. > > This is all I did: > start a cluster > start a job tracker > hadoop fs -put hadoop-examples.jar > <mkdirs> > hadoop jar hadoop-examples.jar wordcount wordcount/input wordcount/output > > I figured it might be something to do with MESOS_HOME not being set in > hadoop-env.sh, so I set that too ( on all machines ), but it didn't seem to > help. > > If it helps, the jobtracker is still up, and it received the job, but > doesn't see any nodes. > > -- > Matthew Rathbone > Foursquare | Software Engineer | Server Engineering Team > [email protected] | @rathboma <http://twitter.com/rathboma> | > 4sq<http://foursquare.com/rathboma> > > On Friday, January 27, 2012 at 5:20 PM, Andy Konwinski wrote: > > It looks like a JSON parsing error in the webui python code (i.e. the error > output shows line 11 of webui/master/index.tpl which is the json code > "state = json.loads(data)"). > > What happens if you go to > > http://ec2-107-21-195-96.compute-1.amazonaws.com:5050/master/state.jsoninside > the firewall (or open up port 5050 in the EC2 firewall for your > machine)? > > When I do this on my machine locally (before running any frameworks or > starting any slaves), I see: > > {"build_date":"2012-01-25 > > 11:19:19","build_user":"andyk","completed_frameworks":[],"frameworks":[],"id":"201201271511-0","pid":" > [email protected]:5050","slaves":[],"start_time":1327705891} > > Andy > > On Fri, Jan 27, 2012 at 3:02 PM, Matthew Rathbone <[email protected] > >wrote: > > So I spun up a mesos cluster using the ec2 scripts. So far so good. > > Then I spun up a jobtracker, that worked (after some fiddling) > > Then I tried to submit an example job (wordcount). > > First of all, the job tracker receives the job, but then I get these > errors in the terminal: > 12/01/27 22:57:12 INFO input.FileInputFormat: Total input paths to process > : 0 > 12/01/27 22:57:13 INFO mapred.JobClient: Running job: job_201201272245_0002 > 12/01/27 22:57:14 INFO mapred.JobClient: map 0% reduce 0% > channel 6: open failed: connect failed: Connection refused > channel 7: open failed: connect failed: Connection refused > channel 6: open failed: connect failed: Connection refused > > > So I check on the mesos dashboard (port 8080) and I see this: > http://cl.ly/221D193v0l012k0h3W0S > > It doesn't look good, anyone have any pointers? (Sorry for spamming the > list so much over the last couple of days) > > -- > Matthew Rathbone > Foursquare | Software Engineer | Server Engineering Team > [email protected] (mailto:[email protected]<[email protected]>) > | @rathboma ( > http://twitter.com/rathboma) | 4sq (http://foursquare.com/rathboma) > > >
