Chris, I think I am comfortable now to add couple of steps to the multi node setup. II am guessing I need some privileges to contribute to http://samza.apache.org/learn/tutorials/0.7.0/run-in-multi-node-yarn.html (need a jira ticket and privilege to add context and close it as well)
- Shekar On Fri, Mar 13, 2015 at 1:28 PM, Shekar Tippur <ctip...@gmail.com> wrote: > After adding classpath to yarn-site.xml, I found that the jars that were > created for argos (extension of Samza) was not part of the tar.gz file that > was exposed to Http request. I changed the post install script on rpm to > expose that. > > I see that the 2 nodes are showing up on rm. We are testing redundancy now. > > - Shekar > > > > On Fri, Mar 13, 2015 at 1:20 PM, Chris Riccomini <criccom...@apache.org> > wrote: > >> Hey Shekar, >> >> Awesome, thanks! Would love to get any doc updates that would be useful. >> >> Curious: what was wrong? >> >> Cheers, >> Chris >> >> On Fri, Mar 13, 2015 at 1:00 PM, Shekar Tippur <ctip...@gmail.com> wrote: >> >> > Thanks for your help Chris. Got it to work now. I will test my case and >> > documentation further. I can edit the Samza documentation to reflect any >> > changes. >> > >> > - Shekar >> > >> > On Thu, Mar 12, 2015 at 5:19 PM, Chris Riccomini <criccom...@apache.org >> > >> > wrote: >> > >> > > Hey Shekar, >> > > >> > > Yes, this is definitely a classpath issue. The pastebin you sent does >> not >> > > include any of the samza-core/samza-yarn/scala JARs. This is rather >> > > strange, since you said you put the JARs in this path: >> > > >> > > /home/hadoop/hadoop-2.5.2/share/hadoop/hdfs/lib/ >> > > >> > > And I do see *other* JARs listed with this path. Are you sure you put >> the >> > > Samza JARs on *all* machines, not just the RM machine? According to >> the >> > > yarn-default.xml logs, it says: >> > > >> > > CLASSPATH for YARN applications. A comma-separated list of CLASSPATH >> > > entries. When this value is empty, the following default CLASSPATH for >> > YARN >> > > applications would be used. For Linux: $HADOOP_CONF_DIR, >> > > $HADOOP_COMMON_HOME/share/hadoop/common/*, >> > > $HADOOP_COMMON_HOME/share/hadoop/common/lib/*, >> > > $HADOOP_HDFS_HOME/share/hadoop/hdfs/*, >> > > $HADOOP_HDFS_HOME/share/hadoop/hdfs/lib/*, >> > > $HADOOP_YARN_HOME/share/hadoop/yarn/*, >> > > $HADOOP_YARN_HOME/share/hadoop/yarn/lib/* >> > > >> > > So, it seems like it should pick up the JARs, if they're in the NM's >> > > directory. >> > > >> > > The exception that you're now seeing seems to suggest that one of the >> > Samza >> > > containers is failing: >> > > >> > > Container for appattempt_1426204312971_0001_000002 exited with >> exitCode: >> > 1 >> > > >> > > The _000002 suffix indicates a non-AM failure (i.e. the Samza >> container >> > > failed, not the Samza AM). Can you check the AM logs, and find the >> > http:// >> > > ... >> > > link to the container logs? It should give a hint about why the >> container >> > > failed. >> > > >> > > Cheers, >> > > Chris >> > > >> > > On Thu, Mar 12, 2015 at 4:58 PM, Shekar Tippur <ctip...@gmail.com> >> > wrote: >> > > >> > > > Chris, >> > > > >> > > > Made some progress. >> > > > >> > > > By adding yarn.application.classpath to yarn-site.xml, I am no >> longer >> > > > getting class not found error. However, I am getting a different >> error: >> > > > >> > > > Application application_1426204312971_0001 failed 2 times due to AM >> > > > Container for appattempt_1426204312971_0001_000002 exited with >> > exitCode: >> > > 1 >> > > > due to: Exception from container-launch: ExitCodeException >> exitCode=1: >> > > > ExitCodeException exitCode=1: >> > > > at org.apache.hadoop.util.Shell.runCommand(Shell.java:538) >> > > > at org.apache.hadoop.util.Shell.run(Shell.java:455) >> > > > at >> > > > >> > >> org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:702) >> > > > at >> > > > >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:195) >> > > > at >> > > > >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:300) >> > > > at >> > > > >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:81) >> > > > at java.util.concurrent.FutureTask.run(FutureTask.java:262) >> > > > at >> > > > >> > > > >> > > >> > >> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) >> > > > at >> > > > >> > > > >> > > >> > >> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) >> > > > at java.lang.Thread.run(Thread.java:745) >> > > > Container exited with a non-zero exit code 1 >> > > > .Failing this attempt.. Failing the application. >> > > > >> > > > Looks like a common issue with yarn but not sure how to resolve as >> yet. >> > > > >> > > > >> > > > - Shekar >> > > > >> > > > On Thu, Mar 12, 2015 at 1:44 PM, Shekar Tippur <ctip...@gmail.com> >> > > wrote: >> > > > >> > > > > Chris - Here it is. >> > > > > >> > > > > http://pastebin.com/c3e21Hzf >> > > > > >> > > > > - Shekar >> > > > > >> > > > > On Thu, Mar 12, 2015 at 10:58 AM, Chris Riccomini < >> > > criccom...@apache.org >> > > > > >> > > > > wrote: >> > > > > >> > > > >> This is the line that I'm interested in: >> > > > >> >> > > > >> STARTUP_MSG: classpath .... >> > > > >> >> > > > >> On Thu, Mar 12, 2015 at 10:55 AM, Chris Riccomini < >> > > > criccom...@apache.org> >> > > > >> wrote: >> > > > >> >> > > > >> > Hey Shekar, >> > > > >> > >> > > > >> > Could you paste the full log on pastebin? It really seems like >> > > > >> something's >> > > > >> > missing from the classpath. If samza-yarn is there, it should >> be >> > > able >> > > > to >> > > > >> > see that file. I think the full log has a dump of the >> classpath. >> > If >> > > it >> > > > >> > doesn't, could you paste the line where the YARN NM is starting >> > up, >> > > > and >> > > > >> > dumps the full classpath? >> > > > >> > >> > > > >> > Cheers, >> > > > >> > Chris >> > > > >> > >> > > > >> > On Thu, Mar 12, 2015 at 10:17 AM, Shekar Tippur < >> > ctip...@gmail.com> >> > > > >> wrote: >> > > > >> > >> > > > >> >> I think all these jars are in place (Under >> > > > >> >> $HADOOP_YARN_HOME/share/hadoop/hdfs/lib) >> > > > >> >> >> > > > >> >> - Shekar >> > > > >> >> >> > > > >> >> On Thu, Mar 12, 2015 at 9:36 AM, Chris Riccomini < >> > > > >> criccom...@apache.org> >> > > > >> >> wrote: >> > > > >> >> >> > > > >> >> > Hey Shekar, >> > > > >> >> > >> > > > >> >> > You need that samza-yarn file on your RM/NM's classpath, >> along >> > > with >> > > > >> >> scala. >> > > > >> >> > We missed this in the docs, and are tracking the issue here: >> > > > >> >> > >> > > > >> >> > https://issues.apache.org/jira/browse/SAMZA-456 >> > > > >> >> > >> > > > >> >> > You'll also need samza-core in the classpath, based on the >> > > > >> discussion on >> > > > >> >> > SAMZA-456. Sorry about that. If you want to update the >> tutorial >> > > > when >> > > > >> you >> > > > >> >> > get your cluster working, and submit a patch, that'd be >> great! >> > :) >> > > > >> >> > >> > > > >> >> > Cheers, >> > > > >> >> > Chris >> > > > >> >> > >> > > > >> >> > On Wed, Mar 11, 2015 at 9:43 PM, Shekar Tippur < >> > > ctip...@gmail.com> >> > > > >> >> wrote: >> > > > >> >> > >> > > > >> >> > > Here is the corresponding log: >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,665 INFO [AsyncDispatcher event >> handler] >> > > > >> >> > > localizer.LocalizedResource >> > > (LocalizedResource.java:handle(203)) >> > > > - >> > > > >> >> > Resource >> > > > >> >> > > http://sprfargas102:8000/hello-samza-0.8.0-dist.tar.gz >> > > > >> transitioned >> > > > >> >> from >> > > > >> >> > > INIT to DOWNLOADING >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,665 INFO [AsyncDispatcher event >> handler] >> > > > >> >> > > localizer.ResourceLocalizationService >> > > > >> >> > > (ResourceLocalizationService.java:handle(679)) - Created >> > > > localizer >> > > > >> for >> > > > >> >> > > container_1426121400423_2587_01_000001 >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,669 INFO [LocalizerRunner for >> > > > >> >> > > container_1426121400423_2587_01_000001] >> > > > >> >> > > localizer.ResourceLocalizationService >> > > > >> >> > > (ResourceLocalizationService.java:writeCredentials(1107)) >> - >> > > > Writing >> > > > >> >> > > credentials to the nmPrivate file >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> /tmp/hadoop-hadoop/nm-local-dir/nmPrivate/container_1426121400423_2587_01_000001.tokens. >> > > > >> >> > > Credentials list: >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,675 INFO [DeletionService #0] >> > > > >> >> > > nodemanager.DefaultContainerExecutor >> > > > >> >> > > (DefaultContainerExecutor.java:deleteAsUser(378)) - >> Deleting >> > > > path : >> > > > >> >> > > >> > > > >> >> > /home/hadoop/hadoop-2.5.2/logs/userlogs/application_1426120927668_0010 >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,676 INFO [LocalizerRunner for >> > > > >> >> > > container_1426121400423_2587_01_000001] >> > > > >> >> > > nodemanager.DefaultContainerExecutor >> > > > >> >> > > (DefaultContainerExecutor.java:createUserCacheDirs(469)) - >> > > > >> >> Initializing >> > > > >> >> > > user root >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,685 INFO [LocalizerRunner for >> > > > >> >> > > container_1426121400423_2587_01_000001] >> > > > >> >> > > nodemanager.DefaultContainerExecutor >> > > > >> >> > > (DefaultContainerExecutor.java:startLocalizer(103)) - >> Copying >> > > > from >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> /tmp/hadoop-hadoop/nm-local-dir/nmPrivate/container_1426121400423_2587_01_000001.tokens >> > > > >> >> > > to >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> /tmp/hadoop-hadoop/nm-local-dir/usercache/root/appcache/application_1426121400423_2587/container_1426121400423_2587_01_000001.tokens >> > > > >> >> > > >> > > > >> >> > > *2015-03-11 20:43:09,685 INFO [LocalizerRunner for >> > > > >> >> > > container_1426121400423_2587_01_000001] >> > > > >> >> > > nodemanager.DefaultContainerExecutor >> > > > >> >> > > (DefaultContainerExecutor.java:startLocalizer(105)) - CWD >> set >> > > to >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> /tmp/hadoop-hadoop/nm-local-dir/usercache/root/appcache/application_1426121400423_2587 >> > > > >> >> > > = >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> file:/tmp/hadoop-hadoop/nm-local-dir/usercache/root/appcache/application_1426121400423_2587* >> > > > >> >> > > >> > > > >> >> > > *2015-03-11 20:43:09,716 INFO [IPC Server handler 2 on >> 8040] >> > > > >> >> > > localizer.ResourceLocalizationService >> > > > >> >> > > (ResourceLocalizationService.java:update(1007)) - DEBUG: >> > > FAILED { >> > > > >> >> > > http://sprfargas102:8000/hello-samza-0.8.0-dist.tar.gz >> > > > >> >> > > <http://sprfargas102:8000/hello-samza-0.8.0-dist.tar.gz>, >> 0, >> > > > >> ARCHIVE, >> > > > >> >> > null >> > > > >> >> > > }, java.lang.ClassNotFoundException: Class >> > > > >> >> > > org.apache.samza.util.hadoop.HttpFileSystem not found* >> > > > >> >> > > >> > > > >> >> > > *2015-03-11 20:43:09,716 INFO [IPC Server handler 2 on >> 8040] >> > > > >> >> > > localizer.LocalizedResource >> > > (LocalizedResource.java:handle(203)) >> > > > - >> > > > >> >> > Resource >> > > > >> >> > > http://sprfargas102:8000/hello-samza-0.8.0-dist.tar.gz(- >> > > > >> >> > > <http://sprfargas102:8000/hello-samza-0.8.0-dist.tar.gz(- >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> >>/tmp/hadoop-hadoop/nm-local-dir/usercache/root/appcache/application_1426121400423_2587/filecache/10/hello-samza-0.8.0-dist.tar.gz) >> > > > >> >> > > transitioned from DOWNLOADING to FAILED* >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,717 INFO [AsyncDispatcher event >> handler] >> > > > >> >> > > container.Container (ContainerImpl.java:handle(918)) - >> > > Container >> > > > >> >> > > container_1426121400423_2587_01_000001 transitioned from >> > > > >> LOCALIZING to >> > > > >> >> > > LOCALIZATION_FAILED >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,717 INFO [AsyncDispatcher event >> handler] >> > > > >> >> > > localizer.LocalResourcesTrackerImpl >> > > > >> >> > > (LocalResourcesTrackerImpl.java:handle(151)) - Container >> > > > >> >> > > container_1426121400423_2587_01_000001 sent RELEASE event >> on >> > a >> > > > >> >> resource >> > > > >> >> > > request { >> > > http://sprfargas102:8000/hello-samza-0.8.0-dist.tar.gz >> > > > , >> > > > >> 0, >> > > > >> >> > > ARCHIVE, null } not present in cache. >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,717 WARN [AsyncDispatcher event >> handler] >> > > > >> >> > > nodemanager.NMAuditLogger >> > (NMAuditLogger.java:logFailure(150)) >> > > - >> > > > >> >> > > USER=root OPERATION=Container >> > > > >> >> > > Finished - Failed TARGET=ContainerImpl RESULT=FAILURE >> > > > >> >> > DESCRIPTION=Container >> > > > >> >> > > failed with state: LOCALIZATION_FAILED >> > > > >> >> > APPID=application_1426121400423_2587 >> > > > >> >> > > CONTAINERID=container_1426121400423_2587_01_000001 >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,717 INFO [AsyncDispatcher event >> handler] >> > > > >> >> > > container.Container (ContainerImpl.java:handle(918)) - >> > > Container >> > > > >> >> > > container_1426121400423_2587_01_000001 transitioned from >> > > > >> >> > > LOCALIZATION_FAILED to DONE >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,717 INFO [AsyncDispatcher event >> handler] >> > > > >> >> > > application.Application >> > (ApplicationImpl.java:transition(340)) >> > > - >> > > > >> >> Removing >> > > > >> >> > > container_1426121400423_2587_01_000001 from application >> > > > >> >> > > application_1426121400423_2587 >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,717 INFO [AsyncDispatcher event >> handler] >> > > > >> >> > > containermanager.AuxServices >> (AuxServices.java:handle(196)) - >> > > Got >> > > > >> >> event >> > > > >> >> > > CONTAINER_STOP for appId application_1426121400423_2587 >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,717 INFO [DeletionService #2] >> > > > >> >> > > nodemanager.DefaultContainerExecutor >> > > > >> >> > > (DefaultContainerExecutor.java:deleteAsUser(369)) - >> Deleting >> > > > >> absolute >> > > > >> >> > path >> > > > >> >> > > : >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> /tmp/hadoop-hadoop/nm-local-dir/usercache/root/appcache/application_1426121400423_2587/container_1426121400423_2587_01_000001 >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,717 WARN [DeletionService #2] >> > > > >> >> > > nodemanager.DefaultContainerExecutor >> > > > >> >> > > (DefaultContainerExecutor.java:deleteAsUser(372)) - delete >> > > > returned >> > > > >> >> false >> > > > >> >> > > for path: >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> [/tmp/hadoop-hadoop/nm-local-dir/usercache/root/appcache/application_1426121400423_2587/container_1426121400423_2587_01_000001] >> > > > >> >> > > >> > > > >> >> > > 2015-03-11 20:43:09,718 WARN [LocalizerRunner for >> > > > >> >> > > container_1426121400423_2587_01_000001] ipc.Client >> > > > >> >> > (Client.java:call(1389)) >> > > > >> >> > > - interrupted waiting to send rpc request to server >> > > > >> >> > > >> > > > >> >> > > java.lang.InterruptedException >> > > > >> >> > > >> > > > >> >> > > at >> > > java.util.concurrent.FutureTask.awaitDone(FutureTask.java:400) >> > > > >> >> > > >> > > > >> >> > > at >> java.util.concurrent.FutureTask.get(FutureTask.java:187) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > >> > > > >> >> > > >> org.apache.hadoop.ipc.Client$Connection.sendRpcRequest(Client.java:1030) >> > > > >> >> > > >> > > > >> >> > > at org.apache.hadoop.ipc.Client.call(Client.java:1384) >> > > > >> >> > > >> > > > >> >> > > at org.apache.hadoop.ipc.Client.call(Client.java:1364) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:206) >> > > > >> >> > > >> > > > >> >> > > at com.sun.proxy.$Proxy29.heartbeat(Unknown Source) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.api.impl.pb.client.LocalizationProtocolPBClientImpl.heartbeat(LocalizationProtocolPBClientImpl.java:62) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer.localizeFiles(ContainerLocalizer.java:255) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer.runLocalization(ContainerLocalizer.java:169) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.startLocalizer(DefaultContainerExecutor.java:107) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1073) >> > > > >> >> > > >> > > > >> >> > > java.io.IOException: java.lang.InterruptedException >> > > > >> >> > > >> > > > >> >> > > at org.apache.hadoop.ipc.Client.call(Client.java:1390) >> > > > >> >> > > >> > > > >> >> > > at org.apache.hadoop.ipc.Client.call(Client.java:1364) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:206) >> > > > >> >> > > >> > > > >> >> > > at com.sun.proxy.$Proxy29.heartbeat(Unknown Source) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.api.impl.pb.client.LocalizationProtocolPBClientImpl.heartbeat(LocalizationProtocolPBClientImpl.java:62) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer.localizeFiles(ContainerLocalizer.java:255) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer.runLocalization(ContainerLocalizer.java:169) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.startLocalizer(DefaultContainerExecutor.java:107) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1073) >> > > > >> >> > > >> > > > >> >> > > Caused by: java.lang.InterruptedException >> > > > >> >> > > >> > > > >> >> > > at >> > > java.util.concurrent.FutureTask.awaitDone(FutureTask.java:400) >> > > > >> >> > > >> > > > >> >> > > at >> java.util.concurrent.FutureTask.get(FutureTask.java:187) >> > > > >> >> > > >> > > > >> >> > > at >> > > > >> >> > >> > > > >> >> > > >> org.apache.hadoop.ipc.Client$Connection.sendRpcRequest(Client.java:1030) >> > > > >> >> > > >> > > > >> >> > > at org.apache.hadoop.ipc.Client.call(Client.java:1384) >> > > > >> >> > > >> > > > >> >> > > ... 8 more >> > > > >> >> > > >> > > > >> >> > > On Wed, Mar 11, 2015 at 4:56 PM, Shekar Tippur < >> > > > ctip...@gmail.com> >> > > > >> >> > wrote: >> > > > >> >> > > >> > > > >> >> > > > Hello, >> > > > >> >> > > > >> > > > >> >> > > > Sorry to reopen this topic. I had setup yarn couple of >> > months >> > > > ago >> > > > >> >> and >> > > > >> >> > > cant >> > > > >> >> > > > seem to replicate this now. >> > > > >> >> > > > >> > > > >> >> > > > I see that I have done everything listed here >> > > > >> >> > > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> http://samza.apache.org/learn/tutorials/0.7.0/run-in-multi-node-yarn.html >> > > > >> >> > > > >> > > > >> >> > > > I see this error on the application side >> > > > >> >> > > > >> > > > >> >> > > > Application application_1426115467623_0492 failed 2 >> times >> > due >> > > > to >> > > > >> AM >> > > > >> >> > > > Container for appattempt_1426115467623_0492_000002 >> exited >> > > with >> > > > >> >> > exitCode: >> > > > >> >> > > > -1000 due to: java.lang.ClassNotFoundException: Class >> > > > >> >> > > > org.apache.samza.util.hadoop.HttpFileSystem not found >> > > > >> >> > > > .Failing this attempt.. Failing the application. >> > > > >> >> > > > >> > > > >> >> > > > I see that >> > > > >> >> > > > >> > > > >> >> > > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > >> /home/hadoop/hadoop-2.5.2/share/hadoop/hdfs/lib/samza-yarn_2.10-0.8.0.jar >> > > > >> >> > > > has that particular class >> > > > >> >> > > > >> > > > >> >> > > > 1739 Tue Nov 25 10:51:40 PST 2014 >> > > > >> >> > > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> >> > > > >> > > >> > >> org/apache/samza/util/hadoop/HttpFileSystem$$anonfun$getFileStatus$1.class >> > > > >> >> > > > >> > > > >> >> > > > 1570 Tue Nov 25 10:51:40 PST 2014 >> > > > >> >> > > > >> > > > >> >> >> > > > >> org/apache/samza/util/hadoop/HttpFileSystem$$anonfun$initialize$1.class >> > > > >> >> > > > >> > > > >> >> > > > 1597 Tue Nov 25 10:51:40 PST 2014 >> > > > >> >> > > > >> > > > org/apache/samza/util/hadoop/HttpFileSystem$$anonfun$open$1.class >> > > > >> >> > > > >> > > > >> >> > > > 1797 Tue Nov 25 10:51:40 PST 2014 >> > > > >> >> > > > >> > > > org/apache/samza/util/hadoop/HttpFileSystem$$anonfun$open$2.class >> > > > >> >> > > > >> > > > >> >> > > > 9549 Tue Nov 25 10:51:40 PST 2014 >> > > > >> >> > > > org/apache/samza/util/hadoop/HttpFileSystem.class >> > > > >> >> > > > >> > > > >> >> > > > >> > > > >> >> > > > I see that env is set right: >> > > > >> >> > > > >> > > > >> >> > > > >> > > > >> >> > > > HADOOP_YARN_HOME=/home/hadoop/hadoop-2.5.2 >> > > > >> >> > > > >> > > > >> >> > > > HADOOP_CONF_DIR=/home/hadoop/hadoop-2.5.2/conf >> > > > >> >> > > > >> > > > >> >> > > > Wondering if I am missing anything... >> > > > >> >> > > > - Shekar >> > > > >> >> > > > >> > > > >> >> > > >> > > > >> >> > >> > > > >> >> >> > > > >> > >> > > > >> > >> > > > >> >> > > > > >> > > > > >> > > > >> > > >> > >> > >