[jira] [Resolved] (MAPREDUCE-2695) Unhealthy nodes (health-script) are still being assigned containers

2011-09-19 Thread Vinod Kumar Vavilapalli (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli resolved MAPREDUCE-2695.


Resolution: Invalid
  Assignee: Vinod Kumar Vavilapalli  (was: Arun C Murthy)

This is invalid on the trunk/branch 0.23 code. Fixed as part of the RM refactor 
that went in a while back.

 Unhealthy nodes (health-script) are still being assigned containers
 ---

 Key: MAPREDUCE-2695
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2695
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Reporter: Arun C Murthy
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.0


 Unhealthy nodes (health-script) are still being assigned containers

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Resolved] (MAPREDUCE-2686) NPE while requesting info for a non-existing job

2011-09-19 Thread Vinod Kumar Vavilapalli (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli resolved MAPREDUCE-2686.


Resolution: Duplicate

This is getting fixed as part of MAPREDUCE-2925.

 NPE while requesting info for a non-existing job
 

 Key: MAPREDUCE-2686
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2686
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Ramya Sunil
Assignee: Siddharth Seth
Priority: Blocker
 Fix For: 0.23.0


 While performing job related operations such as job -kill, -status, -events 
 etc for an unknown job, the following NPE is seen:
 Exception in thread main java.lang.NullPointerException
 at 
 org.apache.hadoop.mapred.ClientServiceDelegate.refreshProxy(ClientServiceDelegate.java:112)
 at 
 org.apache.hadoop.mapred.ClientServiceDelegate.getProxy(ClientServiceDelegate.java:100)
 at 
 org.apache.hadoop.mapred.ClientServiceDelegate.getRefreshedProxy(ClientServiceDelegate.java:93)
 at 
 org.apache.hadoop.mapred.ClientServiceDelegate.getJobStatus(ClientServiceDelegate.java:383)
 at 
 org.apache.hadoop.mapred.YARNRunner.getJobStatus(YARNRunner.java:515)
 at org.apache.hadoop.mapreduce.Cluster.getJob(Cluster.java:154)
 at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:254)
 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69)
 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83)
 at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1074)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3030) RM is not processing heartbeat and continuously giving the message 'Node not found rebooting'

2011-09-19 Thread Devaraj K (JIRA)
RM is not processing heartbeat and continuously giving the message 'Node not 
found rebooting'
-

 Key: MAPREDUCE-3030
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3030
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: resourcemanager
Affects Versions: 0.24.0
Reporter: Devaraj K
Assignee: Devaraj K
Priority: Blocker


{code:title=Node Manager Logs|borderStyle=solid}
2011-09-19 13:39:29,816 INFO  webapp.WebApps (WebApps.java:start(162)) - 
Registered webapp guice modules
2011-09-19 13:39:29,817 INFO  service.AbstractService 
(AbstractService.java:start(61)) - 
Service:org.apache.hadoop.yarn.server.nodemanager.webapp.WebServer is started.
2011-09-19 13:39:29,818 INFO  service.AbstractService 
(AbstractService.java:start(61)) - Service:Dispatcher is started.
2011-09-19 13:39:29,819 INFO  nodemanager.NodeStatusUpdaterImpl 
(NodeStatusUpdaterImpl.java:start(133)) - Configured ContainerManager Address 
is 10.18.52.124:45454
2011-09-19 13:39:29,819 INFO  ipc.YarnRPC (YarnRPC.java:create(47)) - Creating 
YarnRPC for org.apache.hadoop.yarn.ipc.HadoopYarnProtoRPC
2011-09-19 13:39:29,822 INFO  ipc.HadoopYarnRPC 
(HadoopYarnProtoRPC.java:getProxy(49)) - Creating a HadoopYarnProtoRpc proxy 
for protocol interface org.apache.hadoop.yarn.server.api.ResourceTracker
2011-09-19 13:39:29,862 INFO  nodemanager.NodeStatusUpdaterImpl 
(NodeStatusUpdaterImpl.java:registerWithRM(165)) - Connected to ResourceManager 
at 0.0.0.0:8025
2011-09-19 13:39:30,369 INFO  nodemanager.NodeStatusUpdaterImpl 
(NodeStatusUpdaterImpl.java:registerWithRM(189)) - Registered with 
ResourceManager as 10.18.52.124:45454 with total resource of memory: 8192, 
2011-09-19 13:39:30,369 INFO  service.AbstractService 
(AbstractService.java:start(61)) - 
Service:org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl is 
started.
2011-09-19 13:39:30,371 INFO  service.AbstractService 
(AbstractService.java:start(61)) - 
Service:org.apache.hadoop.yarn.server.nodemanager.NodeManager is started.
{code}



{code:title=Resource Manager Logs|borderStyle=solid}
2011-09-19 14:01:03,238 INFO  resourcemanager.ResourceTrackerService 
(ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
10.18.52.124:45454
Call: 
protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
 method=nodeHeartbeat
2011-09-19 14:01:04,240 INFO  resourcemanager.ResourceTrackerService 
(ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
10.18.52.124:45454
Call: 
protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
 method=nodeHeartbeat
2011-09-19 14:01:05,242 INFO  resourcemanager.ResourceTrackerService 
(ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
10.18.52.124:45454
Call: 
protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
 method=nodeHeartbeat
2011-09-19 14:01:06,244 INFO  resourcemanager.ResourceTrackerService 
(ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
10.18.52.124:45454
Call: 
protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
 method=nodeHeartbeat
2011-09-19 14:01:07,246 INFO  resourcemanager.ResourceTrackerService 
(ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
10.18.52.124:45454
Call: 
protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
 method=nodeHeartbeat
2011-09-19 14:01:08,247 INFO  resourcemanager.ResourceTrackerService 
(ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
10.18.52.124:45454
{code}

Node Manager is registered with Resource manager and the for every heartbeat, 
it is printing the above message.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3031) Job Client goes into infinite loop when we kill AM

2011-09-19 Thread Karam Singh (JIRA)
Job Client goes into infinite loop when we kill AM
--

 Key: MAPREDUCE-3031
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3031
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Karam Singh


Started a cluster. Sumitted a sleep job with around 1 maps and 1000 reduces.
Killed AM with kill -9 7000 thousands maps got completed

RM Application kepts on saying Application RUNNING
and jobclient went in infinit loop of trying to connecting AM

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3032) JobHistory doesn't have error information from failed tasks

2011-09-19 Thread Vinod Kumar Vavilapalli (JIRA)
JobHistory doesn't have error information from failed tasks
---

 Key: MAPREDUCE-3032
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3032
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: applicationmaster, mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
 Fix For: 0.23.0




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




Hadoop-Mapreduce-0.23-Build - Build # 11 - Still Failing

2011-09-19 Thread Apache Jenkins Server
See https://builds.apache.org/job/Hadoop-Mapreduce-0.23-Build/11/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 18981 lines...]
Tests run: 4, Failures: 0, Errors: 4, Skipped: 0, Time elapsed: 6.229 sec  
FAILURE!
Running org.apache.hadoop.mapreduce.v2.TestMRJobsWithHistoryService
Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 5.887 sec  
FAILURE!
Running org.apache.hadoop.mapreduce.v2.TestMROldApiJobs
Tests run: 2, Failures: 0, Errors: 2, Skipped: 0, Time elapsed: 6.067 sec  
FAILURE!
Running org.apache.hadoop.mapreduce.v2.TestUberAM
Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 0.03 sec

Results :

Tests in error: 

Tests run: 14, Failures: 0, Errors: 7, Skipped: 1

[INFO] 
[INFO] Reactor Summary:
[INFO] 
[INFO] hadoop-yarn-api ... SUCCESS [8.834s]
[INFO] hadoop-yarn-common  SUCCESS [6.152s]
[INFO] hadoop-yarn-server-common . SUCCESS [7.228s]
[INFO] hadoop-yarn-server-nodemanager  SUCCESS [32.590s]
[INFO] hadoop-yarn-server-resourcemanager  SUCCESS [42.657s]
[INFO] hadoop-yarn-server-tests .. SUCCESS [6.500s]
[INFO] hadoop-yarn-server  SUCCESS [0.071s]
[INFO] hadoop-yarn ... SUCCESS [0.067s]
[INFO] hadoop-mapreduce-client-core .. SUCCESS [2.025s]
[INFO] hadoop-mapreduce-client-common  SUCCESS [5.626s]
[INFO] hadoop-mapreduce-client-shuffle ... SUCCESS [0.966s]
[INFO] hadoop-mapreduce-client-app ... SUCCESS [1:29.216s]
[INFO] hadoop-mapreduce-client-hs  SUCCESS [7.608s]
[INFO] hadoop-mapreduce-client-jobclient . FAILURE [36.353s]
[INFO] hadoop-mapreduce-client ... SKIPPED
[INFO] hadoop-mapreduce .. SKIPPED
[INFO] 
[INFO] BUILD FAILURE
[INFO] 
[INFO] Total time: 4:06.287s
[INFO] Finished at: Mon Sep 19 13:15:04 UTC 2011
[INFO] Final Memory: 54M/399M
[INFO] 
[ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-surefire-plugin:2.7.2:test (default-test) on 
project hadoop-mapreduce-client-jobclient: There are test failures.
[ERROR] 
[ERROR] Please refer to 
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-0.23-Build/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/target/surefire-reports
 for the individual test results.
[ERROR] - [Help 1]
[ERROR] 
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR] 
[ERROR] For more information about the errors and possible solutions, please 
read the following articles:
[ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException
[ERROR] 
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR]   mvn goals -rf :hadoop-mapreduce-client-jobclient
Build step 'Execute shell' marked build as failure
[FINDBUGS] Skipping publisher since build result is FAILURE
Archiving artifacts
Updating MAPREDUCE-2925
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
No tests ran.


Hadoop-Mapreduce-trunk - Build # 835 - Still Failing

2011-09-19 Thread Apache Jenkins Server
See https://builds.apache.org/job/Hadoop-Mapreduce-trunk/835/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 19057 lines...]
Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 5.917 sec  
FAILURE!
Running org.apache.hadoop.mapreduce.v2.TestUberAM
Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 0.03 sec
Running org.apache.hadoop.mapred.TestClientServiceDelegate
Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.7 sec
Running org.apache.hadoop.mapred.TestClientRedirect
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 14.35 sec

Results :

Tests in error: 

Tests run: 14, Failures: 0, Errors: 7, Skipped: 1

[INFO] 
[INFO] Reactor Summary:
[INFO] 
[INFO] hadoop-yarn-api ... SUCCESS [9.107s]
[INFO] hadoop-yarn-common  SUCCESS [6.485s]
[INFO] hadoop-yarn-server-common . SUCCESS [7.491s]
[INFO] hadoop-yarn-server-nodemanager  SUCCESS [35.980s]
[INFO] hadoop-yarn-server-resourcemanager  SUCCESS [42.624s]
[INFO] hadoop-yarn-server-tests .. SUCCESS [6.530s]
[INFO] hadoop-yarn-server  SUCCESS [0.091s]
[INFO] hadoop-yarn ... SUCCESS [0.070s]
[INFO] hadoop-mapreduce-client-core .. SUCCESS [2.171s]
[INFO] hadoop-mapreduce-client-common  SUCCESS [5.819s]
[INFO] hadoop-mapreduce-client-shuffle ... SUCCESS [1.140s]
[INFO] hadoop-mapreduce-client-app ... SUCCESS [1:26.360s]
[INFO] hadoop-mapreduce-client-hs  SUCCESS [7.807s]
[INFO] hadoop-mapreduce-client-jobclient . FAILURE [36.439s]
[INFO] hadoop-mapreduce-client ... SKIPPED
[INFO] hadoop-mapreduce .. SKIPPED
[INFO] 
[INFO] BUILD FAILURE
[INFO] 
[INFO] Total time: 4:08.511s
[INFO] Finished at: Mon Sep 19 13:18:49 UTC 2011
[INFO] Final Memory: 54M/288M
[INFO] 
[ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-surefire-plugin:2.7.2:test (default-test) on 
project hadoop-mapreduce-client-jobclient: There are test failures.
[ERROR] 
[ERROR] Please refer to 
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-trunk/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/target/surefire-reports
 for the individual test results.
[ERROR] - [Help 1]
[ERROR] 
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR] 
[ERROR] For more information about the errors and possible solutions, please 
read the following articles:
[ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException
[ERROR] 
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR]   mvn goals -rf :hadoop-mapreduce-client-jobclient
Build step 'Execute shell' marked build as failure
[FINDBUGS] Skipping publisher since build result is FAILURE
Archiving artifacts
Updating MAPREDUCE-2925
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
No tests ran.


[jira] [Created] (MAPREDUCE-3033) JobClient requires mapreduce.jobtracker.address tag in mapred-site.xm even mapreduce.framework.name is set top yarn

2011-09-19 Thread Karam Singh (JIRA)
JobClient requires mapreduce.jobtracker.address tag in mapred-site.xm even 
mapreduce.framework.name is set top yarn
---

 Key: MAPREDUCE-3033
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3033
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: job submission
Reporter: Karam Singh


If mapreduce.jobtracker.address is set in mapred-site.xml
And mapreduce.framework.name is set yarn 
job submission fails :

Tried to submit sleep job with maps 1 task. Job submission failed with 
following exception -:
11/09/19 13:19:20 INFO ipc.YarnRPC: Creating YarnRPC for 
org.apache.hadoop.yarn.ipc.HadoopYarnProtoRPC
11/09/19 13:19:20 INFO mapred.ResourceMgrDelegate: Connecting to 
ResourceManager at RMHost:8040
11/09/19 13:19:20 INFO ipc.HadoopYarnRPC: Creating a HadoopYarnProtoRpc proxy 
for protocol interface org.apache.hadoop.yarn.api.ClientRMProtocol
11/09/19 13:19:20 INFO mapred.ResourceMgrDelegate: Connected to ResourceManager 
at RMHost:8040
11/09/19 13:19:21 INFO mapred.ResourceMgrDelegate: DEBUG --- getStagingAreaDir: 
dir=/user/username/.staging
11/09/19 13:19:21 INFO mapreduce.JobSubmitter: Cleaning up the staging area 
/user/username/.staging/job_1316435926198_0004
java.lang.RuntimeException: Not a host:port pair: local
at org.apache.hadoop.net.NetUtils.createSocketAddr(NetUtils.java:148)
at org.apache.hadoop.net.NetUtils.createSocketAddr(NetUtils.java:132)
at org.apache.hadoop.mapred.Master.getMasterAddress(Master.java:42)
at org.apache.hadoop.mapred.Master.getMasterPrincipal(Master.java:47)
at 
org.apache.hadoop.mapreduce.security.TokenCache.obtainTokensForNamenodesInternal(TokenCache.java:104)
at 
org.apache.hadoop.mapreduce.security.TokenCache.obtainTokensForNamenodesInternal(TokenCache.java:90)
at 
org.apache.hadoop.mapreduce.security.TokenCache.obtainTokensForNamenodes(TokenCache.java:83)
at 
org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:346)
at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1072)
at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1069)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1135)
at org.apache.hadoop.mapreduce.Job.submit(Job.java:1069)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:1089)
at org.apache.hadoop.mapreduce.SleepJob.run(SleepJob.java:262)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69)
at org.apache.hadoop.mapreduce.SleepJob.main(SleepJob.java:194)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at 
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:144)
at 
org.apache.hadoop.test.MapredTestDriver.run(MapredTestDriver.java:111)
at 
org.apache.hadoop.test.MapredTestDriver.main(MapredTestDriver.java:118)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:189)



--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3034) NM should act on a REBOOT command from RM

2011-09-19 Thread Vinod Kumar Vavilapalli (JIRA)
NM should act on a REBOOT command from RM
-

 Key: MAPREDUCE-3034
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3034
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2, nodemanager
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.0


RM sends a reboot command to NM in some cases, like when it gets lost and 
rejoins back. In such a case, NM should act on the command and 
reboot/reinitalize itself.

This is akin to TT reinitialize on order from JT. We will need to shutdown all 
the services properly and reinitialize - this should automatically take care of 
killing of containers, cleaning up local temporary files etc.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3036) Some of the Resource Manager memory metrics go negative.

2011-09-19 Thread Robert Joseph Evans (JIRA)
Some of the Resource Manager memory metrics go negative.


 Key: MAPREDUCE-3036
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3036
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0, 0.24.0
Reporter: Robert Joseph Evans
Assignee: Robert Joseph Evans
Priority: Blocker
 Fix For: 0.23.0, 0.24.0


ReservedGB seems to always be decremented when a container is released, even 
though the container never reserved any memory.
AvailableGB also seems to be able to go negative in a few situations.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3037) Add a consistent method for getting the job configuration's XML within the AppMaster and the JobHistory

2011-09-19 Thread Jeffrey Naisbitt (JIRA)
Add a consistent method for getting the job configuration's XML within the 
AppMaster and the JobHistory
---

 Key: MAPREDUCE-3037
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3037
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Reporter: Jeffrey Naisbitt
 Fix For: 0.23.0, 0.24.0


We should add consistent functionality for accessing/downloading the actual XML 
for the jobfile configuration from the Web UI (for both the AppMaster UI and 
the JobHistory UI).  Currently, you can get an HTML table with the 
configuration through /yarn/conf (a Configuration link on the left side of 
the UI).  It would be good to add a link there that allows the user to actually 
download the XML.  We could either have a conf-xml servlet or even better we 
could do something more REST-like by allowing requests to contain the desired 
format (XML, HTML, JSON, etc?).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3038) job history server not starting because conf() missing HsController

2011-09-19 Thread Thomas Graves (JIRA)
job history server not starting because conf() missing HsController
---

 Key: MAPREDUCE-3038
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3038
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Thomas Graves
 Fix For: 0.23.0


Exception starting history server.


Sep 19, 2011 6:51:53 PM com.google.inject.MessageProcessor visit
INFO: An exception was caught and reported. Message: 
org.apache.hadoop.yarn.webapp.WebAppException: conf() not found in class 
org.apache.hadoop.mapreduce.v2.hs.webapp.HsController   
  
org.apache.hadoop.yarn.webapp.WebAppException: conf() not found in class 
org.apache.hadoop.mapreduce.v2.hs.webapp.HsController
at org.apache.hadoop.yarn.webapp.Router.addController(Router.java:107)
at org.apache.hadoop.yarn.webapp.Router.add(Router.java:83)
at org.apache.hadoop.yarn.webapp.WebApp.route(WebApp.java:140)
at org.apache.hadoop.yarn.webapp.WebApp.route(WebApp.java:146)
at org.apache.hadoop.mapreduce.v2.hs.webapp.HsWebApp.setup(HsWebApp.java:42)
at org.apache.hadoop.yarn.webapp.WebApp.configureServlets(WebApp.java:121)
at com.google.inject.servlet.ServletModule.configure(ServletModule.java:45)
at com.google.inject.AbstractModule.configure(AbstractModule.java:59)
at com.google.inject.spi.Elements$RecordingBinder.install(Elements.java:223)
at com.google.inject.spi.Elements.getElements(Elements.java:101)
at com.google.inject.InjectorShell$Builder.build(InjectorShell.java:135)
at com.google.inject.InjectorBuilder.build(InjectorBuilder.java:102)
at com.google.inject.Guice.createInjector(Guice.java:92)
at com.google.inject.Guice.createInjector(Guice.java:69)
at com.google.inject.Guice.createInjector(Guice.java:59)
at org.apache.hadoop.yarn.webapp.WebApps$Builder.start(WebApps.java:166)
at 
org.apache.hadoop.mapreduce.v2.hs.HistoryClientService.initializeWebApp(HistoryClientService.java:138)
at 
org.apache.hadoop.mapreduce.v2.hs.HistoryClientService.start(HistoryClientService.java:109)
at 
org.apache.hadoop.yarn.service.CompositeService.start(CompositeService.java:68)
at 
org.apache.hadoop.mapreduce.v2.hs.JobHistoryServer.main(JobHistoryServer.java:83)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3039) Make hadoop-common use same version of avro as HBase

2011-09-19 Thread Joep Rottinghuis (JIRA)
Make hadoop-common use same version of avro as HBase


 Key: MAPREDUCE-3039
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3039
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: contrib/capacity-sched, contrib/fair-share, 
contrib/gridmix, contrib/mrunit, contrib/mumak, contrib/raid, 
contrib/streaming, jobhistoryserver
Affects Versions: 0.22.0
Reporter: Joep Rottinghuis
Assignee: Joep Rottinghuis
 Fix For: 0.22.0


HBase depends on avro 1.5.3 whereas hadoop-common depends on 1.3.2.
When building HBase on top of hadoop, this should be consistent.
Moreover, this should be consistent between common, hdfs, and mapreduce.

Contribs seem to have declared a dependency on avro but are not in fact 
depending on it.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3040) TestMRJobs, TestMRJobsWithHistoryService, TestMROldApiJobs fail

2011-09-19 Thread Thomas Graves (JIRA)
TestMRJobs, TestMRJobsWithHistoryService, TestMROldApiJobs fail
---

 Key: MAPREDUCE-3040
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3040
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Thomas Graves


Running org.apache.hadoop.mapreduce.v2.TestMRJobs
Tests run: 4, Failures: 0, Errors: 4, Skipped: 0, Time elapsed: 6.229 sec  
FAILURE!
Running org.apache.hadoop.mapreduce.v2.TestMRJobsWithHistoryService
Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 5.887 sec  
FAILURE!
Running org.apache.hadoop.mapreduce.v2.TestMROldApiJobs
Tests run: 2, Failures: 0, Errors: 2, Skipped: 0, Time elapsed: 6.067 sec  
FAILURE!

All of them have the exception:


java.lang.NullPointerException
at 
org.apache.hadoop.mapreduce.v2.util.MRApps.parseDistributedCacheArtifacts(MRApps.java:300)
at 
org.apache.hadoop.mapreduce.v2.util.MRApps.setupDistributedCache(MRApps.java:277)
at 
org.apache.hadoop.mapred.YARNRunner.createApplicationSubmissionContext(YARNRunner.java:349)
at org.apache.hadoop.mapred.YARNRunner.submitJob(YARNRunner.java:227)
at 
org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:376)
at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1161)
at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1158)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1135)
at org.apache.hadoop.mapreduce.Job.submit(Job.java:1158)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:1178)
at 
org.apache.hadoop.mapreduce.v2.TestMRJobs.testSleepJob(TestMRJobs.java:147)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at 
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:44)
at 
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:15)
at 
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:41)
at 
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:20)
at 
org.junit.runners.BlockJUnit4ClassRunner.runNotIgnored(BlockJUnit4ClassRunner.java:79)
at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:71)
at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:49)
at org.junit.runners.ParentRunner$3.run(ParentRunner.java:193)
at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:52)
at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:191)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3041) Enhance YARN Client-RM protocol to provide access to information such as cluster's Min/Max Resource capabilities similar to that of AM-RM protocol

2011-09-19 Thread Hitesh Shah (JIRA)
Enhance YARN Client-RM protocol to provide access to information such as 
cluster's Min/Max Resource capabilities similar to that of AM-RM protocol
--

 Key: MAPREDUCE-3041
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3041
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Hitesh Shah
Assignee: Hitesh Shah
 Fix For: 0.23.0


To request a container to launch an application master, the client needs to 
know the min/max resource capabilities so as to be able to make a proper 
resource request when submitting a new application.


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3043) Missing containers info on the nodes page

2011-09-19 Thread Ramya Sunil (JIRA)
Missing containers info on the nodes page
-

 Key: MAPREDUCE-3043
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3043
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: resourcemanager
Affects Versions: 0.23.0
Reporter: Ramya Sunil
 Fix For: 0.23.0


The containers info on the nodes page on the RM seems to be missing. This was 
useful in understanding the usage on each of the nodemanagers.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3044) Pipes jobs stuck without making progress

2011-09-19 Thread Ramya Sunil (JIRA)
Pipes jobs stuck without making progress


 Key: MAPREDUCE-3044
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3044
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Ramya Sunil
 Fix For: 0.23.0


A simple example pipes job gets stuck without making any progress. The AM is 
launched but the maps do not make any progress.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3045) Elapsed time filter on jobhistory server displays incorrect table entries

2011-09-19 Thread Ramya Sunil (JIRA)
Elapsed time filter on jobhistory server displays incorrect table entries
-

 Key: MAPREDUCE-3045
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3045
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver, mrv2
Affects Versions: 0.23.0
Reporter: Ramya Sunil
Priority: Minor
 Fix For: 0.23.0


The elapsed time filter on the jobhistory server filters incorrect information. 
For e.g. on a cluster where the elapsed time of all the tasks is either 7 or 
8sec, the filter displays non null table entries for 1sec or 3sec

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




Hadoop-Mapreduce-22-branch - Build # 72 - Still Failing

2011-09-19 Thread Apache Jenkins Server
See https://builds.apache.org/job/Hadoop-Mapreduce-22-branch/72/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 10521 lines...]
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestTextOutputFormat.java:79:
 warning: [unchecked] unchecked call to write(K,V) as a member of the raw type 
org.apache.hadoop.mapred.RecordWriter
[javac]   
__CLR2_4_39vm9vmgss6ygad.R.inc(12832);theRecordWriter.write(null, null);
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestTextOutputFormat.java:80:
 warning: [unchecked] unchecked call to write(K,V) as a member of the raw type 
org.apache.hadoop.mapred.RecordWriter
[javac]   
__CLR2_4_39vm9vmgss6ygad.R.inc(12833);theRecordWriter.write(key2, val2);
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestTextOutputFormat.java:126:
 warning: [unchecked] unchecked call to write(K,V) as a member of the raw type 
org.apache.hadoop.mapred.RecordWriter
[javac]   
__CLR2_4_39vm9vmgss6ygad.R.inc(12867);theRecordWriter.write(key1, val1);
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestTextOutputFormat.java:127:
 warning: [unchecked] unchecked call to write(K,V) as a member of the raw type 
org.apache.hadoop.mapred.RecordWriter
[javac]   
__CLR2_4_39vm9vmgss6ygad.R.inc(12868);theRecordWriter.write(null, nullWritable);
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestTextOutputFormat.java:128:
 warning: [unchecked] unchecked call to write(K,V) as a member of the raw type 
org.apache.hadoop.mapred.RecordWriter
[javac]   
__CLR2_4_39vm9vmgss6ygad.R.inc(12869);theRecordWriter.write(null, val1);
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestTextOutputFormat.java:129:
 warning: [unchecked] unchecked call to write(K,V) as a member of the raw type 
org.apache.hadoop.mapred.RecordWriter
[javac]   
__CLR2_4_39vm9vmgss6ygad.R.inc(12870);theRecordWriter.write(nullWritable, val2);
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestTextOutputFormat.java:130:
 warning: [unchecked] unchecked call to write(K,V) as a member of the raw type 
org.apache.hadoop.mapred.RecordWriter
[javac]   
__CLR2_4_39vm9vmgss6ygad.R.inc(12871);theRecordWriter.write(key2, nullWritable);
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestTextOutputFormat.java:131:
 warning: [unchecked] unchecked call to write(K,V) as a member of the raw type 
org.apache.hadoop.mapred.RecordWriter
[javac]   
__CLR2_4_39vm9vmgss6ygad.R.inc(12872);theRecordWriter.write(key1, null);
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestTextOutputFormat.java:132:
 warning: [unchecked] unchecked call to write(K,V) as a member of the raw type 
org.apache.hadoop.mapred.RecordWriter
[javac]   
__CLR2_4_39vm9vmgss6ygad.R.inc(12873);theRecordWriter.write(null, null);
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestTextOutputFormat.java:133:
 warning: [unchecked] unchecked call to write(K,V) as a member of the raw type 
org.apache.hadoop.mapred.RecordWriter
[javac]   
__CLR2_4_39vm9vmgss6ygad.R.inc(12874);theRecordWriter.write(key2, val2);
[javac]  ^
[javac] 
/tmp/clover2005808645041714307.tmp/org/apache/hadoop/mapred/TestCombineFileInputFormat.java:76:
 warning: [unchecked] unchecked call to 
CombineFileRecordReader(org.apache.hadoop.mapred.JobConf,org.apache.hadoop.mapred.lib.CombineFileSplit,org.apache.hadoop.mapred.Reporter,java.lang.Classorg.apache.hadoop.mapred.RecordReaderK,V)
 as a member of the raw type 
org.apache.hadoop.mapred.lib.CombineFileRecordReader
[javac] __CLR2_4_37jo7jogss6yg3s.R.inc(9797);return new 
CombineFileRecordReader(job, (CombineFileSplit)split, reporter, 
CombineFileRecordReader.class);
[javac] ^
[javac]