date:20140722

Akira AJISAKA created MAPREDUCE-5988:


 Summary: Fix dead links to the javadocs of o.a.h.mapreduce.counters
 Key: MAPREDUCE-5988
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5988
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.4.1
Reporter: Akira AJISAKA
Priority: Minor


In http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, 
AbstractCounters and CounterGroupBase are listed, but not linked.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-5988) Fix dead links to the javadocs of o.a.h.mapreduce.counters


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Akira AJISAKA updated MAPREDUCE-5988:
-

Attachment: MAPREDUCE-5988.patch

Removing {{@InterfaceAudience.Private}} from the package-info to generate the 
javadocs of {{CounterGroupBase}} and {{AbstractCounters}}.

 Fix dead links to the javadocs of o.a.h.mapreduce.counters
 --

 Key: MAPREDUCE-5988
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5988
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.4.1
Reporter: Akira AJISAKA
Priority: Minor
 Attachments: MAPREDUCE-5988.patch


 In http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, 
 AbstractCounters and CounterGroupBase are listed, but not linked.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-5988) Fix dead links to the javadocs in mapreduce project


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Akira AJISAKA updated MAPREDUCE-5988:
-

Summary: Fix dead links to the javadocs in mapreduce project  (was: Fix 
dead links to the javadocs of o.a.h.mapreduce.counters)

 Fix dead links to the javadocs in mapreduce project
 ---

 Key: MAPREDUCE-5988
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5988
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.4.1
Reporter: Akira AJISAKA
Assignee: Akira AJISAKA
Priority: Minor
 Attachments: MAPREDUCE-5988.patch


 In http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, some 
 classes are listed, but not linked.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-5988) Fix dead links to the javadocs of o.a.h.mapreduce.counters


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Akira AJISAKA updated MAPREDUCE-5988:
-

Description: In 
http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, some classes 
are listed, but not linked.  (was: In 
http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, 
AbstractCounters and CounterGroupBase are listed, but not linked.)
   Assignee: Akira AJISAKA

 Fix dead links to the javadocs of o.a.h.mapreduce.counters
 --

 Key: MAPREDUCE-5988
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5988
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.4.1
Reporter: Akira AJISAKA
Assignee: Akira AJISAKA
Priority: Minor
 Attachments: MAPREDUCE-5988.patch


 In http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, some 
 classes are listed, but not linked.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5988) Fix dead links to the javadocs in mapreduce project


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070062#comment-14070062
 ] 

Akira AJISAKA commented on MAPREDUCE-5988:
--

The below classes are linked, but undocumented.
- AbstractCounters
- CounterGroupBase
- CancelDelegationTokenRequest
- CancelDelegationTokenResponse
- GetDelegationTokenRequest
- RenewDelegationTokenRequest
- RenewDelegationTokenResponse
- HistoryFileManager
- HistoryStorage

 Fix dead links to the javadocs in mapreduce project
 ---

 Key: MAPREDUCE-5988
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5988
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.4.1
Reporter: Akira AJISAKA
Assignee: Akira AJISAKA
Priority: Minor
 Attachments: MAPREDUCE-5988.patch


 In http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, some 
 classes are listed, but not linked.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-5988) Fix dead links to the javadocs in mapreduce project


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Akira AJISAKA updated MAPREDUCE-5988:
-

Description: In 
http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, some classes 
are listed, but not documented.  (was: In 
http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, some classes 
are listed, but not linked.)

 Fix dead links to the javadocs in mapreduce project
 ---

 Key: MAPREDUCE-5988
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5988
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.4.1
Reporter: Akira AJISAKA
Assignee: Akira AJISAKA
Priority: Minor
 Attachments: MAPREDUCE-5988.patch


 In http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, some 
 classes are listed, but not documented.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-5988) Fix dead links to the javadocs in mapreduce project


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Akira AJISAKA updated MAPREDUCE-5988:
-

Attachment: MAPREDUCE-5988.2.patch

Removed {{@InterfaceAudience.Private}} from each package-info. I confirmed the 
javadocs of the above classes were generated.

 Fix dead links to the javadocs in mapreduce project
 ---

 Key: MAPREDUCE-5988
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5988
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.4.1
Reporter: Akira AJISAKA
Assignee: Akira AJISAKA
Priority: Minor
 Attachments: MAPREDUCE-5988.2.patch, MAPREDUCE-5988.patch


 In http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, some 
 classes are listed, but not documented.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-5988) Fix dead links to the javadocs in mapreduce project


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Akira AJISAKA updated MAPREDUCE-5988:
-

Target Version/s: 2.6.0
  Status: Patch Available  (was: Open)

 Fix dead links to the javadocs in mapreduce project
 ---

 Key: MAPREDUCE-5988
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5988
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.4.1
Reporter: Akira AJISAKA
Assignee: Akira AJISAKA
Priority: Minor
 Attachments: MAPREDUCE-5988.2.patch, MAPREDUCE-5988.patch


 In http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, some 
 classes are listed, but not documented.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5988) Fix dead links to the javadocs in mapreduce project

2014-07-22 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070091#comment-14070091
 ] 

Hadoop QA commented on MAPREDUCE-5988:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12657101/MAPREDUCE-5988.2.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+0 tests included{color}.  The patch appears to be a 
documentation patch that doesn't require tests.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs:

  org.apache.hadoop.mapreduce.v2.hs.TestJobHistoryParsing
  org.apache.hadoop.mapreduce.v2.hs.webapp.dao.TestJobInfo

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4760//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4760//console

This message is automatically generated.

 Fix dead links to the javadocs in mapreduce project
 ---

 Key: MAPREDUCE-5988
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5988
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.4.1
Reporter: Akira AJISAKA
Assignee: Akira AJISAKA
Priority: Minor
 Attachments: MAPREDUCE-5988.2.patch, MAPREDUCE-5988.patch


 In http://hadoop.apache.org/docs/r2.4.1/api/allclasses-frame.html, some 
 classes are listed, but not documented.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5957) AM throws ClassNotFoundException with job classloader enabled if custom output format/committer is used


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070121#comment-14070121
 ] 

Hudson commented on MAPREDUCE-5957:
---

FAILURE: Integrated in Hadoop-Yarn-trunk #620 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/620/])
MAPREDUCE-5957. AM throws ClassNotFoundException with job classloader enabled 
if custom output format/committer is used. Contributed by Sangjin Lee (jlowe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1612358)
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/MRAppMaster.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/commit/CommitterEventHandler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapreduce/v2/TestMRJobs.java


 AM throws ClassNotFoundException with job classloader enabled if custom 
 output format/committer is used
 ---

 Key: MAPREDUCE-5957
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5957
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 2.4.0
Reporter: Sangjin Lee
Assignee: Sangjin Lee
 Fix For: 3.0.0, 2.6.0

 Attachments: MAPREDUCE-5957.branch-2.patch, MAPREDUCE-5957.patch, 
 MAPREDUCE-5957.patch, MAPREDUCE-5957.patch, MAPREDUCE-5957.patch, 
 MAPREDUCE-5957.patch, MAPREDUCE-5957.patch


 With the job classloader enabled, the MR AM throws ClassNotFoundException if 
 a custom output format class is specified.
 {noformat}
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: 
 java.lang.RuntimeException: java.lang.ClassNotFoundException: Class 
 com.foo.test.TestOutputFormat not found
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.createOutputCommitter(MRAppMaster.java:473)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.serviceInit(MRAppMaster.java:374)
   at 
 org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster$1.run(MRAppMaster.java:1459)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:415)
   at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.initAndStartAppMaster(MRAppMaster.java:1456)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.main(MRAppMaster.java:1389)
 Caused by: java.lang.RuntimeException: java.lang.ClassNotFoundException: 
 Class com.foo.test.TestOutputFormat not found
   at 
 org.apache.hadoop.conf.Configuration.getClass(Configuration.java:1895)
   at 
 org.apache.hadoop.mapreduce.task.JobContextImpl.getOutputFormatClass(JobContextImpl.java:222)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.createOutputCommitter(MRAppMaster.java:469)
   ... 8 more
 Caused by: java.lang.ClassNotFoundException: Class 
 com.foo.test.TestOutputFormat not found
   at 
 org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:1801)
   at 
 org.apache.hadoop.conf.Configuration.getClass(Configuration.java:1893)
   ... 10 more
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5756) CombineFileInputFormat.getSplits() including directories in its results


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070123#comment-14070123
 ] 

Hudson commented on MAPREDUCE-5756:
---

FAILURE: Integrated in Hadoop-Yarn-trunk #620 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/620/])
MAPREDUCE-5756. CombineFileInputFormat.getSplits() including directories in its 
results. Contributed by Jason Dere (jlowe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1612400)
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/lib/input/CombineFileInputFormat.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapreduce/lib/input/TestCombineFileInputFormat.java


 CombineFileInputFormat.getSplits() including directories in its results
 ---

 Key: MAPREDUCE-5756
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5756
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Jason Dere
Assignee: Jason Dere
 Fix For: 3.0.0, 2.6.0

 Attachments: MAPREDUCE-5756.1.patch, MAPREDUCE-5756.2.patch


 Trying to track down HIVE-6401, where we see some is not a file errors 
 because getSplits() is giving us directories.  I believe the culprit is 
 FileInputFormat.listStatus():
 {code}
 if (recursive  stat.isDirectory()) {
   addInputPathRecursively(result, fs, stat.getPath(),
   inputFilter);
 } else {
   result.add(stat);
 }
 {code}
 Which seems to be allowing directories to be added to the results if 
 recursive is false.  Is this meant to return directories? If not, I think it 
 should look like this:
 {code}
 if (stat.isDirectory()) {
  if (recursive) {
   addInputPathRecursively(result, fs, stat.getPath(),
   inputFilter);
  }
 } else {
   result.add(stat);
 }
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Created] (MAPREDUCE-5989) Add DeletionService in AM

2014-07-22 Thread Varun Saxena (JIRA)

Varun Saxena created MAPREDUCE-5989:
---

 Summary: Add DeletionService in AM
 Key: MAPREDUCE-5989
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5989
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: applicationmaster
Reporter: Varun Saxena
Assignee: Varun Saxena


In AM, for graceful cleanup, I propose addition of a DeletionService which will 
do the following :
1. Cleanup of failed tasks (temporary data need not occupy space till NM's 
Deletion Service is invoked)
2. Staging directory deletion (During AM shutdown, its better to place staging 
dir cleanup in Deletion Service: Refer to MAPREDUCE-4841 )




--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5989) Add DeletionService in AM


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070246#comment-14070246
 ] 

Jason Lowe commented on MAPREDUCE-5989:
---

Is this the same kind of DeletionService that the NM currently uses?  If so I'm 
unclear on the tangible benefits of this, since all that service does is 
potentially postpone deletions.  And as for the staging directory cleanup, 
implementing a deletion service is not needed to fix that issue.  Actually I 
believe it's already fixed by MAPREDUCE-5476 by having it delete the staging 
directory after unregistering so we know no other AM attempts will try to be 
launched after removing the staging directory.

If you could walk through an example scenario where the deletion service is 
used and how it's useful that would help me understand why adding such a 
service would be helpful.

 Add DeletionService in AM
 -

 Key: MAPREDUCE-5989
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5989
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: applicationmaster
Reporter: Varun Saxena
Assignee: Varun Saxena

 In AM, for graceful cleanup, I propose addition of a DeletionService which 
 will do the following :
 1. Cleanup of failed tasks (temporary data need not occupy space till NM's 
 Deletion Service is invoked)
 2. Staging directory deletion (During AM shutdown, its better to place 
 staging dir cleanup in Deletion Service: Refer to MAPREDUCE-4841 )



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Assigned] (MAPREDUCE-4841) Application Master Retries fail due to FileNotFoundException


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe reassigned MAPREDUCE-4841:
-

Assignee: Jason Lowe  (was: Devaraj K)

 Application Master Retries fail due to FileNotFoundException
 

 Key: MAPREDUCE-4841
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4841
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: applicationmaster
Affects Versions: 2.0.1-alpha
Reporter: Devaraj K
Assignee: Jason Lowe
Priority: Critical

 Application attempt1 is deleting the job related files and these are not 
 present in the HDFS for following retries.
 {code:xml}
 Application application_1353724754961_0001 failed 4 times due to AM Container 
 for appattempt_1353724754961_0001_04 exited with exitCode: -1000 due to: 
 RemoteTrace: java.io.FileNotFoundException: File does not exist: 
 hdfs://hacluster:8020/tmp/hadoop-yarn/staging/mapred/.staging/job_1353724754961_0001/appTokens
  at 
 org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:752)
  at org.apache.hadoop.yarn.util.FSDownload.copy(FSDownload.java:88) at 
 org.apache.hadoop.yarn.util.FSDownload.access$000(FSDownload.java:49) at 
 org.apache.hadoop.yarn.util.FSDownload$1.run(FSDownload.java:157) at 
 org.apache.hadoop.yarn.util.FSDownload$1.run(FSDownload.java:155) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
  at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:153) at 
 org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:49) at 
 java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at 
 java.util.concurrent.FutureTask.run(FutureTask.java:138) at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441) at 
 java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at 
 java.util.concurrent.FutureTask.run(FutureTask.java:138) at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
  at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
  at java.lang.Thread.run(Thread.java:662) at LocalTrace: 
 org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl: File 
 does not exist: 
 hdfs://hacluster:8020/tmp/hadoop-yarn/staging/mapred/.staging/job_1353724754961_0001/appTokens
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.protocolrecords.impl.pb.LocalResourceStatusPBImpl.convertFromProtoFormat(LocalResourceStatusPBImpl.java:217)
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.protocolrecords.impl.pb.LocalResourceStatusPBImpl.getException(LocalResourceStatusPBImpl.java:147)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.update(ResourceLocalizationService.java:822)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerTracker.processHeartbeat(ResourceLocalizationService.java:492)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.heartbeat(ResourceLocalizationService.java:221)
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.impl.pb.service.LocalizationProtocolPBServiceImpl.heartbeat(LocalizationProtocolPBServiceImpl.java:46)
  at 
 org.apache.hadoop.yarn.proto.LocalizationProtocol$LocalizationProtocolService$2.callBlockingMethod(LocalizationProtocol.java:57)
  at 
 org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:427)
  at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:924) at 
 org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1692) at 
 org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1688) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
  at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1686) .Failing this 
 attempt.. Failing the application. 
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-4841) Application Master Retries fail due to FileNotFoundException


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070247#comment-14070247
 ] 

Jason Lowe commented on MAPREDUCE-4841:
---

I believe this has been fixed by MAPREDUCE-5476.  [~devaraj.k] if you agree 
then we can mark this is as a duplicate of that JIRA.

 Application Master Retries fail due to FileNotFoundException
 

 Key: MAPREDUCE-4841
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4841
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: applicationmaster
Affects Versions: 2.0.1-alpha
Reporter: Devaraj K
Assignee: Jason Lowe
Priority: Critical

 Application attempt1 is deleting the job related files and these are not 
 present in the HDFS for following retries.
 {code:xml}
 Application application_1353724754961_0001 failed 4 times due to AM Container 
 for appattempt_1353724754961_0001_04 exited with exitCode: -1000 due to: 
 RemoteTrace: java.io.FileNotFoundException: File does not exist: 
 hdfs://hacluster:8020/tmp/hadoop-yarn/staging/mapred/.staging/job_1353724754961_0001/appTokens
  at 
 org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:752)
  at org.apache.hadoop.yarn.util.FSDownload.copy(FSDownload.java:88) at 
 org.apache.hadoop.yarn.util.FSDownload.access$000(FSDownload.java:49) at 
 org.apache.hadoop.yarn.util.FSDownload$1.run(FSDownload.java:157) at 
 org.apache.hadoop.yarn.util.FSDownload$1.run(FSDownload.java:155) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
  at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:153) at 
 org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:49) at 
 java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at 
 java.util.concurrent.FutureTask.run(FutureTask.java:138) at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441) at 
 java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at 
 java.util.concurrent.FutureTask.run(FutureTask.java:138) at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
  at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
  at java.lang.Thread.run(Thread.java:662) at LocalTrace: 
 org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl: File 
 does not exist: 
 hdfs://hacluster:8020/tmp/hadoop-yarn/staging/mapred/.staging/job_1353724754961_0001/appTokens
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.protocolrecords.impl.pb.LocalResourceStatusPBImpl.convertFromProtoFormat(LocalResourceStatusPBImpl.java:217)
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.protocolrecords.impl.pb.LocalResourceStatusPBImpl.getException(LocalResourceStatusPBImpl.java:147)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.update(ResourceLocalizationService.java:822)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerTracker.processHeartbeat(ResourceLocalizationService.java:492)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.heartbeat(ResourceLocalizationService.java:221)
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.impl.pb.service.LocalizationProtocolPBServiceImpl.heartbeat(LocalizationProtocolPBServiceImpl.java:46)
  at 
 org.apache.hadoop.yarn.proto.LocalizationProtocol$LocalizationProtocolService$2.callBlockingMethod(LocalizationProtocol.java:57)
  at 
 org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:427)
  at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:924) at 
 org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1692) at 
 org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1688) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
  at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1686) .Failing this 
 attempt.. Failing the application. 
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-4841) Application Master Retries fail due to FileNotFoundException

2014-07-22 Thread Devaraj K (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj K resolved MAPREDUCE-4841.
--

Resolution: Fixed

It has been fixed by MAPREDUCE-5476, closing it as duplicate of MAPREDUCE-5476.

 Application Master Retries fail due to FileNotFoundException
 

 Key: MAPREDUCE-4841
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4841
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: applicationmaster
Affects Versions: 2.0.1-alpha
Reporter: Devaraj K
Assignee: Jason Lowe
Priority: Critical

 Application attempt1 is deleting the job related files and these are not 
 present in the HDFS for following retries.
 {code:xml}
 Application application_1353724754961_0001 failed 4 times due to AM Container 
 for appattempt_1353724754961_0001_04 exited with exitCode: -1000 due to: 
 RemoteTrace: java.io.FileNotFoundException: File does not exist: 
 hdfs://hacluster:8020/tmp/hadoop-yarn/staging/mapred/.staging/job_1353724754961_0001/appTokens
  at 
 org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:752)
  at org.apache.hadoop.yarn.util.FSDownload.copy(FSDownload.java:88) at 
 org.apache.hadoop.yarn.util.FSDownload.access$000(FSDownload.java:49) at 
 org.apache.hadoop.yarn.util.FSDownload$1.run(FSDownload.java:157) at 
 org.apache.hadoop.yarn.util.FSDownload$1.run(FSDownload.java:155) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
  at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:153) at 
 org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:49) at 
 java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at 
 java.util.concurrent.FutureTask.run(FutureTask.java:138) at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441) at 
 java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at 
 java.util.concurrent.FutureTask.run(FutureTask.java:138) at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
  at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
  at java.lang.Thread.run(Thread.java:662) at LocalTrace: 
 org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl: File 
 does not exist: 
 hdfs://hacluster:8020/tmp/hadoop-yarn/staging/mapred/.staging/job_1353724754961_0001/appTokens
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.protocolrecords.impl.pb.LocalResourceStatusPBImpl.convertFromProtoFormat(LocalResourceStatusPBImpl.java:217)
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.protocolrecords.impl.pb.LocalResourceStatusPBImpl.getException(LocalResourceStatusPBImpl.java:147)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.update(ResourceLocalizationService.java:822)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerTracker.processHeartbeat(ResourceLocalizationService.java:492)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.heartbeat(ResourceLocalizationService.java:221)
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.impl.pb.service.LocalizationProtocolPBServiceImpl.heartbeat(LocalizationProtocolPBServiceImpl.java:46)
  at 
 org.apache.hadoop.yarn.proto.LocalizationProtocol$LocalizationProtocolService$2.callBlockingMethod(LocalizationProtocol.java:57)
  at 
 org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:427)
  at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:924) at 
 org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1692) at 
 org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1688) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
  at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1686) .Failing this 
 attempt.. Failing the application. 
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Reopened] (MAPREDUCE-4841) Application Master Retries fail due to FileNotFoundException

2014-07-22 Thread Devaraj K (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj K reopened MAPREDUCE-4841:
--


 Application Master Retries fail due to FileNotFoundException
 

 Key: MAPREDUCE-4841
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4841
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: applicationmaster
Affects Versions: 2.0.1-alpha
Reporter: Devaraj K
Assignee: Jason Lowe
Priority: Critical

 Application attempt1 is deleting the job related files and these are not 
 present in the HDFS for following retries.
 {code:xml}
 Application application_1353724754961_0001 failed 4 times due to AM Container 
 for appattempt_1353724754961_0001_04 exited with exitCode: -1000 due to: 
 RemoteTrace: java.io.FileNotFoundException: File does not exist: 
 hdfs://hacluster:8020/tmp/hadoop-yarn/staging/mapred/.staging/job_1353724754961_0001/appTokens
  at 
 org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:752)
  at org.apache.hadoop.yarn.util.FSDownload.copy(FSDownload.java:88) at 
 org.apache.hadoop.yarn.util.FSDownload.access$000(FSDownload.java:49) at 
 org.apache.hadoop.yarn.util.FSDownload$1.run(FSDownload.java:157) at 
 org.apache.hadoop.yarn.util.FSDownload$1.run(FSDownload.java:155) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
  at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:153) at 
 org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:49) at 
 java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at 
 java.util.concurrent.FutureTask.run(FutureTask.java:138) at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441) at 
 java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at 
 java.util.concurrent.FutureTask.run(FutureTask.java:138) at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
  at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
  at java.lang.Thread.run(Thread.java:662) at LocalTrace: 
 org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl: File 
 does not exist: 
 hdfs://hacluster:8020/tmp/hadoop-yarn/staging/mapred/.staging/job_1353724754961_0001/appTokens
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.protocolrecords.impl.pb.LocalResourceStatusPBImpl.convertFromProtoFormat(LocalResourceStatusPBImpl.java:217)
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.protocolrecords.impl.pb.LocalResourceStatusPBImpl.getException(LocalResourceStatusPBImpl.java:147)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.update(ResourceLocalizationService.java:822)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerTracker.processHeartbeat(ResourceLocalizationService.java:492)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.heartbeat(ResourceLocalizationService.java:221)
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.impl.pb.service.LocalizationProtocolPBServiceImpl.heartbeat(LocalizationProtocolPBServiceImpl.java:46)
  at 
 org.apache.hadoop.yarn.proto.LocalizationProtocol$LocalizationProtocolService$2.callBlockingMethod(LocalizationProtocol.java:57)
  at 
 org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:427)
  at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:924) at 
 org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1692) at 
 org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1688) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
  at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1686) .Failing this 
 attempt.. Failing the application. 
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-4841) Application Master Retries fail due to FileNotFoundException

2014-07-22 Thread Devaraj K (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj K resolved MAPREDUCE-4841.
--

Resolution: Duplicate

 Application Master Retries fail due to FileNotFoundException
 

 Key: MAPREDUCE-4841
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4841
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: applicationmaster
Affects Versions: 2.0.1-alpha
Reporter: Devaraj K
Assignee: Jason Lowe
Priority: Critical

 Application attempt1 is deleting the job related files and these are not 
 present in the HDFS for following retries.
 {code:xml}
 Application application_1353724754961_0001 failed 4 times due to AM Container 
 for appattempt_1353724754961_0001_04 exited with exitCode: -1000 due to: 
 RemoteTrace: java.io.FileNotFoundException: File does not exist: 
 hdfs://hacluster:8020/tmp/hadoop-yarn/staging/mapred/.staging/job_1353724754961_0001/appTokens
  at 
 org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:752)
  at org.apache.hadoop.yarn.util.FSDownload.copy(FSDownload.java:88) at 
 org.apache.hadoop.yarn.util.FSDownload.access$000(FSDownload.java:49) at 
 org.apache.hadoop.yarn.util.FSDownload$1.run(FSDownload.java:157) at 
 org.apache.hadoop.yarn.util.FSDownload$1.run(FSDownload.java:155) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
  at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:153) at 
 org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:49) at 
 java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at 
 java.util.concurrent.FutureTask.run(FutureTask.java:138) at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441) at 
 java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at 
 java.util.concurrent.FutureTask.run(FutureTask.java:138) at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
  at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
  at java.lang.Thread.run(Thread.java:662) at LocalTrace: 
 org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl: File 
 does not exist: 
 hdfs://hacluster:8020/tmp/hadoop-yarn/staging/mapred/.staging/job_1353724754961_0001/appTokens
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.protocolrecords.impl.pb.LocalResourceStatusPBImpl.convertFromProtoFormat(LocalResourceStatusPBImpl.java:217)
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.protocolrecords.impl.pb.LocalResourceStatusPBImpl.getException(LocalResourceStatusPBImpl.java:147)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.update(ResourceLocalizationService.java:822)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerTracker.processHeartbeat(ResourceLocalizationService.java:492)
  at 
 org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.heartbeat(ResourceLocalizationService.java:221)
  at 
 org.apache.hadoop.yarn.server.nodemanager.api.impl.pb.service.LocalizationProtocolPBServiceImpl.heartbeat(LocalizationProtocolPBServiceImpl.java:46)
  at 
 org.apache.hadoop.yarn.proto.LocalizationProtocol$LocalizationProtocolService$2.callBlockingMethod(LocalizationProtocol.java:57)
  at 
 org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:427)
  at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:924) at 
 org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1692) at 
 org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1688) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
  at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1686) .Failing this 
 attempt.. Failing the application. 
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-5963) ShuffleHandler DB schema should be versioned with compatible/incompatible changes

2014-07-22 Thread Junping Du (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Junping Du updated MAPREDUCE-5963:
--

Attachment: MAPREDUCE-5963-v2.1.patch

In latest patch, fix the findbug warning.

 ShuffleHandler DB schema should be versioned with compatible/incompatible 
 changes
 -

 Key: MAPREDUCE-5963
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5963
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
Affects Versions: 2.4.1
Reporter: Junping Du
Assignee: Junping Du
 Attachments: MAPREDUCE-5963-v2.1.patch, MAPREDUCE-5963-v2.patch, 
 MAPREDUCE-5963.patch


 ShuffleHandler persist job shuffle info into DB schema, which should be 
 versioned with compatible/incompatible changes to support rolling upgrade.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5756) CombineFileInputFormat.getSplits() including directories in its results


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070284#comment-14070284
 ] 

Hudson commented on MAPREDUCE-5756:
---

FAILURE: Integrated in Hadoop-Hdfs-trunk #1812 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1812/])
MAPREDUCE-5756. CombineFileInputFormat.getSplits() including directories in its 
results. Contributed by Jason Dere (jlowe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1612400)
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/lib/input/CombineFileInputFormat.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapreduce/lib/input/TestCombineFileInputFormat.java


 CombineFileInputFormat.getSplits() including directories in its results
 ---

 Key: MAPREDUCE-5756
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5756
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Jason Dere
Assignee: Jason Dere
 Fix For: 3.0.0, 2.6.0

 Attachments: MAPREDUCE-5756.1.patch, MAPREDUCE-5756.2.patch


 Trying to track down HIVE-6401, where we see some is not a file errors 
 because getSplits() is giving us directories.  I believe the culprit is 
 FileInputFormat.listStatus():
 {code}
 if (recursive  stat.isDirectory()) {
   addInputPathRecursively(result, fs, stat.getPath(),
   inputFilter);
 } else {
   result.add(stat);
 }
 {code}
 Which seems to be allowing directories to be added to the results if 
 recursive is false.  Is this meant to return directories? If not, I think it 
 should look like this:
 {code}
 if (stat.isDirectory()) {
  if (recursive) {
   addInputPathRecursively(result, fs, stat.getPath(),
   inputFilter);
  }
 } else {
   result.add(stat);
 }
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5957) AM throws ClassNotFoundException with job classloader enabled if custom output format/committer is used


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070282#comment-14070282
 ] 

Hudson commented on MAPREDUCE-5957:
---

FAILURE: Integrated in Hadoop-Hdfs-trunk #1812 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1812/])
MAPREDUCE-5957. AM throws ClassNotFoundException with job classloader enabled 
if custom output format/committer is used. Contributed by Sangjin Lee (jlowe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1612358)
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/MRAppMaster.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/commit/CommitterEventHandler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapreduce/v2/TestMRJobs.java


 AM throws ClassNotFoundException with job classloader enabled if custom 
 output format/committer is used
 ---

 Key: MAPREDUCE-5957
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5957
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 2.4.0
Reporter: Sangjin Lee
Assignee: Sangjin Lee
 Fix For: 3.0.0, 2.6.0

 Attachments: MAPREDUCE-5957.branch-2.patch, MAPREDUCE-5957.patch, 
 MAPREDUCE-5957.patch, MAPREDUCE-5957.patch, MAPREDUCE-5957.patch, 
 MAPREDUCE-5957.patch, MAPREDUCE-5957.patch


 With the job classloader enabled, the MR AM throws ClassNotFoundException if 
 a custom output format class is specified.
 {noformat}
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: 
 java.lang.RuntimeException: java.lang.ClassNotFoundException: Class 
 com.foo.test.TestOutputFormat not found
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.createOutputCommitter(MRAppMaster.java:473)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.serviceInit(MRAppMaster.java:374)
   at 
 org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster$1.run(MRAppMaster.java:1459)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:415)
   at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.initAndStartAppMaster(MRAppMaster.java:1456)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.main(MRAppMaster.java:1389)
 Caused by: java.lang.RuntimeException: java.lang.ClassNotFoundException: 
 Class com.foo.test.TestOutputFormat not found
   at 
 org.apache.hadoop.conf.Configuration.getClass(Configuration.java:1895)
   at 
 org.apache.hadoop.mapreduce.task.JobContextImpl.getOutputFormatClass(JobContextImpl.java:222)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.createOutputCommitter(MRAppMaster.java:469)
   ... 8 more
 Caused by: java.lang.ClassNotFoundException: Class 
 com.foo.test.TestOutputFormat not found
   at 
 org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:1801)
   at 
 org.apache.hadoop.conf.Configuration.getClass(Configuration.java:1893)
   ... 10 more
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5963) ShuffleHandler DB schema should be versioned with compatible/incompatible changes

2014-07-22 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070309#comment-14070309
 ] 

Hadoop QA commented on MAPREDUCE-5963:
--

{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12657123/MAPREDUCE-5963-v2.1.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-shuffle.

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4761//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/4761//console

This message is automatically generated.

 ShuffleHandler DB schema should be versioned with compatible/incompatible 
 changes
 -

 Key: MAPREDUCE-5963
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5963
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
Affects Versions: 2.4.1
Reporter: Junping Du
Assignee: Junping Du
 Attachments: MAPREDUCE-5963-v2.1.patch, MAPREDUCE-5963-v2.patch, 
 MAPREDUCE-5963.patch


 ShuffleHandler persist job shuffle info into DB schema, which should be 
 versioned with compatible/incompatible changes to support rolling upgrade.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5756) CombineFileInputFormat.getSplits() including directories in its results


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070356#comment-14070356
 ] 

Hudson commented on MAPREDUCE-5756:
---

SUCCESS: Integrated in Hadoop-Mapreduce-trunk #1839 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1839/])
MAPREDUCE-5756. CombineFileInputFormat.getSplits() including directories in its 
results. Contributed by Jason Dere (jlowe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1612400)
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/lib/input/CombineFileInputFormat.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapreduce/lib/input/TestCombineFileInputFormat.java


 CombineFileInputFormat.getSplits() including directories in its results
 ---

 Key: MAPREDUCE-5756
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5756
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Jason Dere
Assignee: Jason Dere
 Fix For: 3.0.0, 2.6.0

 Attachments: MAPREDUCE-5756.1.patch, MAPREDUCE-5756.2.patch


 Trying to track down HIVE-6401, where we see some is not a file errors 
 because getSplits() is giving us directories.  I believe the culprit is 
 FileInputFormat.listStatus():
 {code}
 if (recursive  stat.isDirectory()) {
   addInputPathRecursively(result, fs, stat.getPath(),
   inputFilter);
 } else {
   result.add(stat);
 }
 {code}
 Which seems to be allowing directories to be added to the results if 
 recursive is false.  Is this meant to return directories? If not, I think it 
 should look like this:
 {code}
 if (stat.isDirectory()) {
  if (recursive) {
   addInputPathRecursively(result, fs, stat.getPath(),
   inputFilter);
  }
 } else {
   result.add(stat);
 }
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5957) AM throws ClassNotFoundException with job classloader enabled if custom output format/committer is used


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070354#comment-14070354
 ] 

Hudson commented on MAPREDUCE-5957:
---

SUCCESS: Integrated in Hadoop-Mapreduce-trunk #1839 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1839/])
MAPREDUCE-5957. AM throws ClassNotFoundException with job classloader enabled 
if custom output format/committer is used. Contributed by Sangjin Lee (jlowe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1612358)
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/MRAppMaster.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/commit/CommitterEventHandler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapreduce/v2/TestMRJobs.java


 AM throws ClassNotFoundException with job classloader enabled if custom 
 output format/committer is used
 ---

 Key: MAPREDUCE-5957
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5957
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 2.4.0
Reporter: Sangjin Lee
Assignee: Sangjin Lee
 Fix For: 3.0.0, 2.6.0

 Attachments: MAPREDUCE-5957.branch-2.patch, MAPREDUCE-5957.patch, 
 MAPREDUCE-5957.patch, MAPREDUCE-5957.patch, MAPREDUCE-5957.patch, 
 MAPREDUCE-5957.patch, MAPREDUCE-5957.patch


 With the job classloader enabled, the MR AM throws ClassNotFoundException if 
 a custom output format class is specified.
 {noformat}
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: 
 java.lang.RuntimeException: java.lang.ClassNotFoundException: Class 
 com.foo.test.TestOutputFormat not found
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.createOutputCommitter(MRAppMaster.java:473)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.serviceInit(MRAppMaster.java:374)
   at 
 org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster$1.run(MRAppMaster.java:1459)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:415)
   at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.initAndStartAppMaster(MRAppMaster.java:1456)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.main(MRAppMaster.java:1389)
 Caused by: java.lang.RuntimeException: java.lang.ClassNotFoundException: 
 Class com.foo.test.TestOutputFormat not found
   at 
 org.apache.hadoop.conf.Configuration.getClass(Configuration.java:1895)
   at 
 org.apache.hadoop.mapreduce.task.JobContextImpl.getOutputFormatClass(JobContextImpl.java:222)
   at 
 org.apache.hadoop.mapreduce.v2.app.MRAppMaster.createOutputCommitter(MRAppMaster.java:469)
   ... 8 more
 Caused by: java.lang.ClassNotFoundException: Class 
 com.foo.test.TestOutputFormat not found
   at 
 org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:1801)
   at 
 org.apache.hadoop.conf.Configuration.getClass(Configuration.java:1893)
   ... 10 more
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-250) JobTracker should log the scheduling of setup/cleanup task


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-250.


Resolution: Fixed

Fairly confident this has been fixed. Closing as stale.

 JobTracker should log the scheduling of setup/cleanup task
 --

 Key: MAPREDUCE-250
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-250
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Amar Kamat

 Setup/Cleanup is launched under (m+1)^th^ tip or (r+1)^th^ tip. It will be 
 nice if jobtracker logs this info.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-2811) Adding Multiple Reducers implementations.

[
https://issues.apache.org/jira/browse/MAPREDUCE-2811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Allen Wittenauer updated MAPREDUCE-2811:

Description: Like HADOOP-372, we have a multi format Reducer too. Someone
suggested that if we need different reducers and map implementations(like what
i need) I was better of by writing 2 jobs. I dont quite agree. I am calculating
2 big matrices that must be calculated in the map step, summed in the reducers
multiplied and then written to a file. The First mapper sums a matrix based on
the i,j th index(key) into the file and the second mapper adds the N*1
dimension vector that uses a new line as key. These keys must be passed as such
to the reduce process. (was: Like the Patch released here
https://issues.apache.org/jira/browse/HADOOP-372 can we have a multi format
Reducer too. Someone suggested that if we need different reducers and map
implementations(like what i need) I was better of by writing 2 jobs. I dont
quite agree. I am calculating 2 big matrices that must be calculated in the map
step, summed in the reducers multiplied and then written to a file. The First
mapper sums a matrix based on the i,j th index(key) into the file and the
second mapper adds the N*1 dimension vector that uses a new line as key. These
keys must be passed as such to the reduce process.)

Adding Multiple Reducers implementations.
-

Key: MAPREDUCE-2811
URL: https://issues.apache.org/jira/browse/MAPREDUCE-2811
Project: Hadoop Map/Reduce
Issue Type: New Feature
Reporter: Sidharth Gupta

Like HADOOP-372, we have a multi format Reducer too. Someone suggested that
if we need different reducers and map implementations(like what i need) I was
better of by writing 2 jobs. I dont quite agree. I am calculating 2 big
matrices that must be calculated in the map step, summed in the reducers
multiplied and then written to a file. The First mapper sums a matrix based
on the i,j th index(key) into the file and the second mapper adds the N*1
dimension vector that uses a new line as key. These keys must be passed as
such to the reduce process.

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-126) Job history analysis showing wrong job runtime


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-126:
---

Labels: newbie  (was: )

 Job history analysis showing wrong job runtime
 --

 Key: MAPREDUCE-126
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-126
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.20.1
Reporter: Amar Kamat
  Labels: newbie

 Analysis of completed jobs shows wrong runtime. Here is the faulty code
 {code:title=analysisjobhistory.jsp|borderStyle=solid}
 bFinished At : /b  %=StringUtils.getFormattedTimeWithDiff(dateFormat, 
 job.getLong(Keys.FINISH_TIME), job.getLong(Keys.LAUNCH_TIME)) %br/
 {code}
 I think it should be 
 {code:title=analysisjobhistory.jsp|borderStyle=solid}
 bFinished At : /b  %=StringUtils.getFormattedTimeWithDiff(dateFormat, 
 job.getLong(Keys.FINISH_TIME), job.getLong(Keys.SUBMIT_TIME)) %br/
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-126) Job history analysis showing wrong job runtime


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-126.


Resolution: Incomplete

This code is long gone in 2.x. Closing as stale.

 Job history analysis showing wrong job runtime
 --

 Key: MAPREDUCE-126
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-126
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.20.1
Reporter: Amar Kamat
  Labels: newbie

 Analysis of completed jobs shows wrong runtime. Here is the faulty code
 {code:title=analysisjobhistory.jsp|borderStyle=solid}
 bFinished At : /b  %=StringUtils.getFormattedTimeWithDiff(dateFormat, 
 job.getLong(Keys.FINISH_TIME), job.getLong(Keys.LAUNCH_TIME)) %br/
 {code}
 I think it should be 
 {code:title=analysisjobhistory.jsp|borderStyle=solid}
 bFinished At : /b  %=StringUtils.getFormattedTimeWithDiff(dateFormat, 
 job.getLong(Keys.FINISH_TIME), job.getLong(Keys.SUBMIT_TIME)) %br/
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-484) Logos for Hive and JobTracker


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-484.


Resolution: Fixed

Stale. Closing.

 Logos for Hive and JobTracker
 -

 Key: MAPREDUCE-484
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-484
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Aaron Newton
Priority: Trivial
 Attachments: hive  job tracker icons (font outlines).ai, hive  job 
 tracker icons (font outlines).pdf, hive  job tracker icons (font 
 outlines).pdf, hive.png, hive.png, jobtracker.png


 Greetings fine Hadoop peoples,
 While working on a few projects here at Cloudera we found ourselves wanting 
 for some sort of icon for both the JobTracker and for Hive. After checking on 
 the project page for Hive (the JobTracker doesn't really have one) and 
 finding that these items have no icons, we rolled up our sleeves and made 
 some. We'd like to contribute these to the project, so if you want 'em, 
 they're all yours. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-700) Too many copies of job-conf with the jobtracker


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-700.


Resolution: Fixed

Lots of changes here already. Closing this as stale.

 Too many copies of job-conf with the jobtracker
 ---

 Key: MAPREDUCE-700
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-700
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobtracker
Reporter: Amar Kamat
Assignee: Amar Kamat

 As of today the jobtracker has job-conf copies in
 # mapred.system.dir : created while job-submission 
 # jobtracker-subdir (created by JobInProgress upon creation)
 # log-dir : created upon job-init
 # history-dir : created upon job-init
 Its difficult to manage these conf files. The problem aggravates under 
 restart.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-173) JobConf should also load resources from hdfs (or other filesystems)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-173.


Resolution: Fixed

This is almost certainly fixed by now.

 JobConf should also load resources from hdfs (or other filesystems)
 ---

 Key: MAPREDUCE-173
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-173
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Amar Kamat

 {{JobConf conf = new JobConf(path)}} doesnt load the configuration if _path_ 
 points to a resource on hdfs. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-322) TaskTracker shuold run user tasks nicely in the local machine


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-322.


Resolution: Fixed

This has been fixed with both cgroups and task level niceness.

 TaskTracker shuold run user tasks nicely in the local machine
 -

 Key: MAPREDUCE-322
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-322
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Tsz Wo Nicholas Sze

 If one task tried to use all CPUs in a local machine, all other tasks or 
 processes (includes tasktracker and datanode daemons) may hardly get a chance 
 to run.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-275) Display lost tracker information on the jobtracker webui and persist it across restarts


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-275.


Resolution: Won't Fix

I'm going to close this as Won't Fix.

 Display lost tracker information on the jobtracker webui and persist it 
 across restarts
 ---

 Key: MAPREDUCE-275
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-275
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Amar Kamat
Assignee: Amar Kamat

 As of today its difficult to distinguish between active tracker and lost 
 trackers (lost trackers are considered active). It will be nice if the 
 jobtracker can display what all trackers are lost and maintain it across 
 restarts. HADOOP-5643 does something similar for decommissioned trackers.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-156) ProcessTree.destroy() is sleeping for 5 seconds holding the task slot


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-156.


Resolution: Won't Fix

This is intentional to potentially give time for the process to clean up. 
Closing as won't fix.

 ProcessTree.destroy() is sleeping for 5 seconds holding the task slot
 -

 Key: MAPREDUCE-156
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-156
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Ravi Gummadi

 Currently, in ProcessTree.destroy(), after sending SIGTERM to the task JVM, 
 TT sleeps for 5 seconds(default value of 
 mapred.tasktracker.tasks.sleeptime-before-sigkill) before sending SIGKILL. 
 This seems to be blocking the task slot(not getting released) for 5 seconds. 
 We should avoid this so that another task could be launched in that slot 
 immediately.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-327) Add explicit remote map count JobTracker metrics


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070734#comment-14070734
 ] 

Allen Wittenauer commented on MAPREDUCE-327:


In real-world scenarios, we've discovered that task locality as reported by the 
system can effectively be a lie because of CFIF/MFIF. Given 4 input splits, if 
the first is local but the rest are not, the task will still be considered 
local even though 3/4'ths of the data came off rack!

 Add explicit remote map count JobTracker metrics
 

 Key: MAPREDUCE-327
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-327
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Hong Tang
  Labels: newbie

 I am proposing to add a counter REMOTE_MAPS in addition to the following 
 counters: TOTAL_MAPS, DATA_LOCAL_MAPS, RACK_LOCAL_MAPS. A Map Task is 
 considered a remote-map iff the input split returns a set of locations, but 
 none is chosen to execute the map task.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-327) Add explicit remote map count JobTracker metrics


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-327:
---

Labels: newbie  (was: )

 Add explicit remote map count JobTracker metrics
 

 Key: MAPREDUCE-327
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-327
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Hong Tang
  Labels: newbie

 I am proposing to add a counter REMOTE_MAPS in addition to the following 
 counters: TOTAL_MAPS, DATA_LOCAL_MAPS, RACK_LOCAL_MAPS. A Map Task is 
 considered a remote-map iff the input split returns a set of locations, but 
 none is chosen to execute the map task.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-5990) If output directory can not be created, error message on stdout does not provide any clue.


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-5990:


Component/s: examples

 If output directory can not be created, error message on stdout does not 
 provide any clue.
 --

 Key: MAPREDUCE-5990
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5990
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: examples
Reporter: Suhas Gogate
  Labels: newbie

 In the following wordcount example output directory path can not be created 
 because /temp does not exists and user has not privileges to create output 
 path at /. 
 hadoop --config ./clustdir/ jar /homes/gogate/wordcount.jar 
 com..wordcount.WordCount /in-path/gogate/myfile /temp/mywc-gogate 
 09/04/28 23:00:32 WARN mapred.JobClient: Use GenericOptionsParser for parsing 
 the arguments. Applications should implement Tool for the same.
 09/04/28 23:00:32 INFO mapred.FileInputFormat: Total input paths to process : 
 1
 09/04/28 23:00:32 INFO mapred.FileInputFormat: Total input paths to process : 
 1
 09/04/28 23:00:33 INFO mapred.JobClient: Running job: job_200904282249_0004
 java.io.IOException: Job failed!
   at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1113)
   at com..wordcount.WordCount.main(WordCount.java:55)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.apache.hadoop.util.RunJar.main(RunJar.java:155)
   at org.apache.hadoop.mapred.JobShell.run(JobShell.java:54)
   at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
   at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
   at org.apache.hadoop.mapred.JobShell.main(JobShell.java:68)
  



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-5963) ShuffleHandler DB schema should be versioned with compatible/incompatible changes


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070737#comment-14070737
 ] 

Jason Lowe commented on MAPREDUCE-5963:
---

+1 lgtm.  Committing this.

 ShuffleHandler DB schema should be versioned with compatible/incompatible 
 changes
 -

 Key: MAPREDUCE-5963
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5963
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
Affects Versions: 2.4.1
Reporter: Junping Du
Assignee: Junping Du
 Attachments: MAPREDUCE-5963-v2.1.patch, MAPREDUCE-5963-v2.patch, 
 MAPREDUCE-5963.patch


 ShuffleHandler persist job shuffle info into DB schema, which should be 
 versioned with compatible/incompatible changes to support rolling upgrade.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Moved] (MAPREDUCE-5990) If output directory can not be created, error message on stdout does not provide any clue.


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer moved HADOOP-5756 to MAPREDUCE-5990:
-

Affects Version/s: (was: 0.18.3)
   Issue Type: Improvement  (was: Bug)
  Key: MAPREDUCE-5990  (was: HADOOP-5756)
  Project: Hadoop Map/Reduce  (was: Hadoop Common)

 If output directory can not be created, error message on stdout does not 
 provide any clue.
 --

 Key: MAPREDUCE-5990
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5990
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: examples
Reporter: Suhas Gogate
  Labels: newbie

 In the following wordcount example output directory path can not be created 
 because /temp does not exists and user has not privileges to create output 
 path at /. 
 hadoop --config ./clustdir/ jar /homes/gogate/wordcount.jar 
 com..wordcount.WordCount /in-path/gogate/myfile /temp/mywc-gogate 
 09/04/28 23:00:32 WARN mapred.JobClient: Use GenericOptionsParser for parsing 
 the arguments. Applications should implement Tool for the same.
 09/04/28 23:00:32 INFO mapred.FileInputFormat: Total input paths to process : 
 1
 09/04/28 23:00:32 INFO mapred.FileInputFormat: Total input paths to process : 
 1
 09/04/28 23:00:33 INFO mapred.JobClient: Running job: job_200904282249_0004
 java.io.IOException: Job failed!
   at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1113)
   at com..wordcount.WordCount.main(WordCount.java:55)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.apache.hadoop.util.RunJar.main(RunJar.java:155)
   at org.apache.hadoop.mapred.JobShell.run(JobShell.java:54)
   at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
   at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
   at org.apache.hadoop.mapred.JobShell.main(JobShell.java:68)
  



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-197) add new options to mapred job -list-attempt-ids to dump counters and diagnostic messages


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-197:
---

Description: 
It would be very nice when tracking down tasks that have strange values for 
their counters, if there was a command line tool to print out the task attempts 
and their counters and diagnostic messages. I propose adding switches to 
-list-attempt-ids to accomplish that:

{quote}
mapred job -list-attempt-ids [-counters] [-diagnostics] job type state
{quote}

  was:
It would be very nice when tracking down tasks that have strange values for 
their counters, if there was a command line tool to print out the task attempts 
and their counters and diagnostic messages. I propose adding switches to 
-list-attempt-ids to accomplish that:

{quote}
hadoop job -list-attempt-ids [-counters] [-diagnostics] job type state
{quote}


 add new options to mapred job -list-attempt-ids to dump counters and 
 diagnostic messages
 

 Key: MAPREDUCE-197
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-197
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
Reporter: Owen O'Malley
Assignee: Owen O'Malley
  Labels: newbie

 It would be very nice when tracking down tasks that have strange values for 
 their counters, if there was a command line tool to print out the task 
 attempts and their counters and diagnostic messages. I propose adding 
 switches to -list-attempt-ids to accomplish that:
 {quote}
 mapred job -list-attempt-ids [-counters] [-diagnostics] job type state
 {quote}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-197) add new options to mapred job -list-attempt-ids to dump counters and diagnostic messages


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-197:
---

Summary: add new options to mapred job -list-attempt-ids to dump counters 
and diagnostic messages  (was: add new options to hadoop job -list-attempt-ids 
to dump counters and diagnostic messages)

 add new options to mapred job -list-attempt-ids to dump counters and 
 diagnostic messages
 

 Key: MAPREDUCE-197
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-197
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
Reporter: Owen O'Malley
Assignee: Owen O'Malley
  Labels: newbie

 It would be very nice when tracking down tasks that have strange values for 
 their counters, if there was a command line tool to print out the task 
 attempts and their counters and diagnostic messages. I propose adding 
 switches to -list-attempt-ids to accomplish that:
 {quote}
 hadoop job -list-attempt-ids [-counters] [-diagnostics] job type state
 {quote}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-168) hadoop job -list all should display the code for Killed also.


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-168:
---

Labels: newbie  (was: )

 hadoop job -list all should display the code for Killed also.
 -

 Key: MAPREDUCE-168
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-168
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Hemanth Yamijala
  Labels: newbie

 hadoop job -list all shows a legend for the states: PREP, SUCCEEDED, FAILED 
 and RUNNING. It should also display the state for KILLED (as 5).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-168) mapred job -list all should display the code for Killed also.


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-168:
---

Description: mapred job -list all shows a legend for the states: PREP, 
SUCCEEDED, FAILED and RUNNING. It should also display the state for KILLED (as 
5).  (was: hadoop job -list all shows a legend for the states: PREP, SUCCEEDED, 
FAILED and RUNNING. It should also display the state for KILLED (as 5).)

 mapred job -list all should display the code for Killed also.
 -

 Key: MAPREDUCE-168
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-168
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Hemanth Yamijala
  Labels: newbie

 mapred job -list all shows a legend for the states: PREP, SUCCEEDED, FAILED 
 and RUNNING. It should also display the state for KILLED (as 5).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-197) add new options to hadoop job -list-attempt-ids to dump counters and diagnostic messages


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-197:
---

Labels: newbie  (was: )

 add new options to hadoop job -list-attempt-ids to dump counters and 
 diagnostic messages
 

 Key: MAPREDUCE-197
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-197
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
Reporter: Owen O'Malley
Assignee: Owen O'Malley
  Labels: newbie

 It would be very nice when tracking down tasks that have strange values for 
 their counters, if there was a command line tool to print out the task 
 attempts and their counters and diagnostic messages. I propose adding 
 switches to -list-attempt-ids to accomplish that:
 {quote}
 hadoop job -list-attempt-ids [-counters] [-diagnostics] job type state
 {quote}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-168) mapred job -list all should display the code for Killed also.


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-168:
---

Summary: mapred job -list all should display the code for Killed also.  
(was: hadoop job -list all should display the code for Killed also.)

 mapred job -list all should display the code for Killed also.
 -

 Key: MAPREDUCE-168
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-168
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Hemanth Yamijala
  Labels: newbie

 hadoop job -list all shows a legend for the states: PREP, SUCCEEDED, FAILED 
 and RUNNING. It should also display the state for KILLED (as 5).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-403) ProcessTree can try and kill a null PID


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-403.


Resolution: Incomplete

I'm going to close this as stale.  If this is still an issue, probably better 
to open a new jira.

 ProcessTree can try and kill a null PID
 -

 Key: MAPREDUCE-403
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-403
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Steve Loughran
Priority: Minor

 Saw this in a test run, while trying to shut down a TaskTracker
 [sf-startdaemon-debug] 09/05/07 16:42:42 [Map-events fetcher for all reduce 
 tasks on tracker_morzine.hpl.hp.com:localhost/127.0.0.1:36239] INFO 
 mapred.TaskTracker : Shutting down: Map-events fetcher for all reduce tasks 
 on tracker_morzine.hpl.hp.com:localhost/127.0.0.1:36239
 [sf-startdaemon-debug] 09/05/07 16:42:42 [TerminatorThread] WARN 
 util.ProcessTree : Error executing shell command 
 org.apache.hadoop.util.Shell$ExitCodeException: ERROR: garbage process ID 
 -null.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-403) ProcessTree can try and kill a null PID


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-403:
---

Summary: ProcessTree can try and kill a null PID  (was: ProcessTree can 
try and kill a null POD)

 ProcessTree can try and kill a null PID
 -

 Key: MAPREDUCE-403
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-403
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Steve Loughran
Priority: Minor

 Saw this in a test run, while trying to shut down a TaskTracker
 [sf-startdaemon-debug] 09/05/07 16:42:42 [Map-events fetcher for all reduce 
 tasks on tracker_morzine.hpl.hp.com:localhost/127.0.0.1:36239] INFO 
 mapred.TaskTracker : Shutting down: Map-events fetcher for all reduce tasks 
 on tracker_morzine.hpl.hp.com:localhost/127.0.0.1:36239
 [sf-startdaemon-debug] 09/05/07 16:42:42 [TerminatorThread] WARN 
 util.ProcessTree : Error executing shell command 
 org.apache.hadoop.util.Shell$ExitCodeException: ERROR: garbage process ID 
 -null.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (MAPREDUCE-526) Sometimes job does not get removed from scheduler queue after it is killed


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-526.


Resolution: Won't Fix

Closing this as won't fix.

 Sometimes job does not get removed from scheduler queue after it is killed
 --

 Key: MAPREDUCE-526
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-526
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Karam Singh

 Sometimes when we kill a job, it does get removed from waiting queue, while 
 job status: Killed with Job Setup and Cleanup: Successful 
 Also JobTracker webui shows job under failed jobs lists and hadoop job -list 
 all, hadoop queue queuename -showJobs also shows jobs state=5.
 Prior to killing job state was Running



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (MAPREDUCE-635) IllegalArgumentException is thrown if mapred local dir is not writable.