[jira] [Created] (MAPREDUCE-5486) Potential file handler leak in JobHistoryServer web ui.

2013-08-29 Thread Rohith Sharma K S (JIRA)
Rohith Sharma K S created MAPREDUCE-5486:


 Summary: Potential file handler leak in JobHistoryServer web ui.
 Key: MAPREDUCE-5486
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5486
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver
Affects Versions: 2.0.5-alpha, 2.1.1-beta
Reporter: Rohith Sharma K S


Any problem in getting aggregated logs for rendering on web ui, then LogReader 
is not closed. 

Now, it reader is not closed which causing many connections in close_wait state.

hadoopuser@hadoopuser: jps
*27909* JobHistoryServer

DataNode port is 50010. When greped with DataNode port, many connections are in 
CLOSE_WAIT from JHS.
hadoopuser@hadoopuser: netstat -tanlp |grep 50010
tcp0  0 10.18.40.48:50010   0.0.0.0:*   LISTEN  
21453/java  
tcp1  0 10.18.40.48:20596   10.18.40.48:50010   CLOSE_WAIT  
*27909*/java  
tcp1  0 10.18.40.48:19667   10.18.40.152:50010  CLOSE_WAIT  
*27909*/java  
tcp1  0 10.18.40.48:20593   10.18.40.48:50010   CLOSE_WAIT  
*27909*/java  
tcp1  0 10.18.40.48:12290   10.18.40.48:50010   CLOSE_WAIT  
*27909*/java  
tcp1  0 10.18.40.48:19662   10.18.40.152:50010  CLOSE_WAIT  
*27909*/java  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Hadoop-Mapreduce-trunk - Build # 1533 - Still Failing

2013-08-29 Thread Apache Jenkins Server
See https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1533/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 31948 lines...]
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 17.203 sec
Running org.apache.hadoop.mapreduce.lib.partition.TestKeyFieldHelper
Tests run: 3, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.138 sec
Running org.apache.hadoop.mapreduce.lib.partition.TestBinaryPartitioner
Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.477 sec
Running org.apache.hadoop.mapreduce.lib.partition.TestMRKeyFieldBasedPartitioner
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.423 sec
Running org.apache.hadoop.mapreduce.TestChild
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 61.707 sec
Running org.apache.hadoop.mapreduce.filecache.TestURIFragments
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.062 sec
Running org.apache.hadoop.mapreduce.TestMapReduce
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 8.373 sec

Results :

Tests run: 466, Failures: 0, Errors: 0, Skipped: 11

[INFO] 
[INFO] Reactor Summary:
[INFO] 
[INFO] hadoop-mapreduce-client ... SUCCESS [1.755s]
[INFO] hadoop-mapreduce-client-core .. SUCCESS [38.053s]
[INFO] hadoop-mapreduce-client-common  SUCCESS [24.098s]
[INFO] hadoop-mapreduce-client-shuffle ... SUCCESS [2.356s]
[INFO] hadoop-mapreduce-client-app ... SUCCESS [5:48.826s]
[INFO] hadoop-mapreduce-client-hs  SUCCESS [1:38.173s]
[INFO] hadoop-mapreduce-client-jobclient . FAILURE 
[1:16:23.696s]
[INFO] hadoop-mapreduce-client-hs-plugins  SKIPPED
[INFO] Apache Hadoop MapReduce Examples .. SKIPPED
[INFO] hadoop-mapreduce .. SKIPPED
[INFO] 
[INFO] BUILD FAILURE
[INFO] 
[INFO] Total time: 1:24:57.596s
[INFO] Finished at: Thu Aug 29 14:42:49 UTC 2013
[INFO] Final Memory: 40M/104M
[INFO] 
[ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-surefire-plugin:2.12.3:test (default-test) on 
project hadoop-mapreduce-client-jobclient: ExecutionException; nested exception 
is java.util.concurrent.ExecutionException: java.lang.RuntimeException: The 
forked VM terminated without saying properly goodbye. VM crash or System.exit 
called ? - [Help 1]
[ERROR] 
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR] 
[ERROR] For more information about the errors and possible solutions, please 
read the following articles:
[ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException
[ERROR] 
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR]   mvn goals -rf :hadoop-mapreduce-client-jobclient
Build step 'Execute shell' marked build as failure
[FINDBUGS] Skipping publisher since build result is FAILURE
Archiving artifacts
Updating HADOOP-9910
Updating YARN-1101
Updating YARN-1034
Updating HADOOP-9894
Updating HDFS-5078
Updating HADOOP-9906
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
No tests ran.

[jira] [Resolved] (MAPREDUCE-3900) mr-jobhistory-daemon.sh should rely on MAPREDUCE env. variables instead of the YARN ones

2013-08-29 Thread Sean Mackrory (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Mackrory resolved MAPREDUCE-3900.
--

Resolution: Duplicate

 mr-jobhistory-daemon.sh should rely on MAPREDUCE env. variables instead of 
 the YARN ones
 

 Key: MAPREDUCE-3900
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3900
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: jobhistoryserver
Affects Versions: 0.23.0
Reporter: Roman Shaposhnik
Assignee: Roman Shaposhnik

 It nice to see yarn-deamo.sh be split into a separate script for managing MR 
 service(s), but once that has happened we should go all the way and make it 
 configurable as an MR entity.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (MAPREDUCE-5472) reducer of sort job restarts from scratch in between after RM restart

2013-08-29 Thread Omkar Vinit Joshi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Omkar Vinit Joshi resolved MAPREDUCE-5472.
--

Resolution: Cannot Reproduce

 reducer of sort job restarts from scratch in between after RM restart
 -

 Key: MAPREDUCE-5472
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5472
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: yeshavora
Assignee: Omkar Vinit Joshi
Priority: Blocker

 Steps Followed:
 1) Run a sort job. As soon as it finishes all the map tasks. [100% map], 
 restart resource manager.
 2) Analyse the progress of the sort job.
 It starts with 100% map 0% reduce
 100% map 32% reduce
 100% map 0% reduce
 Reducer stays at 30% reduce for around 5-10 minutes. and again start reducer 
 from scratch.
 Log from failed reducer attempt:
 Error: java.io.IOException: Error while reading compressed data at 
 org.apache.hadoop.io.IOUtils.wrappedReadForCompressedData(IOUtils.java:174) 
 at org.apache.hadoop.mapred.IFile$Reader.readData(IFile.java:383) at 
 org.apache.hadoop.mapred.IFile$Reader.nextRawValue(IFile.java:444) at 
 org.apache.hadoop.mapred.Merger$Segment.nextRawValue(Merger.java:327) at 
 org.apache.hadoop.mapred.Merger$Segment.getValue(Merger.java:309) at 
 org.apache.hadoop.mapred.Merger$MergeQueue.next(Merger.java:533) at 
 org.apache.hadoop.mapred.ReduceTask$4.next(ReduceTask.java:619) at 
 org.apache.hadoop.mapreduce.task.ReduceContextImpl.nextKeyValue(ReduceContextImpl.java:154)
  at 
 org.apache.hadoop.mapreduce.task.ReduceContextImpl.nextKey(ReduceContextImpl.java:121)
  at 
 org.apache.hadoop.mapreduce.lib.reduce.WrappedReducer$Context.nextKey(WrappedReducer.java:297)
  at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:170) at 
 org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:645) at 
 org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:405) at 
 org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:162) at 
 java.security.AccessController.doPrivileged(Native Method) at 
 javax.security.auth.Subject.doAs(Subject.java:396) at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1477)
  at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:157) Caused by: 
 org.apache.hadoop.fs.FSError: java.io.IOException: Input/output error at 
 org.apache.hadoop.fs.RawLocalFileSystem$LocalFSFileInputStream.read(RawLocalFileSystem.java:177)
  at java.io.BufferedInputStream.read1(BufferedInputStream.java:256) at 
 java.io.BufferedInputStream.read(BufferedInputStream.java:317) at 
 java.io.DataInputStream.read(DataInputStream.java:132) at 
 org.apache.hadoop.mapred.IFileInputStream.doRead(IFileInputStream.java:209) 
 at org.apache.hadoop.mapred.IFileInputStream.read(IFileInputStream.java:152) 
 at 
 org.apache.hadoop.io.compress.BlockDecompressorStream.getCompressedData(BlockDecompressorStream.java:127)
  at 
 org.apache.hadoop.io.compress.BlockDecompressorStream.decompress(BlockDecompressorStream.java:98)
  at 
 org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:85)
  at 
 org.apache.hadoop.io.IOUtils.wrappedReadForCompressedData(IOUtils.java:170) 
 ... 17 more Caused by: java.io.IOException: Input/output error at 
 java.io.FileInputStream.readBytes(Native Method) at 
 java.io.FileInputStream.read(FileInputStream.java:220) at 
 org.apache.hadoop.fs.RawLocalFileSystem$TrackingFileInputStream.read(RawLocalFileSystem.java:110)
  at 
 org.apache.hadoop.fs.RawLocalFileSystem$LocalFSFileInputStream.read(RawLocalFileSystem.java:171)
  ... 26 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (MAPREDUCE-5487) In task processes, JobConf is unnecessarily loaded again in Limits

2013-08-29 Thread Sandy Ryza (JIRA)
Sandy Ryza created MAPREDUCE-5487:
-

 Summary: In task processes, JobConf is unnecessarily loaded again 
in Limits
 Key: MAPREDUCE-5487
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5487
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: performance, task
Affects Versions: 2.1.0-beta
Reporter: Sandy Ryza


Limits statically loads a JobConf, which incurs costs of reading files from 
disk and parsing XML.  The contents of this JobConf are identical to the one 
loaded by YarnChild (before adding job.xml as a resource).  Allowing Limits to 
initialize with the JobConf loaded in YarnChild would reduce task startup time.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira