Re: [VOTE] Release Apache Hadoop 2.2.0

2013-10-14 Thread Tsuyoshi OZAWA
+1 (non-binding)

- Verified md5 checksums and signature.
- Build from source, and run some example MR jobs in single node setup.

On Sun, Oct 13, 2013 at 7:42 PM, Siddharth Seth ss...@apache.org wrote:
 +1 (binding)

 Verified checksums and signature. Built from source and ran some simple MR
 and Tez jobs.

 - Sid

 On Mon, Oct 7, 2013 at 12:00 AM, Arun C Murthy a...@hortonworks.com wrote:

 Folks,

 I've created a release candidate (rc0) for hadoop-2.2.0 that I would like
 to get released - this release fixes a small number of bugs and some
 protocol/api issues which should ensure they are now stable and will not
 change in hadoop-2.x.

 The RC is available at:
 http://people.apache.org/~acmurthy/hadoop-2.2.0-rc0
 The RC tag in svn is here:
 http://svn.apache.org/repos/asf/hadoop/common/tags/release-2.2.0-rc0

 The maven artifacts are available via repository.apache.org.

 Please try the release and vote; the vote will run for the usual 7 days.

 thanks,
 Arun

 P.S.: Thanks to Colin, Andrew, Daryn, Chris and others for helping nail
 down the symlinks-related issues. I'll release note the fact that we have
 disabled it in 2.2. Also, thanks to Vinod for some heavy-lifting on the
 YARN side in the last couple of weeks.





 --
 Arun C. Murthy
 Hortonworks Inc.
 http://hortonworks.com/



 --
 CONFIDENTIALITY NOTICE
 NOTICE: This message is intended for the use of the individual or entity to
 which it is addressed and may contain information that is confidential,
 privileged and exempt from disclosure under applicable law. If the reader
 of this message is not the intended recipient, you are hereby notified that
 any printing, copying, dissemination, distribution, disclosure or
 forwarding of this communication is strictly prohibited. If you have
 received this communication in error, please contact the sender immediately
 and delete it from your system. Thank You.




-- 
- Tsuyoshi


[jira] [Resolved] (MAPREDUCE-5581) killing jobs which have failed causes log missing

2013-10-14 Thread Jason Lowe (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe resolved MAPREDUCE-5581.
---

Resolution: Duplicate

This is a duplicate of MAPREDUCE-5502.

 killing jobs which have failed causes log missing
 -

 Key: MAPREDUCE-5581
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5581
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: client
Affects Versions: 2.1.1-beta
Reporter: Nemon Lou

 In hive code,when a job failed,they invoke the RunningJob.killJob() API 
 immediately.
 From mapreduce client side,when job is at failed state,the YARNRunner will 
 invoke resMgrDelegate.killApplication to kill that job.And this prevent AM 
 from writing logs to job history server.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


Hadoop-Mapreduce-trunk - Build # 1578 - Still Failing

2013-10-14 Thread Apache Jenkins Server
See https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1578/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 33671 lines...]
  
TestEncryptedShuffle.encryptedShuffleWithoutClientCerts:169-encryptedShuffleWithCerts:156
 null
  TestChild.testChild:151-submitAndValidateJob:137 null

Tests in error: 
  TestMiniMRWithDFSWithDistinctUsers.setUp:97 » YarnRuntime 
java.lang.OutOfMemor...
  TestMiniMRWithDFSWithDistinctUsers.setUp:97 » YarnRuntime 
java.lang.OutOfMemor...
  TestReduceFetchFromPartialMem.testReduceFromPartialMem:93-runJob:300 » IO 
Job...
  TestJobSysDirWithDFS.testWithDFS:130 » YarnRuntime 
java.lang.OutOfMemoryError:...
  TestReduceFetchFromPartialMem.testReduceFromPartialMem:93-runJob:300 » IO 
Job...
  TestLazyOutput.testLazyOutput:146 » YarnRuntime java.lang.OutOfMemoryError: 
un...
  TestSpecialCharactersInOutputPath.testJobWithDFS:112 » YarnRuntime 
java.lang.O...
  TestMapReduceLazyOutput.testLazyOutput:136 » YarnRuntime 
java.lang.OutOfMemory...
  TestSpeculativeExecution.setup:122 » IO Cannot run program stat: 
java.io.IOE...
  TestMRJobs.setup:130 » YarnRuntime java.lang.OutOfMemoryError: unable to 
creat...
  TestRMNMInfo.setup:84 » IO Cannot run program stat: java.io.IOException: 
err...
  TestUberAM.setup:45-TestMRJobs.setup:130 » YarnRuntime 
java.lang.OutOfMemoryE...

Tests run: 455, Failures: 8, Errors: 12, Skipped: 11

[INFO] 
[INFO] Reactor Summary:
[INFO] 
[INFO] hadoop-mapreduce-client ... SUCCESS [2.515s]
[INFO] hadoop-mapreduce-client-core .. SUCCESS [45.525s]
[INFO] hadoop-mapreduce-client-common  SUCCESS [24.742s]
[INFO] hadoop-mapreduce-client-shuffle ... SUCCESS [2.438s]
[INFO] hadoop-mapreduce-client-app ... SUCCESS [6:47.509s]
[INFO] hadoop-mapreduce-client-hs  SUCCESS [2:00.518s]
[INFO] hadoop-mapreduce-client-jobclient . FAILURE [44:55.175s]
[INFO] hadoop-mapreduce-client-hs-plugins  SKIPPED
[INFO] Apache Hadoop MapReduce Examples .. SKIPPED
[INFO] hadoop-mapreduce .. SKIPPED
[INFO] 
[INFO] BUILD FAILURE
[INFO] 
[INFO] Total time: 54:59.081s
[INFO] Finished at: Mon Oct 14 14:14:28 UTC 2013
[INFO] Final Memory: 22M/84M
[INFO] 
[ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-surefire-plugin:2.16:test (default-test) on 
project hadoop-mapreduce-client-jobclient: ExecutionException; nested exception 
is java.util.concurrent.ExecutionException: java.lang.RuntimeException: The 
forked VM terminated without saying properly goodbye. VM crash or System.exit 
called ?
[ERROR] Command was/bin/sh -c cd 
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-trunk/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient
  /home/jenkins/tools/java/jdk1.6.0_26/jre/bin/java -Xmx1024m 
-XX:+HeapDumpOnOutOfMemoryError -jar 
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-trunk/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/target/surefire/surefirebooter457716163653505892.jar
 
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-trunk/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/target/surefire/surefire795825364178818457tmp
 
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-trunk/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/target/surefire/surefire_1128262753521404065768tmp
[ERROR] - [Help 1]
[ERROR] 
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR] 
[ERROR] For more information about the errors and possible solutions, please 
read the following articles:
[ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException
[ERROR] 
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR]   mvn goals -rf :hadoop-mapreduce-client-jobclient
Build step 'Execute shell' marked build as failure
[FINDBUGS] Skipping publisher since build result is FAILURE
Archiving artifacts
Updating MAPREDUCE-5329
Updating MAPREDUCE-5463
Updating YARN-305
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
No tests ran.

[jira] [Resolved] (MAPREDUCE-5546) mapred.cmd on Windows set HADOOP_OPTS incorrectly

2013-10-14 Thread Chris Nauroth (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Nauroth resolved MAPREDUCE-5546.
--

   Resolution: Fixed
Fix Version/s: 2.2.1
   3.0.0

I've committed this to trunk, branch-2, and branch-2.2.  Chuan, thank you for 
the patch.

 mapred.cmd on Windows set HADOOP_OPTS incorrectly
 -

 Key: MAPREDUCE-5546
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5546
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 3.0.0, 2.2.0
Reporter: Chuan Liu
Assignee: Chuan Liu
 Fix For: 3.0.0, 2.2.1

 Attachments: MAPREDUCE-5546-trunk.patch


 The mapred command on Windows does not set HADOOP_OPTS correctly. As a 
 result, some options and settings will miss in the final command, and this 
 will lead to some desired behavior missing. One example is the logging file 
 setting will miss, i.e. even if one set HADOOP_ROOT_LOGGER to DRFA, there is 
 no history server log at HADOOP_LOGFILE.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Created] (MAPREDUCE-5583) Ability to limit running map and reduce tasks

2013-10-14 Thread Jason Lowe (JIRA)
Jason Lowe created MAPREDUCE-5583:
-

 Summary: Ability to limit running map and reduce tasks
 Key: MAPREDUCE-5583
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5583
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mr-am, mrv2
Affects Versions: 2.1.1-beta, 0.23.9
Reporter: Jason Lowe


It would be nice if users could specify a limit to the number of map or reduce 
tasks that are running simultaneously.  Occasionally users are performing 
operations in tasks that can lead to DDoS scenarios if too many tasks run 
simultaneously (e.g.: accessing a database, web service, etc.).  Having the 
ability to throttle the number of tasks simultaneously running would provide 
users a way to mitigate issues with too many tasks on a large cluster 
attempting to access a serivce at any one time.

This is similar to the functionality requested by MAPREDUCE-224 and implemented 
by HADOOP-3412 but was dropped in mrv2.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


streaming documentation in Hadoop 2?

2013-10-14 Thread Sandy Ryza
Hi All,

I noticed that the hadoop streaming documentation does not exist in the
Hadoop 2 source tree, and also cannot be found on the internet.   Is this
on purpose?  I found this wiki page
http://wiki.apache.org/hadoop/HadoopStreaming - is that where doc is
supposed to go?  As this page isn't tied to a specific version, how does it
work if new options are added?

thanks,
-Sandy


Re: streaming documentation in Hadoop 2?

2013-10-14 Thread Eli Collins
It probably just needs doc, I'd go ahead and file a jira for it. The
wiki content here could be a good starting point.

On Mon, Oct 14, 2013 at 2:56 PM, Sandy Ryza sandy.r...@cloudera.com wrote:
 Hi All,

 I noticed that the hadoop streaming documentation does not exist in the
 Hadoop 2 source tree, and also cannot be found on the internet.   Is this
 on purpose?  I found this wiki page
 http://wiki.apache.org/hadoop/HadoopStreaming - is that where doc is
 supposed to go?  As this page isn't tied to a specific version, how does it
 work if new options are added?

 thanks,
 -Sandy


Re: streaming documentation in Hadoop 2?

2013-10-14 Thread Sandy Ryza
Doc existed in MR1 http://hadoop.apache.org/docs/stable/streaming.html, but
it looks like it and a bunch of other stuff (e.g. Rumen and the MapReduce
Tutorial) weren't ported over.


On Mon, Oct 14, 2013 at 3:20 PM, Eli Collins e...@cloudera.com wrote:

 It probably just needs doc, I'd go ahead and file a jira for it. The
 wiki content here could be a good starting point.

 On Mon, Oct 14, 2013 at 2:56 PM, Sandy Ryza sandy.r...@cloudera.com
 wrote:
  Hi All,
 
  I noticed that the hadoop streaming documentation does not exist in the
  Hadoop 2 source tree, and also cannot be found on the internet.   Is this
  on purpose?  I found this wiki page
  http://wiki.apache.org/hadoop/HadoopStreaming - is that where doc is
  supposed to go?  As this page isn't tied to a specific version, how does
 it
  work if new options are added?
 
  thanks,
  -Sandy