[jira] [Created] (MAPREDUCE-3409) Incorrect custom task status when running on MR2

2011-11-15 Thread Ahmed Radwan (Created) (JIRA)
Incorrect custom task status when running on MR2


 Key: MAPREDUCE-3409
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3409
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Ahmed Radwan


To reproduce this problem:

1- In your mapper setup() set: 

{code}
context.setStatus("myStatus")
{code}

2- When the job finishes:

{code} 
TaskReport[] reports = job.getTaskReports(TaskType.MAP);
assertEquals("myStatus", reports[0].getState());
{code}

The returned status from reports[0].getState() is "SUCCEEDED" as opposed to the 
expected "myStatus" value.

This exact code work fine on MR1. I saw this issue when tried running the 
TestTaskContext test cases on MR2.



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Resolved] (MAPREDUCE-3396) Mumak compilation is broken (for a while?)

2011-11-15 Thread Konstantin Boudnik (Resolved) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Konstantin Boudnik resolved MAPREDUCE-3396.
---

Resolution: Duplicate

It really dups MAPREDUCE-3311, so closing this one.

> Mumak compilation is broken (for a while?)
> --
>
> Key: MAPREDUCE-3396
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3396
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/mumak
>Affects Versions: 0.22.0
>Reporter: Konstantin Boudnik
>Assignee: Konstantin Boudnik
> Fix For: 0.22.0
>
> Attachments: MAPREDUCE-3396.patch
>
>
> {{contrib/mumak}} compilation is broken because of the missing 
> {{commons-lang}} dependency

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3408) yarn-daemon.sh unconditionnaly sets yarn.root.logger

2011-11-15 Thread Created
yarn-daemon.sh unconditionnaly sets yarn.root.logger


 Key: MAPREDUCE-3408
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3408
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2, nodemanager, resourcemanager
Affects Versions: 0.23.0, 0.23.1
Reporter: Bruno Mahé
Assignee: Bruno Mahé


yarn-daemon.sh unconditionnaly sets yarn.root.logger which then prevent any 
override from happening.
>From ./hadoop-mapreduce-project/hadoop-yarn/bin/yarn-daemon.sh:
> export YARN_ROOT_LOGGER="INFO,DRFA"
> export YARN_JHS_LOGGER="INFO,JSA"

and then yarn-daemon.sh will call "$YARN_HOME"/bin/yarn which does the 
following:
> YARN_OPTS="$YARN_OPTS -Dhadoop.root.logger=${YARN_ROOT_LOGGER:-INFO,console}"
> YARN_OPTS="$YARN_OPTS -Dyarn.root.logger=${YARN_ROOT_LOGGER:-INFO,console}"

This has at least 2 issues:
* I cannot override hadoop.root.logger when using the yarn-daemon.sh script
* I cannot have different values for hadoop.root.logger and yarn.root.logger

I currently see two different ways to proceed forward:
1/ Make the script yarn-daemon.sh only sets a default value for 
YARN_ROOT_LOGGER if this variable is not defined
2/ Remove the quoted code from yarn-daemon.sh since yarn already does something 
similar
3/ Entirely remove that chunk and let people define their logging however they 
want through some properties files (see log4j.properties in the conf 
directories for instance)

I would also use the variable HADOOP_ROOT_LOGGER for hadoop.root.logger if 
either option 1/ or 2/ would be taken.

I don't really have any preference toward any of these solutions. What would 
you recommend? What is the Apache Hadoop way for this matter?

Note: This is probably happening as well for the other daemons, and I will take 
a look at it once this issue is resolved.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3407) Wrong jar getting used in TestMR*Jobs* for MiniMRYarnCluster

2011-11-15 Thread Hitesh Shah (Created) (JIRA)
Wrong jar getting used in TestMR*Jobs* for MiniMRYarnCluster


 Key: MAPREDUCE-3407
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3407
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0, 0.24.0
Reporter: Hitesh Shah
Assignee: Hitesh Shah
Priority: Minor
 Fix For: 0.23.1


pom for mapreduce-client-jobclient sets system property to incorrect jar name. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3406) Add node information to bin/mapred job -list-attempt-ids and other improvements

2011-11-15 Thread Ravi Prakash (Created) (JIRA)
Add node information to bin/mapred job -list-attempt-ids and other improvements
---

 Key: MAPREDUCE-3406
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3406
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Ravi Prakash
Assignee: Ravi Prakash
 Fix For: 0.23.1


>From [~rramya]
Providing the NM information where the containers are scheduled in bin/mapred 
job -list-attempt-ids will be helpful in automation, debugging and to avoid 
grepping through the AM logs.

>From my own observation, the list-attempt-ids should list the attempt ids and 
>not require the arguments. The arguments if given, can be used to filter the 
>results. From the usage:
bq. [-list-attempt-ids   ]. Valid values for 
 are MAP REDUCE JOB_SETUP JOB_CLEANUP TASK_CLEANUP. Valid values for 
 are running, completed



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Resolved] (MAPREDUCE-3405) MAPREDUCE-3015 broke compilation of contrib scheduler tests

2011-11-15 Thread Todd Lipcon (Resolved) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Todd Lipcon resolved MAPREDUCE-3405.


   Resolution: Fixed
Fix Version/s: 0.20.206.0

> MAPREDUCE-3015 broke compilation of contrib scheduler tests
> ---
>
> Key: MAPREDUCE-3405
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3405
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/capacity-sched, contrib/fair-share
>Affects Versions: 0.20.206.0
>Reporter: Todd Lipcon
>Assignee: Todd Lipcon
>Priority: Critical
> Fix For: 0.20.206.0
>
> Attachments: mr-3405.txt
>
>
> MAPREDUCE-3015 added a new argument to the TaskTrackerStatus constructor, 
> which is used by a few of the scheduler tests, but didn't update those tests. 
> So, the contrib test build is now failing on 0.20-security

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Resolved] (MAPREDUCE-3027) MR-279: Completed container exit state needs to be enhanced to differentiate between container aborts/failures and actual application process exit codes

2011-11-15 Thread Hitesh Shah (Resolved) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hitesh Shah resolved MAPREDUCE-3027.


Resolution: Invalid

> MR-279: Completed container exit state needs to be enhanced to differentiate 
> between container aborts/failures and actual application process exit codes
> 
>
> Key: MAPREDUCE-3027
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3027
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Hitesh Shah
>Assignee: Hitesh Shah
> Fix For: 0.24.0
>
>
> Currently, a completed container's exit status is set to -100 to denote the 
> container being killed by the framework either as a result of the application 
> releasing the container or a node failure. An application process may also 
> return an exit code of -100 creating an ambiguity. 
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3405) MAPREDUCE-3015 broke compilation of contrib scheduler tests

2011-11-15 Thread Todd Lipcon (Created) (JIRA)
MAPREDUCE-3015 broke compilation of contrib scheduler tests
---

 Key: MAPREDUCE-3405
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3405
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: contrib/capacity-sched, contrib/fair-share
Affects Versions: 0.20.206.0
Reporter: Todd Lipcon
Assignee: Todd Lipcon
Priority: Critical


MAPREDUCE-3015 added a new argument to the TaskTrackerStatus constructor, which 
is used by a few of the scheduler tests, but didn't update those tests. So, the 
contrib test build is now failing on 0.20-security

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




Re: Purpose of COMMIT_PENDING

2011-11-15 Thread Harsh J
Pedro,

Simply put: Speculative execution.

When a task enters that state, it means that it has completed the M/R 
execution, and its awaiting the tracker to commit it so that it can run the 
OutputCommitter process and finalize the outputs (outputs lie in temporary 
directories until committed, if you check with FileOutputCommitter, the default 
OutputCommitter in Hadoop MR).

This is to avoid conflicting outputs when you have speculatives turned on. Two 
tasks can complete at the same time and you do not want both to be committed. 
So the TT will commit the first one that reports back, and kill away the other 
COMMIT_PENDING waiting one in this case.

You might notice (3) cause speculative execution does affect the tail of a job 
run.

On 15-Nov-2011, at 10:35 PM, Pedro Costa wrote:

> Hi,
> 
> Hadoop MR tasks can have the state COMMIT_PENDING.
> 
> 1- What's the purpose of that state?
> 2- What's the reason for a task being in this state?
> 3- It's only the last task before finishing a job that enters this state?
> 
> -- 
> Thanks,



[jira] [Created] (MAPREDUCE-3404) Speculative Execution: speculative map tasks launched even if -Dmapreduce.job.maps.speculative.execution=false

2011-11-15 Thread patrick white (Created) (JIRA)
Speculative Execution: speculative map tasks launched even if 
-Dmapreduce.job.maps.speculative.execution=false
--

 Key: MAPREDUCE-3404
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3404
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: job submission
Affects Versions: 0.23.0
 Environment: Hadoop version is: Hadoop 0.23.0.1110031628
10 node test cluster

Reporter: patrick white


When forcing a mapper to take significantly longer than other map tasks, 
speculative map tasks are
launched even if the mapreduce.job.maps.speculative.execution parameter is set 
to 'false'.

Testcase: ran default WordCount job with spec execution set to false for both 
map and reduce but still saw a fifth mapper
task launch, ran job as follows:

hadoop --config   jar   /tmp/testphw/wordcount.jar   WordCount  
-Dmapreduce.job.maps.speculative.execution=false  
-Dmapreduce.job.reduces.speculative.execution=false 
/tmp/test_file_of_words* /tmp/file_of_words.out

Input data was 4 text files >hdfs blocksize, with same word pattern plus one 
diff text line in each file, fourth
file was 4 times as large as others:

hadoop --config   fs -ls  /tmp
Found 5 items
drwxr-xr-x   - user hdfs  0 2011-10-20 16:17 /tmp/file_of_words.out
-rw-r--r--   3 user hdfs   62800021 2011-10-20 14:45 /tmp/test_file_of_words1
-rw-r--r--   3 user hdfs   62800024 2011-10-20 14:46 /tmp/test_file_of_words2
-rw-r--r--   3 user hdfs   62800024 2011-10-20 14:46 /tmp/test_file_of_words3
-rw-r--r--   3 user hdfs  271708312 2011-10-20 15:50 /tmp/test_file_of_words4

Job launched 5 mappers despite spec exec set to false, output snippet:

org.apache.hadoop.mapreduce.JobCounter
NUM_FAILED_MAPS=1
TOTAL_LAUNCHED_MAPS=5
TOTAL_LAUNCHED_REDUCES=1
RACK_LOCAL_MAPS=5
SLOTS_MILLIS_MAPS=273540
SLOTS_MILLIS_REDUCES=212876


Reran same case as above only set both spec exec params to 'true', same results 
only this time the fifth task being
launched is expected since spec exec = true.

job run:

hadoop --config   jar   /tmp/testphw/wordcount.jar   WordCount  
-Dmapreduce.job.maps.speculative.execution=true  
-Dmapreduce.job.reduces.speculative.execution=true 
/tmp/test_file_of_words* /tmp/file_of_words.out

output snippet:

org.apache.hadoop.mapreduce.JobCounter
NUM_FAILED_MAPS=1
TOTAL_LAUNCHED_MAPS=5
TOTAL_LAUNCHED_REDUCES=1
RACK_LOCAL_MAPS=5
SLOTS_MILLIS_MAPS=279653
SLOTS_MILLIS_REDUCES=211474

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3403) Speculative Execution: enabling multiple reduce tasks inhibit spec exec launch of mappers

2011-11-15 Thread patrick white (Created) (JIRA)
Speculative Execution: enabling multiple reduce tasks inhibit spec exec launch 
of mappers 
--

 Key: MAPREDUCE-3403
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3403
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: job submission
Affects Versions: 0.23.0
 Environment: Hadoop version is: Hadoop 0.23.0.1110031628
10 node test cluster
Reporter: patrick white


When forcing multiple reduce tasks to be launched by applying the 
setNumReduceTasks() method on a Job object, and
running on input data which has one significantly longer map (and consequently 
reduce) task;

- a speculative reduce task was not launched, even with a longer running 
reducer only 4 reduce tasks were launched

- the spec launch of map tasks was inhibited by the setNumReduceTasks() method 
applied, so even with
-Dmapreduce.job.maps.speculative.execution=true we only had 4 map tasks 
launched. The exact same code with the
setNumReduceTasks() method taken out, and on the same input data set, 
consistently launched 5 mappers as expected.

Testing info:

3. modified WordCount to force 4 reducers being launched, by adding:

job.setNumReduceTasks(4); // hardwire 4 reducers for now
System.out.println("\nTESTDEBUG: using 4 reduce tasks for now\n\n");

to the Job object. This causes 4 reduce tasks to be launched, oddly though it 
inhibits the map task from speculative
launch. So the same job code, without the setNumReduceTasks() method, will 
launch 5 mappers as described in case #2.
When this method is added, that same job will only launch 4 mappers, as well as 
4 reducers, otherwise the job
successfully completes.

output snippet with setNumReduceTasks():

org.apache.hadoop.mapreduce.JobCounter
TOTAL_LAUNCHED_MAPS=4
TOTAL_LAUNCHED_REDUCES=4
RACK_LOCAL_MAPS=4
SLOTS_MILLIS_MAPS=190787
SLOTS_MILLIS_REDUCES=572554

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




Purpose of COMMIT_PENDING

2011-11-15 Thread Pedro Costa
Hi,

Hadoop MR tasks can have the state COMMIT_PENDING.

1- What's the purpose of that state?
2- What's the reason for a task being in this state?
3- It's only the last task before finishing a job that enters this state?

-- 
Thanks,


[jira] [Created] (MAPREDUCE-3402) AMScalability test of Sleep job with 100K 1-sec maps regressed into running very slowly

2011-11-15 Thread Vinod Kumar Vavilapalli (Created) (JIRA)
AMScalability test of Sleep job with 100K 1-sec maps regressed into running 
very slowly
---

 Key: MAPREDUCE-3402
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3402
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: applicationmaster, mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
 Fix For: 0.23.1


The world was rosier before October 19-25, [~karams] says.

The 100K 1 second sleep job used to take around 800mins or 13-14 mins. It now 
runs till 45 mins and still manages to complete only about 45K tasks.

One/more of the flurry of commits for 0.23.0 deserve(s) the blame.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3401) Make single node secure cluster setup documentation for 0.23

2011-11-15 Thread Anupam Seth (Created) (JIRA)
Make single node secure cluster setup documentation for 0.23


 Key: MAPREDUCE-3401
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3401
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Reporter: Anupam Seth
Assignee: Anupam Seth
Priority: Minor
 Fix For: 0.23.1


This JIRA is to track some minor corrections and suggestions for improvement 
for the documentation for the setup of a single node cluster using 0.23 
currently available at 
http://people.apache.org/~acmurthy/hadoop-0.23/hadoop-yarn/hadoop-yarn-site/SingleCluster.html



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira