[jira] [Commented] (MAPREDUCE-6440) Duplicate Key in Json Output for Job details

2015-08-17 Thread Ajith S (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699221#comment-14699221
 ] 

Ajith S commented on MAPREDUCE-6440:


Hi [~bibinchundatt]

Thanks for reporting issue and analysis. You are right, the two *type* is 
ofcourse confusing unless someone knows the implementation details because 
*type* usually refers to datatype in json (*type:reduceTaskAttemptInfo* is 
right) . I suggest we can change the name of 
*org.apache.hadoop.mapreduce.v2.app.webapp.dao.TaskAttemptInfo.type*. Ofcourse 
a incompatible change

 Duplicate Key in Json Output for Job details
 

 Key: MAPREDUCE-6440
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6440
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver
Reporter: Anushri
Assignee: Bibin A Chundatt
Priority: Minor

 Duplicate key in Json Output for Job details for the url : 
 http://jhs_ip:jhs_port/ws/v1/history/mapreduce/jobs/job_id/tasks/task_id/attempts
 If the task type is REDUCE the json output for this url contains duplicate 
 key for type.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MAPREDUCE-6440) Duplicate Key in Json Output for Job details

2015-08-17 Thread zhihai xu (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699850#comment-14699850
 ] 

zhihai xu commented on MAPREDUCE-6440:
--

Yes, it looks like a good suggestion to avoid name conflict with reserved 
keywords.

 Duplicate Key in Json Output for Job details
 

 Key: MAPREDUCE-6440
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6440
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver
Reporter: Anushri
Assignee: Bibin A Chundatt
Priority: Minor

 Duplicate key in Json Output for Job details for the url : 
 http://jhs_ip:jhs_port/ws/v1/history/mapreduce/jobs/job_id/tasks/task_id/attempts
 If the task type is REDUCE the json output for this url contains duplicate 
 key for type.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (MAPREDUCE-6453) Repeatable Input File Format

2015-08-17 Thread AbdulRahman AlHamali (JIRA)
AbdulRahman AlHamali created MAPREDUCE-6453:
---

 Summary: Repeatable Input File Format
 Key: MAPREDUCE-6453
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6453
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
Reporter: AbdulRahman AlHamali
Assignee: AbdulRahman AlHamali
Priority: Minor


We are interested in running the training process of deep learning 
architectures on Hadoop clusters. We developed an algorithm that can carry out 
this training process in a MapReduce fashion. However, there is still a problem 
that we can improve.

In deep learning, training data is usually repeated multiple times (10 or even 
more). However, we were not able to find a way to go through the input training 
file multiple times without having to reduce first and then go back and then 
map and reduce and so on so forth. So, to carry on the experiments, we were 
forced to phyiscally repeat the files 10 or 20 times. This is not the best 
solution, obviously, because first the file size is becoming much larger, and 
second, it is not a neat way to carry out the job.


Thus, what we aim to do is to create an interface that input file formats can 
implement that would provide them with the ability to repeat a file n times 
before eventually reducing, which will solve the problem and make Hadoop more 
suitable for the training of deep learning algorithms, or for such problems 
that require going over the data multiple times before reducing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MAPREDUCE-6440) Duplicate Key in Json Output for Job details

2015-08-17 Thread zhihai xu (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699739#comment-14699739
 ] 

zhihai xu commented on MAPREDUCE-6440:
--

It looks like app master web service url 
http://am_ip:am_port/ws/v1/mapreduce/jobs/{jobid}/tasks/{taskid}/attempts 
also has this issue. Yes, it will be an incompatible change to fix this issue. 
it will be better to fix it at 3.0.0.

 Duplicate Key in Json Output for Job details
 

 Key: MAPREDUCE-6440
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6440
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver
Reporter: Anushri
Assignee: Bibin A Chundatt
Priority: Minor

 Duplicate key in Json Output for Job details for the url : 
 http://jhs_ip:jhs_port/ws/v1/history/mapreduce/jobs/job_id/tasks/task_id/attempts
 If the task type is REDUCE the json output for this url contains duplicate 
 key for type.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MAPREDUCE-6440) Duplicate Key in Json Output for Job details

2015-08-17 Thread zhihai xu (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

zhihai xu updated MAPREDUCE-6440:
-
Target Version/s: 3.0.0

 Duplicate Key in Json Output for Job details
 

 Key: MAPREDUCE-6440
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6440
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver
Reporter: Anushri
Assignee: Bibin A Chundatt
Priority: Minor

 Duplicate key in Json Output for Job details for the url : 
 http://jhs_ip:jhs_port/ws/v1/history/mapreduce/jobs/job_id/tasks/task_id/attempts
 If the task type is REDUCE the json output for this url contains duplicate 
 key for type.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MAPREDUCE-6452) NPE when intermediate encrypt enabled for LocalRunner

2015-08-17 Thread zhihai xu (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699771#comment-14699771
 ] 

zhihai xu commented on MAPREDUCE-6452:
--

Hi [~ajithshetty], Can I take this issue if you didn't start working on it yet? 
I have a patch ready for this issue. thanks

 NPE when intermediate encrypt enabled for LocalRunner
 -

 Key: MAPREDUCE-6452
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6452
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Bibin A Chundatt
Assignee: Ajith S
Priority: Minor

 Enable the below properties try running mapreduce job
 mapreduce.framework.name=local
 mapreduce.job.encrypted-intermediate-data=true
 {code}
 2015-08-14 16:27:25,248 WARN  [Thread-21] mapred.LocalJobRunner 
 (LocalJobRunner.java:run(561)) - job_local473843898_0001
 java.lang.Exception: java.lang.NullPointerException
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:463)
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:523)
 Caused by: java.lang.NullPointerException
 at 
 org.apache.hadoop.crypto.CryptoOutputStream.init(CryptoOutputStream.java:92)
 at 
 org.apache.hadoop.fs.crypto.CryptoFSDataOutputStream.init(CryptoFSDataOutputStream.java:31)
 at 
 org.apache.hadoop.mapreduce.CryptoUtils.wrapIfNecessary(CryptoUtils.java:112)
 at 
 org.apache.hadoop.mapred.MapTask$MapOutputBuffer.sortAndSpill(MapTask.java:1611)
 at 
 org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:1492)
 at 
 org.apache.hadoop.mapred.MapTask$NewOutputCollector.close(MapTask.java:723)
 at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:793)
 at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:244)
 at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
 at java.util.concurrent.FutureTask.run(FutureTask.java:266)
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
 at java.lang.Thread.run(Thread.java:745)
 {code}
 Jobs are failing always



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MAPREDUCE-6452) NPE when intermediate encrypt enabled for LocalRunner

2015-08-17 Thread Bibin A Chundatt (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699620#comment-14699620
 ] 

Bibin A Chundatt commented on MAPREDUCE-6452:
-

[~aw] Any thoughts on this issue?

 NPE when intermediate encrypt enabled for LocalRunner
 -

 Key: MAPREDUCE-6452
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6452
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Bibin A Chundatt
Assignee: Ajith S
Priority: Minor

 Enable the below properties try running mapreduce job
 mapreduce.framework.name=local
 mapreduce.job.encrypted-intermediate-data=true
 {code}
 2015-08-14 16:27:25,248 WARN  [Thread-21] mapred.LocalJobRunner 
 (LocalJobRunner.java:run(561)) - job_local473843898_0001
 java.lang.Exception: java.lang.NullPointerException
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:463)
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:523)
 Caused by: java.lang.NullPointerException
 at 
 org.apache.hadoop.crypto.CryptoOutputStream.init(CryptoOutputStream.java:92)
 at 
 org.apache.hadoop.fs.crypto.CryptoFSDataOutputStream.init(CryptoFSDataOutputStream.java:31)
 at 
 org.apache.hadoop.mapreduce.CryptoUtils.wrapIfNecessary(CryptoUtils.java:112)
 at 
 org.apache.hadoop.mapred.MapTask$MapOutputBuffer.sortAndSpill(MapTask.java:1611)
 at 
 org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:1492)
 at 
 org.apache.hadoop.mapred.MapTask$NewOutputCollector.close(MapTask.java:723)
 at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:793)
 at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:244)
 at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
 at java.util.concurrent.FutureTask.run(FutureTask.java:266)
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
 at java.lang.Thread.run(Thread.java:745)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MAPREDUCE-6452) NPE when intermediate encrypt enabled for LocalRunner

2015-08-17 Thread Bibin A Chundatt (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bibin A Chundatt updated MAPREDUCE-6452:

Description: 
Enable the below properties try running mapreduce job

mapreduce.framework.name=local
mapreduce.job.encrypted-intermediate-data=true

{code}
2015-08-14 16:27:25,248 WARN  [Thread-21] mapred.LocalJobRunner 
(LocalJobRunner.java:run(561)) - job_local473843898_0001
java.lang.Exception: java.lang.NullPointerException
at 
org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:463)
at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:523)
Caused by: java.lang.NullPointerException
at 
org.apache.hadoop.crypto.CryptoOutputStream.init(CryptoOutputStream.java:92)
at 
org.apache.hadoop.fs.crypto.CryptoFSDataOutputStream.init(CryptoFSDataOutputStream.java:31)
at 
org.apache.hadoop.mapreduce.CryptoUtils.wrapIfNecessary(CryptoUtils.java:112)
at 
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.sortAndSpill(MapTask.java:1611)
at 
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:1492)
at 
org.apache.hadoop.mapred.MapTask$NewOutputCollector.close(MapTask.java:723)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:793)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
at 
org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:244)
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)

{code}

Jobs are failing always

  was:

Enable the below properties try running mapreduce job

mapreduce.framework.name=local
mapreduce.job.encrypted-intermediate-data=true

{code}
2015-08-14 16:27:25,248 WARN  [Thread-21] mapred.LocalJobRunner 
(LocalJobRunner.java:run(561)) - job_local473843898_0001
java.lang.Exception: java.lang.NullPointerException
at 
org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:463)
at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:523)
Caused by: java.lang.NullPointerException
at 
org.apache.hadoop.crypto.CryptoOutputStream.init(CryptoOutputStream.java:92)
at 
org.apache.hadoop.fs.crypto.CryptoFSDataOutputStream.init(CryptoFSDataOutputStream.java:31)
at 
org.apache.hadoop.mapreduce.CryptoUtils.wrapIfNecessary(CryptoUtils.java:112)
at 
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.sortAndSpill(MapTask.java:1611)
at 
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:1492)
at 
org.apache.hadoop.mapred.MapTask$NewOutputCollector.close(MapTask.java:723)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:793)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
at 
org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:244)
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)

{code}




 NPE when intermediate encrypt enabled for LocalRunner
 -

 Key: MAPREDUCE-6452
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6452
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Bibin A Chundatt
Assignee: Ajith S
Priority: Minor

 Enable the below properties try running mapreduce job
 mapreduce.framework.name=local
 mapreduce.job.encrypted-intermediate-data=true
 {code}
 2015-08-14 16:27:25,248 WARN  [Thread-21] mapred.LocalJobRunner 
 (LocalJobRunner.java:run(561)) - job_local473843898_0001
 java.lang.Exception: java.lang.NullPointerException
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:463)
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:523)
 Caused by: java.lang.NullPointerException
 at 
 org.apache.hadoop.crypto.CryptoOutputStream.init(CryptoOutputStream.java:92)
 at 
 org.apache.hadoop.fs.crypto.CryptoFSDataOutputStream.init(CryptoFSDataOutputStream.java:31)
 at 
 org.apache.hadoop.mapreduce.CryptoUtils.wrapIfNecessary(CryptoUtils.java:112)
 at 
 

[jira] [Assigned] (MAPREDUCE-6452) NPE when intermediate encrypt enabled for LocalRunner

2015-08-17 Thread Ajith S (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ajith S reassigned MAPREDUCE-6452:
--

Assignee: Ajith S

 NPE when intermediate encrypt enabled for LocalRunner
 -

 Key: MAPREDUCE-6452
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6452
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Bibin A Chundatt
Assignee: Ajith S
Priority: Minor

 Enable the below properties try running mapreduce job
 mapreduce.framework.name=local
 mapreduce.job.encrypted-intermediate-data=true
 {code}
 2015-08-14 16:27:25,248 WARN  [Thread-21] mapred.LocalJobRunner 
 (LocalJobRunner.java:run(561)) - job_local473843898_0001
 java.lang.Exception: java.lang.NullPointerException
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:463)
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:523)
 Caused by: java.lang.NullPointerException
 at 
 org.apache.hadoop.crypto.CryptoOutputStream.init(CryptoOutputStream.java:92)
 at 
 org.apache.hadoop.fs.crypto.CryptoFSDataOutputStream.init(CryptoFSDataOutputStream.java:31)
 at 
 org.apache.hadoop.mapreduce.CryptoUtils.wrapIfNecessary(CryptoUtils.java:112)
 at 
 org.apache.hadoop.mapred.MapTask$MapOutputBuffer.sortAndSpill(MapTask.java:1611)
 at 
 org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:1492)
 at 
 org.apache.hadoop.mapred.MapTask$NewOutputCollector.close(MapTask.java:723)
 at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:793)
 at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
 at 
 org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:244)
 at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
 at java.util.concurrent.FutureTask.run(FutureTask.java:266)
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
 at java.lang.Thread.run(Thread.java:745)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MAPREDUCE-2454) Allow external sorter plugin for MR

2015-08-17 Thread Chris Douglas (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Douglas updated MAPREDUCE-2454:
-
Assignee: Mariappan Asokan  (was: Bharat Jha)

 Allow external sorter plugin for MR
 ---

 Key: MAPREDUCE-2454
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2454
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
Affects Versions: 2.0.0-alpha, 3.0.0, 2.0.2-alpha
Reporter: Mariappan Asokan
Assignee: Mariappan Asokan
Priority: Minor
  Labels: features, performance, plugin, sort
 Fix For: 2.0.3-alpha

 Attachments: HadoopSortPlugin.pdf, HadoopSortPlugin.pdf, 
 KeyValueIterator.java, MR-2454-trunkPatchPreview.gz, MapOutputSorter.java, 
 MapOutputSorterAbstract.java, ReduceInputSorter.java, 
 mapreduce-2454-modified-code.patch, mapreduce-2454-modified-test.patch, 
 mapreduce-2454-new-test.patch, mapreduce-2454-protection-change.patch, 
 mapreduce-2454.patch, mapreduce-2454.patch, mapreduce-2454.patch, 
 mapreduce-2454.patch, mapreduce-2454.patch, mapreduce-2454.patch, 
 mapreduce-2454.patch, mapreduce-2454.patch, mapreduce-2454.patch, 
 mapreduce-2454.patch, mapreduce-2454.patch, mapreduce-2454.patch, 
 mapreduce-2454.patch, mapreduce-2454.patch, mapreduce-2454.patch, 
 mapreduce-2454.patch, mapreduce-2454.patch, mapreduce-2454.patch, 
 mapreduce-2454.patch, mapreduce-2454.patch, mapreduce-2454.patch, 
 mr-2454-on-mr-279-build82.patch.gz


 Define interfaces and some abstract classes in the Hadoop framework to 
 facilitate external sorter plugins both on the Map and Reduce sides.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)