date:20150205

[jira] [Commented] (MAPREDUCE-6223) TestJobConf#testNegativeValueForTaskVmem failures

2015-02-05 Thread Masatake Iwasaki (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308760#comment-14308760
 ] 

Masatake Iwasaki commented on MAPREDUCE-6223:
-

s/not local value but//

 TestJobConf#testNegativeValueForTaskVmem failures
 -

 Key: MAPREDUCE-6223
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6223
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: test
Affects Versions: 3.0.0
Reporter: Gera Shegalov
Assignee: Varun Saxena
 Attachments: MAPREDUCE-6223.001.patch, MAPREDUCE-6223.002.patch


 {code}
 Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 3.328 sec  
 FAILURE! - in org.apache.hadoop.conf.TestJobConf
 testNegativeValueForTaskVmem(org.apache.hadoop.conf.TestJobConf)  Time 
 elapsed: 0.089 sec   FAILURE!
 java.lang.AssertionError: expected:1024 but was:-1
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.conf.TestJobConf.testNegativeValueForTaskVmem(TestJobConf.java:111)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6165) [JDK8] TestCombineFileInputFormat failed on JDK8

2015-02-05 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308545#comment-14308545
 ] 

Hadoop QA commented on MAPREDUCE-6165:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12685053/MAPREDUCE-6165-001.patch
  against trunk revision 6583ad1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient:

  org.apache.hadoop.conf.TestJobConf

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/5169//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/5169//console

This message is automatically generated.

 [JDK8] TestCombineFileInputFormat failed on JDK8
 

 Key: MAPREDUCE-6165
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6165
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Wei Yan
Assignee: Akira AJISAKA
Priority: Minor
 Attachments: MAPREDUCE-6165-001.patch, MAPREDUCE-6165-reproduce.patch


 The error msg:
 {noformat}
 testSplitPlacementForCompressedFiles(org.apache.hadoop.mapreduce.lib.input.TestCombineFileInputFormat)
   Time elapsed: 2.487 sec   FAILURE!
 junit.framework.AssertionFailedError: expected:2 but was:1
   at junit.framework.Assert.fail(Assert.java:57)
   at junit.framework.Assert.failNotEquals(Assert.java:329)
   at junit.framework.Assert.assertEquals(Assert.java:78)
   at junit.framework.Assert.assertEquals(Assert.java:234)
   at junit.framework.Assert.assertEquals(Assert.java:241)
   at junit.framework.TestCase.assertEquals(TestCase.java:409)
   at 
 org.apache.hadoop.mapreduce.lib.input.TestCombineFileInputFormat.testSplitPlacementForCompressedFiles(TestCombineFileInputFormat.java:911)
 testSplitPlacement(org.apache.hadoop.mapreduce.lib.input.TestCombineFileInputFormat)
   Time elapsed: 0.985 sec   FAILURE!
 junit.framework.AssertionFailedError: expected:2 but was:1
   at junit.framework.Assert.fail(Assert.java:57)
   at junit.framework.Assert.failNotEquals(Assert.java:329)
   at junit.framework.Assert.assertEquals(Assert.java:78)
   at junit.framework.Assert.assertEquals(Assert.java:234)
   at junit.framework.Assert.assertEquals(Assert.java:241)
   at junit.framework.TestCase.assertEquals(TestCase.java:409)
   at 
 org.apache.hadoop.mapreduce.lib.input.TestCombineFileInputFormat.testSplitPlacement(TestCombineFileInputFormat.java:368)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6227) DFSIO for truncate

2015-02-05 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308606#comment-14308606
 ] 

Hadoop QA commented on MAPREDUCE-6227:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12696938/DFSIO-truncate-00.patch
  against trunk revision 6583ad1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient:

  org.apache.hadoop.conf.TestJobConf

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/5170//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/5170//console

This message is automatically generated.

 DFSIO for truncate
 --

 Key: MAPREDUCE-6227
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6227
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: benchmarks, test
Affects Versions: 2.7.0
Reporter: Konstantin Shvachko
Assignee: Konstantin Shvachko
 Attachments: DFSIO-truncate-00.patch


 Create a benchmark and a test for truncate within the framework of TestDFSIO.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MAPREDUCE-6234) MRJobConfig.DEFAULT_*_MEMORY_MB should be consistent with mapred-default.xml

2015-02-05 Thread Masatake Iwasaki (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Masatake Iwasaki updated MAPREDUCE-6234:

Attachment: MAPREDUCE-6234.002.patch

.002 addresses my comment in MAPREDUCE-6223. Tests needing default value in 
conf can use {{MRJobConfig.DEFAULT_MAP_MEMORY_MB}} and test needing the value 
processed by JobConf#getMemoryRequired can use 
{{JobConf.DEFAULT_MAP_MEMORY_REQUIRED}}.

 MRJobConfig.DEFAULT_*_MEMORY_MB should be consistent with mapred-default.xml
 

 Key: MAPREDUCE-6234
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6234
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: contrib/gridmix, mrv2
Reporter: Masatake Iwasaki
Assignee: Masatake Iwasaki
 Attachments: MAPREDUCE-6234.001.patch, MAPREDUCE-6234.002.patch


 TestHighRamJob fails by this.
 {code}
 ---
  T E S T S
 ---
 Running org.apache.hadoop.mapred.gridmix.TestHighRamJob
 Tests run: 1, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 1.162 sec  
 FAILURE! - in org.apache.hadoop.mapred.gridmix.TestHighRamJob
 testHighRamFeatureEmulation(org.apache.hadoop.mapred.gridmix.TestHighRamJob)  
 Time elapsed: 1.102 sec   FAILURE!
 java.lang.AssertionError: expected:1024 but was:-1
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.mapred.gridmix.TestHighRamJob.testHighRamConfig(TestHighRamJob.java:98)
   at 
 org.apache.hadoop.mapred.gridmix.TestHighRamJob.testHighRamFeatureEmulation(TestHighRamJob.java:117)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6234) MRJobConfig.DEFAULT_*_MEMORY_MB should be consistent with mapred-default.xml

2015-02-05 Thread Tsuyoshi OZAWA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308750#comment-14308750
 ] 

Tsuyoshi OZAWA commented on MAPREDUCE-6234:
---

Make sense.

[~jira.shegalov], do you know the reason that DEFAULT_MAP_MEMORY_MB is not 
updated in MAPREDUCE-5785? If there is no reason, I think we can apply this 
patch to trunk. 

 MRJobConfig.DEFAULT_*_MEMORY_MB should be consistent with mapred-default.xml
 

 Key: MAPREDUCE-6234
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6234
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: contrib/gridmix, mrv2
Reporter: Masatake Iwasaki
Assignee: Masatake Iwasaki
 Attachments: MAPREDUCE-6234.001.patch


 TestHighRamJob fails by this.
 {code}
 ---
  T E S T S
 ---
 Running org.apache.hadoop.mapred.gridmix.TestHighRamJob
 Tests run: 1, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 1.162 sec  
 FAILURE! - in org.apache.hadoop.mapred.gridmix.TestHighRamJob
 testHighRamFeatureEmulation(org.apache.hadoop.mapred.gridmix.TestHighRamJob)  
 Time elapsed: 1.102 sec   FAILURE!
 java.lang.AssertionError: expected:1024 but was:-1
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.mapred.gridmix.TestHighRamJob.testHighRamConfig(TestHighRamJob.java:98)
   at 
 org.apache.hadoop.mapred.gridmix.TestHighRamJob.testHighRamFeatureEmulation(TestHighRamJob.java:117)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6223) TestJobConf#testNegativeValueForTaskVmem failures

2015-02-05 Thread Masatake Iwasaki (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308748#comment-14308748
 ] 

Masatake Iwasaki commented on MAPREDUCE-6223:
-

I think JobConf#getMemoryReuiqred should get 1024 from not local value but 
constant in MRJobConfig other than DEFAULT_*_MEMORY_MB because 1024 is never 
set in Configuration. [~ajisakaa] / [~ozawa], please commit the patch of this 
issue first. I will update the patch of MAPREDUCE-6234 later.

 TestJobConf#testNegativeValueForTaskVmem failures
 -

 Key: MAPREDUCE-6223
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6223
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: test
Affects Versions: 3.0.0
Reporter: Gera Shegalov
Assignee: Varun Saxena
 Attachments: MAPREDUCE-6223.001.patch, MAPREDUCE-6223.002.patch


 {code}
 Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 3.328 sec  
 FAILURE! - in org.apache.hadoop.conf.TestJobConf
 testNegativeValueForTaskVmem(org.apache.hadoop.conf.TestJobConf)  Time 
 elapsed: 0.089 sec   FAILURE!
 java.lang.AssertionError: expected:1024 but was:-1
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.conf.TestJobConf.testNegativeValueForTaskVmem(TestJobConf.java:111)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MAPREDUCE-6227) DFSIO for truncate

2015-02-05 Thread Konstantin Shvachko (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Konstantin Shvachko updated MAPREDUCE-6227:
---
Attachment: DFSIO-truncate-01.patch

Moved TestDFSIO_results.log under {{target/test-dir}} for tests.

 DFSIO for truncate
 --

 Key: MAPREDUCE-6227
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6227
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: benchmarks, test
Affects Versions: 2.7.0
Reporter: Konstantin Shvachko
Assignee: Konstantin Shvachko
 Attachments: DFSIO-truncate-00.patch, DFSIO-truncate-01.patch


 Create a benchmark and a test for truncate within the framework of TestDFSIO.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6234) MRJobConfig.DEFAULT_*_MEMORY_MB should be consistent with mapred-default.xml

2015-02-05 Thread Gera Shegalov (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308764#comment-14308764
 ] 

Gera Shegalov commented on MAPREDUCE-6234:
--

I apologize, I am a little tied up right now to do a thorough review. Looking 
into resolving this is on my list. I was thinking that direct references to to 
DEFAULT_*_MEMORY_MB should be wrapped in a single method. Maybe [~kasha] can 
chime in in the meantime.

 MRJobConfig.DEFAULT_*_MEMORY_MB should be consistent with mapred-default.xml
 

 Key: MAPREDUCE-6234
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6234
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: contrib/gridmix, mrv2
Reporter: Masatake Iwasaki
Assignee: Masatake Iwasaki
 Attachments: MAPREDUCE-6234.001.patch


 TestHighRamJob fails by this.
 {code}
 ---
  T E S T S
 ---
 Running org.apache.hadoop.mapred.gridmix.TestHighRamJob
 Tests run: 1, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 1.162 sec  
 FAILURE! - in org.apache.hadoop.mapred.gridmix.TestHighRamJob
 testHighRamFeatureEmulation(org.apache.hadoop.mapred.gridmix.TestHighRamJob)  
 Time elapsed: 1.102 sec   FAILURE!
 java.lang.AssertionError: expected:1024 but was:-1
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.mapred.gridmix.TestHighRamJob.testHighRamConfig(TestHighRamJob.java:98)
   at 
 org.apache.hadoop.mapred.gridmix.TestHighRamJob.testHighRamFeatureEmulation(TestHighRamJob.java:117)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6223) TestJobConf#testNegativeValueForTaskVmem failures

2015-02-05 Thread Varun Saxena (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308671#comment-14308671
 ] 

Varun Saxena commented on MAPREDUCE-6223:
-

[~ajisakaa] / [~ozawa],
As you wish. Because currently test failures are appearing till MAPREDUCE-6223 
is committed.
Ideally {{MRJobConfig.DEFAULT_MAP_MEMORY_MB}} should not be changed. I feel we 
should not be taking default value from a local variable.
MAPREDUCE-6234 hence will be a redundant fix as we will have to revert its 
changes again.

Although it is somewhat confusing that default value in mapred-default.xml is 
-1 and in code we take it as 1024. But if somebody reads the config 
description, which should be done, its quite clear what is the behavior of this 
config.
{code}
description
  The amount of memory to request from the scheduler for each   
  map task. If this is not specified or is non-positive, it is inferred from
  mapreduce.map.java.opts and mapreduce.job.heap.memory-mb.ratio.
  If java-opts are also not specified, we set it to 1024.
/description
{code}

You can take a call whether to commit that or not. Alternatively you can review 
and commit this as well.

 TestJobConf#testNegativeValueForTaskVmem failures
 -

 Key: MAPREDUCE-6223
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6223
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: test
Affects Versions: 3.0.0
Reporter: Gera Shegalov
Assignee: Varun Saxena
 Attachments: MAPREDUCE-6223.001.patch, MAPREDUCE-6223.002.patch


 {code}
 Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 3.328 sec  
 FAILURE! - in org.apache.hadoop.conf.TestJobConf
 testNegativeValueForTaskVmem(org.apache.hadoop.conf.TestJobConf)  Time 
 elapsed: 0.089 sec   FAILURE!
 java.lang.AssertionError: expected:1024 but was:-1
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.conf.TestJobConf.testNegativeValueForTaskVmem(TestJobConf.java:111)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6235) Bundle and compress files passed with -libjars prior to uploading and distributing

2015-02-05 Thread Dustin Cote (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307179#comment-14307179
 ] 

Dustin Cote commented on MAPREDUCE-6235:


Thanks folks, I believe I was seeing a time difference because of the time to 
compress.  I'll go ahead and close this out since no code change should be made 
here.

 Bundle and compress files passed with -libjars prior to uploading and 
 distributing
 --

 Key: MAPREDUCE-6235
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6235
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: distributed-cache, mrv2
Affects Versions: 2.6.0
Reporter: Dustin Cote
Assignee: Dustin Cote
Priority: Minor

 To improve performance, we should upload jars flagged by -libjars as a single 
 bundle and expand on arrival instead of uploading the jars one by one.   This 
 would also reduce network overhead of using the -libjars option.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6235) Bundle and compress files passed with -libjars prior to uploading and distributing

2015-02-05 Thread Dustin Cote (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307181#comment-14307181
 ] 

Dustin Cote commented on MAPREDUCE-6235:


Time to *zip* not compress... ok now closing it.

 Bundle and compress files passed with -libjars prior to uploading and 
 distributing
 --

 Key: MAPREDUCE-6235
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6235
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: distributed-cache, mrv2
Affects Versions: 2.6.0
Reporter: Dustin Cote
Assignee: Dustin Cote
Priority: Minor

 To improve performance, we should upload jars flagged by -libjars as a single 
 bundle and expand on arrival instead of uploading the jars one by one.   This 
 would also reduce network overhead of using the -libjars option.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (MAPREDUCE-6235) Bundle and compress files passed with -libjars prior to uploading and distributing

2015-02-05 Thread Dustin Cote (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dustin Cote resolved MAPREDUCE-6235.

Resolution: Invalid

 Bundle and compress files passed with -libjars prior to uploading and 
 distributing
 --

 Key: MAPREDUCE-6235
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6235
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: distributed-cache, mrv2
Affects Versions: 2.6.0
Reporter: Dustin Cote
Assignee: Dustin Cote
Priority: Minor

 To improve performance, we should upload jars flagged by -libjars as a single 
 bundle and expand on arrival instead of uploading the jars one by one.   This 
 would also reduce network overhead of using the -libjars option.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6245) Fixed split shuffling.

2015-02-05 Thread Eric Payne (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307331#comment-14307331
 ] 

Eric Payne commented on MAPREDUCE-6245:
---

[~lbkzman], Can you please describe the problem that this Jira is trying to 
resolve?

 Fixed split shuffling.
 --

 Key: MAPREDUCE-6245
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6245
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 2.6.0
Reporter: lbkzman
Assignee: lbkzman





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6059) Speed up history server startup time

2015-02-05 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307434#comment-14307434
 ] 

Allen Wittenauer commented on MAPREDUCE-6059:
-

It wasn't committed to branch-2 because I generally don't.  

 Speed up history server startup time
 

 Key: MAPREDUCE-6059
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6059
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Affects Versions: 2.4.0
Reporter: Siqi Li
Assignee: Siqi Li
 Fix For: 3.0.0

 Attachments: YARN-2366.v1.patch


 When history server starts up, It scans every history directories and put all 
 history files into a cache, whereas this cache only stores 20K recent history 
 files. Therefore, it is wasting a large portion of time loading old history 
 files into the cache, and the startup time will keep increasing if we don't 
 trim the number of history files. For example, when history server starts up 
 with 2.5M history files in HDFS, it took ~5 minutes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6059) Speed up history server startup time

2015-02-05 Thread Jason Lowe (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307452#comment-14307452
 ] 

Jason Lowe commented on MAPREDUCE-6059:
---

If you have no objections, I'd like to commit this to branch-2 as well.  I'd 
like to keep the trunk and branch-2 lines as reasonably close as we can to 
minimize the pain of maintaining the two lines.

 Speed up history server startup time
 

 Key: MAPREDUCE-6059
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6059
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Affects Versions: 2.4.0
Reporter: Siqi Li
Assignee: Siqi Li
 Fix For: 3.0.0

 Attachments: YARN-2366.v1.patch


 When history server starts up, It scans every history directories and put all 
 history files into a cache, whereas this cache only stores 20K recent history 
 files. Therefore, it is wasting a large portion of time loading old history 
 files into the cache, and the startup time will keep increasing if we don't 
 trim the number of history files. For example, when history server starts up 
 with 2.5M history files in HDFS, it took ~5 minutes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6059) Speed up history server startup time

2015-02-05 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307471#comment-14307471
 ] 

Allen Wittenauer commented on MAPREDUCE-6059:
-

No objection from me if you want to be Sisyphus.  :)

 Speed up history server startup time
 

 Key: MAPREDUCE-6059
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6059
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Affects Versions: 2.4.0
Reporter: Siqi Li
Assignee: Siqi Li
 Fix For: 3.0.0

 Attachments: YARN-2366.v1.patch


 When history server starts up, It scans every history directories and put all 
 history files into a cache, whereas this cache only stores 20K recent history 
 files. Therefore, it is wasting a large portion of time loading old history 
 files into the cache, and the startup time will keep increasing if we don't 
 trim the number of history files. For example, when history server starts up 
 with 2.5M history files in HDFS, it took ~5 minutes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MAPREDUCE-5847) Remove redundant code for fileOutputByteCounter in MapTask and ReduceTask

2015-02-05 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-5847:

Status: Open  (was: Patch Available)

Cancelling patch as it no longer applies.

 Remove redundant code for fileOutputByteCounter in MapTask and ReduceTask 
 --

 Key: MAPREDUCE-5847
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5847
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1, mrv2, task
Affects Versions: 2.4.0
Reporter: Gera Shegalov
Assignee: Gera Shegalov
 Attachments: MAPREDUCE-5847.v01.patch, MAPREDUCE-5847.v02.patch


 Both MapTask and ReduceTask carry redundant code to update BYTES_WRITTEN 
 counter. However, {{Task.updateCounters}} uses file system stats for this. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MAPREDUCE-207) Computing Input Splits on the MR Cluster

2015-02-05 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-207:
---
Status: Open  (was: Patch Available)

 Computing Input Splits on the MR Cluster
 

 Key: MAPREDUCE-207
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-207
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: applicationmaster, mrv2
Reporter: Philip Zeyliger
Assignee: Gera Shegalov
 Attachments: MAPREDUCE-207.patch, MAPREDUCE-207.v02.patch, 
 MAPREDUCE-207.v03.patch, MAPREDUCE-207.v05.patch, MAPREDUCE-207.v06.patch, 
 MAPREDUCE-207.v07.patch


 Instead of computing the input splits as part of job submission, Hadoop could 
 have a separate job task type that computes the input splits, therefore 
 allowing that computation to happen on the cluster.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MAPREDUCE-207) Computing Input Splits on the MR Cluster

2015-02-05 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-207:
---
Status: Patch Available  (was: Open)

 Computing Input Splits on the MR Cluster
 

 Key: MAPREDUCE-207
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-207
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: applicationmaster, mrv2
Reporter: Philip Zeyliger
Assignee: Gera Shegalov
 Attachments: MAPREDUCE-207.patch, MAPREDUCE-207.v02.patch, 
 MAPREDUCE-207.v03.patch, MAPREDUCE-207.v05.patch, MAPREDUCE-207.v06.patch, 
 MAPREDUCE-207.v07.patch


 Instead of computing the input splits as part of job submission, Hadoop could 
 have a separate job task type that computes the input splits, therefore 
 allowing that computation to happen on the cluster.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-207) Computing Input Splits on the MR Cluster

2015-02-05 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308003#comment-14308003
 ] 

Hadoop QA commented on MAPREDUCE-207:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12655331/MAPREDUCE-207.v07.patch
  against trunk revision e1990ab.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/5167//console

This message is automatically generated.

 Computing Input Splits on the MR Cluster
 

 Key: MAPREDUCE-207
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-207
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: applicationmaster, mrv2
Reporter: Philip Zeyliger
Assignee: Gera Shegalov
 Attachments: MAPREDUCE-207.patch, MAPREDUCE-207.v02.patch, 
 MAPREDUCE-207.v03.patch, MAPREDUCE-207.v05.patch, MAPREDUCE-207.v06.patch, 
 MAPREDUCE-207.v07.patch


 Instead of computing the input splits as part of job submission, Hadoop could 
 have a separate job task type that computes the input splits, therefore 
 allowing that computation to happen on the cluster.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MAPREDUCE-5044) Have AM trigger jstack on task attempts that timeout before killing them

2015-02-05 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-5044:

Status: Open  (was: Patch Available)

 Have AM trigger jstack on task attempts that timeout before killing them
 

 Key: MAPREDUCE-5044
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5044
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mr-am
Affects Versions: 2.1.0-beta
Reporter: Jason Lowe
Assignee: Gera Shegalov
 Attachments: MAPREDUCE-5044.v01.patch, MAPREDUCE-5044.v02.patch, 
 MAPREDUCE-5044.v03.patch, MAPREDUCE-5044.v04.patch, MAPREDUCE-5044.v05.patch, 
 MAPREDUCE-5044.v06.patch, Screen Shot 2013-11-12 at 1.05.32 PM.png, Screen 
 Shot 2013-11-12 at 1.06.04 PM.png


 When an AM expires a task attempt it would be nice if it triggered a jstack 
 output via SIGQUIT before killing the task attempt.  This would be invaluable 
 for helping users debug their hung tasks, especially if they do not have 
 shell access to the nodes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MAPREDUCE-5044) Have AM trigger jstack on task attempts that timeout before killing them

2015-02-05 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-5044:

Status: Patch Available  (was: Open)

 Have AM trigger jstack on task attempts that timeout before killing them
 

 Key: MAPREDUCE-5044
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5044
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mr-am
Affects Versions: 2.1.0-beta
Reporter: Jason Lowe
Assignee: Gera Shegalov
 Attachments: MAPREDUCE-5044.v01.patch, MAPREDUCE-5044.v02.patch, 
 MAPREDUCE-5044.v03.patch, MAPREDUCE-5044.v04.patch, MAPREDUCE-5044.v05.patch, 
 MAPREDUCE-5044.v06.patch, Screen Shot 2013-11-12 at 1.05.32 PM.png, Screen 
 Shot 2013-11-12 at 1.06.04 PM.png


 When an AM expires a task attempt it would be nice if it triggered a jstack 
 output via SIGQUIT before killing the task attempt.  This would be invaluable 
 for helping users debug their hung tasks, especially if they do not have 
 shell access to the nodes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-5044) Have AM trigger jstack on task attempts that timeout before killing them

2015-02-05 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307791#comment-14307791
 ] 

Hadoop QA commented on MAPREDUCE-5044:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12645521/MAPREDUCE-5044.v06.patch
  against trunk revision c4980a2.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/5164//console

This message is automatically generated.

 Have AM trigger jstack on task attempts that timeout before killing them
 

 Key: MAPREDUCE-5044
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5044
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mr-am
Affects Versions: 2.1.0-beta
Reporter: Jason Lowe
Assignee: Gera Shegalov
 Attachments: MAPREDUCE-5044.v01.patch, MAPREDUCE-5044.v02.patch, 
 MAPREDUCE-5044.v03.patch, MAPREDUCE-5044.v04.patch, MAPREDUCE-5044.v05.patch, 
 MAPREDUCE-5044.v06.patch, Screen Shot 2013-11-12 at 1.05.32 PM.png, Screen 
 Shot 2013-11-12 at 1.06.04 PM.png


 When an AM expires a task attempt it would be nice if it triggered a jstack 
 output via SIGQUIT before killing the task attempt.  This would be invaluable 
 for helping users debug their hung tasks, especially if they do not have 
 shell access to the nodes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MAPREDUCE-5044) Have AM trigger jstack on task attempts that timeout before killing them

2015-02-05 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-5044:

Status: Open  (was: Patch Available)

Cancelling patch as it no longer applies.

 Have AM trigger jstack on task attempts that timeout before killing them
 

 Key: MAPREDUCE-5044
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5044
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mr-am
Affects Versions: 2.1.0-beta
Reporter: Jason Lowe
Assignee: Gera Shegalov
 Attachments: MAPREDUCE-5044.v01.patch, MAPREDUCE-5044.v02.patch, 
 MAPREDUCE-5044.v03.patch, MAPREDUCE-5044.v04.patch, MAPREDUCE-5044.v05.patch, 
 MAPREDUCE-5044.v06.patch, Screen Shot 2013-11-12 at 1.05.32 PM.png, Screen 
 Shot 2013-11-12 at 1.06.04 PM.png


 When an AM expires a task attempt it would be nice if it triggered a jstack 
 output via SIGQUIT before killing the task attempt.  This would be invaluable 
 for helping users debug their hung tasks, especially if they do not have 
 shell access to the nodes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAPREDUCE-6237) DBRecordReader is not thread safe

2015-02-05 Thread Hadoop QA (JIRA)

[
https://issues.apache.org/jira/browse/MAPREDUCE-6237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307813#comment-14307813
]

Hadoop QA commented on MAPREDUCE-6237:
--

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12696811/mapreduce-6237.patch
against trunk revision d27439f.

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 tests included{color}. The patch appears to include 1 new
or modified test files.

{color:red}-1 javac{color}. The applied patch generated 1154 javac
compiler warnings (more than the trunk's current 1149 warnings).

{color:green}+1 javadoc{color}. There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}. The patch built with
eclipse:eclipse.

{color:red}-1 findbugs{color}. The patch appears to introduce 13 new
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.

{color:green}+1 core tests{color}. The patch passed unit tests in
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core.

Test results:
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/5162//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/5162//artifact/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-core.html
Javac warnings:
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/5162//artifact/patchprocess/diffJavacWarnings.txt
Console output:
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/5162//console

This message is automatically generated.

DBRecordReader is not thread safe
-

Key: MAPREDUCE-6237
URL: https://issues.apache.org/jira/browse/MAPREDUCE-6237
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: mrv2
Affects Versions: 2.5.0
Reporter: Kannan Rajah
Assignee: Kannan Rajah
Attachments: mapreduce-6237.patch, mapreduce-6237.patch,
mapreduce-6237.patch

DBInputFormat.createDBRecorder is reusing JDBC connections across instances
of DBRecordReader. This is not a good idea. We should be creating separate
connection. If performance is a concern, then we should be using connection
pooling instead.
I looked at DBOutputFormat.getRecordReader. It actually creates a new
Connection object for each DBRecordReader. So can we just change
DBInputFormat to create new Connection every time? The connection reuse code
was added as part of connection leak bug in MAPREDUCE-1443. Any reason for
caching the connection?
We observed this issue in a customer setup where they were reading data from
MySQL using Pig. As per customer, the query is returning two records which
causes Pig to create two instances of DBRecordReader. These two instances are
sharing the database connection instance. The first DBRecordReader runs to
extract the first record from MySQL just fine, but then closes the shared
connection instance. When the second DBRecordReader runs, it tries to execute
a query to retrieve the second record on the closed shared connection
instance, which fail. If we set
mapred.map.tasks to 1, the query will be successful.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

1 2 >

1 - 100 of 127 matches

Mail list logo