from:"Tom White"

[jira] [Resolved] (MAPREDUCE-3936) Clients should not enforce counter limits

2015-05-11 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-3936.
--
Resolution: Won't Fix

 Clients should not enforce counter limits 
 --

 Key: MAPREDUCE-3936
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3936
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv1
Reporter: Tom White
Assignee: Tom White
  Labels: BB2015-05-TBR
 Attachments: MAPREDUCE-3936.patch, MAPREDUCE-3936.patch


 The code for enforcing counter limits (from MAPREDUCE-1943) creates a static 
 JobConf instance to load the limits, which may throw an exception if the 
 client limit is set to be lower than the limit on the cluster (perhaps 
 because the cluster limit was raised from the default).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Re: [VOTE] Release Apache Hadoop 2.0.4-alpha

2013-04-19 Thread Tom White

+1

Checked sigs and checksums, source tag, and built from source.

Cheers,
Tom

On Fri, Apr 12, 2013 at 2:56 PM, Arun C Murthy a...@hortonworks.com wrote:
 Folks,

 I've created a release candidate (RC2) for hadoop-2.0.4-alpha that I would 
 like to release.

 The RC is available at: 
 http://people.apache.org/~acmurthy/hadoop-2.0.4-alpha-rc2/
 The RC tag in svn is here: 
 http://svn.apache.org/repos/asf/hadoop/common/tags/release-2.0.4-alpha-rc2

 The maven artifacts are available via repository.apache.org.

 Please try the release and vote; the vote will run for the usual 7 days.

 thanks,
 Arun


 --
 Arun C. Murthy
 Hortonworks Inc.
 http://hortonworks.com/

[jira] [Created] (MAPREDUCE-5035) Update MR1 memory configuration docs

2013-02-27 Thread Tom White (JIRA)

Tom White created MAPREDUCE-5035:


 Summary: Update MR1 memory configuration docs
 Key: MAPREDUCE-5035
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5035
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Reporter: Tom White
Assignee: Tom White


The pmem/vmem settings in the docs 
(http://hadoop.apache.org/docs/r1.1.1/cluster_setup.html#Memory+monitoring) 
have not been supported for a long time. The docs should be updated to reflect 
the new settings (mapred.cluster.map.memory.mb etc).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-5035) Update MR1 memory configuration docs

2013-02-27 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-5035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-5035.
--

   Resolution: Fixed
Fix Version/s: 1.2.0
 Hadoop Flags: Reviewed

I just committed this.

 Update MR1 memory configuration docs
 

 Key: MAPREDUCE-5035
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5035
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Reporter: Tom White
Assignee: Tom White
 Fix For: 1.2.0

 Attachments: cluster_setup.html, MAPREDUCE-5035.patch


 The pmem/vmem settings in the docs 
 (http://hadoop.apache.org/docs/r1.1.1/cluster_setup.html#Memory+monitoring) 
 have not been supported for a long time. The docs should be updated to 
 reflect the new settings (mapred.cluster.map.memory.mb etc).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Re: Release numbering for branch-2 releases

2013-02-01 Thread Tom White

Possibly the reason for Stack's consternation is that this is a
Hadoop-specific versioning scheme, rather than a standard one like
Semantic Versioning (http://semver.org/) which is more widely
understood.

With that scheme we would have something like

  2.0.0-alpha, 2.0.0-alpha.1, 2.0.0-alpha.2, 2.0.0-alpha.3, 2.0.0-beta, 2.0.0

so that the alpha and beta tags all precede the 2.0.0 GA release,
which is the one that we make compatibility promises for.

Whereas Arun is proposing

  2.0.0-alpha, 2.0.1-alpha, 2.0.2-alpha, 2.1.0-alpha, 2.2.0-beta, 2.3.0

and the casual observer might expect there to be a stable 2.0.1 (say)
on seeing the existence of 2.0.2-alpha.

The first three of these are already released, so I don't think we
could switch to the Semantic Versioning scheme at this stage. We could
for release 3 though.

Tom

On Thu, Jan 31, 2013 at 8:12 PM, Arun C Murthy a...@hortonworks.com wrote:
 Stack,

 On Jan 30, 2013, at 9:25 PM, Stack wrote:

 I find the above opaque and written in a cryptic language that I might grok
 if I spent a day or two running over cited issues trying to make some
 distillation of the esotericia debated therein.  If you want feedback from
 other than the cognescenti, I would suggest a better summation of what all
 is involved.


 I apologize if there was too much technical details.

 The simplified version is that hadoop-2 isn't baked as it stands today, and 
 is not viable to be supported by this community in a stable manner. In 
 particular, it is due to the move to PB for HDFS protocols and the freshly 
 minted YARN apis/protocols. As a result, we have been forced to make 
 (incompatible) changes in every hadoop-2 release so far (2.0.0, 2.0.2 etc.). 
 Since we released the previous bits we have found security issues, bugs and 
 other issues which will cause long-term maintenance harm (details are in the 
 HADOOP/HDFS/YARN jiras in the original email).

 My aim, as the RM, is to try nudge (nay, force) all contributors to spend 
 time over the next couple of months focussing on fixing known issues and to 
 look for other surprises - this way I hope to ensure we do not have further 
 incompatible changes for downstream projects and we can support hadoop-2 for 
 at least a couple of years. I hope this makes sense to you. I don't think 
 turning around and calling these 3.x or 4.x makes things better since no 
 amount of numbering lipstick will make the software better or viable for the 
 long-term for both users and other projects. Worse, it will force HBase and 
 other projects to deal with *even more* major Hadoop releases... which seems 
 like a royal pita.

 I hope that clarifies things. Thanks Stack.

 Arun

Re: Release numbering for branch-2 releases

2013-02-01 Thread Tom White

On Wed, Jan 30, 2013 at 11:32 PM, Vinod Kumar Vavilapalli
vino...@hortonworks.com wrote:
 I still have a list of pending API/protocol cleanup in YARN that need to be
 in before we even attempt supporting compatibility further down the road.

To let others track these it would be useful if they were tagged in
JIRA with a label (e.g. apichange).

 There's no way we can support wire compatibility with the APIs in the state
 that they are in now. So, +1 for a beta sometime in March.

 There are some early adopters, I am particularly speaking of YARN, who have
 been instrumental in helping ironing out the alpha software, some with very
 large clusters and end-user base. These users will continue to be affected
 with these API/protocol changes, but the alpha tag was clearly meant to
 clarify this. I think we should graciously send out a note (on general@)
 about an impending beta from where everyone can except a high degree of
 compatibility.

 Just caught up with the discussion on the referred JIRAs. I can clearly see
 how a single release with an umbrella alpha/beta tag is causing tensions
 *only* because we have a single project and product. More reinforcement for
 my proclivity towards separate releases and by extension towards the
 projects' split.

Good point. There's nothing to stop us doing separate releases of
sub-project components now. Doing so might help us find
incompatibilities between the different components in a release line
(2.x at the moment).


 Thanks,
 +Vinod



 On Tue, Jan 29, 2013 at 2:40 PM, Arun C Murthy a...@hortonworks.com wrote:

 Thanks Suresh. Adding back other *-dev lists.

 On Jan 29, 2013, at 1:58 PM, Suresh Srinivas wrote:

  +1 for a release with all the changes that are committed. That way it
  carries all the important bug fixes.
 
 
  So, rather than debate more, I had a brief chat with Suresh and Todd.
 Todd
  suggested calling the next release as hadoop-2.1.0-alpha to indicate the
  incompatibility a little better. This makes sense to me, as long as we
 are
  clear that we won't make any further *feature* releases in hadoop-2.0.x
  series (obviously we might be forced to do security/bug-fix release).
 
 
 
  We have been incorrectly using point releases to introduce features.
 Given
  there are many features in this release, calling it 2.1.0 instead of
 2.0.3
  makes sense. As you said, I am okay with the proposed plan as long as we
 do
  not lapse back to introducing new features in point releases meant for
  critical bugs.





 --
 +Vinod
 Hortonworks Inc.
 http://hortonworks.com/

[jira] [Resolved] (MAPREDUCE-4850) Job recovery may fail if staging directory has been deleted

2013-01-09 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-4850.
--

   Resolution: Fixed
Fix Version/s: 1.2.0
 Hadoop Flags: Reviewed

I ran test-patch and it came back clean. I just committed this.

 Job recovery may fail if staging directory has been deleted
 ---

 Key: MAPREDUCE-4850
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4850
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Affects Versions: 1.1.1
Reporter: Tom White
Assignee: Tom White
 Fix For: 1.2.0

 Attachments: MAPREDUCE-4850.patch, MAPREDUCE-4850.patch


 The job staging directory is deleted in the job cleanup task, which happens 
 before the job-info file is deleted from the system directory (by the 
 JobInProgress garbageCollect() method). If the JT shuts down between these 
 two operations, then when the JT restarts and tries to recover the job, it 
 fails since the job.xml and splits are no longer available.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4850) Job recovery may fail if staging directory has been deleted

2012-12-05 Thread Tom White (JIRA)

Tom White created MAPREDUCE-4850:


 Summary: Job recovery may fail if staging directory has been 
deleted
 Key: MAPREDUCE-4850
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4850
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Affects Versions: 1.1.1
Reporter: Tom White
Assignee: Tom White


The job staging directory is deleted in the job cleanup task, which happens 
before the job-info file is deleted from the system directory (by the 
JobInProgress garbageCollect() method). If the JT shuts down between these two 
operations, then when the JT restarts and tries to recover the job, it fails 
since the job.xml and splits are no longer available.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4824) Provide a mechanism for jobs to indicate they should not be recovered on restart

2012-11-27 Thread Tom White (JIRA)

Tom White created MAPREDUCE-4824:


 Summary: Provide a mechanism for jobs to indicate they should not 
be recovered on restart
 Key: MAPREDUCE-4824
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4824
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: mrv1
Affects Versions: 1.1.0
Reporter: Tom White
Assignee: Tom White


Some jobs (like Sqoop or HBase jobs) are not idempotent, so should not be 
recovered on jobtracker restart. MAPREDUCE-2702 solves this problem for MR2, 
however the approach there is not applicable for MR1, since even if we only use 
the job-level part of the patch and add a isRecoverySupported method to 
OutputCommitter, there is no way to use that information from the JT (which 
initiates recovery), since the JT does not instantiate OutputCommitters - and 
it shouldn't since they are user-level code. (In MR2 it's OK since the MR AM 
calls the method.)

Instead, we can add a MR configuration property to say that a job is not 
recoverable, and the JT could safely read this from the job conf.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4632) Make sure MapReduce declares correct set of dependencies

2012-09-04 Thread Tom White (JIRA)

Tom White created MAPREDUCE-4632:


 Summary: Make sure MapReduce declares correct set of dependencies
 Key: MAPREDUCE-4632
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4632
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build
Affects Versions: 2.0.0-alpha
Reporter: Tom White


This is the equivalent of HADOOP-8278 for MapReduce.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Reopened] (MAPREDUCE-4567) Fix failing TestJobKillAndFail in branch-1

2012-08-31 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White reopened MAPREDUCE-4567:
--


Reverted and reopened along with MAPREDUCE-4488 - see that issue for details.

 Fix failing TestJobKillAndFail in branch-1
 --

 Key: MAPREDUCE-4567
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4567
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Affects Versions: 1.2.0
Reporter: Tom White
Assignee: Tom White
 Fix For: 1.2.0

 Attachments: MAPREDUCE-4567.patch


 This was introduced in MAPREDUCE-4488.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4610) Support deprecated mapreduce.job.counters.limit property in MR2

2012-08-30 Thread Tom White (JIRA)

Tom White created MAPREDUCE-4610:


 Summary: Support deprecated mapreduce.job.counters.limit property 
in MR2
 Key: MAPREDUCE-4610
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4610
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 2.0.0-alpha
Reporter: Tom White
Assignee: Tom White


The property mapreduce.job.counters.limit was introduced in MAPREDUCE-1943, but 
the mechanism was changed in MAPREDUCE-901 where the property name was changed 
to mapreduce.job.counters.max without supporting the old name. We should 
deprecate but honour the old name to make it easier for folks to move from 
Hadoop 1 to Hadoop 2.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-4567) Fix failing TestJobKillAndFail in branch-1

2012-08-21 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-4567.
--

  Resolution: Fixed
Hadoop Flags: Reviewed

I committed this to branch-1.

 Fix failing TestJobKillAndFail in branch-1
 --

 Key: MAPREDUCE-4567
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4567
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Affects Versions: 1.2.0
Reporter: Tom White
Assignee: Tom White
 Fix For: 1.2.0

 Attachments: MAPREDUCE-4567.patch


 This was introduced in MAPREDUCE-4488.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-4488) Port MAPREDUCE-463 (The job setup and cleanup tasks should be optional) to branch-1

2012-08-13 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-4488.
--

  Resolution: Fixed
Hadoop Flags: Reviewed

I just committed this to branch-1. (Ran unit tests and test-patch successfully.)

 Port MAPREDUCE-463 (The job setup and cleanup tasks should be optional) to 
 branch-1
 ---

 Key: MAPREDUCE-4488
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4488
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: mrv1, performance
Affects Versions: 1.0.3
Reporter: Tom White
Assignee: Tom White
 Attachments: MAPREDUCE-4488.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-4257) Support fair-sharing option within a MR2 Capacity Scheduler queue

2012-07-11 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-4257.
--

Resolution: Invalid

Yes, I don't think we need this.

 Support fair-sharing option within a MR2 Capacity Scheduler queue
 -

 Key: MAPREDUCE-4257
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4257
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: capacity-sched, mrv2
Reporter: Tom White
Assignee: Karthik Kambatla

 The fair scheduler can run jobs in a single pool (queue) in FIFO or fair 
 share mode. In FIFO mode one job runs at a time, in priority order, while in 
 fair share mode multiple jobs can run at the same time, and they share the 
 capacity of the pool. This JIRA is to add the latter feature to Capacity 
 Scheduler as an option - the default would remain FIFO.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-3837) Hadoop 22 Job tracker is not able to recover job in case of crash and after that no user can submit job.

2012-07-03 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-3837.
--

   Resolution: Fixed
Fix Version/s: (was: 1.1.1)
   1.2.0

I just committed this to branch-1. Thanks Mayank!

 Hadoop 22 Job tracker is not able to recover job in case of crash and after 
 that no user can submit job.
 

 Key: MAPREDUCE-3837
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3837
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.22.0, 1.1.1
Reporter: Mayank Bansal
Assignee: Mayank Bansal
 Fix For: 0.24.0, 1.2.0, 0.22.1, 0.23.2

 Attachments: PATCH-HADOOP-1-MAPREDUCE-3837-1.patch, 
 PATCH-HADOOP-1-MAPREDUCE-3837-2.patch, PATCH-HADOOP-1-MAPREDUCE-3837-3.patch, 
 PATCH-HADOOP-1-MAPREDUCE-3837-4.patch, PATCH-HADOOP-1-MAPREDUCE-3837.patch, 
 PATCH-MAPREDUCE-3837.patch, PATCH-TRUNK-MAPREDUCE-3837.patch


 If job tracker is crashed while running , and there were some jobs are 
 running , so if job tracker's property mapreduce.jobtracker.restart.recover 
 is true then it should recover the job.
 However the current behavior is as follows
 jobtracker try to restore the jobs but it can not . And after that jobtracker 
 closes its handle to hdfs and nobody else can submit job. 
 Thanks,
 Mayank

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4337) Support multiple reducers in 'uber' jobs

2012-06-11 Thread Tom White (JIRA)

Tom White created MAPREDUCE-4337:


 Summary: Support multiple reducers in 'uber' jobs
 Key: MAPREDUCE-4337
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4337
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Reporter: Tom White


This is MAPREDUCE-434 for 'uber' jobs (jobs that run in the same container as 
the AM).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4299) Terasort hangs with MR2 FifoScheduler

2012-05-31 Thread Tom White (JIRA)

Tom White created MAPREDUCE-4299:


 Summary: Terasort hangs with MR2 FifoScheduler
 Key: MAPREDUCE-4299
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4299
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 2.0.0-alpha
Reporter: Tom White


What happens is that the number of reducers ramp up until they occupy all of 
the job's containers, at which point the maps no longer make any progress and 
the job hangs.

When the same job is run with the CapacityScheduler it succeeds, so this looks 
like a FifoScheduler bug.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4252) MR2 job never completes with 1 pending task

2012-05-11 Thread Tom White (JIRA)

Tom White created MAPREDUCE-4252:


 Summary: MR2 job never completes with 1 pending task
 Key: MAPREDUCE-4252
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4252
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.1
Reporter: Tom White
Assignee: Tom White


This was found by ATM:

bq. I ran a teragen with 1000 map tasks. Many task attempts failed, but after 
999 of the tasks had completed, the job is now sitting forever with 1 task 
pending.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4230) Ensure framework counter group synchronization is correct for serialization/deserialization

2012-05-07 Thread Tom White (JIRA)

Tom White created MAPREDUCE-4230:


 Summary: Ensure framework counter group synchronization is correct 
for serialization/deserialization
 Key: MAPREDUCE-4230
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4230
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Tom White
 Fix For: 2.0.0


FrameworkCounterGroup and FileSystemCounterGroup may be susceptible to a race 
outlined in 
https://issues.apache.org/jira/browse/MAPREDUCE-4226?focusedCommentId=13269657page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13269657
 by Robert Joseph Evans.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4226) ConcurrentModificationException in FileSystemCounterGroup

2012-05-04 Thread Tom White (JIRA)

Tom White created MAPREDUCE-4226:


 Summary: ConcurrentModificationException in FileSystemCounterGroup
 Key: MAPREDUCE-4226
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4226
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.1
Reporter: Tom White
Assignee: Tom White


This was seen in a Hive job. I'll attach a failing test case.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4217) Task commit waits for up to 3 seconds

2012-05-01 Thread Tom White (JIRA)

Tom White created MAPREDUCE-4217:


 Summary: Task commit waits for up to 3 seconds
 Key: MAPREDUCE-4217
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4217
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: task
Affects Versions: 0.23.1
Reporter: Tom White
Assignee: Tom White


Following MAPREDUCE-2450, TaskReporter#stopCommunicationThread may wait up to 3 
seconds (the value of PROGRESS_INTERVAL) before it returns. This can be 
noticeable for short running tasks.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-4217) Task commit waits for up to 3 seconds

2012-05-01 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-4217.
--

Resolution: Duplicate

MAPREDUCE-3809 is the same. Thanks for pointing it out Siddharth.

 Task commit waits for up to 3 seconds
 -

 Key: MAPREDUCE-4217
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4217
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: task
Affects Versions: 0.23.1
Reporter: Tom White
Assignee: Tom White

 Following MAPREDUCE-2450, TaskReporter#stopCommunicationThread may wait up to 
 3 seconds (the value of PROGRESS_INTERVAL) before it returns. This can be 
 noticeable for short running tasks.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4168) Support multiple network interfaces

2012-04-20 Thread Tom White (Created) (JIRA)

Support multiple network interfaces
---

 Key: MAPREDUCE-4168
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4168
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
Reporter: Tom White


Umbrella jira to track the MapReduce side of HADOOP-8198.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4158) Port mapreduce.MapFileOutputFormat to branch-1

2012-04-16 Thread Tom White (Created) (JIRA)

Port mapreduce.MapFileOutputFormat to branch-1
--

 Key: MAPREDUCE-4158
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4158
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: client
Affects Versions: 1.0.2
Reporter: Tom White
Assignee: Tom White


MapFileOutputFormat was missed in MAPREDUCE-3607.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-3378) Create a single 'hadoop-mapreduce' Maven artifact

2012-04-13 Thread Tom White (Resolved) (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-3378.
--

Resolution: Won't Fix

I've opened HADOOP-8278 to track 1. HADOOP-8009 addressed 2. So I'm closing 
this JIRA now.



 Create a single 'hadoop-mapreduce' Maven artifact
 -

 Key: MAPREDUCE-3378
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3378
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build
Affects Versions: 0.23.0
Reporter: Tom White
 Attachments: MAPREDUCE-3378.patch


 In 0.23.0 there are multiple artifacts (hadoop-mapreduce-client-app, 
 hadoop-mapreduce-client-common, hadoop-mapreduce-client-core, etc). It would 
 be simpler for users to declare a dependency on hadoop-mapreduce (much like 
 there's hadoop-common and hadoop-hdfs). (This would also be a step towards 
 MAPREDUCE-2600.)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4146) Support limits on task status string length and number of block locations in branch-2

2012-04-12 Thread Tom White (Created) (JIRA)

Support limits on task status string length and number of block locations in 
branch-2
-

 Key: MAPREDUCE-4146
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4146
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Tom White


This brings MAPREDUCE-1943 to branch-2. Counter limits were introduced in 
MAPREDUCE-901.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4147) YARN should not have a compile-time dependency on HDFS

2012-04-12 Thread Tom White (Created) (JIRA)

YARN should not have a compile-time dependency on HDFS
--

 Key: MAPREDUCE-4147
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4147
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.23.1
Reporter: Tom White
Assignee: Tom White
 Attachments: MAPREDUCE-4147.patch

YARN doesn't (and shouldn't) use any HDFS-specific APIs, so it should not 
declare HDFS as a compile-time dependency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4148) MapReduce should not have a compile-time dependency on HDFS

2012-04-12 Thread Tom White (Created) (JIRA)

MapReduce should not have a compile-time dependency on HDFS
---

 Key: MAPREDUCE-4148
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4148
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Tom White
Assignee: Tom White


MapReduce depends on HDFS's DelegationTokenIdentifier (for printing token debug 
information). We should remove this dependency and MapReduce's compile-time 
dependency on HDFS.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-4138) Reduce memory usage of counters due to non-static nested classes

2012-04-11 Thread Tom White (Created) (JIRA)

Reduce memory usage of counters due to non-static nested classes


 Key: MAPREDUCE-4138
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4138
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Tom White
Assignee: Tom White


FrameworkCounter is a non-static nested class of FrameworkCounterGroup which 
means it retains a reference to the outer class, which isn't really needed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-2835) Make per-job counter limits configurable

2012-03-14 Thread Tom White (Resolved) (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-2835.
--

   Resolution: Fixed
Fix Version/s: 1.1.0
 Hadoop Flags: Reviewed

I've just committed this.

 Make per-job counter limits configurable
 

 Key: MAPREDUCE-2835
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2835
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Affects Versions: 0.20.204.0
Reporter: Tom White
Assignee: Tom White
 Fix For: 1.1.0

 Attachments: MAPREDUCE-2835.patch, MAPREDUCE-2835.patch


 The per-job counter limits introduced in MAPREDUCE-1943 are fixed, except for 
 the total number allowed per job (mapreduce.job.counters.limit). It would be 
 useful to make them all configurable.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3935) Annotate Counters.Counter and Counters.Group as @Public

2012-02-28 Thread Tom White (Created) (JIRA)

Annotate Counters.Counter and Counters.Group as @Public
---

 Key: MAPREDUCE-3935
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3935
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: client
Reporter: Tom White
Assignee: Tom White


For clarity these inner classes should be marked as public stable.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3936) Improve counter limits behaviour in 1.x

2012-02-28 Thread Tom White (Created) (JIRA)

Improve counter limits behaviour in 1.x
---

 Key: MAPREDUCE-3936
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3936
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv1
Reporter: Tom White


The code for enforcing counter limits (from MAPREDUCE-1943) creates a static 
JobConf instance to load the limits, which may throw an exception if the client 
limit is set to be lower than the limit on the cluster (perhaps because the 
cluster limit was raised from the default).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3871) Allow symlinking in LocalJobRunner DistributedCache

2012-02-16 Thread Tom White (Created) (JIRA)

Allow symlinking in LocalJobRunner DistributedCache
---

 Key: MAPREDUCE-3871
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3871
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: distributed-cache
Reporter: Tom White
Assignee: Tom White
 Attachments: MAPREDUCE-3871.patch

Currently the LocalJobRunner doesn't create symlinks for files in the 
DistributedCache. It is safe to create symlinks if files of the same name don't 
exist. LocalJobRunner should also delete the symlinks when the job has 
completed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3858) Task attempt failure during commit results in task never completing

2012-02-13 Thread Tom White (Created) (JIRA)

Task attempt failure during commit results in task never completing
---

 Key: MAPREDUCE-3858
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3858
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Reporter: Tom White
Assignee: Tom White
Priority: Critical
 Attachments: MAPREDUCE-3858.patch

On a terasort job a task attempt failed during the commit phase. Another 
attempt was rescheduled, but when it tried to commit it failed.

{noformat}
attempt_1329019187148_0083_r_000586_0 already given a go for committing the 
task output, so killing attempt_1329019187148_0083_r_000586_1
{noformat}

The job hung as new attempts kept getting scheduled only to fail during commit.



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3854) Reinstate environment variable tests in TestMiniMRChildTask

2012-02-10 Thread Tom White (Created) (JIRA)

Reinstate environment variable tests in TestMiniMRChildTask
---

 Key: MAPREDUCE-3854
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3854
 Project: Hadoop Map/Reduce
  Issue Type: Test
  Components: mrv2
Reporter: Tom White
 Fix For: 0.23.2


MAPREDUCE-3716 reinstated one of the tests in TestMiniMRChildTask, but there 
are two more which should be run.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3794) Support mapred.Task.Counter and mapred.JobInProgress.Counter enums for compatibility

2012-02-02 Thread Tom White (Created) (JIRA)

Support mapred.Task.Counter and mapred.JobInProgress.Counter enums for 
compatibility


 Key: MAPREDUCE-3794
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3794
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Tom White
Assignee: Tom White


The new counters are mapreduce.TaskCounter and mapreduce.JobCounter, but we 
should support the old ones too since they are public in Hadoop 1.x.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3749) ConcurrentModificationException in counter groups

2012-01-27 Thread Tom White (Created) (JIRA)

ConcurrentModificationException in counter groups
-

 Key: MAPREDUCE-3749
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3749
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Tom White
Priority: Blocker


Iterating over a counter's groups while adding more groups will cause a 
ConcurrentModificationException.

This was found while running Hive unit tests against a recent 0.23 version.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3684) LocalDistributedCacheManager does not shut down its thread pool

2012-01-17 Thread Tom White (Created) (JIRA)

LocalDistributedCacheManager does not shut down its thread pool
---

 Key: MAPREDUCE-3684
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3684
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: client
Affects Versions: 0.23.0
Reporter: Tom White
Assignee: Tom White


This was observed by running a Hive job in local mode. The job completed but 
the client process did not exit for 60 seconds.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3636) Apply audience and stability annotations to classes in MapReduce for 1.x

2012-01-06 Thread Tom White (Created) (JIRA)

Apply audience and stability annotations to classes in MapReduce for 1.x


 Key: MAPREDUCE-3636
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3636
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: documentation
Reporter: Tom White


Port MAPREDUCE-1623 to branch-1.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3607) Port missing new API mapreduce lib classes to 1.x

2011-12-28 Thread Tom White (Created) (JIRA)

Port missing new API mapreduce lib classes to 1.x
-

 Key: MAPREDUCE-3607
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3607
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: client
Affects Versions: 1.0.0
Reporter: Tom White
Assignee: Tom White


There are a number of classes under mapreduce.lib that are not present in the 
1.x series. Including these would help users and downstream projects using the 
new MapReduce API migrate to later versions of Hadoop in the future.

A few examples of where this would help:
* Sqoop uses mapreduce.lib.db.DBWritable and 
mapreduce.lib.input.CombineFileInputFormat (SQOOP-384).
* Mahout uses mapreduce.lib.output.MultipleOutputs (MAHOUT-822).
* HBase has a backport of mapreduce.lib.partition.InputSampler and 
TotalOrderPartitioner (in org.apache.hadoop.hbase.mapreduce.hadoopbackport) - 
it would be better if it used the ones in Hadoop.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-3467) Mavenizing har

2011-12-22 Thread Tom White (Resolved) (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-3467.
--

Resolution: Duplicate

This was fixed in HADOOP-7810.

 Mavenizing har
 --

 Key: MAPREDUCE-3467
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3467
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.23.1
Reporter: John George
Priority: Critical

 As part of mapreduce mavenization, har should also be mavenized and added to 
 maven repo

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3595) Add missing TestCounters#testCounterValue test from branch 1 to 0.23

2011-12-22 Thread Tom White (Created) (JIRA)

Add missing TestCounters#testCounterValue test from branch 1 to 0.23


 Key: MAPREDUCE-3595
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3595
 Project: Hadoop Map/Reduce
  Issue Type: Test
  Components: test
Reporter: Tom White
Assignee: Tom White
 Attachments: MAPREDUCE-3595.patch



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3542) Support FileSystemCounter legacy counter group name for compatibility

2011-12-13 Thread Tom White (Created) (JIRA)

Support FileSystemCounter legacy counter group name for compatibility
---

 Key: MAPREDUCE-3542
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3542
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Tom White
Assignee: Tom White


The group name changed from FileSystemCounter to 
org.apache.hadoop.mapreduce.FileSystemCounter, but we should support the old 
one for compatibility's sake. This came up in PIG-2347. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3527) Fix minor API incompatibilities between 1.0 and 0.23

2011-12-09 Thread Tom White (Created) (JIRA)

Fix minor API incompatibilities between 1.0 and 0.23


 Key: MAPREDUCE-3527
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3527
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Tom White
Assignee: Tom White


There are a few minor incompatibilities that were found in HADOOP-7738 and are 
straightforward to fix.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-3428) MR AppMaster CLASSPATH is dependent on the compile-time environment

2011-12-02 Thread Tom White (Resolved) (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-3428.
--

Resolution: Duplicate

 MR AppMaster CLASSPATH is dependent on the compile-time environment 
 

 Key: MAPREDUCE-3428
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3428
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Ahmed Radwan
  Labels: bigtop

 The CLASSPATH for the MapReduce Application master is set using compile time 
 path information, which is typically different from run-time. This will cause 
 failure when running on different environments.
 Specifically, the YarnRunner, and as part ApplicationSubmissionContext 
 creation, sets the classpath for the application master using 
 MRApps.setClasspath(environment), and then the setMRFrameworkClasspath(..) 
 method uses compile time path information present in the 
 mrapp-generated-classpath file (created at compile-time).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3479) JobClient#getJob cannot find local jobs

2011-11-28 Thread Tom White (Created) (JIRA)

JobClient#getJob cannot find local jobs
---

 Key: MAPREDUCE-3479
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3479
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: client
Affects Versions: 0.23.0
Reporter: Tom White
Assignee: Tom White


The problem is that JobClient#submitJob doesn't pass the Cluster object to Job 
for the submission process, which means that two Cluster objects and two 
LocalJobRunner objects are created. LocalJobRunner keeps an instance map of job 
IDs to Jobs, and when JobClient#getJob is called the LocalJobRunner with the 
unpopulated map is used which results in the job not being found.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3416) Rename {start,stop}-all.sh to {start,stop}-yarn.sh for consistency with HDFS

2011-11-16 Thread Tom White (Created) (JIRA)

Rename {start,stop}-all.sh to {start,stop}-yarn.sh for consistency with HDFS


 Key: MAPREDUCE-3416
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3416
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Tom White


There are already {start,stop}-all.sh scripts for starting and stopping all 
Hadoop daemons which conflict with the YARN {start,stop}-all.sh scripts. The 
latter should be renamed {start,stop}-yarn.sh.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3422) Counter display names are not being picked up

2011-11-16 Thread Tom White (Created) (JIRA)

Counter display names are not being picked up
-

 Key: MAPREDUCE-3422
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3422
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Tom White


When running a job I see MAP_INPUT_RECORDS rather than Map input records 
for the counter name. To fix this the resource bundle properties files need to 
be moved to the src/main/resources tree. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3378) Create a single 'hadoop-mapreduce' Maven artifact

2011-11-08 Thread Tom White (Created) (JIRA)

Create a single 'hadoop-mapreduce' Maven artifact
-

 Key: MAPREDUCE-3378
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3378
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build
Affects Versions: 0.23.0
Reporter: Tom White


In 0.23.0 there are multiple artifacts (hadoop-mapreduce-client-app, 
hadoop-mapreduce-client-common, hadoop-mapreduce-client-core, etc). It would be 
simpler for users to declare a dependency on hadoop-mapreduce (much like 
there's hadoop-common and hadoop-hdfs). (This would also be a step towards 
MAPREDUCE-2600.)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

Re: hadoop-0.23.0 jiras

2011-10-31 Thread Tom White

Hi Arun,

A couple more issues that I think are blockers:

HADOOP-7782 (Aggregate project javadocs) should be a blocker since
otherwise private APIs are published as a part of the public API.
MAPREDUCE-3237 (Move LocalJobRunner to hadoop-mapreduce-client-core
module) is also a blocker since without it I don't think it's possible
to run local jobs.

Also, what do you want to do about the documentation? Some is in
Forrest and some is in APT.

Here are the current 0.23 blockers:
https://issues.apache.org/jira/secure/IssueNavigator.jspa?mode=hiderequestId=12317667

Cheers,
Tom

On Mon, Oct 31, 2011 at 12:11 AM, Arun C Murthy a...@hortonworks.com wrote:
 Done. Ping me if you find anything astray.

 thanks,
 Arun

 On Oct 30, 2011, at 9:18 PM, Arun C Murthy wrote:

 FYI - I'm starting to change 'fix version' for all non-blockers and do a 
 final pass on open blockers.

 I'll send a note out when done.

 thanks,
 Arun

[jira] [Created] (MAPREDUCE-3237) Move LocalJobRunner to hadoop-mapreduce-client-core module

2011-10-20 Thread Tom White (Created) (JIRA)

Move LocalJobRunner to hadoop-mapreduce-client-core module
--

 Key: MAPREDUCE-3237
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3237
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: client
Reporter: Tom White
 Fix For: 0.23.0


LocalJobRunner works independently of MR1 (jobtracker and tasktrackers) and MR2 
(YARN). The MR1 directory is being kept around only to support unit tests, so 
LocalJobRunner should be moved out to somewhere more permanent.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-3003) Publish Yarn and MapReduce artifacts to Maven snapshot repository

2011-09-13 Thread Tom White (JIRA)

Publish Yarn and MapReduce artifacts to Maven snapshot repository
-

 Key: MAPREDUCE-3003
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3003
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: build
Reporter: Tom White
Assignee: Tom White


Currently this is failing since no distribution management section is defined 
in the POM.

https://builds.apache.org/view/G-L/view/Hadoop/job/Hadoop-Common-trunk-Commit/883/consoleFull

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Re: Problem getting the code from SVN

2011-08-22 Thread Tom White

On Sun, Aug 21, 2011 at 11:13 AM, Konstantin Boudnik c...@apache.org wrote:
 Seems like it has been broken in the recent re-shuffle of the workspace due to
 the maven changes. This external now lives under hadoop-common, apparently.

It's actually still under common, however it's no longer needed for
HDFS since it has been replaced by test-patch.sh in dev-support. I'll
file a ticket to remove the externals.


 On Sun, Aug 21, 2011 at 07:20PM, Praveen Sripati wrote:
 Hi,

 When I try to get the code from svn, I get the below error.

 svn co http://svn.apache.org/repos/asf/hadoop/common/trunk/

 A    trunk/hadoop-mapreduce/bin/mapred-config.sh
 A    trunk/hadoop-mapreduce/bin/stop-mapred.sh
 A    trunk/hadoop-mapreduce/bin/mapred
 A    trunk/hadoop-mapreduce/bin/start-mapred.sh
 A    trunk/hadoop-project
 A    trunk/hadoop-project/pom.xml
  U   trunk

 Fetching external item into 'trunk/hadoop-hdfs/src/test/bin'
 svn: warning: OPTIONS of '
 https://svn.apache.org/repos/asf/hadoop/common/trunk/common/src/test/bin':
 Could not resolve hostname `svn.apache.org': No address associated with
 hostname (https://svn.apache.org)

Is this a DNS issue?

Cheers,
Tom


 svn: warning: Error handling externals definition for
 'trunk/hadoop-mapreduce/src/test/bin':
 svn: warning: OPTIONS of '
 https://svn.apache.org/repos/asf/hadoop/common/trunk/common/src/test/bin':
 Could not resolve hostname `svn.apache.org': No address associated with
 hostname (https://svn.apache.org)
 Checked out revision 1159979.

 Thanks,
 Praveen

Re: Mavenizing the HDFS build

2011-08-19 Thread Tom White

HDFS-2096 is now committed to trunk. The instructions for building
HDFS can be found in the top-level BUILDING.txt file.

I added a script to https://issues.apache.org/jira/browse/HADOOP-7500
to help with migrating HDFS patches to the new layout.

There are a few follow-up patches that need doing soon (e.g.
HADOOP-7498, HADOOP-7496, MAPREDUCE-2856), but these shouldn't stop
folks from doing development as usual.

Thanks to everyone who helped with this!

Cheers,
Tom

On Thu, Aug 18, 2011 at 11:30 AM, Tom White t...@cloudera.com wrote:
 Now that MR-279 has been merged into trunk, I plan to commit the HDFS
 mavenization changes tomorrow (Friday) at 9am PDT.

 Cheers,
 Tom

 On Mon, Aug 15, 2011 at 1:24 PM, Arun C Murthy a...@hortonworks.com wrote:
 Thanks Tom.

 I'm running the final set of tests with the 'MR-279 rebased on trunk' and 
 should be done by tmrw.

 Also, can you guys please ensure that secure HDFS works after mvn'ization?

 thanks,
 Arun

 On Aug 13, 2011, at 9:39 PM, Tom White wrote:

 Hi Arun,

 I'm fine with that. When do you expect to start the vote?

 Cheers,
 Tom

 On Fri, Aug 12, 2011 at 11:41 PM, Arun C Murthy a...@hortonworks.com 
 wrote:
 Hi Tom,

  Can I request you to wait on this commit until we merge MR-279? As Vinod 
 pointed out in his mail to mapreduce-dev@ we are very close to getting the 
 merge done. We should call a vote asap. By holding off it the mvn patch it 
 will save us a bit of time - we spent at more than a couple of days on 
 resolving after the common mvn'ization.

  Thanks for understanding.

 Arun

 On Aug 12, 2011, at 4:18 PM, Tom White wrote:

 The work in https://issues.apache.org/jira/browse/HDFS-2096 is ready
 to be committed, so unless there are any objections I will do so on
 Monday at 5pm UTC (that's 10am PDT, http://s.apache.org/o6F).

 I'll also create a script to convert patches to the new layout, and
 switch over the Jenkins jobs that test and build HDFS.

 Cheers,
 Tom

Re: Mavenizing the HDFS build

2011-08-18 Thread Tom White

Now that MR-279 has been merged into trunk, I plan to commit the HDFS
mavenization changes tomorrow (Friday) at 9am PDT.

Cheers,
Tom

On Mon, Aug 15, 2011 at 1:24 PM, Arun C Murthy a...@hortonworks.com wrote:
 Thanks Tom.

 I'm running the final set of tests with the 'MR-279 rebased on trunk' and 
 should be done by tmrw.

 Also, can you guys please ensure that secure HDFS works after mvn'ization?

 thanks,
 Arun

 On Aug 13, 2011, at 9:39 PM, Tom White wrote:

 Hi Arun,

 I'm fine with that. When do you expect to start the vote?

 Cheers,
 Tom

 On Fri, Aug 12, 2011 at 11:41 PM, Arun C Murthy a...@hortonworks.com wrote:
 Hi Tom,

  Can I request you to wait on this commit until we merge MR-279? As Vinod 
 pointed out in his mail to mapreduce-dev@ we are very close to getting the 
 merge done. We should call a vote asap. By holding off it the mvn patch it 
 will save us a bit of time - we spent at more than a couple of days on 
 resolving after the common mvn'ization.

  Thanks for understanding.

 Arun

 On Aug 12, 2011, at 4:18 PM, Tom White wrote:

 The work in https://issues.apache.org/jira/browse/HDFS-2096 is ready
 to be committed, so unless there are any objections I will do so on
 Monday at 5pm UTC (that's 10am PDT, http://s.apache.org/o6F).

 I'll also create a script to convert patches to the new layout, and
 switch over the Jenkins jobs that test and build HDFS.

 Cheers,
 Tom

Re: [VOTE] Merge MR-279 to trunk.

2011-08-17 Thread Tom White

Excellent work. +1

Tom

On Tue, Aug 16, 2011 at 2:14 PM, Mahadev Konar maha...@hortonworks.com wrote:
 Hi all,

  We are excited to let you know that we have MR-279 ready to be merged to 
 trunk. I have uploaded necessary details on 
 https://issues.apache.org/jira/browse/MAPREDUCE-279.

  Please take a look and vote.

  Clearly I am +1 on it.

 thanks
 mahadev

Re: Mavenizing the HDFS build

2011-08-13 Thread Tom White

Hi Arun,

I'm fine with that. When do you expect to start the vote?

Cheers,
Tom

On Fri, Aug 12, 2011 at 11:41 PM, Arun C Murthy a...@hortonworks.com wrote:
 Hi Tom,

  Can I request you to wait on this commit until we merge MR-279? As Vinod 
 pointed out in his mail to mapreduce-dev@ we are very close to getting the 
 merge done. We should call a vote asap. By holding off it the mvn patch it 
 will save us a bit of time - we spent at more than a couple of days on 
 resolving after the common mvn'ization.

  Thanks for understanding.

 Arun

 On Aug 12, 2011, at 4:18 PM, Tom White wrote:

 The work in https://issues.apache.org/jira/browse/HDFS-2096 is ready
 to be committed, so unless there are any objections I will do so on
 Monday at 5pm UTC (that's 10am PDT, http://s.apache.org/o6F).

 I'll also create a script to convert patches to the new layout, and
 switch over the Jenkins jobs that test and build HDFS.

 Cheers,
 Tom

[jira] [Created] (MAPREDUCE-2835) Make per-job counter limits configurable

2011-08-11 Thread Tom White (JIRA)

Make per-job counter limits configurable


 Key: MAPREDUCE-2835
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2835
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Affects Versions: 0.20.204.0
Reporter: Tom White
Assignee: Tom White
 Fix For: 0.20.205.0


The per-job counter limits introduced in MAPREDUCE-1943 are fixed, except for 
the total number allowed per job (mapreduce.job.counters.limit). It would be 
useful to make them all configurable.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (MAPREDUCE-2741) Make ant build system work with hadoop-common JAR generated by Maven

2011-08-02 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-2741.
--

  Resolution: Fixed
Hadoop Flags: [Reviewed]

I've committed this. Thanks, Alejandro.

 Make ant build system work with hadoop-common JAR generated by Maven
 

 Key: MAPREDUCE-2741
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2741
 Project: Hadoop Map/Reduce
  Issue Type: Task
  Components: build
Affects Versions: 0.23.0
Reporter: Alejandro Abdelnur
Assignee: Alejandro Abdelnur
 Fix For: 0.23.0

 Attachments: MAPRED-2741.patch, MAPREDUCE-2741.patch


 Some tweaks must be done in MAPRED  its contribs ivy configuration to work 
 with HADOOP-6671.
 This wil be a temporary fix until MAPRED is mavenized.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Re: MR1 next steps

2011-07-15 Thread Tom White

+1 for #2 as long as the user-level MR API remains compatible.

Cheers,
Tom

On Thu, Jul 7, 2011 at 9:58 AM, Eli Collins e...@cloudera.com wrote:
 Hey gang,

 Had some discussion about what to do with MR1 with Arun at the summit,
 wanted to move it on-list.. Was thinking we should sort these out some
 on mr-dev before discussing/announcing a decision on general.

 The question is, now that we'll soon have MR2 merged (hurray!), to
 what extent do we ant to support MR1?  By MR1 I mean the JT and TT,
 not the old MR API, which MR2 supports. Ie this isn't about job API
 compatibility it's about implementation compatibility (eg existing
 systems which may depend on JT/TT interfaces like metrics). Here are
 the options as I see them:

 1. Do nothing. MR1 will continue to be a regression, both in terms of
 features and stability, against the MR in 203. Eg, MR1 in trunk still
 doesn't support security. We would continue to recommend people use
 MR1 from 20 (and MR2 from 23). Unclear what the value of having MR1 in
 trunk in this shape is.

 2. Remove the MR1 code from trunk/23, and just support MR2 in 23.
 People who want MR1 can use the current stable release (which, per
 option 1, we would recommend even if we left the code in as is).

 3. Get MR1 in trunk in shape comparable to MR in 203. This preserves
 the additional changes (to JT/TT at least) that have been added in
 trunk since 0.20. Not clear if anyone would want to invest the
 considerable effort this would take given that we have MR2 now (and
 existing releases).

 4. Put the MR1 code from 203 into trunk. This overwrites the changes
 added to trunk not in 203, and would require some integration, however
 it would give us a solid MR1 implementation that could be used in the
 same release as MR2. It would be an incompatible change wrt 21/22,
 however would be compatible in the sense that there are now both valid
 MR1 and MR2 options in a single release.

 I think #2 makes the most sense. From a developer perspective, MR2 is
 good stuff, there's no need for us to maintain two implementations in
 trunk/23 since we're already maintaining MR1 in the current releases.
 I'm skeptical that anyone would volunteer to do #3 (lot of work,
 unclear gain) or #4 (we already maintain MR1 elsewhere).  This allows
 us to focus energy on MR2 instead of investing in MR1 (eg MR-2178,
 which hasn't made much progress for ages).  From a user perspective,
 MR2 preserves Job compatibility, so it should just programs that talk
 to the JT/TT that are affected. MR2 is a little harder to run
 out-of-the-box, however we can fix that and we don't recommend people
 use MR1 from 21/22/trunk anyway.

 Thoughts?

 Thanks,
 Eli

[jira] [Created] (MAPREDUCE-2638) Create a simple stress test for the fair scheduler

2011-07-01 Thread Tom White (JIRA)

Create a simple stress test for the fair scheduler
--

 Key: MAPREDUCE-2638
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2638
 Project: Hadoop Map/Reduce
  Issue Type: Test
  Components: contrib/fair-share
Reporter: Tom White
Assignee: Tom White


This would be a test that runs against a cluster, typically with settings that 
allow preemption to be exercised.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-2455) Remove deprecated JobTracker.State in favour of JobTrackerStatus

2011-04-26 Thread Tom White (JIRA)

Remove deprecated JobTracker.State in favour of JobTrackerStatus


 Key: MAPREDUCE-2455
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2455
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
Reporter: Tom White
Assignee: Tom White
 Fix For: 0.23.0


MAPREDUCE-2337 deprecated getJobTrackerState() on ClusterStatus, this issue is 
to remove the getter (in favour of getJobTrackerStatus(), which will remain) so 
there is no longer a direct dependency of the public API on JobTracker. This is 
for MAPREDUCE-1638.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-2421) Remove JobHistory's dependency on JobTracker

2011-04-06 Thread Tom White (JIRA)

Remove JobHistory's dependency on JobTracker


 Key: MAPREDUCE-2421
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2421
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: client
Reporter: Tom White
Assignee: Tom White


The direct dependency can be removed by introducing a callback inteface. This 
is for MAPREDUCE-1638. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (MAPREDUCE-2400) Remove Cluster's dependency on JobTracker

2011-03-21 Thread Tom White (JIRA)

Remove Cluster's dependency on JobTracker
-

 Key: MAPREDUCE-2400
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2400
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: client
Reporter: Tom White
Assignee: Tom White


Introduce a factory using ServiceLoader to remove the direct dependency.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] Resolved: (MAPREDUCE-2392) TaskTracker shutdown in the tests sometimes take 60s

2011-03-17 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-2392.
--

   Resolution: Fixed
Fix Version/s: 0.22.0
 Hadoop Flags: [Reviewed]

I've just committed this.

 TaskTracker shutdown in the tests sometimes take 60s
 

 Key: MAPREDUCE-2392
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2392
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Tom White
Assignee: Tom White
 Fix For: 0.22.0

 Attachments: MAPREDUCE-2392.patch


 There are a lot of the following in the test logs:
 {noformat}
 2011-03-16 13:47:02,267 INFO  mapred.TaskTracker 
 (TaskTracker.java:shutdown(1275)) - Shutting down StatusHttpServer
 2011-03-16 13:48:02,349 ERROR mapred.TaskTracker 
 (TaskTracker.java:offerService(1609)) - Caught exception: 
 java.io.IOException: Call to localhost/127.0.0.1:57512 failed on local 
 exception: java.nio.channels.ClosedByInterruptException
 {noformat}
 Note there is over one minute between the first line and the second.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] Created: (MAPREDUCE-2351) mapred.job.tracker.history.completed.location should support an arbitrary filesystem URI

2011-03-01 Thread Tom White (JIRA)

mapred.job.tracker.history.completed.location should support an arbitrary 
filesystem URI


 Key: MAPREDUCE-2351
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2351
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Tom White
Assignee: Tom White


Currently, mapred.job.tracker.history.completed.location is resolved relative 
to the default filesystem. If not set it defaults to history/done in the local 
log directory. There is no way to set it to another local filesystem location 
(with a file:// URI) or an arbitrary Hadoop filesystem.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] Created: (MAPREDUCE-2338) Deprecate field selection methods on JobConf

2011-02-18 Thread Tom White (JIRA)

Deprecate field selection methods on JobConf


 Key: MAPREDUCE-2338
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2338
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Tom White


KeyFieldBasedComparator and KeyFieldBasedPartitioner are libraries so should 
not appear as getters and setters in JobConf. For the new API there are getters 
and setters on KeyFieldBasedComparator and KeyFieldBasedPartitioner themselves 
- we should add the equivalent to the old API and deprecate the references in 
JobConf.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] Created: (MAPREDUCE-2337) Remove dependence of public MapReduce API on classes in server package

2011-02-17 Thread Tom White (JIRA)

Remove dependence of public MapReduce API on classes in server package
--

 Key: MAPREDUCE-2337
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2337
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Tom White
Assignee: Tom White


Cluster#getJobTrackerState() returns a 
org.apache.hadoop.mapreduce.server.jobtracker.State enum, which makes the API 
in o.a.h.mapreduce have a dependency on the server package. It would be better 
to make the public API self-contained by using an equivalent enum in the 
Cluster class.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] Created: (MAPREDUCE-2282) MapReduce tests don't compile following HDFS-1561

2011-01-21 Thread Tom White (JIRA)

MapReduce tests don't compile following HDFS-1561
-

 Key: MAPREDUCE-2282
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2282
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Tom White
Priority: Blocker


TestMRServerPorts depends on TestHDFSServerPorts which was changed by 
HDFS-1561, resulting in a compilation failure.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-2273) TaskLogServlet does not set content type

2011-01-19 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-2273.
--

Resolution: Duplicate

 TaskLogServlet does not set content type
 

 Key: MAPREDUCE-2273
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2273
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.22.0
Reporter: Tom White
Priority: Blocker

 TaskLogServlet has never set the content type, even though it can serve plain 
 text or HTML, but since HADOOP-7093 the problem has been highlighted since it 
 serves HTML as text/plain (before it was serving plain text as text/html).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-2086) CHANGES.txt does not reflect the release of version 0.21.0.

2010-10-05 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-2086.
--

Resolution: Fixed
  Assignee: Tom White

I've fixed this.

 CHANGES.txt does not reflect the release of version 0.21.0.
 ---

 Key: MAPREDUCE-2086
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2086
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.21.0
Reporter: Konstantin Shvachko
Assignee: Tom White
 Fix For: 0.21.1


 CHANGES.txt should show the release date for 0.21.0 and include section for 
 for 0.21.1 - Unreleased. Latest changes, that did not make into 0.21.0, 
 should be moved under 0.21.1 section.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-2092) Trunk can't be compiled

2010-09-24 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-2092.
--

Resolution: Fixed

HADOOP-6951 has now been reverted and I confirmed MapReduce trunk now compiles. 
Sorry for the noise.

 Trunk can't be compiled
 ---

 Key: MAPREDUCE-2092
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2092
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: tasktracker
Reporter: Konstantin Boudnik

 Compilation of the trunk is broken because of an attempt to call
 {{ServiceAuthorizationManager.refresh}} from a static content. 
 0.21 branch seems to be Ok.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-2014) Remove task-controller from 0.21 branch

2010-08-16 Thread Tom White (JIRA)

Remove task-controller from 0.21 branch
---

 Key: MAPREDUCE-2014
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2014
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: security
Reporter: Tom White
Assignee: Tom White
 Fix For: 0.21.0




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1870) Harmonize MapReduce JAR library versions with Common and HDFS

2010-06-15 Thread Tom White (JIRA)

Harmonize MapReduce JAR library versions with Common and HDFS
-

 Key: MAPREDUCE-1870
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1870
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: build
Reporter: Tom White
Assignee: Tom White
Priority: Blocker
 Fix For: 0.21.0


MapReduce part of HADOOP-6800.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Re: building 0.21

2010-06-08 Thread Tom White

Perhaps you're not getting up-to-date libraries? Try

common ant clean jar mvn-install
hdfs ant veryclean jar mvn-install -Dresolvers=internal
mapreduce ant veryclean jar -Dresolvers=internal

This works for me with the 0.21 branches.

Cheers,
Tom

On Tue, Jun 8, 2010 at 9:48 AM, Torsten Curdt tcu...@apache.org wrote:
 I've build all the various dependencies

  hadoop-commons (branch 0.21)
  hadoop-hdfs (branch 0.21)
  hadoop (trunk == 0.21?? there is no 0.21 branch)

 and then

  hadoop-mapreduce (branch 0.21)

 Using it I get the following exeception:

  Exception in thread main java.lang.NoSuchMethodError:
 org.apache.hadoop.conf.Configuration.addDeprecation(Ljava/lang/String;[Ljava/lang/String;)V
        at 
 org.apache.hadoop.mapreduce.util.ConfigUtil.addDeprecatedKeys(ConfigUtil.java:49)
        at 
 org.apache.hadoop.mapreduce.util.ConfigUtil.loadResources(ConfigUtil.java:40)
        at org.apache.hadoop.mapreduce.Cluster.clinit(Cluster.java:64)

 What's going on?

 cheers
 --
 Torsten

[jira] Resolved: (MAPREDUCE-1810) 0.21 build is broken

2010-05-27 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-1810.
--

Fix Version/s: 0.21.0
   Resolution: Fixed

I've just committed this (to the 0.21 branch).

 0.21 build is broken
 

 Key: MAPREDUCE-1810
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1810
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: build
Affects Versions: 0.21.0
Reporter: Sharad Agarwal
Assignee: Tom White
 Fix For: 0.21.0

 Attachments: MAPREDUCE-1810.patch, MAPREDUCE-1810.patch


 /src/java/org/apache/hadoop/mapred/AdminOperationsProtocol.java:31: cannot 
 find symbol
 [javac] symbol  : method value()
 [javac] location: @interface org.apache.hadoop.security.KerberosInfo
 [javac] @KerberosInfo(MRJobConfig.JOB_JOBTRACKER_ID)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-1751) Change MapReduce to depend on Hadoop 'common' artifacts instead of 'core'

2010-05-27 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-1751.
--

Hadoop Flags: [Reviewed]
Assignee: Tom White
  Resolution: Fixed

I've just committed this.

 Change MapReduce to depend on Hadoop 'common' artifacts instead of 'core'
 -

 Key: MAPREDUCE-1751
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1751
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build
Reporter: Tom White
Assignee: Tom White
Priority: Blocker
 Fix For: 0.21.0

 Attachments: MAPREDUCE-1751.patch


 This is the MapReduce part of HADOOP-6404.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1789) MapReduce trunk fails to compile following HADOOP-6600

2010-05-13 Thread Tom White (JIRA)

MapReduce trunk fails to compile following HADOOP-6600
--

 Key: MAPREDUCE-1789
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1789
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: build
Reporter: Tom White
Assignee: Tom White
Priority: Blocker
 Fix For: 0.21.0


A few classes need updating following the change to KerberosInfo introduced in 
HADOOP-6600

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Re: Minutes: Hadoop Contributor Meeting 05/06/2010

2010-05-10 Thread Tom White

Not sure why my attachment didn't make it to the list. Anyway, I've
posted Arun's notes on the wiki at
http://wiki.apache.org/hadoop/HadoopContributorsMeeting20100506, and
included the content of my slide there. (Attachments on the wiki have
been disabled - as of today apparently, see SVN commit r775220 - so I
wasn't able to post the slide there either.)

Tom

On Fri, May 7, 2010 at 9:36 AM, Tom White t...@cloudera.com wrote:
 Here's my (single) slide about the 0.21 release.

 Tom

 On Thu, May 6, 2010 at 5:38 PM, Arun C Murthy acmur...@gmail.com wrote:
 # Shared goals
  - Hadoop is HDFS  Map-Reduce in this context of this set of slides
 # Priorities
  * Yahoo
    - Correctness
    - Availability: Not the same as high-availability (6 9s. etc.) i.e. SPOFs
    - API Compatibility
    - Scalability
    - Operability
    - Performance
    - Innovation
  * Cloudera
    - Test coverage, api coverage
    - APL Licensed codec (lzo replacement)
    - Security
    - Wire compatibility
    - Cluster-wide resource availability
    - New apis (FileContext, MR Context Objs.), documentation of their
 advantages
    - HDFS to better support non-MR use-cases
    - Cluster metrics hooks
    - MR modularity (package)
  * Facebook
    - Correctness
    - Availability, High Availability, Failover, Continuous Availability
    - Scalability
 # Bar for patches/features keeps going higher as the project matures
  - Build consensus (e.g. Python Enhancement Process, JSR etc.)
  - Run/test on your own to prove the concept/feature or branch and finish
  - Early versions of libraries should be started outside of the project
 (github etc.) e.g. input-formats, compression-codecs etc.
    - github for all the above
    - Prune contrib
 # Maven for packaging
 # Tom: hadoop-0.21 (Tom - can you please post your slides? Thanks!)
 # Owen: Release Manager (see slides)
 # Agenda for next meeting
  - Eli: Hadoop Enhancement Process (modelled on PEP?)
  - Branching strategies: Development Models

 Arun

Re: Minutes: Hadoop Contributor Meeting 05/06/2010

2010-05-07 Thread Tom White

Here's my (single) slide about the 0.21 release.

Tom

On Thu, May 6, 2010 at 5:38 PM, Arun C Murthy acmur...@gmail.com wrote:
 # Shared goals
  - Hadoop is HDFS  Map-Reduce in this context of this set of slides
 # Priorities
  * Yahoo
    - Correctness
    - Availability: Not the same as high-availability (6 9s. etc.) i.e. SPOFs
    - API Compatibility
    - Scalability
    - Operability
    - Performance
    - Innovation
  * Cloudera
    - Test coverage, api coverage
    - APL Licensed codec (lzo replacement)
    - Security
    - Wire compatibility
    - Cluster-wide resource availability
    - New apis (FileContext, MR Context Objs.), documentation of their
 advantages
    - HDFS to better support non-MR use-cases
    - Cluster metrics hooks
    - MR modularity (package)
  * Facebook
    - Correctness
    - Availability, High Availability, Failover, Continuous Availability
    - Scalability
 # Bar for patches/features keeps going higher as the project matures
  - Build consensus (e.g. Python Enhancement Process, JSR etc.)
  - Run/test on your own to prove the concept/feature or branch and finish
  - Early versions of libraries should be started outside of the project
 (github etc.) e.g. input-formats, compression-codecs etc.
    - github for all the above
    - Prune contrib
 # Maven for packaging
 # Tom: hadoop-0.21 (Tom - can you please post your slides? Thanks!)
 # Owen: Release Manager (see slides)
 # Agenda for next meeting
  - Eli: Hadoop Enhancement Process (modelled on PEP?)
  - Branching strategies: Development Models

 Arun

[jira] Created: (MAPREDUCE-1751) Change MapReduce to depend on Hadoop 'common' artifacts instead of 'core'

2010-05-03 Thread Tom White (JIRA)

Change MapReduce to depend on Hadoop 'common' artifacts instead of 'core'
-

 Key: MAPREDUCE-1751
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1751
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build
Reporter: Tom White
Priority: Blocker
 Fix For: 0.21.0


This is the MapReduce part of HADOOP-6404.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1735) Un-deprecate the old MapReduce API in the 0.21 branch

2010-04-27 Thread Tom White (JIRA)

Un-deprecate the old MapReduce API in the 0.21 branch
-

 Key: MAPREDUCE-1735
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1735
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Tom White
Priority: Blocker
 Fix For: 0.21.0


This issue is to un-deprecate the old MapReduce API (in o.a.h.mapred) in the 
next 0.21 release, as discussed at 
http://www.mail-archive.com/mapreduce-dev@hadoop.apache.org/msg01833.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Re: Un-deprecate the old MapReduce API?

2010-04-27 Thread Tom White

It sounds like there's no strong objection to un-deprecating the old
API in 0.20 - I'll create a patch for this (see
https://issues.apache.org/jira/browse/MAPREDUCE-1734).

0.21 is less clear cut. However, if the new API were marked as
Evolving, then it's odd, to say the least, if the old API were
deprecated since it would send a confusing message to users. There
seems to be consensus that the new API is Evolving (please comment on
https://issues.apache.org/jira/browse/MAPREDUCE-1623 to discuss
whether all of the new API should be marked Evolving, which the latest
patch does). Indeed, the new API hasn't seen widespread use yet, so it
still seems premature to deprecate the old API in 0.21. I've opened
https://issues.apache.org/jira/browse/MAPREDUCE-1735 where we can
discuss this particular case further.

Cheers,
Tom

On Fri, Apr 23, 2010 at 9:21 AM, Alan Gates ga...@yahoo-inc.com wrote:
 I don't have any issue with un-deprecating the old APIs.  I agree if changes
 are needed it's better to mark the new APIs to reflect it.  I just hope
 those changes can be kept as backward compatible as possible.  In particular
 with Job, Pig uses that in some of it's APIs that it has declared stable
 (LoadFunc, StoreFunc).

 Alan.

 On Apr 22, 2010, at 11:30 PM, Arun C Murthy wrote:

 Alan,

 On Apr 22, 2010, at 12:12 PM, Alan Gates wrote:

 Speaking for one power user (Pig) that did move to the new APIs, moving
 that interface to evolving is a little unsettling.  Is there a feel for how
 much the new API is going to change?


 The intent isn't to mark the 'new' apis as 'Evolving' to change them
 willy-nilly... please don't read it so!

 This is just a pragmatic proposal to reflect that the 'old' apis will, for
 lack of stabilization of new apis, be supported.

 Given that, the new apis could mostly be stable, but for Job and Cluster -
 is that reasonable? This will ensure we send the right message all concerned
 regarding stability of o.a.h.mapreduce.{Mapper|Reducer|...}. Thoughts?

 Arun

 Alan.

[jira] Created: (MAPREDUCE-1681) MapReduce API compatibility

2010-04-06 Thread Tom White (JIRA)

MapReduce API compatibility
---

 Key: MAPREDUCE-1681
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1681
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build, documentation
Reporter: Tom White
Priority: Blocker
 Fix For: 0.21.0


This is an umbrella issue to document and test MapReduce API compatibility 
across releases.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1650) Exclude Private elements from generated MapReduce Javadoc

2010-03-30 Thread Tom White (JIRA)

Exclude Private elements from generated MapReduce Javadoc
-

 Key: MAPREDUCE-1650
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1650
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: documentation
Reporter: Tom White
Assignee: Tom White


Exclude elements annotated with InterfaceAudience.Private or 
InterfaceAudience.LimitedPrivate from Javadoc and JDiff.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1637) Create a test for API compatibility between releases

2010-03-26 Thread Tom White (JIRA)

Create a test for API compatibility between releases


 Key: MAPREDUCE-1637
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1637
 Project: Hadoop Map/Reduce
  Issue Type: Test
  Components: build, test
Reporter: Tom White
 Fix For: 0.21.0


We should have an automated test (or a set of tests) for checking that programs 
written against an old version of the API still run with a newer version. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1638) Divide MapReduce into API and implementation source trees

2010-03-26 Thread Tom White (JIRA)

Divide MapReduce into API and implementation source trees
-

 Key: MAPREDUCE-1638
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1638
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build, client
Reporter: Tom White
Assignee: Tom White


I think it makes sense to separate the MapReduce source into public API and 
implementation trees. The public API could be broken further into kernel and 
library trees.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1625) Improve grouping of packages in Javadoc

2010-03-24 Thread Tom White (JIRA)

Improve grouping of packages in Javadoc
---

 Key: MAPREDUCE-1625
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1625
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: documentation
Reporter: Tom White
Assignee: Tom White


There are a couple of problems with the current Javadoc:
* The main MapReduce package documentation on the index page appears under 
Other Packages below the fold.
* Some contrib classes and packages are interspersed in the main MapReduce 
documentation, which is very confusing for users.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1626) Publish Javadoc for all contrib packages with user-facing APIs

2010-03-24 Thread Tom White (JIRA)

Publish Javadoc for all contrib packages with user-facing APIs
--

 Key: MAPREDUCE-1626
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1626
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: documentation
Reporter: Tom White


Some packages don't appear in the Javadoc. E.g. MRUnit, Vertica.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1592) Generate Eclipse's .classpath file from Ivy config

2010-03-11 Thread Tom White (JIRA)

Generate Eclipse's .classpath file from Ivy config
--

 Key: MAPREDUCE-1592
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1592
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: build
Reporter: Tom White
Assignee: Tom White


MapReduce companion issue for HADOOP-6407.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1507) The old MapReduce API is only partially deprecated

2010-02-18 Thread Tom White (JIRA)

The old MapReduce API is only partially deprecated
--

 Key: MAPREDUCE-1507
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1507
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Tom White
Assignee: Tom White


Not all of the old API is currently marked as deprecated. E.g. 
org.apache.hadoop.mapred.OutputFormat is deprecated, but 
org.apache.hadoop.mapred.FileOutputFormat isn't.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1511) Examples should not use deprecated APIs

2010-02-18 Thread Tom White (JIRA)

Examples should not use deprecated APIs
---

 Key: MAPREDUCE-1511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1511
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: job submission
Reporter: Tom White
Assignee: Tom White


MAPREDUCE-777 deprecated some APIs which are still being used by the examples. 
This issue is to fix the examples so they use the replacements.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1452) Add a low-level MapReduce API

2010-02-03 Thread Tom White (JIRA)

Add a low-level MapReduce API
-

 Key: MAPREDUCE-1452
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1452
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
Reporter: Tom White
 Fix For: 0.22.0


Add an API to MapReduce that operates at the raw bytes level. The existing 
(object-based) MapReduce APIs would be implemented on top of the raw API, and 
in future it will be easier to add new APIs (like MAPREDUCE-1183) and 
higher-level abstractions on MapReduce. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1453) Enforce the distinction between MapReduce kernel and library code

2010-02-03 Thread Tom White (JIRA)

Enforce the distinction between MapReduce kernel and library code
-

 Key: MAPREDUCE-1453
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1453
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Tom White
 Fix For: 0.22.0


Currently MapReduce kernel and library code are in the same source tree which 
makes it possible to inadvertently introduce dependencies on the library by the 
kernel. With MAPREDUCE-1452 it makes sense to enforce the compile-time 
dependencies (library depends on kernel, but not vice versa), by having two 
source trees.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1411) Infer MapReduce types where possible

2010-01-25 Thread Tom White (JIRA)

Infer MapReduce types where possible


 Key: MAPREDUCE-1411
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1411
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: client
Reporter: Tom White
Assignee: Tom White


Currently users must specify the output types of the map and reduce if they are 
different to the default (LongWritable keys, Text values). In many cases this 
information is available to the system from the user-supplied mapper and 
reducer, so it would be nice if the user didn't have to specify them in two 
places.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (MAPREDUCE-1349) Create jira component - contrib/index

2009-12-31 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-1349.
--

Resolution: Fixed
  Assignee: Tom White

Fixed.

 Create jira component - contrib/index 
 --

 Key: MAPREDUCE-1349
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1349
 Project: Hadoop Map/Reduce
  Issue Type: Task
Reporter: Kay Kay
Assignee: Tom White

 Can we have a component - contrib/index for issues related to contrib/index  
 (in jira ) 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (MAPREDUCE-1348) Package org.apache.hadoop.blockforensics does not match directory name

2009-12-30 Thread Tom White (JIRA)

Package org.apache.hadoop.blockforensics does not match directory name
--

 Key: MAPREDUCE-1348
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1348
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: build
Reporter: Tom White


BlockSearch is in the package org.apache.hadoop.blockforensics, but in the 
source directory org/apache/hadoop/block_forensics. While javac doesn't seem to 
mind about this mismatch, Eclipse treats it as an error.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Re: mapreduce trunk is broken

2009-12-15 Thread Tom White

This is due to https://issues.apache.org/jira/browse/HADOOP-5901,
which I've now reverted. I just did

svn up; ant veryclean compile

and managed to build successfully.

Cheers,
Tom

On Tue, Dec 15, 2009 at 11:24 AM, Zheng Shao zs...@facebook.com wrote:
 http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-h3.grid.sp2.yahoo.net/194/artifact/trunk/patchprocess/trunkReleaseAuditWarnings.txt

 compile-mapred-test:
    [mkdir] Created dir: 
 /grid/0/hudson/hudson-slave/workspace/Mapreduce-Patch-h3.grid.sp2.yahoo.net/trunk/build/test/mapred/classes
    [mkdir] Created dir: 
 /grid/0/hudson/hudson-slave/workspace/Mapreduce-Patch-h3.grid.sp2.yahoo.net/trunk/build/test/mapred/testjar
    [mkdir] Created dir: 
 /grid/0/hudson/hudson-slave/workspace/Mapreduce-Patch-h3.grid.sp2.yahoo.net/trunk/build/test/mapred/testshell
    [javac] Compiling 237 source files to 
 /grid/0/hudson/hudson-slave/workspace/Mapreduce-Patch-h3.grid.sp2.yahoo.net/trunk/build/test/mapred/classes
    [javac] 
 /grid/0/hudson/hudson-slave/workspace/Mapreduce-Patch-h3.grid.sp2.yahoo.net/trunk/src/test/mapred/org/apache/hadoop/mapred/MiniMRCluster.java:342:
  unreported exception java.io.IOException; must be caught or declared to be 
 thrown
    [javac]     FileSystem.setDefaultUri(result, namenode);
    [javac]                             ^
    [javac] Note: Some input files use or override a deprecated API.
    [javac] Note: Recompile with -Xlint:deprecation for details.
    [javac] 1 error

 BUILD FAILED
 /grid/0/hudson/hudson-slave/workspace/Mapreduce-Patch-h3.grid.sp2.yahoo.net/trunk/build.xml:487:
  Compile failed; see the compiler error output for details.

 Total time: 1 minute 16 seconds


 Zheng

[jira] Resolved: (MAPREDUCE-690) Sqoop's test hive script needs to be executable

2009-07-03 Thread Tom White (JIRA)


 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White resolved MAPREDUCE-690.
-

   Resolution: Fixed
Fix Version/s: 0.21.0
 Hadoop Flags: [Reviewed]

+1

I've just committed this. Thanks Aaron!

 Sqoop's test hive script needs to be executable
 -

 Key: MAPREDUCE-690
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-690
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: contrib/sqoop
Reporter: Aaron Kimball
Assignee: Aaron Kimball
 Fix For: 0.21.0


 The testdata/hive/bin/hive script needs to be chmod +x so that unit tests can 
 run it. This needs to be set with an svn property.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

1 2 >

100 matches

Mail list logo