[
https://issues.apache.org/jira/browse/MAPREDUCE-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12894173#action_12894173
]
Dick King commented on MAPREDUCE-323:
-------------------------------------
I need to modify {{getMatchingJob(String, String, String[])}} in my comment of
28/Jul/10 03:09 PM as follows:
{noformat}
class PathCow implements Iterator<Path> {
// Iterator<Path> methods
int numberMatches();
// returns number of matches you could get if you drive the Iterator to
// the end. Might be an approximation.
}
PathCow getMatchingJob
(String user, String jobnameSubstring, String[] dateStrings,
boolean backwards)
throws IOException
// has no remove() method
// any criterion can be null
// filtering is conjunctive
// dates are MM/DD/YYYY
// results happen approximately oldest first [or newest first,
// if backwards is true]
// a new file that gets added after the iterator is created can either be
// or not be delivered by the result
// dates are approximations of completion time
{noformat}
> Improve the way job history files are managed
> ---------------------------------------------
>
> Key: MAPREDUCE-323
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-323
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: jobtracker
> Affects Versions: 0.21.0, 0.22.0
> Reporter: Amar Kamat
> Assignee: Dick King
> Priority: Critical
>
> Today all the jobhistory files are dumped in one _job-history_ folder. This
> can cause problems when there is a need to search the history folder
> (job-recovery etc). It would be nice if we group all the jobs under a _user_
> folder. So all the jobs for user _amar_ will go in _history-folder/amar/_.
> Jobs can be categorized using various features like _jobid, date, jobname_
> etc but using _username_ will make the search much more efficient and also
> will not result into namespace explosion.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.