[ https://issues.apache.org/jira/browse/HADOOP-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
[EMAIL PROTECTED] updated HADOOP-1199: -------------------------------------- Attachment: hadoop1199-v2.patch Version 2. Keys, rather than LongWritable line numbers, are now a compound of host, taskid, and line number: e.g. debord.archive.org:task_0023_m_000000_0:11223. > want InputFormat for task logs > ------------------------------ > > Key: HADOOP-1199 > URL: https://issues.apache.org/jira/browse/HADOOP-1199 > Project: Hadoop > Issue Type: New Feature > Components: mapred > Reporter: Doug Cutting > Attachments: hadoop1199-v2.patch, hadoop1199.patch > > > We should provide an InputFormat implementation that includes all the task > logs from a job. Folks should be able to do something like: > job = new JobConf(); > job.setInputFormatClass(TaskLogInputFormat.class); > TaskLogInputFormat.setJobId(jobId); > ... > Tasks should ideally be localized to the node that each log is on. > Examining logs should be as lightweight as possible, to facilitate debugging. > It should not require a copy to HDFS. A faster debug loop is like a faster > search engine: it makes people more productive. The sooner one can find that, > e.g., most tasks failed with a NullPointerException on line 723, the better. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.