[
https://issues.apache.org/jira/browse/MAPREDUCE-2374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Todd Lipcon updated MAPREDUCE-2374:
-----------------------------------
Description:
Some very small percentage of tasks fail with a "Text file busy" error.
The following was the original diagnosis:
{quote}
Our use of PrintWriter in TaskController.writeCommand is unsafe, since that
class swallows all IO exceptions. We're not currently checking for errors,
which I'm seeing result in occasional task failures with the message "Text file
busy" - assumedly because the close() call is failing silently for some reason.
{quote}
.. but turned out to be another issue as well (see below)
was:Our use of PrintWriter in TaskController.writeCommand is unsafe, since
that class swallows all IO exceptions. We're not currently checking for errors,
which I'm seeing result in occasional task failures with the message "Text file
busy" - assumedly because the close() call is failing silently for some reason.
Summary: "Text File Busy" errors launching MR tasks (was: Should not
use PrintWriter to write taskjvm.sh)
Updated description and summary given the original description didn't turn out
to be the true underlying issue.
> "Text File Busy" errors launching MR tasks
> ------------------------------------------
>
> Key: MAPREDUCE-2374
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2374
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Affects Versions: 0.22.0
> Reporter: Todd Lipcon
> Assignee: Todd Lipcon
> Fix For: 0.22.1
>
> Attachments: failed_taskjvmsh.strace, mapreduce-2374-branch-1.patch,
> mapreduce-2374-on-20sec.txt, mapreduce-2374.txt, mapreduce-2374.txt,
> successfull_taskjvmsh.strace
>
>
> Some very small percentage of tasks fail with a "Text file busy" error.
> The following was the original diagnosis:
> {quote}
> Our use of PrintWriter in TaskController.writeCommand is unsafe, since that
> class swallows all IO exceptions. We're not currently checking for errors,
> which I'm seeing result in occasional task failures with the message "Text
> file busy" - assumedly because the close() call is failing silently for some
> reason.
> {quote}
> .. but turned out to be another issue as well (see below)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira