[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated MAPREDUCE-1098:
-----------------------------------------------

    Attachment: patch-1098-3.txt

Updated patch from Arun. Patch does the following:
1. changes the key into cacheArchives map to use both uri and modification-time 
of the file.
2. changes the localizedPath to be unique for a CacheStatus object.
3. Added a testcase to test the freshness of the file on FileSystem. If the 
file gets modified while the job is running, it would fail. If a second job 
sees different modification time of the file,  it will be localized in a 
different path.

> Incorrect synchronization in DistributedCache causes TaskTrackers to freeze 
> up during localization of Cache for tasks.
> ----------------------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-1098
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1098
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: tasktracker
>            Reporter: Sreekanth Ramakrishnan
>            Assignee: Amareshwari Sriramadasu
>             Fix For: 0.21.0
>
>         Attachments: MAPREDUCE-1098.patch, MAPREDUCE-1098.patch, 
> MAPREDUCE-1098.patch, patch-1098-0.20.txt, patch-1098-1.txt, 
> patch-1098-2.txt, patch-1098-3.txt, patch-1098-ydist.txt, patch-1098.txt
>
>
> Currently {{org.apache.hadoop.filecache.DistributedCache.getLocalCache(URI, 
> Configuration, Path, FileStatus, boolean, long, Path, boolean)}} allows only 
> one {{TaskRunner}} thread in TT to localize {{DistributedCache}} across jobs. 
> Current way of synchronization is across baseDir this has to be changed to 
> lock on the same baseDir.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to