[ https://issues.apache.org/jira/browse/MAPREDUCE-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840972#action_12840972 ]
Mahadev konar commented on MAPREDUCE-1548: ------------------------------------------ bq. I think it is very useful to store the information, which can possibly be used in untar, but ls should return the meta data from the part files in order to avoid inconsistency. I think liststatus should show the information stored in index file and not the part file permissions. Its a tricky question but I think the information stored in the index file should be returned. Also, there are multiple files that need to be read for reading a real archived file (_index, _masterindex, part-****), so we cannot really return part-*** file permissions and assume that it will always work. There could be case when _index file might not be readable. We will have to throw the appropriate exception about files not readable in case they are not. But, definitely the filestatus information stored in the index file seems the right thing to return. > Hadoop archives should be able to preserve times and other properties from > original files > ----------------------------------------------------------------------------------------- > > Key: MAPREDUCE-1548 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1548 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: harchive > Reporter: Rodrigo Schmidt > Assignee: Rodrigo Schmidt > > Files inside hadoop archives don't keep their original: > - modification time > - access time > - permission > - owner > - group > all such properties are currently taken from the file storing the archive > index, and not the stored files. This doesn't look very correct. > There should be possible to preserve the original properties of the stored > files. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.