[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun Suresh updated MAPREDUCE-5890:
-----------------------------------

    Attachment: MAPREDUCE-5890.4.patch

[~chris.douglas],
Updating patch.. 
As per your suggestion, I've refactored out the need for another file to store 
the IV.

But I still could not find a consistent way for seamlessly handling the IV : 
* The IV for the Spill file created at the end of the Map phase is now added to 
the spill {{IndexRecord}} and is transmitted to the {{Fetcher}} via the 
{{ShuffleHeader}} in the {{ShuffleHandler}}
* Unfortunately, the intermediate files created by the OnDisk mergers do not 
have an index file associated with them. I was thus forced to write the IV as a 
prefix into the stream. This happens when I "wrap" the outputStream, before 
passing it to the {{IFile}} writer.

> Support for encrypting Intermediate data and spills in local filesystem
> -----------------------------------------------------------------------
>
>                 Key: MAPREDUCE-5890
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5890
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>          Components: security
>    Affects Versions: 2.4.0
>            Reporter: Alejandro Abdelnur
>            Assignee: Arun Suresh
>              Labels: encryption
>         Attachments: MAPREDUCE-5890.3.patch, MAPREDUCE-5890.4.patch, 
> org.apache.hadoop.mapred.TestMRIntermediateDataEncryption-output.txt, 
> syslog.tar.gz
>
>
> For some sensitive data, encryption while in flight (network) is not 
> sufficient, it is required that while at rest it should be encrypted. 
> HADOOP-10150 & HDFS-6134 bring encryption at rest for data in filesystem 
> using Hadoop FileSystem API. MapReduce intermediate data and spills should 
> also be encrypted while at rest.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to