[
https://issues.apache.org/jira/browse/MAPREDUCE-5025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13604928#comment-13604928
]
Jerry Chen commented on MAPREDUCE-5025:
---------------------------------------
Encryption codecs and compression codecs work basically the same way, as block
or stream transforms, and therefore encryption can mostly masquerade as
compression. We think this will make encryption features easy to work with.
Consider how compression codecs plug in today at the file level, and in
MapReduce, as options to SequenceFile, etc. Dropping crypto codecs into these
existing plug points would seem to introduce the least risk and change to
existing code and applications. Of course, unlike compression algorithms crypto
algorithms cannot properly transform input without initialization with key
material. We do need to propagate that bit of extra state securely to where
user code executes during the MapReduce job.
A transparent encrypted file system would be a good feature. That is something
we should discuss on another JIRA? "EncryptingFilterFileSystem"? As a global
filter on a MapReduce job we think it heavyweight and limiting. It would
complicate configuration and code if some jobs need an untranslated view to
plain files and a translated view to encrypted ones simultaneously, which we
think will be the common case. Some input and therefore output files will be
sensitive and require encryption, but others will not, and encryption
introduces costs, so we imagine jobs would optimize its use.
> Key Distribution and Management for supporting crypto codec in Map Reduce
> -------------------------------------------------------------------------
>
> Key: MAPREDUCE-5025
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5025
> Project: Hadoop Map/Reduce
> Issue Type: Sub-task
> Components: security
> Affects Versions: trunk
> Reporter: Jerry Chen
> Assignee: Jerry Chen
> Attachments: MAPREDUCE-5025.patch
>
> Original Estimate: 504h
> Remaining Estimate: 504h
>
> This task defines the work to enable Map Reduce to utilize the Crypto Codec
> framework to support encryption and decryption of data during MapReduce Job.
> According to the some real use case and discussions from the community, for
> encryption and decryption files in Map Reduce, we have the following
> requirements:
> 1. Different stages (input, output, intermediate output) should have the
> flexibility to choose whether encrypt or not, as well as which crypto codec
> to use.
> 2. Different stages may have different scheme of providing the keys.
> 3. Different Files (for example, different input files) may have or use
> different keys.
> 4. Support a flexible way of retrieving keys for encryption or decryption.
> So this task defines and provides the framework for supporting these
> requirements as well as the implementations for common use and key retrieving
> scenarios.
> The design document of this part is included in the Hadoop Crypto Design
> attached in HADOOP-9331.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira