date:20140918

Tsz Wo Nicholas Sze created HDFS-7088:
-

 Summary: Archival Storage: fix TestBalancer and 
TestBalancerWithMultipleNameNodes
 Key: HDFS-7088
 URL: https://issues.apache.org/jira/browse/HDFS-7088
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: balancer, test
Reporter: Tsz Wo Nicholas Sze
Assignee: Tsz Wo Nicholas Sze
Priority: Minor


{noformat}
java.lang.AssertionError: expected:0 but was:-3
at org.junit.Assert.fail(Assert.java:88)
at org.junit.Assert.failNotEquals(Assert.java:743)
at org.junit.Assert.assertEquals(Assert.java:118)
at org.junit.Assert.assertEquals(Assert.java:555)
at org.junit.Assert.assertEquals(Assert.java:542)
at 
org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.runBalancer(TestBalancerWithMultipleNameNodes.java:163)
at 
org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.runTest(TestBalancerWithMultipleNameNodes.java:365)
at 
org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.testBalancer(TestBalancerWithMultipleNameNodes.java:379)
{noformat}




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (HDFS-2932) Under replicated block after the pipeline recovery.

2014-09-18 Thread Srikanth Upputuri (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-2932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Srikanth Upputuri resolved HDFS-2932.
-
   Resolution: Duplicate
Fix Version/s: (was: 0.24.0)

Closed as duplicate of HDFS-3493. 

 Under replicated block after the pipeline recovery.
 ---

 Key: HDFS-2932
 URL: https://issues.apache.org/jira/browse/HDFS-2932
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: datanode
Affects Versions: 0.24.0
Reporter: J.Andreina
Assignee: Srikanth Upputuri

 Started 1NN,DN1,DN2,DN3 in the same machine.
 Written a huge file of size 2 Gb
 while the write for the block-id-1005 is in progress bruought down DN3.
 after the pipeline recovery happened.Block stamp changed into block_id_1006 
 in DN1,Dn2.
 after the write is over.DN3 is brought up and fsck command is issued.
 the following mess is displayed as follows
 block-id_1006 is underreplicatede.Target replicas is 3 but found 2 replicas.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6824) Additional user documentation for HDFS encryption.

2014-09-18 Thread Andrew Wang (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-6824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138572#comment-14138572
 ] 

Andrew Wang commented on HDFS-6824:
---

Also need to fix [~yoderme]'s comment from HDFS-6394:

https://issues.apache.org/jira/browse/HDFS-6394?focusedCommentId=14087313page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14087313

Someone else also mentioned that we should emphasize that data isn't 
transparently encrypted on HDFS upgrade, and needs to be copied in to an EZ. 
I'll do this too.

 Additional user documentation for HDFS encryption.
 --

 Key: HDFS-6824
 URL: https://issues.apache.org/jira/browse/HDFS-6824
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: documentation
Affects Versions: fs-encryption (HADOOP-10150 and HDFS-6134)
Reporter: Andrew Wang
Assignee: Andrew Wang
Priority: Minor

 We'd like to better document additional things about HDFS encryption: setup 
 and configuration, using alternate access methods (namely WebHDFS and 
 HttpFS), other misc improvements.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Assigned] (HDFS-7077) Separate CipherSuite from crypto protocol version

2014-09-18 Thread Andrew Wang (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-7077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Wang reassigned HDFS-7077:
-

Assignee: Andrew Wang

 Separate CipherSuite from crypto protocol version
 -

 Key: HDFS-7077
 URL: https://issues.apache.org/jira/browse/HDFS-7077
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: encryption
Affects Versions: 2.6.0
Reporter: Andrew Wang
Assignee: Andrew Wang

 Right now the CipherSuite is used for protocol version negotiation, which is 
 wrong. We need to separate it out. An EZ should be locked to a certain 
 CipherSuite and protocol version. A client reading and writing to the EZ then 
 needs to negotiate based on both of these parameters.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-6727) Refresh data volumes on DataNode based on configuration changes

2014-09-18 Thread Lei (Eddy) Xu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-6727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei (Eddy) Xu updated HDFS-6727:

Attachment: HDFS-6727.007.patch

Update patch to fix findbugs reports. 

 Refresh data volumes on DataNode based on configuration changes
 ---

 Key: HDFS-6727
 URL: https://issues.apache.org/jira/browse/HDFS-6727
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: datanode
Affects Versions: 2.5.0, 2.4.1
Reporter: Lei (Eddy) Xu
Assignee: Lei (Eddy) Xu
  Labels: datanode
 Attachments: HDFS-6727.000.delta-HDFS-6775.txt, HDFS-6727.001.patch, 
 HDFS-6727.002.patch, HDFS-6727.003.patch, HDFS-6727.004.patch, 
 HDFS-6727.005.patch, HDFS-6727.006.patch, HDFS-6727.006.patch, 
 HDFS-6727.007.patch, HDFS-6727.combo.patch, patchFindBugsOutputhadoop-hdfs.txt


 HDFS-1362 requires DataNode to reload configuration file during the runtime, 
 so that DN can change the data volumes dynamically. This JIRA reuses the 
 reconfiguration framework introduced by HADOOP-7001 to enable DN to 
 reconfigure at runtime.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7073) Allow falling back to a non-SASL connection on DataTransferProtocol in several edge cases.

[
https://issues.apache.org/jira/browse/HDFS-7073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138634#comment-14138634
]

Yi Liu commented on HDFS-7073:
--

Hi [~cnauroth], nice work.

{quote}
DataNode: There had been some mishandling in checkSecureConfig around checking
the dfs.data.tranfser.protection property. It's defined in hdfs-default.xml, so
it always comes in with empty string as the default (not null). I changed some
of this logic to check for empty string instead of null.
{quote}
That's great for this fix too, otherwise if cluster is security enabled and we
still can start DN listened on an unprivileged port ( 1024) even
{{dfs.data.transfer.protection}} is empty.

{quote}
Cluster is unsecured, but has block access tokens enabled. This is not
something I've seen done in practice, but I've heard historically it has been
allowed. The HDFS-2856 code relied on seeing an empty block access token to
trigger fallback, and this doesn't work if the unsecured cluster actually is
using block access tokens.
{quote}

In the patch, fallback for writeblock is handled, but fallback for readblock is
not handled.
The test case for this scenario is hard to write because
{{UserGroupInformation#isSecurityEnabled()}} is static, so we can't configure
client secured but server unsecured.
But I just have this environment and test this scenario, I configured:
server(unsecured and block access tokens enabled), client (secure enabled,
block access tokens enabled and fallback enabled). I see write file is
successful, but *read file failed*.

Allow falling back to a non-SASL connection on DataTransferProtocol in
several edge cases.
--

Key: HDFS-7073
URL: https://issues.apache.org/jira/browse/HDFS-7073
Project: Hadoop HDFS
Issue Type: Bug
Components: datanode, hdfs-client, security
Reporter: Chris Nauroth
Assignee: Chris Nauroth
Attachments: HDFS-7073.1.patch

HDFS-2856 implemented general SASL support on DataTransferProtocol. Part of
that work also included a fallback mode in case the remote cluster is running
under a different configuration without SASL. I've discovered a few edge
case configurations that this did not support:
* Cluster is unsecured, but has block access tokens enabled. This is not
something I've seen done in practice, but I've heard historically it has been
allowed. The HDFS-2856 code relied on seeing an empty block access token to
trigger fallback, and this doesn't work if the unsecured cluster actually is
using block access tokens.
* The DataNode has an unpublicized testing configuration property that could
be used to skip the privileged port check. However, the HDFS-2856 code is
still enforcing requirement of SASL when the ports are not privileged, so
this would force existing configurations to make changes to activate SASL.
This patch will restore the old behavior so that these edge case
configurations will continue to work the same way.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7073) Allow falling back to a non-SASL connection on DataTransferProtocol in several edge cases.

[
https://issues.apache.org/jira/browse/HDFS-7073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138639#comment-14138639
]

Yi Liu commented on HDFS-7073:
--

For the first comment, I want to add: even though follow-on sasl handshake
would failed, but the error log user see is not explicit. So it's pretty good
of the fix for *not* let DN start successful on unprivileged port if
{{dfs.data.transfer.protection}} is empty.

Allow falling back to a non-SASL connection on DataTransferProtocol in
several edge cases.
--

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7088) Archival Storage: fix TestBalancer and TestBalancerWithMultipleNameNodes


 [ 
https://issues.apache.org/jira/browse/HDFS-7088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tsz Wo Nicholas Sze updated HDFS-7088:
--
Attachment: h7088_20140918.patch

The failures are because of calling hflush() when writing id file.

h7088_20140918.patch: do not write to id file for unit tests.

 Archival Storage: fix TestBalancer and TestBalancerWithMultipleNameNodes
 

 Key: HDFS-7088
 URL: https://issues.apache.org/jira/browse/HDFS-7088
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: balancer, test
Reporter: Tsz Wo Nicholas Sze
Assignee: Tsz Wo Nicholas Sze
Priority: Minor
 Attachments: h7088_20140918.patch


 {noformat}
 java.lang.AssertionError: expected:0 but was:-3
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.runBalancer(TestBalancerWithMultipleNameNodes.java:163)
   at 
 org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.runTest(TestBalancerWithMultipleNameNodes.java:365)
   at 
 org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.testBalancer(TestBalancerWithMultipleNameNodes.java:379)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-6584) Support Archival Storage

[
https://issues.apache.org/jira/browse/HDFS-6584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tsz Wo Nicholas Sze updated HDFS-6584:
--
Attachment: h6584_20140918.patch

h6584_20140918.patch: with HDFS-7088.

Support Archival Storage

Key: HDFS-6584
URL: https://issues.apache.org/jira/browse/HDFS-6584
Project: Hadoop HDFS
Issue Type: New Feature
Components: balancer, namenode
Reporter: Tsz Wo Nicholas Sze
Assignee: Tsz Wo Nicholas Sze
Attachments: HDFS-6584.000.patch,
HDFSArchivalStorageDesign20140623.pdf, HDFSArchivalStorageDesign20140715.pdf,
archival-storage-testplan.pdf, h6584_20140907.patch, h6584_20140908.patch,
h6584_20140908b.patch, h6584_20140911.patch, h6584_20140911b.patch,
h6584_20140915.patch, h6584_20140916.patch, h6584_20140916.patch,
h6584_20140917.patch, h6584_20140917b.patch, h6584_20140918.patch

In most of the Hadoop clusters, as more and more data is stored for longer
time, the demand for storage is outstripping the compute. Hadoop needs a cost
effective and easy to manage solution to meet this demand for storage.
Current solution is:
- Delete the old unused data. This comes at operational cost of identifying
unnecessary data and deleting them manually.
- Add more nodes to the clusters. This adds along with storage capacity
unnecessary compute capacity to the cluster.
Hadoop needs a solution to decouple growing storage capacity from compute
capacity. Nodes with higher density and less expensive storage with low
compute power are becoming available and can be used as cold storage in the
clusters. Based on policy the data from hot storage can be moved to cold
storage. Adding more nodes to the cold storage can grow the storage
independent of the compute capacity in the cluster.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6584) Support Archival Storage

[
https://issues.apache.org/jira/browse/HDFS-6584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138658#comment-14138658
]

Hadoop QA commented on HDFS-6584:
-

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12669669/h6584_20140918.patch
against trunk revision ee21b13.

{color:red}-1 patch{color}. The patch command could not apply the patch.

Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/8076//console

This message is automatically generated.

Support Archival Storage

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6606) Optimize HDFS Encrypted Transport performance


[ 
https://issues.apache.org/jira/browse/HDFS-6606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138691#comment-14138691
 ] 

Yi Liu commented on HDFS-6606:
--

Rebase the patch for latest trunk.

[~usrikanth], Jaas GSSAPI mechanism indeed supports AES, but it's not suitable 
here, client need to make sure the DN is legal too. For DIGEST-MD5, the 
password is generated using the accessToken or encryption key, by this way, DN 
can validate client and the client can also validate whether DN is legal 
(ensure {{block access token}} not got by malicious process). But for GSSAPI 
mechanism, we can't ensure this and have performance issue.  Another reason is 
that not all users could use third-party JCE provider; if using CryptoCodec, 
it's scalable and have built-in support for AES-NI in Hadoop.

 Optimize HDFS Encrypted Transport performance
 -

 Key: HDFS-6606
 URL: https://issues.apache.org/jira/browse/HDFS-6606
 Project: Hadoop HDFS
  Issue Type: Improvement
  Components: datanode, hdfs-client, security
Reporter: Yi Liu
Assignee: Yi Liu
 Attachments: HDFS-6606.001.patch, HDFS-6606.002.patch, 
 HDFS-6606.003.patch, HDFS-6606.004.patch, 
 OptimizeHdfsEncryptedTransportperformance.pdf


 In HDFS-3637, [~atm] added support for encrypting the DataTransferProtocol, 
 it was a great work.
 It utilizes SASL {{Digest-MD5}} mechanism (use Qop: auth-conf),  it supports 
 three security strength:
 * high  3des   or rc4 (128bits)
 * medium des or rc4(56bits)
 * low   rc4(40bits)
 3des and rc4 are slow, only *tens of MB/s*, 
 http://www.javamex.com/tutorials/cryptography/ciphers.shtml
 http://www.cs.wustl.edu/~jain/cse567-06/ftp/encryption_perf/
 I will give more detailed performance data in future. Absolutely it’s 
 bottleneck and will vastly affect the end to end performance. 
 AES(Advanced Encryption Standard) is recommended as a replacement of DES, 
 it’s more secure; with AES-NI support, the throughput can reach nearly 
 *2GB/s*, it won’t be the bottleneck any more, AES and CryptoCodec work is 
 supported in HADOOP-10150, HADOOP-10603 and HADOOP-10693 (We may need to add 
 a new mode support for AES). 
 This JIRA will use AES with AES-NI support as encryption algorithm for 
 DataTransferProtocol.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-6606) Optimize HDFS Encrypted Transport performance


 [ 
https://issues.apache.org/jira/browse/HDFS-6606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yi Liu updated HDFS-6606:
-
Attachment: HDFS-6606.005.patch

 Optimize HDFS Encrypted Transport performance
 -

 Key: HDFS-6606
 URL: https://issues.apache.org/jira/browse/HDFS-6606
 Project: Hadoop HDFS
  Issue Type: Improvement
  Components: datanode, hdfs-client, security
Reporter: Yi Liu
Assignee: Yi Liu
 Attachments: HDFS-6606.001.patch, HDFS-6606.002.patch, 
 HDFS-6606.003.patch, HDFS-6606.004.patch, HDFS-6606.005.patch, 
 OptimizeHdfsEncryptedTransportperformance.pdf


 In HDFS-3637, [~atm] added support for encrypting the DataTransferProtocol, 
 it was a great work.
 It utilizes SASL {{Digest-MD5}} mechanism (use Qop: auth-conf),  it supports 
 three security strength:
 * high  3des   or rc4 (128bits)
 * medium des or rc4(56bits)
 * low   rc4(40bits)
 3des and rc4 are slow, only *tens of MB/s*, 
 http://www.javamex.com/tutorials/cryptography/ciphers.shtml
 http://www.cs.wustl.edu/~jain/cse567-06/ftp/encryption_perf/
 I will give more detailed performance data in future. Absolutely it’s 
 bottleneck and will vastly affect the end to end performance. 
 AES(Advanced Encryption Standard) is recommended as a replacement of DES, 
 it’s more secure; with AES-NI support, the throughput can reach nearly 
 *2GB/s*, it won’t be the bottleneck any more, AES and CryptoCodec work is 
 supported in HADOOP-10150, HADOOP-10603 and HADOOP-10693 (We may need to add 
 a new mode support for AES). 
 This JIRA will use AES with AES-NI support as encryption algorithm for 
 DataTransferProtocol.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6970) Move startFile EDEK retries to the DFSClient


[ 
https://issues.apache.org/jira/browse/HDFS-6970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138692#comment-14138692
 ] 

Hadoop QA commented on HDFS-6970:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12669643/hdfs-6970.001.patch
  against trunk revision ee21b13.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-hdfs-project/hadoop-hdfs:

  org.apache.hadoop.hdfs.TestEncryptionZonesWithKMS
  org.apache.hadoop.hdfs.web.TestWebHdfsFileSystemContract
  
org.apache.hadoop.hdfs.server.namenode.ha.TestPipelinesFailover

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/8074//testReport/
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/8074//console

This message is automatically generated.

 Move startFile EDEK retries to the DFSClient
 

 Key: HDFS-6970
 URL: https://issues.apache.org/jira/browse/HDFS-6970
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: encryption
Affects Versions: 2.5.0
Reporter: Andrew Wang
Assignee: Andrew Wang
 Attachments: hdfs-6970.001.patch


 [~sureshms] pointed out that holding on to an RPC handler while talking to 
 the KMS is bad, since it can exhaust the available handlers. Let's avoid this 
 by doing retries at the DFSClient rather than in the RPC handler, and moving 
 EDEK fetching to the background.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7073) Allow falling back to a non-SASL connection on DataTransferProtocol in several edge cases.

[
https://issues.apache.org/jira/browse/HDFS-7073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138694#comment-14138694
]

Yi Liu commented on HDFS-7073:
--

One security issue I can think is: If we allow this type of fallback, as
discussed in HDFS-2856 about the attack vector, a malicious task can easily
listen on the DN's port after it dies and steal the block access token. So we'd
better not allow the fallback?

Allow falling back to a non-SASL connection on DataTransferProtocol in
several edge cases.
--

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-6584) Support Archival Storage

[
https://issues.apache.org/jira/browse/HDFS-6584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tsz Wo Nicholas Sze updated HDFS-6584:
--
Attachment: h6584_20140918b.patch

h6584_20140918b.patch: excludes hdfs.cmd since the patch command does not work
with dos file correctly.

Support Archival Storage

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6727) Refresh data volumes on DataNode based on configuration changes


[ 
https://issues.apache.org/jira/browse/HDFS-6727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138726#comment-14138726
 ] 

Hadoop QA commented on HDFS-6727:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12669653/HDFS-6727.007.patch
  against trunk revision ee21b13.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-hdfs-project/hadoop-hdfs:

  org.apache.hadoop.hdfs.TestEncryptionZonesWithKMS
  
org.apache.hadoop.hdfs.server.namenode.ha.TestPipelinesFailover

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/8075//testReport/
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/8075//console

This message is automatically generated.

 Refresh data volumes on DataNode based on configuration changes
 ---

 Key: HDFS-6727
 URL: https://issues.apache.org/jira/browse/HDFS-6727
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: datanode
Affects Versions: 2.5.0, 2.4.1
Reporter: Lei (Eddy) Xu
Assignee: Lei (Eddy) Xu
  Labels: datanode
 Attachments: HDFS-6727.000.delta-HDFS-6775.txt, HDFS-6727.001.patch, 
 HDFS-6727.002.patch, HDFS-6727.003.patch, HDFS-6727.004.patch, 
 HDFS-6727.005.patch, HDFS-6727.006.patch, HDFS-6727.006.patch, 
 HDFS-6727.007.patch, HDFS-6727.combo.patch, patchFindBugsOutputhadoop-hdfs.txt


 HDFS-1362 requires DataNode to reload configuration file during the runtime, 
 so that DN can change the data volumes dynamically. This JIRA reuses the 
 reconfiguration framework introduced by HADOOP-7001 to enable DN to 
 reconfigure at runtime.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7086) httpfs create files default overwrite behavior is set to true

2014-09-18 Thread Steve Loughran (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-7086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Loughran updated HDFS-7086:
-
Component/s: documentation

 httpfs create files default overwrite behavior is set to true
 -

 Key: HDFS-7086
 URL: https://issues.apache.org/jira/browse/HDFS-7086
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.0.0-alpha, 2.1.0-beta, 2.2.0, 2.3.0, 2.4.1, 2.5.1
 Environment: Linux, Java
Reporter: Eric Yang

 WebHDFS documentation says overwrite flag is default to false, but httpfs set 
 the flag to true by default.  This can be different from user's expectation 
 and cause data to be overwritten.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Work started] (HDFS-7082) When replication factor equals number of data nodes, corrupt replica will never get substituted with good replica

2014-09-18 Thread Srikanth Upputuri (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-7082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HDFS-7082 started by Srikanth Upputuri.
---
 When replication factor equals number of data nodes, corrupt replica will 
 never get substituted with good replica
 -

 Key: HDFS-7082
 URL: https://issues.apache.org/jira/browse/HDFS-7082
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Reporter: Srikanth Upputuri
Assignee: Srikanth Upputuri
Priority: Minor

 BlockManager will not invalidate a corrupt replica if this brings down the 
 total number of replicas below replication factor (except if the corrupt 
 replica has a wrong genstamp). On clusters where the replication factor = 
 total data nodes, a new replica can not be created from a live replica as all 
 the available datanodes already have a replica each. Because of this, the 
 corrupt replicas will never be substituted with good replicas, so will never 
 get deleted. Sooner or later all replicas may get corrupt and there will be 
 no live replicas in the cluster for this block.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7086) httpfs create files default overwrite behavior is set to true

2014-09-18 Thread Steve Loughran (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-7086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138755#comment-14138755
 ] 

Steve Loughran commented on HDFS-7086:
--

This behaviour is consistent with HDFS and other implementations of 
{{FileSystem}}, because {{create(Path)}} defaults to overwrite
{code}
/**
   * Create an FSDataOutputStream at the indicated Path.
   * Files are overwritten by default.
   * @param f the file to create
   */
  public FSDataOutputStream create(Path f) throws IOException {
return create(f, true);
  }
{code}

Looking at the filesystem.md and {{AbstractContractCreateTest}}, I don't see 
where this is explicitly called out or tested for. Doing both of these would 
ensure that when someone got round to doing contract tests for WebHDFS its 
consistency with HDFS can be validated.

Tagging as a documentation  test

 httpfs create files default overwrite behavior is set to true
 -

 Key: HDFS-7086
 URL: https://issues.apache.org/jira/browse/HDFS-7086
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: documentation, test
Affects Versions: 2.0.0-alpha, 2.1.0-beta, 2.2.0, 2.3.0, 2.4.1, 2.5.1
 Environment: Linux, Java
Reporter: Eric Yang

 WebHDFS documentation says overwrite flag is default to false, but httpfs set 
 the flag to true by default.  This can be different from user's expectation 
 and cause data to be overwritten.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7086) httpfs create files default overwrite behavior is set to true

2014-09-18 Thread Steve Loughran (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-7086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Loughran updated HDFS-7086:
-
Component/s: test

 httpfs create files default overwrite behavior is set to true
 -

 Key: HDFS-7086
 URL: https://issues.apache.org/jira/browse/HDFS-7086
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: documentation, test
Affects Versions: 2.0.0-alpha, 2.1.0-beta, 2.2.0, 2.3.0, 2.4.1, 2.5.1
 Environment: Linux, Java
Reporter: Eric Yang

 WebHDFS documentation says overwrite flag is default to false, but httpfs set 
 the flag to true by default.  This can be different from user's expectation 
 and cause data to be overwritten.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-6808) Add command line option to ask DataNode reload configuration.

2014-09-18 Thread Lei (Eddy) Xu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-6808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei (Eddy) Xu updated HDFS-6808:

Attachment: HDFS-6808.006.patch

Hi, [~cmccabe], thanks for your great suggestions. Async API makes more sense. 
I've changed the patch to reflect the discussions. In summary, this patch

* Changes the Reconfiguration framework from HDFS-7001. It adds 
{{ReconfigurableBase#startReconfigureTask()}} which starts a background thread 
to do configuration reloading, so that it supports async API. Also it checks 
whether there is an active task is running, if so it returns errors. 
* Provides CLI command (similar to {{btrfs scrub start|status}} to start and 
query the status of the reconfiguration work.
{noformat}
dfsadmin -reconfig -datanode [start|status] host:port
{noformat}
But no {{-reconfig cancel}} is provided, because there is not an obvious way 
for me to interrupt the reconfiguration process while ensures {{DN}} 
consistent. Maybe we can fix it later.
* The protobuf protocol for {{-reconfig status}} is basically returning a 
{{Mapconf change, error message}}, with task start and/or end times. It is 
the caller's (i.e., {{DFSAdmin}}) responsibility to print these error messages, 
so that it can generate CLI messages, XML, HTML...




 Add command line option to ask DataNode reload configuration.
 -

 Key: HDFS-6808
 URL: https://issues.apache.org/jira/browse/HDFS-6808
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: datanode
Affects Versions: 2.5.0
Reporter: Lei (Eddy) Xu
Assignee: Lei (Eddy) Xu
 Attachments: HDFS-6808.000.combo.patch, HDFS-6808.000.patch, 
 HDFS-6808.001.combo.patch, HDFS-6808.001.patch, HDFS-6808.002.combo.patch, 
 HDFS-6808.002.patch, HDFS-6808.003.combo.txt, HDFS-6808.003.patch, 
 HDFS-6808.004.combo.patch, HDFS-6808.004.patch, HDFS-6808.005.combo.patch, 
 HDFS-6808.005.patch, HDFS-6808.006.patch


 The workflow of dynamically changing data volumes on DataNode is
 # Users manually changed {{dfs.datanode.data.dir}} in the configuration file
 # User use command line to notify DN to reload configuration and updates its 
 volumes. 
 This work adds command line support to notify DN to reload configuration.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6808) Add command line option to ask DataNode reload configuration.

[
https://issues.apache.org/jira/browse/HDFS-6808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138764#comment-14138764
]

Hadoop QA commented on HDFS-6808:
-

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12669693/HDFS-6808.006.patch
against trunk revision ee21b13.

{color:red}-1 patch{color}. The patch command could not apply the patch.

Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/8080//console

This message is automatically generated.

Add command line option to ask DataNode reload configuration.
-

Key: HDFS-6808
URL: https://issues.apache.org/jira/browse/HDFS-6808
Project: Hadoop HDFS
Issue Type: Sub-task
Components: datanode
Affects Versions: 2.5.0
Reporter: Lei (Eddy) Xu
Assignee: Lei (Eddy) Xu
Attachments: HDFS-6808.000.combo.patch, HDFS-6808.000.patch,
HDFS-6808.001.combo.patch, HDFS-6808.001.patch, HDFS-6808.002.combo.patch,
HDFS-6808.002.patch, HDFS-6808.003.combo.txt, HDFS-6808.003.patch,
HDFS-6808.004.combo.patch, HDFS-6808.004.patch, HDFS-6808.005.combo.patch,
HDFS-6808.005.patch, HDFS-6808.006.patch

The workflow of dynamically changing data volumes on DataNode is
# Users manually changed {{dfs.datanode.data.dir}} in the configuration file
# User use command line to notify DN to reload configuration and updates its
volumes.
This work adds command line support to notify DN to reload configuration.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-6808) Add command line option to ask DataNode reload configuration.

2014-09-18 Thread Lei (Eddy) Xu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-6808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei (Eddy) Xu updated HDFS-6808:

Attachment: HDFS-6808.006.combo.patch

 Add command line option to ask DataNode reload configuration.
 -

 Key: HDFS-6808
 URL: https://issues.apache.org/jira/browse/HDFS-6808
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: datanode
Affects Versions: 2.5.0
Reporter: Lei (Eddy) Xu
Assignee: Lei (Eddy) Xu
 Attachments: HDFS-6808.000.combo.patch, HDFS-6808.000.patch, 
 HDFS-6808.001.combo.patch, HDFS-6808.001.patch, HDFS-6808.002.combo.patch, 
 HDFS-6808.002.patch, HDFS-6808.003.combo.txt, HDFS-6808.003.patch, 
 HDFS-6808.004.combo.patch, HDFS-6808.004.patch, HDFS-6808.005.combo.patch, 
 HDFS-6808.005.patch, HDFS-6808.006.combo.patch, HDFS-6808.006.patch


 The workflow of dynamically changing data volumes on DataNode is
 # Users manually changed {{dfs.datanode.data.dir}} in the configuration file
 # User use command line to notify DN to reload configuration and updates its 
 volumes. 
 This work adds command line support to notify DN to reload configuration.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6705) Create an XAttr that disallows the HDFS admin from accessing a file


[ 
https://issues.apache.org/jira/browse/HDFS-6705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138808#comment-14138808
 ] 

Hudson commented on HDFS-6705:
--

SUCCESS: Integrated in Hadoop-Yarn-trunk #684 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/684/])
HDFS-6705. Create an XAttr that disallows the HDFS admin from accessing a file. 
(clamb via wang) (wang: rev ea4e2e843ecadd8019ea35413f4a34b97a424923)
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/common/HdfsServerConstants.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/FSXAttrBaseTest.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/XAttrPermissionFilter.java
* hadoop-hdfs-project/hadoop-hdfs/src/site/apt/ExtendedAttributes.apt.vm
* hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSNamesystem.java
* hadoop-hdfs-project/hadoop-hdfs/src/test/resources/testXAttrConf.xml
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSDirectory.java


 Create an XAttr that disallows the HDFS admin from accessing a file
 ---

 Key: HDFS-6705
 URL: https://issues.apache.org/jira/browse/HDFS-6705
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: namenode, security
Affects Versions: 3.0.0
Reporter: Charles Lamb
Assignee: Charles Lamb
 Fix For: 2.6.0

 Attachments: HDFS-6705.001.patch, HDFS-6705.002.patch, 
 HDFS-6705.003.patch, HDFS-6705.004.patch, HDFS-6705.005.patch, 
 HDFS-6705.006.patch, HDFS-6705.007.patch, HDFS-6705.008.patch


 There needs to be an xattr that specifies that the HDFS admin can not access 
 a file. This is needed for m/r delegation tokens and data at rest encryption.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7075) hadoop-fuse-dfs fails because it cannot find JavaKeyStoreProvider$Factory


[ 
https://issues.apache.org/jira/browse/HDFS-7075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138812#comment-14138812
 ] 

Hudson commented on HDFS-7075:
--

SUCCESS: Integrated in Hadoop-Yarn-trunk #684 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/684/])
HDFS-7075. hadoop-fuse-dfs fails because it cannot find 
JavaKeyStoreProvider$Factory. (cmccabe) (cmccabe: rev 
f23024852502441fc259012664e444e5e51c604a)
* hadoop-common-project/hadoop-common/CHANGES.txt
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/crypto/key/KeyProviderFactory.java


 hadoop-fuse-dfs fails because it cannot find JavaKeyStoreProvider$Factory
 -

 Key: HDFS-7075
 URL: https://issues.apache.org/jira/browse/HDFS-7075
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.6.0
Reporter: Colin Patrick McCabe
Assignee: Colin Patrick McCabe
 Fix For: 2.6.0

 Attachments: HDFS-7075.001.patch


 hadoop-fuse-dfs fails complaining with:
 {code}
 java.util.ServiceConfigurationError: 
 org.apache.hadoop.crypto.key.KeyProviderFactory: Provider 
 org.apache.hadoop.crypto.key.JavaKeyStoreProvider$Factory not found
 {code}
 Here is an example of the hadoop-fuse-dfs debug output.
 {code}
 14/09/04 13:49:04 WARN crypto.CryptoCodec: Crypto codec 
 org.apache.hadoop.crypto.OpensslAesCtrCryptoCodec is not available.
 hdfsBuilderConnect(forceNewInstance=1, 
 nn=hdfs://hdfs-cdh5-secure-1.vpc.cloudera.com:8020, port=0, 
 kerbTicketCachePath=/tmp/krb5cc_0, userName=root) error:
 java.util.ServiceConfigurationError: 
 org.apache.hadoop.crypto.key.KeyProviderFactory: Provider 
 org.apache.hadoop.crypto.key.JavaKeyStoreProvider$Factory not found
   at java.util.ServiceLoader.fail(ServiceLoader.java:231)
   at java.util.ServiceLoader.access$300(ServiceLoader.java:181)
   at java.util.ServiceLoader$LazyIterator.next(ServiceLoader.java:365)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7004) Update KeyProvider instantiation to create by URI


[ 
https://issues.apache.org/jira/browse/HDFS-7004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138816#comment-14138816
 ] 

Hudson commented on HDFS-7004:
--

SUCCESS: Integrated in Hadoop-Yarn-trunk #684 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/684/])
HDFS-7004. Update KeyProvider instantiation to create by URI. (wang) (wang: rev 
10e8602f32b553a1424f1a9b5f9f74f7b68a49d1)
* hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZonesWithHA.java
* 
hadoop-common-project/hadoop-kms/src/test/java/org/apache/hadoop/crypto/key/kms/server/TestKMS.java
* hadoop-common-project/hadoop-kms/src/site/apt/index.apt.vm
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestReservedRawPaths.java
* 
hadoop-common-project/hadoop-kms/src/test/java/org/apache/hadoop/crypto/key/kms/server/MiniKMS.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZones.java
* 
hadoop-common-project/hadoop-kms/src/main/java/org/apache/hadoop/crypto/key/kms/server/KMSConfiguration.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/DFSUtil.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/cli/TestCryptoAdminCLI.java
* hadoop-common-project/hadoop-kms/src/main/conf/kms-site.xml
* 
hadoop-common-project/hadoop-kms/src/main/java/org/apache/hadoop/crypto/key/kms/server/KMSWebApp.java
* hadoop-hdfs-project/hadoop-hdfs/src/main/resources/hdfs-default.xml
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/DFSConfigKeys.java
* hadoop-hdfs-project/hadoop-hdfs/src/site/apt/TransparentEncryption.apt.vm


 Update KeyProvider instantiation to create by URI
 -

 Key: HDFS-7004
 URL: https://issues.apache.org/jira/browse/HDFS-7004
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: encryption
Affects Versions: 2.6.0
Reporter: Andrew Wang
Assignee: Andrew Wang
 Fix For: 2.6.0

 Attachments: hdfs-7004.001.patch, hdfs-7004.002.patch, 
 hdfs-7004.004.patch


 See HADOOP-11054, would be good to update the NN/DFSClient to fetch via this 
 method rather than depending on the URI path lookup.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6843) Create FileStatus isEncrypted() method


[ 
https://issues.apache.org/jira/browse/HDFS-6843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138811#comment-14138811
 ] 

Hudson commented on HDFS-6843:
--

SUCCESS: Integrated in Hadoop-Yarn-trunk #684 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/684/])
HDFS-6843. Create FileStatus isEncrypted() method (clamb via cmccabe) (cmccabe: 
rev e3803d002c660f18a5c2ecf32344fd6f3f491a5b)
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/web/JsonUtil.java
* 
hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/fs/contract/AbstractContractOpenTest.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/protocol/FsPermissionExtension.java
* hadoop-common-project/hadoop-common/src/site/markdown/filesystem/filesystem.md
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/protocol/FsAclPermission.java
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/permission/FsPermission.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/protocolPB/PBHelper.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSDirectory.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/FSAclBaseTest.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZones.java
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/FileStatus.java
HDFS-6843. Add to CHANGES.txt (cmccabe: rev 
f24ac429d102777fe021e9852cfff38312643512)
* hadoop-common-project/hadoop-common/CHANGES.txt


 Create FileStatus isEncrypted() method
 --

 Key: HDFS-6843
 URL: https://issues.apache.org/jira/browse/HDFS-6843
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: namenode, security
Affects Versions: 3.0.0
Reporter: Charles Lamb
Assignee: Charles Lamb
 Fix For: 2.6.0

 Attachments: HDFS-6843.001.patch, HDFS-6843.002.patch, 
 HDFS-6843.003.patch, HDFS-6843.004.patch, HDFS-6843.005.patch, 
 HDFS-6843.005.patch, HDFS-6843.006.patch, HDFS-6843.007.patch, 
 HDFS-6843.008.patch, HDFS-6843.009.patch, HDFS-6843.010.patch


 FileStatus should have a 'boolean isEncrypted()' method. (it was in the 
 context of discussing with AndreW about FileStatus being a Writable).
 Having this method would allow MR JobSubmitter do the following:
 -
 BOOLEAN intermediateEncryption = false
 IF jobconf.contains(mr.intermidate.encryption) THEN
   intermediateEncryption = jobConf.getBoolean(mr.intermidate.encryption)
 ELSE
   IF (I/O)Format INSTANCEOF File(I/O)Format THEN
 intermediateEncryption = ANY File(I/O)Format HAS a Path with status 
 isEncrypted()==TRUE
   FI
   jobConf.setBoolean(mr.intermidate.encryption, intermediateEncryption)
 FI



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7078) Fix listEZs to work correctly with snapshots


[ 
https://issues.apache.org/jira/browse/HDFS-7078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138815#comment-14138815
 ] 

Hudson commented on HDFS-7078:
--

SUCCESS: Integrated in Hadoop-Yarn-trunk #684 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/684/])
HDFS-7078. Fix listEZs to work correctly with snapshots. (wang) (wang: rev 
0ecefe60179968984b1892a14411566b7a0c8df3)
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZones.java
* hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/EncryptionZoneManager.java


 Fix listEZs to work correctly with snapshots
 

 Key: HDFS-7078
 URL: https://issues.apache.org/jira/browse/HDFS-7078
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: encryption
Affects Versions: 2.6.0
Reporter: Andrew Wang
Assignee: Andrew Wang
 Fix For: 2.6.0

 Attachments: hdfs-7078.001.patch, hdfs-7078.002.patch


 listEZs will list encryption zones that are only present in a snapshot, 
 rather than only the EZs in the current filesystem state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6606) Optimize HDFS Encrypted Transport performance


[ 
https://issues.apache.org/jira/browse/HDFS-6606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138825#comment-14138825
 ] 

Hadoop QA commented on HDFS-6606:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12669675/HDFS-6606.005.patch
  against trunk revision ee21b13.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs:

  org.apache.hadoop.hdfs.TestEncryptionZonesWithKMS
  
org.apache.hadoop.hdfs.server.namenode.ha.TestPipelinesFailover

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/8077//testReport/
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/8077//console

This message is automatically generated.

 Optimize HDFS Encrypted Transport performance
 -

 Key: HDFS-6606
 URL: https://issues.apache.org/jira/browse/HDFS-6606
 Project: Hadoop HDFS
  Issue Type: Improvement
  Components: datanode, hdfs-client, security
Reporter: Yi Liu
Assignee: Yi Liu
 Attachments: HDFS-6606.001.patch, HDFS-6606.002.patch, 
 HDFS-6606.003.patch, HDFS-6606.004.patch, HDFS-6606.005.patch, 
 OptimizeHdfsEncryptedTransportperformance.pdf


 In HDFS-3637, [~atm] added support for encrypting the DataTransferProtocol, 
 it was a great work.
 It utilizes SASL {{Digest-MD5}} mechanism (use Qop: auth-conf),  it supports 
 three security strength:
 * high  3des   or rc4 (128bits)
 * medium des or rc4(56bits)
 * low   rc4(40bits)
 3des and rc4 are slow, only *tens of MB/s*, 
 http://www.javamex.com/tutorials/cryptography/ciphers.shtml
 http://www.cs.wustl.edu/~jain/cse567-06/ftp/encryption_perf/
 I will give more detailed performance data in future. Absolutely it’s 
 bottleneck and will vastly affect the end to end performance. 
 AES(Advanced Encryption Standard) is recommended as a replacement of DES, 
 it’s more secure; with AES-NI support, the throughput can reach nearly 
 *2GB/s*, it won’t be the bottleneck any more, AES and CryptoCodec work is 
 supported in HADOOP-10150, HADOOP-10603 and HADOOP-10693 (We may need to add 
 a new mode support for AES). 
 This JIRA will use AES with AES-NI support as encryption algorithm for 
 DataTransferProtocol.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6995) Block should be placed in the client's 'rack-local' node if 'client-local' node is not available


[ 
https://issues.apache.org/jira/browse/HDFS-6995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138913#comment-14138913
 ] 

Hadoop QA commented on HDFS-6995:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/1262/HDFS-6995-002.patch
  against trunk revision ee21b13.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 2 new 
or modified test files.

  {color:red}-1 javac{color}.  The applied patch generated 1265 javac 
compiler warnings (more than the trunk's current 677 warnings).

{color:red}-1 javadoc{color}.  The javadoc tool appears to have generated 
109 warning messages.
See 
https://builds.apache.org/job/PreCommit-HDFS-Build/8079//artifact/trunk/patchprocess/diffJavadocWarnings.txt
 for details.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs:

  org.apache.hadoop.crypto.random.TestOsSecureRandom
  org.apache.hadoop.ha.TestZKFailoverControllerStress
  org.apache.hadoop.hdfs.TestEncryptionZonesWithKMS
  
org.apache.hadoop.hdfs.server.namenode.ha.TestPipelinesFailover
  org.apache.hadoop.hdfs.server.namenode.TestMetaSave

  The following test timeouts occurred in 
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs:

org.apache.hadoop.hdfs.server.namenode.TestDecommissioningStatus

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/8079//testReport/
Javac warnings: 
https://builds.apache.org/job/PreCommit-HDFS-Build/8079//artifact/trunk/patchprocess/diffJavacWarnings.txt
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/8079//console

This message is automatically generated.

 Block should be placed in the client's 'rack-local' node if 'client-local' 
 node is not available
 

 Key: HDFS-6995
 URL: https://issues.apache.org/jira/browse/HDFS-6995
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Affects Versions: 2.5.0
Reporter: Vinayakumar B
Assignee: Vinayakumar B
 Attachments: HDFS-6995-001.patch, HDFS-6995-002.patch


 HDFS cluster is rack aware.
 Client is in different node than of datanode,
 but Same rack contains one or more datanodes.
 In this case first preference should be given to select 'rack-local' node.
 Currently, since no Node in clusterMap corresponds to client's location, 
 blockplacement policy choosing a *random* node as local node and proceeding 
 for further placements.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6808) Add command line option to ask DataNode reload configuration.


[ 
https://issues.apache.org/jira/browse/HDFS-6808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138914#comment-14138914
 ] 

Hadoop QA commented on HDFS-6808:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12669695/HDFS-6808.006.combo.patch
  against trunk revision ee21b13.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 5 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:red}-1 findbugs{color}.  The patch appears to introduce 2 new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs:

  org.apache.hadoop.crypto.random.TestOsSecureRandom
  org.apache.hadoop.hdfs.TestEncryptionZonesWithKMS
  
org.apache.hadoop.hdfs.server.namenode.ha.TestPipelinesFailover
  
org.apache.hadoop.hdfs.server.namenode.ha.TestInitializeSharedEdits

  The following test timeouts occurred in 
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs:

org.apache.hadoop.hdfs.server.namenode.TestDecommissioningStatus

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/8081//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HDFS-Build/8081//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-hdfs.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HDFS-Build/8081//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-common.html
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/8081//console

This message is automatically generated.

 Add command line option to ask DataNode reload configuration.
 -

 Key: HDFS-6808
 URL: https://issues.apache.org/jira/browse/HDFS-6808
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: datanode
Affects Versions: 2.5.0
Reporter: Lei (Eddy) Xu
Assignee: Lei (Eddy) Xu
 Attachments: HDFS-6808.000.combo.patch, HDFS-6808.000.patch, 
 HDFS-6808.001.combo.patch, HDFS-6808.001.patch, HDFS-6808.002.combo.patch, 
 HDFS-6808.002.patch, HDFS-6808.003.combo.txt, HDFS-6808.003.patch, 
 HDFS-6808.004.combo.patch, HDFS-6808.004.patch, HDFS-6808.005.combo.patch, 
 HDFS-6808.005.patch, HDFS-6808.006.combo.patch, HDFS-6808.006.patch


 The workflow of dynamically changing data volumes on DataNode is
 # Users manually changed {{dfs.datanode.data.dir}} in the configuration file
 # User use command line to notify DN to reload configuration and updates its 
 volumes. 
 This work adds command line support to notify DN to reload configuration.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7075) hadoop-fuse-dfs fails because it cannot find JavaKeyStoreProvider$Factory


[ 
https://issues.apache.org/jira/browse/HDFS-7075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138940#comment-14138940
 ] 

Hudson commented on HDFS-7075:
--

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1900 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1900/])
HDFS-7075. hadoop-fuse-dfs fails because it cannot find 
JavaKeyStoreProvider$Factory. (cmccabe) (cmccabe: rev 
f23024852502441fc259012664e444e5e51c604a)
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/crypto/key/KeyProviderFactory.java
* hadoop-common-project/hadoop-common/CHANGES.txt


 hadoop-fuse-dfs fails because it cannot find JavaKeyStoreProvider$Factory
 -

 Key: HDFS-7075
 URL: https://issues.apache.org/jira/browse/HDFS-7075
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.6.0
Reporter: Colin Patrick McCabe
Assignee: Colin Patrick McCabe
 Fix For: 2.6.0

 Attachments: HDFS-7075.001.patch


 hadoop-fuse-dfs fails complaining with:
 {code}
 java.util.ServiceConfigurationError: 
 org.apache.hadoop.crypto.key.KeyProviderFactory: Provider 
 org.apache.hadoop.crypto.key.JavaKeyStoreProvider$Factory not found
 {code}
 Here is an example of the hadoop-fuse-dfs debug output.
 {code}
 14/09/04 13:49:04 WARN crypto.CryptoCodec: Crypto codec 
 org.apache.hadoop.crypto.OpensslAesCtrCryptoCodec is not available.
 hdfsBuilderConnect(forceNewInstance=1, 
 nn=hdfs://hdfs-cdh5-secure-1.vpc.cloudera.com:8020, port=0, 
 kerbTicketCachePath=/tmp/krb5cc_0, userName=root) error:
 java.util.ServiceConfigurationError: 
 org.apache.hadoop.crypto.key.KeyProviderFactory: Provider 
 org.apache.hadoop.crypto.key.JavaKeyStoreProvider$Factory not found
   at java.util.ServiceLoader.fail(ServiceLoader.java:231)
   at java.util.ServiceLoader.access$300(ServiceLoader.java:181)
   at java.util.ServiceLoader$LazyIterator.next(ServiceLoader.java:365)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7004) Update KeyProvider instantiation to create by URI


[ 
https://issues.apache.org/jira/browse/HDFS-7004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138944#comment-14138944
 ] 

Hudson commented on HDFS-7004:
--

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1900 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1900/])
HDFS-7004. Update KeyProvider instantiation to create by URI. (wang) (wang: rev 
10e8602f32b553a1424f1a9b5f9f74f7b68a49d1)
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZones.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/DFSUtil.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/DFSConfigKeys.java
* 
hadoop-common-project/hadoop-kms/src/test/java/org/apache/hadoop/crypto/key/kms/server/MiniKMS.java
* hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* hadoop-common-project/hadoop-kms/src/main/conf/kms-site.xml
* 
hadoop-common-project/hadoop-kms/src/main/java/org/apache/hadoop/crypto/key/kms/server/KMSWebApp.java
* hadoop-hdfs-project/hadoop-hdfs/src/main/resources/hdfs-default.xml
* 
hadoop-common-project/hadoop-kms/src/main/java/org/apache/hadoop/crypto/key/kms/server/KMSConfiguration.java
* hadoop-common-project/hadoop-kms/src/site/apt/index.apt.vm
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestReservedRawPaths.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZonesWithHA.java
* hadoop-hdfs-project/hadoop-hdfs/src/site/apt/TransparentEncryption.apt.vm
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/cli/TestCryptoAdminCLI.java
* 
hadoop-common-project/hadoop-kms/src/test/java/org/apache/hadoop/crypto/key/kms/server/TestKMS.java


 Update KeyProvider instantiation to create by URI
 -

 Key: HDFS-7004
 URL: https://issues.apache.org/jira/browse/HDFS-7004
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: encryption
Affects Versions: 2.6.0
Reporter: Andrew Wang
Assignee: Andrew Wang
 Fix For: 2.6.0

 Attachments: hdfs-7004.001.patch, hdfs-7004.002.patch, 
 hdfs-7004.004.patch


 See HADOOP-11054, would be good to update the NN/DFSClient to fetch via this 
 method rather than depending on the URI path lookup.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6843) Create FileStatus isEncrypted() method


[ 
https://issues.apache.org/jira/browse/HDFS-6843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138939#comment-14138939
 ] 

Hudson commented on HDFS-6843:
--

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1900 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1900/])
HDFS-6843. Create FileStatus isEncrypted() method (clamb via cmccabe) (cmccabe: 
rev e3803d002c660f18a5c2ecf32344fd6f3f491a5b)
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/protocolPB/PBHelper.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/protocol/FsAclPermission.java
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/permission/FsPermission.java
* 
hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/fs/contract/AbstractContractOpenTest.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/web/JsonUtil.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/FSAclBaseTest.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSDirectory.java
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/FileStatus.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZones.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/protocol/FsPermissionExtension.java
* hadoop-common-project/hadoop-common/src/site/markdown/filesystem/filesystem.md
HDFS-6843. Add to CHANGES.txt (cmccabe: rev 
f24ac429d102777fe021e9852cfff38312643512)
* hadoop-common-project/hadoop-common/CHANGES.txt


 Create FileStatus isEncrypted() method
 --

 Key: HDFS-6843
 URL: https://issues.apache.org/jira/browse/HDFS-6843
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: namenode, security
Affects Versions: 3.0.0
Reporter: Charles Lamb
Assignee: Charles Lamb
 Fix For: 2.6.0

 Attachments: HDFS-6843.001.patch, HDFS-6843.002.patch, 
 HDFS-6843.003.patch, HDFS-6843.004.patch, HDFS-6843.005.patch, 
 HDFS-6843.005.patch, HDFS-6843.006.patch, HDFS-6843.007.patch, 
 HDFS-6843.008.patch, HDFS-6843.009.patch, HDFS-6843.010.patch


 FileStatus should have a 'boolean isEncrypted()' method. (it was in the 
 context of discussing with AndreW about FileStatus being a Writable).
 Having this method would allow MR JobSubmitter do the following:
 -
 BOOLEAN intermediateEncryption = false
 IF jobconf.contains(mr.intermidate.encryption) THEN
   intermediateEncryption = jobConf.getBoolean(mr.intermidate.encryption)
 ELSE
   IF (I/O)Format INSTANCEOF File(I/O)Format THEN
 intermediateEncryption = ANY File(I/O)Format HAS a Path with status 
 isEncrypted()==TRUE
   FI
   jobConf.setBoolean(mr.intermidate.encryption, intermediateEncryption)
 FI



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6705) Create an XAttr that disallows the HDFS admin from accessing a file


[ 
https://issues.apache.org/jira/browse/HDFS-6705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138936#comment-14138936
 ] 

Hudson commented on HDFS-6705:
--

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1900 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1900/])
HDFS-6705. Create an XAttr that disallows the HDFS admin from accessing a file. 
(clamb via wang) (wang: rev ea4e2e843ecadd8019ea35413f4a34b97a424923)
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/common/HdfsServerConstants.java
* hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/XAttrPermissionFilter.java
* hadoop-hdfs-project/hadoop-hdfs/src/test/resources/testXAttrConf.xml
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSDirectory.java
* hadoop-hdfs-project/hadoop-hdfs/src/site/apt/ExtendedAttributes.apt.vm
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/FSXAttrBaseTest.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSNamesystem.java


 Create an XAttr that disallows the HDFS admin from accessing a file
 ---

 Key: HDFS-6705
 URL: https://issues.apache.org/jira/browse/HDFS-6705
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: namenode, security
Affects Versions: 3.0.0
Reporter: Charles Lamb
Assignee: Charles Lamb
 Fix For: 2.6.0

 Attachments: HDFS-6705.001.patch, HDFS-6705.002.patch, 
 HDFS-6705.003.patch, HDFS-6705.004.patch, HDFS-6705.005.patch, 
 HDFS-6705.006.patch, HDFS-6705.007.patch, HDFS-6705.008.patch


 There needs to be an xattr that specifies that the HDFS admin can not access 
 a file. This is needed for m/r delegation tokens and data at rest encryption.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7078) Fix listEZs to work correctly with snapshots


[ 
https://issues.apache.org/jira/browse/HDFS-7078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138943#comment-14138943
 ] 

Hudson commented on HDFS-7078:
--

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1900 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1900/])
HDFS-7078. Fix listEZs to work correctly with snapshots. (wang) (wang: rev 
0ecefe60179968984b1892a14411566b7a0c8df3)
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZones.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/EncryptionZoneManager.java
* hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt


 Fix listEZs to work correctly with snapshots
 

 Key: HDFS-7078
 URL: https://issues.apache.org/jira/browse/HDFS-7078
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: encryption
Affects Versions: 2.6.0
Reporter: Andrew Wang
Assignee: Andrew Wang
 Fix For: 2.6.0

 Attachments: hdfs-7078.001.patch, hdfs-7078.002.patch


 listEZs will list encryption zones that are only present in a snapshot, 
 rather than only the EZs in the current filesystem state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6843) Create FileStatus isEncrypted() method


[ 
https://issues.apache.org/jira/browse/HDFS-6843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138952#comment-14138952
 ] 

Hudson commented on HDFS-6843:
--

FAILURE: Integrated in Hadoop-Hdfs-trunk #1875 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1875/])
HDFS-6843. Create FileStatus isEncrypted() method (clamb via cmccabe) (cmccabe: 
rev e3803d002c660f18a5c2ecf32344fd6f3f491a5b)
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSDirectory.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/protocol/FsAclPermission.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/web/JsonUtil.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/protocolPB/PBHelper.java
* hadoop-common-project/hadoop-common/src/site/markdown/filesystem/filesystem.md
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/permission/FsPermission.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/protocol/FsPermissionExtension.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/FSAclBaseTest.java
* 
hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/fs/contract/AbstractContractOpenTest.java
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/FileStatus.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZones.java
HDFS-6843. Add to CHANGES.txt (cmccabe: rev 
f24ac429d102777fe021e9852cfff38312643512)
* hadoop-common-project/hadoop-common/CHANGES.txt


 Create FileStatus isEncrypted() method
 --

 Key: HDFS-6843
 URL: https://issues.apache.org/jira/browse/HDFS-6843
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: namenode, security
Affects Versions: 3.0.0
Reporter: Charles Lamb
Assignee: Charles Lamb
 Fix For: 2.6.0

 Attachments: HDFS-6843.001.patch, HDFS-6843.002.patch, 
 HDFS-6843.003.patch, HDFS-6843.004.patch, HDFS-6843.005.patch, 
 HDFS-6843.005.patch, HDFS-6843.006.patch, HDFS-6843.007.patch, 
 HDFS-6843.008.patch, HDFS-6843.009.patch, HDFS-6843.010.patch


 FileStatus should have a 'boolean isEncrypted()' method. (it was in the 
 context of discussing with AndreW about FileStatus being a Writable).
 Having this method would allow MR JobSubmitter do the following:
 -
 BOOLEAN intermediateEncryption = false
 IF jobconf.contains(mr.intermidate.encryption) THEN
   intermediateEncryption = jobConf.getBoolean(mr.intermidate.encryption)
 ELSE
   IF (I/O)Format INSTANCEOF File(I/O)Format THEN
 intermediateEncryption = ANY File(I/O)Format HAS a Path with status 
 isEncrypted()==TRUE
   FI
   jobConf.setBoolean(mr.intermidate.encryption, intermediateEncryption)
 FI



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7078) Fix listEZs to work correctly with snapshots


[ 
https://issues.apache.org/jira/browse/HDFS-7078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138956#comment-14138956
 ] 

Hudson commented on HDFS-7078:
--

FAILURE: Integrated in Hadoop-Hdfs-trunk #1875 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1875/])
HDFS-7078. Fix listEZs to work correctly with snapshots. (wang) (wang: rev 
0ecefe60179968984b1892a14411566b7a0c8df3)
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZones.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/EncryptionZoneManager.java
* hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt


 Fix listEZs to work correctly with snapshots
 

 Key: HDFS-7078
 URL: https://issues.apache.org/jira/browse/HDFS-7078
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: encryption
Affects Versions: 2.6.0
Reporter: Andrew Wang
Assignee: Andrew Wang
 Fix For: 2.6.0

 Attachments: hdfs-7078.001.patch, hdfs-7078.002.patch


 listEZs will list encryption zones that are only present in a snapshot, 
 rather than only the EZs in the current filesystem state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7075) hadoop-fuse-dfs fails because it cannot find JavaKeyStoreProvider$Factory


[ 
https://issues.apache.org/jira/browse/HDFS-7075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138953#comment-14138953
 ] 

Hudson commented on HDFS-7075:
--

FAILURE: Integrated in Hadoop-Hdfs-trunk #1875 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1875/])
HDFS-7075. hadoop-fuse-dfs fails because it cannot find 
JavaKeyStoreProvider$Factory. (cmccabe) (cmccabe: rev 
f23024852502441fc259012664e444e5e51c604a)
* hadoop-common-project/hadoop-common/CHANGES.txt
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/crypto/key/KeyProviderFactory.java


 hadoop-fuse-dfs fails because it cannot find JavaKeyStoreProvider$Factory
 -

 Key: HDFS-7075
 URL: https://issues.apache.org/jira/browse/HDFS-7075
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.6.0
Reporter: Colin Patrick McCabe
Assignee: Colin Patrick McCabe
 Fix For: 2.6.0

 Attachments: HDFS-7075.001.patch


 hadoop-fuse-dfs fails complaining with:
 {code}
 java.util.ServiceConfigurationError: 
 org.apache.hadoop.crypto.key.KeyProviderFactory: Provider 
 org.apache.hadoop.crypto.key.JavaKeyStoreProvider$Factory not found
 {code}
 Here is an example of the hadoop-fuse-dfs debug output.
 {code}
 14/09/04 13:49:04 WARN crypto.CryptoCodec: Crypto codec 
 org.apache.hadoop.crypto.OpensslAesCtrCryptoCodec is not available.
 hdfsBuilderConnect(forceNewInstance=1, 
 nn=hdfs://hdfs-cdh5-secure-1.vpc.cloudera.com:8020, port=0, 
 kerbTicketCachePath=/tmp/krb5cc_0, userName=root) error:
 java.util.ServiceConfigurationError: 
 org.apache.hadoop.crypto.key.KeyProviderFactory: Provider 
 org.apache.hadoop.crypto.key.JavaKeyStoreProvider$Factory not found
   at java.util.ServiceLoader.fail(ServiceLoader.java:231)
   at java.util.ServiceLoader.access$300(ServiceLoader.java:181)
   at java.util.ServiceLoader$LazyIterator.next(ServiceLoader.java:365)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6705) Create an XAttr that disallows the HDFS admin from accessing a file


[ 
https://issues.apache.org/jira/browse/HDFS-6705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138949#comment-14138949
 ] 

Hudson commented on HDFS-6705:
--

FAILURE: Integrated in Hadoop-Hdfs-trunk #1875 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1875/])
HDFS-6705. Create an XAttr that disallows the HDFS admin from accessing a file. 
(clamb via wang) (wang: rev ea4e2e843ecadd8019ea35413f4a34b97a424923)
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/XAttrPermissionFilter.java
* hadoop-hdfs-project/hadoop-hdfs/src/test/resources/testXAttrConf.xml
* hadoop-hdfs-project/hadoop-hdfs/src/site/apt/ExtendedAttributes.apt.vm
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSNamesystem.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/FSXAttrBaseTest.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/common/HdfsServerConstants.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSDirectory.java
* hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt


 Create an XAttr that disallows the HDFS admin from accessing a file
 ---

 Key: HDFS-6705
 URL: https://issues.apache.org/jira/browse/HDFS-6705
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: namenode, security
Affects Versions: 3.0.0
Reporter: Charles Lamb
Assignee: Charles Lamb
 Fix For: 2.6.0

 Attachments: HDFS-6705.001.patch, HDFS-6705.002.patch, 
 HDFS-6705.003.patch, HDFS-6705.004.patch, HDFS-6705.005.patch, 
 HDFS-6705.006.patch, HDFS-6705.007.patch, HDFS-6705.008.patch


 There needs to be an xattr that specifies that the HDFS admin can not access 
 a file. This is needed for m/r delegation tokens and data at rest encryption.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7004) Update KeyProvider instantiation to create by URI


[ 
https://issues.apache.org/jira/browse/HDFS-7004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138957#comment-14138957
 ] 

Hudson commented on HDFS-7004:
--

FAILURE: Integrated in Hadoop-Hdfs-trunk #1875 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1875/])
HDFS-7004. Update KeyProvider instantiation to create by URI. (wang) (wang: rev 
10e8602f32b553a1424f1a9b5f9f74f7b68a49d1)
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/DFSConfigKeys.java
* hadoop-common-project/hadoop-kms/src/site/apt/index.apt.vm
* 
hadoop-common-project/hadoop-kms/src/test/java/org/apache/hadoop/crypto/key/kms/server/TestKMS.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestReservedRawPaths.java
* 
hadoop-common-project/hadoop-kms/src/test/java/org/apache/hadoop/crypto/key/kms/server/MiniKMS.java
* hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* hadoop-hdfs-project/hadoop-hdfs/src/main/resources/hdfs-default.xml
* 
hadoop-common-project/hadoop-kms/src/main/java/org/apache/hadoop/crypto/key/kms/server/KMSConfiguration.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/cli/TestCryptoAdminCLI.java
* hadoop-common-project/hadoop-kms/src/main/conf/kms-site.xml
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZonesWithHA.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/DFSUtil.java
* 
hadoop-common-project/hadoop-kms/src/main/java/org/apache/hadoop/crypto/key/kms/server/KMSWebApp.java
* 
hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZones.java
* hadoop-hdfs-project/hadoop-hdfs/src/site/apt/TransparentEncryption.apt.vm


 Update KeyProvider instantiation to create by URI
 -

 Key: HDFS-7004
 URL: https://issues.apache.org/jira/browse/HDFS-7004
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: encryption
Affects Versions: 2.6.0
Reporter: Andrew Wang
Assignee: Andrew Wang
 Fix For: 2.6.0

 Attachments: hdfs-7004.001.patch, hdfs-7004.002.patch, 
 hdfs-7004.004.patch


 See HADOOP-11054, would be good to update the NN/DFSClient to fetch via this 
 method rather than depending on the URI path lookup.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6970) Move startFile EDEK retries to the DFSClient


[ 
https://issues.apache.org/jira/browse/HDFS-6970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138973#comment-14138973
 ] 

Yi Liu commented on HDFS-6970:
--

This modification LGTM, thanks [~andrew.wang]

 Move startFile EDEK retries to the DFSClient
 

 Key: HDFS-6970
 URL: https://issues.apache.org/jira/browse/HDFS-6970
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: encryption
Affects Versions: 2.5.0
Reporter: Andrew Wang
Assignee: Andrew Wang
 Attachments: hdfs-6970.001.patch


 [~sureshms] pointed out that holding on to an RPC handler while talking to 
 the KMS is bad, since it can exhaust the available handlers. Let's avoid this 
 by doing retries at the DFSClient rather than in the RPC handler, and moving 
 EDEK fetching to the background.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6584) Support Archival Storage

[
https://issues.apache.org/jira/browse/HDFS-6584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138977#comment-14138977
]

Hadoop QA commented on HDFS-6584:
-

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12669683/h6584_20140918b.patch
against trunk revision ee21b13.

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 tests included{color}. The patch appears to include 28 new
or modified test files.

{color:green}+1 javac{color}. The applied patch does not increase the
total number of javac compiler warnings.

{color:green}+1 javadoc{color}. There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}. The patch built with
eclipse:eclipse.

{color:green}+1 findbugs{color}. The patch does not introduce any new
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.

{color:red}-1 core tests{color}. The patch failed these unit tests in
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs:

org.apache.hadoop.ipc.TestFairCallQueue
org.apache.hadoop.ipc.TestCallQueueManager
org.apache.hadoop.crypto.random.TestOsSecureRandom
org.apache.hadoop.hdfs.server.mover.TestStorageMover
org.apache.hadoop.tracing.TestTracing
org.apache.hadoop.hdfs.server.datanode.TestBPOfferService

org.apache.hadoop.hdfs.tools.offlineEditsViewer.TestOfflineEditsViewer
org.apache.hadoop.hdfs.TestHFlush

org.apache.hadoop.hdfs.server.namenode.ha.TestPipelinesFailover
org.apache.hadoop.hdfs.TestEncryptionZonesWithKMS

{color:green}+1 contrib tests{color}. The patch passed contrib unit tests.

Test results:
https://builds.apache.org/job/PreCommit-HDFS-Build/8078//testReport/
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/8078//console

This message is automatically generated.

Support Archival Storage

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6584) Support Archival Storage