date:20150331


 [ 
https://issues.apache.org/jira/browse/HDFS-7954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiaoyu Yao updated HDFS-7954:
-
Assignee: Xiaoyu Yao
  Status: Patch Available  (was: Open)

 TestBalancer#testBalancerWithPinnedBlocks failed on Windows
 ---

 Key: HDFS-7954
 URL: https://issues.apache.org/jira/browse/HDFS-7954
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: test
Reporter: Xiaoyu Yao
Assignee: Xiaoyu Yao
 Attachments: HDFS-7947.00.patch


 {code}
 testBalancerWithPinnedBlocks(org.apache.hadoop.hdfs.server.balancer.TestBalancer)
   Time elapsed: 22.624 sec   FAILURE!
 java.lang.AssertionError: expected:-3 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hdfs.server.balancer.TestBalancer.testBalancerWithPinnedBlocks(TestBalancer.java:353)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7954) TestBalancer#testBalancerWithPinnedBlocks failed on Windows


 [ 
https://issues.apache.org/jira/browse/HDFS-7954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiaoyu Yao updated HDFS-7954:
-
Attachment: HDFS-7947.00.patch

Post a patch to skip this test on Windows.

 TestBalancer#testBalancerWithPinnedBlocks failed on Windows
 ---

 Key: HDFS-7954
 URL: https://issues.apache.org/jira/browse/HDFS-7954
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: test
Reporter: Xiaoyu Yao
 Attachments: HDFS-7947.00.patch


 {code}
 testBalancerWithPinnedBlocks(org.apache.hadoop.hdfs.server.balancer.TestBalancer)
   Time elapsed: 22.624 sec   FAILURE!
 java.lang.AssertionError: expected:-3 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hdfs.server.balancer.TestBalancer.testBalancerWithPinnedBlocks(TestBalancer.java:353)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-8026) Trace FSOutputSummer#writeChecksumChunks rather than DFSOutputStream#writeChunk


[ 
https://issues.apache.org/jira/browse/HDFS-8026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388100#comment-14388100
 ] 

Hadoop QA commented on HDFS-8026:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12708293/HDFS-8026.001.patch
  against trunk revision 1a495fb.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs:

  org.apache.hadoop.hdfs.TestSetrepIncreasing
  org.apache.hadoop.tracing.TestTracing

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10122//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10122//console

This message is automatically generated.

 Trace FSOutputSummer#writeChecksumChunks rather than 
 DFSOutputStream#writeChunk
 ---

 Key: HDFS-8026
 URL: https://issues.apache.org/jira/browse/HDFS-8026
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.7.0
Reporter: Colin Patrick McCabe
Assignee: Colin Patrick McCabe
Priority: Minor
 Attachments: HDFS-8026.001.patch


 We should trace FSOutputSummer#writeChecksumChunks rather than 
 DFSOutputStream#writeChunk.  When tracing writeChunk, we get a new trace span 
 every 512 bytes; when tracing writeChecksumChunks, we normally get a new 
 trace span only when the FSOutputSummer buffer is full (9x less often.)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7811) Avoid recursive call getStoragePolicyID in INodeFile#computeQuotaUsage


[ 
https://issues.apache.org/jira/browse/HDFS-7811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388160#comment-14388160
 ] 

Xiaoyu Yao commented on HDFS-7811:
--

I can't find an easy way to unit tests the recursive call does not happen 
without adding test hooks in production code. 

 Avoid recursive call getStoragePolicyID in INodeFile#computeQuotaUsage
 --

 Key: HDFS-7811
 URL: https://issues.apache.org/jira/browse/HDFS-7811
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: datanode, namenode
Reporter: Xiaoyu Yao
Assignee: Xiaoyu Yao
 Attachments: HDFS-7811.00.patch, HDFS-7811.01.patch


 This is a follow up based on comment from [~jingzhao] on HDFS-7723. 
 I just noticed that INodeFile#computeQuotaUsage calls getStoragePolicyID to 
 identify the storage policy id of the file. This may not be very efficient 
 (especially when we're computing the quota usage of a directory) because 
 getStoragePolicyID may recursively check the ancestral INode's storage 
 policy. I think here an improvement can be passing the lowest parent 
 directory's storage policy down while traversing the tree. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7922) ShortCircuitCache#close is not releasing ScheduledThreadPoolExecutors


[ 
https://issues.apache.org/jira/browse/HDFS-7922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388203#comment-14388203
 ] 

Hadoop QA commented on HDFS-7922:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12708313/004-HDFS-7922.patch
  against trunk revision cce66ba.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-hdfs-project/hadoop-hdfs:

  org.apache.hadoop.hdfs.server.namenode.TestAuditLogs

  The following test timeouts occurred in 
hadoop-hdfs-project/hadoop-hdfs:

org.apache.hadoop.hdfs.server.blockmanagement.TestDatanodeManager

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10123//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10123//console

This message is automatically generated.

 ShortCircuitCache#close is not releasing ScheduledThreadPoolExecutors
 -

 Key: HDFS-7922
 URL: https://issues.apache.org/jira/browse/HDFS-7922
 Project: Hadoop HDFS
  Issue Type: Bug
Reporter: Rakesh R
Assignee: Rakesh R
 Attachments: 001-HDFS-7922.patch, 002-HDFS-7922.patch, 
 003-HDFS-7922.patch, 004-HDFS-7922.patch


 ShortCircuitCache has the following executors. It would be good to shutdown 
 these pools during ShortCircuitCache#close to avoid leaks.
 {code}
   /**
* The executor service that runs the cacheCleaner.
*/
   private final ScheduledThreadPoolExecutor cleanerExecutor
   = new ScheduledThreadPoolExecutor(1, new ThreadFactoryBuilder().
   setDaemon(true).setNameFormat(ShortCircuitCache_Cleaner).
   build());
   /**
* The executor service that runs the cacheCleaner.
*/
   private final ScheduledThreadPoolExecutor releaserExecutor
   = new ScheduledThreadPoolExecutor(1, new ThreadFactoryBuilder().
   setDaemon(true).setNameFormat(ShortCircuitCache_SlotReleaser).
   build());
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-5019) Cleanup imports in HDFS project

2015-03-31 Thread Tsuyoshi Ozawa (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388238#comment-14388238
 ] 

Tsuyoshi Ozawa commented on HDFS-5019:
--

Hi [~djp], thank you for updating. Could you rebase the patch?

 Cleanup imports in HDFS project
 ---

 Key: HDFS-5019
 URL: https://issues.apache.org/jira/browse/HDFS-5019
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.6.0
Reporter: Junping Du
Assignee: Junping Du
Priority: Minor
 Attachments: HDFS-5019-v2.patch, HDFS-5019.patch


 There are some unused imported packages in current code base which cause some 
 unnecessary java warnings. Also, the sequence of imports should follow 
 alphabet and import x.x.* is not recommended.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7939) Two fsimage_rollback_* files are created which are not deleted after rollback.

2015-03-31 Thread J.Andreina (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-7939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

J.Andreina updated HDFS-7939:
-
Status: Patch Available  (was: Open)

 Two fsimage_rollback_* files are created which are not deleted after rollback.
 --

 Key: HDFS-7939
 URL: https://issues.apache.org/jira/browse/HDFS-7939
 Project: Hadoop HDFS
  Issue Type: Bug
Reporter: J.Andreina
Assignee: J.Andreina
Priority: Critical
 Attachments: HDFS-7939.1.patch


 During checkpoint , if any failure in uploading to the remote Namenode  then 
 restarting Namenode with rollingUpgrade started option creates 2 
 fsimage_rollback_* at Active Namenode .
 On rolling upgrade rollback , initially created fsimage_rollback_* file is 
 not been deleted.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7999) FsDatasetImpl#createTemporary sometimes holds the FSDatasetImpl lock for a very long time

2015-03-31 Thread Xinwei Qin (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-7999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388273#comment-14388273
 ] 

Xinwei Qin  commented on HDFS-7999:
---

Yeah, It's a good and necessary idea to avoid holding the lock for a long time 
by the createTemporary() method.

 FsDatasetImpl#createTemporary sometimes holds the FSDatasetImpl lock for a 
 very long time
 -

 Key: HDFS-7999
 URL: https://issues.apache.org/jira/browse/HDFS-7999
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.6.0
Reporter: zhouyingchao
Assignee: zhouyingchao
 Attachments: HDFS-7999-001.patch


 I'm using 2.6.0 and noticed that sometime DN's heartbeat were delayed for 
 very long time, say more than 100 seconds. I get the jstack twice and looks 
 like they are all blocked (at getStorageReport) by dataset lock, and which is 
 held by a thread that is calling createTemporary, which again is blocked to 
 wait earlier incarnation writer to exit.
 The heartbeat thread stack:
java.lang.Thread.State: BLOCKED (on object monitor)
 at 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsVolumeImpl.getDfsUsed(FsVolumeImpl.java:152)
 - waiting to lock 0x0007b01428c0 (a 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl)
 at 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl.getStorageReports(FsDatasetImpl.java:144)
 - locked 0x0007b0140ed0 (a java.lang.Object)
 at 
 org.apache.hadoop.hdfs.server.datanode.BPServiceActor.sendHeartBeat(BPServiceActor.java:575)
 at 
 org.apache.hadoop.hdfs.server.datanode.BPServiceActor.offerService(BPServiceActor.java:680)
 at 
 org.apache.hadoop.hdfs.server.datanode.BPServiceActor.run(BPServiceActor.java:850)
 at java.lang.Thread.run(Thread.java:662)
 The DataXceiver thread holds the dataset lock:
 DataXceiver for client at X daemon prio=10 tid=0x7f14041e6480 
 nid=0x52bc in Object.wait() [0x7f11d78f7000]
 java.lang.Thread.State: TIMED_WAITING (on object monitor)
 at java.lang.Object.wait(Native Method)
 at java.lang.Thread.join(Thread.java:1194)
 locked 0x0007a33b85d8 (a org.apache.hadoop.util.Daemon)
 at 
 org.apache.hadoop.hdfs.server.datanode.ReplicaInPipeline.stopWriter(ReplicaInPipeline.java:183)
 at 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl.createTemporary(FsDatasetImpl.java:1231)
 locked 0x0007b01428c0 (a 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl)
 at 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl.createTemporary(FsDatasetImpl.java:114)
 at 
 org.apache.hadoop.hdfs.server.datanode.BlockReceiver.init(BlockReceiver.java:179)
 at 
 org.apache.hadoop.hdfs.server.datanode.DataXceiver.writeBlock(DataXceiver.java:615)
 at 
 org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.opWriteBlock(Receiver.java:137)
 at 
 org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.processOp(Receiver.java:74)
 at 
 org.apache.hadoop.hdfs.server.datanode.DataXceiver.run(DataXceiver.java:235)
 at java.lang.Thread.run(Thread.java:662)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7933) fsck should also report decommissioning replicas.


 [ 
https://issues.apache.org/jira/browse/HDFS-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiaoyu Yao updated HDFS-7933:
-
Attachment: (was: HDFS-7933.02.patch)

 fsck should also report decommissioning replicas. 
 --

 Key: HDFS-7933
 URL: https://issues.apache.org/jira/browse/HDFS-7933
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Reporter: Jitendra Nath Pandey
Assignee: Xiaoyu Yao
 Attachments: HDFS-7933.00.patch, HDFS-7933.01.patch


 Fsck doesn't count replicas that are on decommissioning nodes. If a block has 
 all replicas on the decommissioning nodes, it will be marked as missing, 
 which is alarming for the admins, although the system will replicate them 
 before nodes are decommissioned.
 Fsck output should also show decommissioning replicas along with the live 
 replicas.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7933) fsck should also report decommissioning replicas.


 [ 
https://issues.apache.org/jira/browse/HDFS-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiaoyu Yao updated HDFS-7933:
-
Attachment: HDFS-7933.02.patch

Thanks [~jnp] for reviewing the patch. I've updated the patch based on your 
feedback. 

Summary of changes:

1) adding NumberReplicas#decommissioned and NumberReplicas#decommissioning to 
track the decommissioned and decommissioning replicas, respectively. 

2) deprecating NumberReplicas#decommissionedReplicas() by 
NumberReplicas#decommissionedAndDecommissioning() to avoid the misleading name.

3) Display decommissioning and decommissioned replica separately in 
NamenodeFsck#check().

 fsck should also report decommissioning replicas. 
 --

 Key: HDFS-7933
 URL: https://issues.apache.org/jira/browse/HDFS-7933
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Reporter: Jitendra Nath Pandey
Assignee: Xiaoyu Yao
 Attachments: HDFS-7933.00.patch, HDFS-7933.01.patch, 
 HDFS-7933.02.patch


 Fsck doesn't count replicas that are on decommissioning nodes. If a block has 
 all replicas on the decommissioning nodes, it will be marked as missing, 
 which is alarming for the admins, although the system will replicate them 
 before nodes are decommissioned.
 Fsck output should also show decommissioning replicas along with the live 
 replicas.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7933) fsck should also report decommissioning replicas.


 [ 
https://issues.apache.org/jira/browse/HDFS-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiaoyu Yao updated HDFS-7933:
-
Attachment: HDFS-7933.02.patch

 fsck should also report decommissioning replicas. 
 --

 Key: HDFS-7933
 URL: https://issues.apache.org/jira/browse/HDFS-7933
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Reporter: Jitendra Nath Pandey
Assignee: Xiaoyu Yao
 Attachments: HDFS-7933.00.patch, HDFS-7933.01.patch


 Fsck doesn't count replicas that are on decommissioning nodes. If a block has 
 all replicas on the decommissioning nodes, it will be marked as missing, 
 which is alarming for the admins, although the system will replicate them 
 before nodes are decommissioned.
 Fsck output should also show decommissioning replicas along with the live 
 replicas.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7933) fsck should also report decommissioning replicas.


 [ 
https://issues.apache.org/jira/browse/HDFS-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiaoyu Yao updated HDFS-7933:
-
Attachment: (was: HDFS-7933.02.patch)

 fsck should also report decommissioning replicas. 
 --

 Key: HDFS-7933
 URL: https://issues.apache.org/jira/browse/HDFS-7933
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Reporter: Jitendra Nath Pandey
Assignee: Xiaoyu Yao
 Attachments: HDFS-7933.00.patch, HDFS-7933.01.patch


 Fsck doesn't count replicas that are on decommissioning nodes. If a block has 
 all replicas on the decommissioning nodes, it will be marked as missing, 
 which is alarming for the admins, although the system will replicate them 
 before nodes are decommissioned.
 Fsck output should also show decommissioning replicas along with the live 
 replicas.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7933) fsck should also report decommissioning replicas.


 [ 
https://issues.apache.org/jira/browse/HDFS-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiaoyu Yao updated HDFS-7933:
-
Attachment: HDFS-7933.02.patch

 fsck should also report decommissioning replicas. 
 --

 Key: HDFS-7933
 URL: https://issues.apache.org/jira/browse/HDFS-7933
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Reporter: Jitendra Nath Pandey
Assignee: Xiaoyu Yao
 Attachments: HDFS-7933.00.patch, HDFS-7933.01.patch, 
 HDFS-7933.02.patch


 Fsck doesn't count replicas that are on decommissioning nodes. If a block has 
 all replicas on the decommissioning nodes, it will be marked as missing, 
 which is alarming for the admins, although the system will replicate them 
 before nodes are decommissioned.
 Fsck output should also show decommissioning replicas along with the live 
 replicas.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7701) Support reporting per storage type quota and usage with hadoop/hdfs shell

2015-03-31 Thread Peter Shi (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-7701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388235#comment-14388235
 ] 

Peter Shi commented on HDFS-7701:
-

Thanks for giving such detailed suggestion, i will upload the fixed patth ASAP.

 Support reporting per storage type quota and usage with hadoop/hdfs shell
 -

 Key: HDFS-7701
 URL: https://issues.apache.org/jira/browse/HDFS-7701
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: datanode, namenode
Reporter: Xiaoyu Yao
Assignee: Peter Shi
 Attachments: HDFS-7701.01.patch, HDFS-7701.02.patch, 
 HDFS-7701.03.patch


 hadoop fs -count -q or hdfs dfs -count -q currently shows name space/disk 
 space quota and remaining quota information. With HDFS-7584, we want to 
 display per storage type quota and its remaining information as well.
 The current output format as shown below may not easily accomodate 6 more 
 columns = 3 (existing storage types) * 2 (quota/remaining quota). With new 
 storage types added in future, this will make the output even more crowded. 
 There are also compatibility issues as we don't want to break any existing 
 scripts monitoring hadoop fs -count -q output. 
 $ hadoop fs -count -q -v /test
QUOTA   REM_QUOTA SPACE_QUOTA REM_SPACE_QUOTADIR_COUNT   
 FILE_COUNT   CONTENT_SIZE PATHNAME
 none inf   524288000   5242665691 
   15  21431 /test
 Propose to add -t parameter to display ONLY the storage type quota 
 information of the directory in the separately. This way, existing scripts 
 will work as-is without using -t parameter. 
 1) When -t is not followed by a specific storage type, quota and usage 
 information for all storage types will be displayed. 
 $ hadoop fs -count -q  -t -h -v /test
SSD_QUOTA   REM_SSD_QUOTA DISK_QUOTA REM_DISK_QUOTA 
 ARCHIVAL_QUOTA REM_ARCHIVAL_QUOTA PATHNAME
 512MB 256MB   none inf none  
 inf/test
 2) If -t is followed by a storage type, only the quota and remaining quota of 
 the storage type is displayed. 
 $ hadoop fs -count -q  -t SSD -h -v /test
  
 SSD_QUOTA REM_SSD_QUOTA PATHNAME
 512 MB 256 MB   /test



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Work started] (HDFS-7949) WebImageViewer need support file size calculation with striped blocks


 [ 
https://issues.apache.org/jira/browse/HDFS-7949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HDFS-7949 started by Rakesh R.
--
 WebImageViewer need support file size calculation with striped blocks
 -

 Key: HDFS-7949
 URL: https://issues.apache.org/jira/browse/HDFS-7949
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hui Zheng
Assignee: Rakesh R
Priority: Minor
 Attachments: HDFS-7949-001.patch


 The file size calculation should be changed when the blocks of the file are 
 striped in WebImageViewer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7701) Support reporting per storage type quota and usage with hadoop/hdfs shell

2015-03-31 Thread Peter Shi (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-7701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Peter Shi updated HDFS-7701:

Attachment: HDFS-7701.04.patch

 Support reporting per storage type quota and usage with hadoop/hdfs shell
 -

 Key: HDFS-7701
 URL: https://issues.apache.org/jira/browse/HDFS-7701
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: datanode, namenode
Reporter: Xiaoyu Yao
Assignee: Peter Shi
 Attachments: HDFS-7701.01.patch, HDFS-7701.02.patch, 
 HDFS-7701.03.patch, HDFS-7701.04.patch


 hadoop fs -count -q or hdfs dfs -count -q currently shows name space/disk 
 space quota and remaining quota information. With HDFS-7584, we want to 
 display per storage type quota and its remaining information as well.
 The current output format as shown below may not easily accomodate 6 more 
 columns = 3 (existing storage types) * 2 (quota/remaining quota). With new 
 storage types added in future, this will make the output even more crowded. 
 There are also compatibility issues as we don't want to break any existing 
 scripts monitoring hadoop fs -count -q output. 
 $ hadoop fs -count -q -v /test
QUOTA   REM_QUOTA SPACE_QUOTA REM_SPACE_QUOTADIR_COUNT   
 FILE_COUNT   CONTENT_SIZE PATHNAME
 none inf   524288000   5242665691 
   15  21431 /test
 Propose to add -t parameter to display ONLY the storage type quota 
 information of the directory in the separately. This way, existing scripts 
 will work as-is without using -t parameter. 
 1) When -t is not followed by a specific storage type, quota and usage 
 information for all storage types will be displayed. 
 $ hadoop fs -count -q  -t -h -v /test
SSD_QUOTA   REM_SSD_QUOTA DISK_QUOTA REM_DISK_QUOTA 
 ARCHIVAL_QUOTA REM_ARCHIVAL_QUOTA PATHNAME
 512MB 256MB   none inf none  
 inf/test
 2) If -t is followed by a storage type, only the quota and remaining quota of 
 the storage type is displayed. 
 $ hadoop fs -count -q  -t SSD -h -v /test
  
 SSD_QUOTA REM_SSD_QUOTA PATHNAME
 512 MB 256 MB   /test



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-8012) Updatable HAR Filesystem

2015-03-31 Thread Madhan Sundararajan Devaki (JIRA)

[
https://issues.apache.org/jira/browse/HDFS-8012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Madhan Sundararajan Devaki updated HDFS-8012:
-
Description:
Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files
+ Remove existing files
+ Replace existing files
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when there
are a large number of tables that need to be loaded into Hive as external
tables.

Updatable HAR Filesystem

Key: HDFS-8012
URL: https://issues.apache.org/jira/browse/HDFS-8012
Project: Hadoop HDFS
Issue Type: Bug
Components: datanode, hdfs-client
Reporter: Madhan Sundararajan Devaki
Priority: Critical

Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files
+ Remove existing files
+ Replace existing files
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when
there are a large number of tables that need to be loaded into Hive as
external tables.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7949) WebImageViewer need support file size calculation with striped blocks


 [ 
https://issues.apache.org/jira/browse/HDFS-7949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rakesh R updated HDFS-7949:
---
Attachment: HDFS-7949-001.patch

 WebImageViewer need support file size calculation with striped blocks
 -

 Key: HDFS-7949
 URL: https://issues.apache.org/jira/browse/HDFS-7949
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hui Zheng
Assignee: Rakesh R
Priority: Minor
 Attachments: HDFS-7949-001.patch


 The file size calculation should be changed when the blocks of the file are 
 striped in WebImageViewer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7716) Erasure Coding: extend BlockInfo to handle EC info


 [ 
https://issues.apache.org/jira/browse/HDFS-7716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinayakumar B updated HDFS-7716:

Fix Version/s: HDFS-7285

 Erasure Coding: extend BlockInfo to handle EC info
 --

 Key: HDFS-7716
 URL: https://issues.apache.org/jira/browse/HDFS-7716
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Jing Zhao
Assignee: Jing Zhao
 Fix For: HDFS-7285

 Attachments: HDFS-7716.000.patch, HDFS-7716.001.patch, 
 HDFS-7716.002.patch, HDFS-7716.003.patch


 The current BlockInfo's implementation only supports the replication 
 mechanism. To use the same blocksMap handling block group and its data/parity 
 blocks, we need to define a new BlockGroupInfo class.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-8012) Updatable HAR Filesystem

2015-03-31 Thread Madhan Sundararajan Devaki (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-8012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Madhan Sundararajan Devaki updated HDFS-8012:
-
Description: 
Is there a plan to support updatable HAR Filesystem? If so, by when is this 
expected please?
The following operations may be supported.
+ Add new files
+ Remove existing files
+ Replace existing files

  was:Is there a plan to support updatable HAR Filesystem? If so, by when is 
this expected please?


 Updatable HAR Filesystem
 

 Key: HDFS-8012
 URL: https://issues.apache.org/jira/browse/HDFS-8012
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: datanode, hdfs-client
Reporter: Madhan Sundararajan Devaki
Priority: Critical

 Is there a plan to support updatable HAR Filesystem? If so, by when is this 
 expected please?
 The following operations may be supported.
 + Add new files
 + Remove existing files
 + Replace existing files



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-8012) Updatable HAR Filesystem

2015-03-31 Thread Madhan Sundararajan Devaki (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-8012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Madhan Sundararajan Devaki updated HDFS-8012:
-
Issue Type: Improvement  (was: Bug)

 Updatable HAR Filesystem
 

 Key: HDFS-8012
 URL: https://issues.apache.org/jira/browse/HDFS-8012
 Project: Hadoop HDFS
  Issue Type: Improvement
  Components: datanode, hdfs-client
Reporter: Madhan Sundararajan Devaki
Priority: Critical

 Is there a plan to support updatable HAR Filesystem? If so, by when is this 
 expected please?
 The following operations may be supported.
 + Add new files
 + Remove existing files
 + Replace existing files
 This is required in cases where data is stored in AVRO format in HDFS and the 
 corresponding .avsc files are used to create Hive external tables.
 This will lead to the small files (.avsc files in this case) problem when 
 there are a large number of tables that need to be loaded into Hive as 
 external tables.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7652) Process block reports for erasure coded blocks


 [ 
https://issues.apache.org/jira/browse/HDFS-7652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinayakumar B updated HDFS-7652:

Fix Version/s: HDFS-7285

 Process block reports for erasure coded blocks
 --

 Key: HDFS-7652
 URL: https://issues.apache.org/jira/browse/HDFS-7652
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Zhe Zhang
Assignee: Zhe Zhang
 Fix For: HDFS-7285

 Attachments: HDFS-7652.001.patch, HDFS-7652.002.patch, 
 HDFS-7652.003.patch, HDFS-7652.004.patch, HDFS-7652.005.patch, 
 HDFS-7652.006.patch


 HDFS-7339 adds support in NameNode for persisting block groups. For memory 
 efficiency, erasure coded blocks under the striping layout are not stored in 
 {{BlockManager#blocksMap}}. Instead, entire block groups are stored in 
 {{BlockGroupManager#blockGroups}}. When a block report arrives from the 
 DataNode, it should be processed under the block group that it belongs to. 
 The following naming protocol is used to calculate the group of a given block:
 {code}
  * HDFS-EC introduces a hierarchical protocol to name blocks and groups:
  * Contiguous: {reserved block IDs | flag | block ID}
  * Striped: {reserved block IDs | flag | block group ID | index in group}
  *
  * Following n bits of reserved block IDs, The (n+1)th bit in an ID
  * distinguishes contiguous (0) and striped (1) blocks. For a striped block,
  * bits (n+2) to (64-m) represent the ID of its block group, while the last m
  * bits represent its index of the group. The value m is determined by the
  * maximum number of blocks in a group (MAX_BLOCKS_IN_GROUP).
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-8011) standby nn can't started


[ 
https://issues.apache.org/jira/browse/HDFS-8011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388371#comment-14388371
 ] 

Vinayakumar B commented on HDFS-8011:
-

Hi [~fujie] Can you attach little more log around above mentioned exceptions.?

 standby nn can't started
 

 Key: HDFS-8011
 URL: https://issues.apache.org/jira/browse/HDFS-8011
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha
Affects Versions: 2.3.0
 Environment: centeros 6.2  64bit 
Reporter: fujie

 We have seen crash when starting the standby namenode, with fatal errors. Any 
 solutions, workarouds, or ideas would be helpful for us.
 1. Here is the context: 
   At begining we have 2 namenodes, take A as active and B as standby. For 
 some resons, namenode A was dead, so namenode B is working as active.
   When we try to restart A after a minute, it can't work. During this 
 time a lot of files were put to HDFS, and a lot of files were renamed. 
   Nodenode A crashed when awaiting reported blocks in safemode each 
 time.
  
 2. We can see error log below:
   1)2015-03-30  ERROR 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader: Encountered exception 
 on operation CloseOp [length=0, inodeId=0, 
 path=/xxx/_temporary/xxx/part-r-00074.bz2, replication=3, 
 mtime=1427699913947, atime=1427699081161, blockSize=268435456, 
 blocks=[blk_2103131025_1100889495739], permissions=dm:dm:rw-r--r--, 
 clientName=, clientMachine=, opCode=OP_CLOSE, txid=7632753612]
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hdfs.server.blockmanagement.BlockInfoUnderConstruction.setGenerationStampAndVerifyReplicas(BlockInfoUnderConstruction.java:247)
 at 
 org.apache.hadoop.hdfs.server.blockmanagement.BlockInfoUnderConstruction.commitBlock(BlockInfoUnderConstruction.java:267)
 at 
 org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.forceCompleteBlock(BlockManager.java:639)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.updateBlocks(FSEditLogLoader.java:813)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.applyEditLogOp(FSEditLogLoader.java:383)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadEditRecords(FSEditLogLoader.java:209)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadFSEdits(FSEditLogLoader.java:122)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSImage.loadEdits(FSImage.java:737)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer.doTailEdits(EditLogTailer.java:227)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread.doWork(EditLogTailer.java:321)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread.access$0(EditLogTailer.java:302)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread$1.run(EditLogTailer.java:296)
 at java.security.AccessController.doPrivileged(Native Method)
 at javax.security.auth.Subject.doAs(Subject.java:356)
 at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1528)
 at 
 org.apache.hadoop.security.SecurityUtil.doAsLoginUserOrFatal(SecurityUtil.java:413)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread.run(EditLogTailer.java:292)
 
2)2015-03-30  FATAL 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer: Unknown error 
 encountered while tailing edits. Shutting down standby N
 N.
 java.io.IOException: Failed to apply edit log operation AddBlockOp 
 [path=/xxx/_temporary/xxx/part-m-00121, 
 penultimateBlock=blk_2102331803_1100888911441, 
 lastBlock=blk_2102661068_1100889009168, RpcClientId=, RpcCallId=-2]: error
 null
 at 
 org.apache.hadoop.hdfs.server.namenode.MetaRecoveryContext.editLogLoaderPrompt(MetaRecoveryContext.java:94)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadEditRecords(FSEditLogLoader.java:215)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadFSEdits(FSEditLogLoader.java:122)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSImage.loadEdits(FSImage.java:737)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer.doTailEdits(EditLogTailer.java:227)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread.doWork(EditLogTailer.java:321)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread.access$0(EditLogTailer.java:302)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread$1.run(EditLogTailer.java:296)
 at java.security.AccessController.doPrivileged(Native Method)
 at javax.security.auth.Subject.doAs(Subject.java:356)

[jira] [Commented] (HDFS-7933) fsck should also report decommissioning replicas.


[ 
https://issues.apache.org/jira/browse/HDFS-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388386#comment-14388386
 ] 

Hadoop QA commented on HDFS-7933:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12708365/HDFS-7933.02.patch
  against trunk revision 85dc3c1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-hdfs-project/hadoop-hdfs:

  org.apache.hadoop.hdfs.server.namenode.ha.TestRetryCacheWithHA
  
org.apache.hadoop.hdfs.server.namenode.TestDefaultBlockPlacementPolicy

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10126//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10126//console

This message is automatically generated.

 fsck should also report decommissioning replicas. 
 --

 Key: HDFS-7933
 URL: https://issues.apache.org/jira/browse/HDFS-7933
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Reporter: Jitendra Nath Pandey
Assignee: Xiaoyu Yao
 Attachments: HDFS-7933.00.patch, HDFS-7933.01.patch, 
 HDFS-7933.02.patch


 Fsck doesn't count replicas that are on decommissioning nodes. If a block has 
 all replicas on the decommissioning nodes, it will be marked as missing, 
 which is alarming for the admins, although the system will replicate them 
 before nodes are decommissioned.
 Fsck output should also show decommissioning replicas along with the live 
 replicas.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7999) FsDatasetImpl#createTemporary sometimes holds the FSDatasetImpl lock for a very long time

2015-03-31 Thread Xinwei Qin (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-7999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388309#comment-14388309
 ] 

Xinwei Qin  commented on HDFS-7999:
---

Hi [~cmccabe]
Thanks for your comment.
{quote}
even if we made the heartbeat lockless, there are still many other problems 
associated with having FsDatasetImpl#createTemporary hold the FSDatasetImpl 
lock for a very long time. Any thread that needs to read or write from the 
datanode will be blocked.
{quote}
Make the heartbeat lockless can avoid the happening of dead DataNode, and I 
think it is a necessary 
patch([https://issues.apache.org/jira/browse/HDFS-7060]). 
FSDatasetImpl lock held for a long time is another problem, May be the patch of 
this jira can alleviate the problem.

 FsDatasetImpl#createTemporary sometimes holds the FSDatasetImpl lock for a 
 very long time
 -

 Key: HDFS-7999
 URL: https://issues.apache.org/jira/browse/HDFS-7999
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.6.0
Reporter: zhouyingchao
Assignee: zhouyingchao
 Attachments: HDFS-7999-001.patch


 I'm using 2.6.0 and noticed that sometime DN's heartbeat were delayed for 
 very long time, say more than 100 seconds. I get the jstack twice and looks 
 like they are all blocked (at getStorageReport) by dataset lock, and which is 
 held by a thread that is calling createTemporary, which again is blocked to 
 wait earlier incarnation writer to exit.
 The heartbeat thread stack:
java.lang.Thread.State: BLOCKED (on object monitor)
 at 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsVolumeImpl.getDfsUsed(FsVolumeImpl.java:152)
 - waiting to lock 0x0007b01428c0 (a 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl)
 at 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl.getStorageReports(FsDatasetImpl.java:144)
 - locked 0x0007b0140ed0 (a java.lang.Object)
 at 
 org.apache.hadoop.hdfs.server.datanode.BPServiceActor.sendHeartBeat(BPServiceActor.java:575)
 at 
 org.apache.hadoop.hdfs.server.datanode.BPServiceActor.offerService(BPServiceActor.java:680)
 at 
 org.apache.hadoop.hdfs.server.datanode.BPServiceActor.run(BPServiceActor.java:850)
 at java.lang.Thread.run(Thread.java:662)
 The DataXceiver thread holds the dataset lock:
 DataXceiver for client at X daemon prio=10 tid=0x7f14041e6480 
 nid=0x52bc in Object.wait() [0x7f11d78f7000]
 java.lang.Thread.State: TIMED_WAITING (on object monitor)
 at java.lang.Object.wait(Native Method)
 at java.lang.Thread.join(Thread.java:1194)
 locked 0x0007a33b85d8 (a org.apache.hadoop.util.Daemon)
 at 
 org.apache.hadoop.hdfs.server.datanode.ReplicaInPipeline.stopWriter(ReplicaInPipeline.java:183)
 at 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl.createTemporary(FsDatasetImpl.java:1231)
 locked 0x0007b01428c0 (a 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl)
 at 
 org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl.createTemporary(FsDatasetImpl.java:114)
 at 
 org.apache.hadoop.hdfs.server.datanode.BlockReceiver.init(BlockReceiver.java:179)
 at 
 org.apache.hadoop.hdfs.server.datanode.DataXceiver.writeBlock(DataXceiver.java:615)
 at 
 org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.opWriteBlock(Receiver.java:137)
 at 
 org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.processOp(Receiver.java:74)
 at 
 org.apache.hadoop.hdfs.server.datanode.DataXceiver.run(DataXceiver.java:235)
 at java.lang.Thread.run(Thread.java:662)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-8027) Erasure Coding: Update CHANGES-HDFS-7285.txt with branch commits


 [ 
https://issues.apache.org/jira/browse/HDFS-8027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinayakumar B updated HDFS-8027:

Attachment: HDFS-8027-01.patch

Attaching for reference. 
Jiras are ordered as per Jira resolution date

 Erasure Coding: Update CHANGES-HDFS-7285.txt with branch commits
 

 Key: HDFS-8027
 URL: https://issues.apache.org/jira/browse/HDFS-8027
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Vinayakumar B
Assignee: Vinayakumar B
 Attachments: HDFS-8027-01.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (HDFS-8027) Erasure Coding: Update CHANGES-HDFS-7285.txt with branch commits


 [ 
https://issues.apache.org/jira/browse/HDFS-8027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinayakumar B resolved HDFS-8027.
-
   Resolution: Fixed
Fix Version/s: HDFS-7285

Committed to HDFS-7285 branch,
Committed directly as this is only CHANGES-HDFS-7285.txt update.

 Erasure Coding: Update CHANGES-HDFS-7285.txt with branch commits
 

 Key: HDFS-8027
 URL: https://issues.apache.org/jira/browse/HDFS-8027
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Vinayakumar B
Assignee: Vinayakumar B
 Fix For: HDFS-7285

 Attachments: HDFS-8027-01.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7285) Erasure Coding Support inside HDFS

[
https://issues.apache.org/jira/browse/HDFS-7285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388329#comment-14388329
]

Vinayakumar B commented on HDFS-7285:
-

Hi,
I think most of the commits to HDFS-7285 were not added to
CHANGES-HDFS-EC-7285.txt.
This will help to update CHANGES.txt at the time of merging to trunk, and hence
recording the contributions.
Very happy to see many new people Contributing to this work.

For all commits till now I have updated CHANGES-HDFS-EC-7285.txt through
HDFS-8027.
Please take care for further commits.
Thanks.

Erasure Coding Support inside HDFS
--

Key: HDFS-7285
URL: https://issues.apache.org/jira/browse/HDFS-7285
Project: Hadoop HDFS
Issue Type: New Feature
Reporter: Weihua Jiang
Assignee: Zhe Zhang
Attachments: ECAnalyzer.py, ECParser.py, HDFS-7285-initial-PoC.patch,
HDFSErasureCodingDesign-20141028.pdf, HDFSErasureCodingDesign-20141217.pdf,
HDFSErasureCodingDesign-20150204.pdf, HDFSErasureCodingDesign-20150206.pdf,
fsimage-analysis-20150105.pdf

Erasure Coding (EC) can greatly reduce the storage overhead without sacrifice
of data reliability, comparing to the existing HDFS 3-replica approach. For
example, if we use a 10+4 Reed Solomon coding, we can allow loss of 4 blocks,
with storage overhead only being 40%. This makes EC a quite attractive
alternative for big data storage, particularly for cold data.
Facebook had a related open source project called HDFS-RAID. It used to be
one of the contribute packages in HDFS but had been removed since Hadoop 2.0
for maintain reason. The drawbacks are: 1) it is on top of HDFS and depends
on MapReduce to do encoding and decoding tasks; 2) it can only be used for
cold files that are intended not to be appended anymore; 3) the pure Java EC
coding implementation is extremely slow in practical use. Due to these, it
might not be a good idea to just bring HDFS-RAID back.
We (Intel and Cloudera) are working on a design to build EC into HDFS that
gets rid of any external dependencies, makes it self-contained and
independently maintained. This design lays the EC feature on the storage type
support and considers compatible with existing HDFS features like caching,
snapshot, encryption, high availability and etc. This design will also
support different EC coding schemes, implementations and policies for
different deployment scenarios. By utilizing advanced libraries (e.g. Intel
ISA-L library), an implementation can greatly improve the performance of EC
encoding/decoding and makes the EC solution even more attractive. We will
post the design document soon.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-8012) Updatable HAR Filesystem

2015-03-31 Thread Madhan Sundararajan Devaki (JIRA)

[
https://issues.apache.org/jira/browse/HDFS-8012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Madhan Sundararajan Devaki updated HDFS-8012:
-
Description:
Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files
+ Remove existing files
+ Replace existing files (Optional)
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when there
are a large number of tables that need to be loaded into Hive as external
tables.

was:
Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files
+ Remove existing files
+ Replace existing files
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when there
are a large number of tables that need to be loaded into Hive as external
tables.

Updatable HAR Filesystem

Key: HDFS-8012
URL: https://issues.apache.org/jira/browse/HDFS-8012
Project: Hadoop HDFS
Issue Type: Improvement
Components: datanode, hdfs-client
Reporter: Madhan Sundararajan Devaki
Priority: Critical

Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files
+ Remove existing files
+ Replace existing files (Optional)
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when
there are a large number of tables that need to be loaded into Hive as
external tables.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-8012) Updatable HAR Filesystem

2015-03-31 Thread Madhan Sundararajan Devaki (JIRA)

[
https://issues.apache.org/jira/browse/HDFS-8012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Madhan Sundararajan Devaki updated HDFS-8012:
-
Description:
Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files [ -a filename-uri1 filename-uri2 ... / -a dirname-uri1
dirname-uri2 ...]
+ Remove existing files [ -d filename-uri1 filename-uri2 ... / -d dirname-uri1
dirname-uri2 ...]
+ Update/Replace existing files (Optional) [ -u old-filename-uri
new-filename-uri]
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when there
are a large number of tables that need to be loaded into Hive as external
tables.

was:
Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files [ -a filename-uri1, filename-uri2, ...]
+ Remove existing files [ -d filename-uri1, filename-uri2, ...]
+ Update/Replace existing files (Optional) [ -u old-filename-uri
new-filename-uri]
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when there
are a large number of tables that need to be loaded into Hive as external
tables.

Updatable HAR Filesystem

Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files [ -a filename-uri1 filename-uri2 ... / -a dirname-uri1
dirname-uri2 ...]
+ Remove existing files [ -d filename-uri1 filename-uri2 ... / -d
dirname-uri1 dirname-uri2 ...]
+ Update/Replace existing files (Optional) [ -u old-filename-uri
new-filename-uri]
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when
there are a large number of tables that need to be loaded into Hive as
external tables.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HDFS-8027) Update CHANGES-HDFS-7285.txt with branch commits

Vinayakumar B created HDFS-8027:
---

 Summary: Update CHANGES-HDFS-7285.txt with branch commits
 Key: HDFS-8027
 URL: https://issues.apache.org/jira/browse/HDFS-8027
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Vinayakumar B
Assignee: Vinayakumar B






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-8027) Erasure Coding: Update CHANGES-HDFS-7285.txt with branch commits


 [ 
https://issues.apache.org/jira/browse/HDFS-8027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinayakumar B updated HDFS-8027:

Summary: Erasure Coding: Update CHANGES-HDFS-7285.txt with branch commits  
(was: Update CHANGES-HDFS-7285.txt with branch commits)

 Erasure Coding: Update CHANGES-HDFS-7285.txt with branch commits
 

 Key: HDFS-8027
 URL: https://issues.apache.org/jira/browse/HDFS-8027
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Vinayakumar B
Assignee: Vinayakumar B





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-8027) Erasure Coding: Update CHANGES-HDFS-7285.txt with branch commits


 [ 
https://issues.apache.org/jira/browse/HDFS-8027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinayakumar B updated HDFS-8027:

Description: Latest branch commits are not tracked in CHANGES-HDFS-7285.txt.

 Erasure Coding: Update CHANGES-HDFS-7285.txt with branch commits
 

 Key: HDFS-8027
 URL: https://issues.apache.org/jira/browse/HDFS-8027
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Vinayakumar B
Assignee: Vinayakumar B
 Fix For: HDFS-7285

 Attachments: HDFS-8027-01.patch


 Latest branch commits are not tracked in CHANGES-HDFS-7285.txt.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-8011) standby nn can't started

2015-03-31 Thread fujie (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-8011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388330#comment-14388330
 ] 

fujie commented on HDFS-8011:
-

HDFS-6825 affects version is 2.5.0, but our hadoop version is 2.3.0. So are you 
sure it is the same issue?

1. I am sure that the file was deleted. And I have some new findings.

Such as we have image-file-1, editlog-file-1 and editlog-file-inprogress 
when start the standby namenode A.
I found the below behavior of these files:
step-1) SNN will load image-file-1 and editlog-file-1 and generate new 
image file, take it as image-file-2.
step-2) SNN will cp image-file-2 to ative namenode.
step-3) editlog-file-inprogress will be renamed to editlog-file-2 and a new 
editlog-file-inprogress will be opened.
step-4) SNN will load editlog-file-2, at the same time datanode will report 
heartbeat to both active and standby. 

The crash happends at step-4. We print all the failed files and all of them are 
in editlog-file-2.
We alse have a statistics, 20,000 operations failed in 500,000 operations. Then 
we parsed editlog-file-2, and got the familar contents
of failed records. All of them, RPC_CLIENTID is null(blank) , and RPC_CALLID is 
-2.
RECORD
OPCODEOP_ADD_BLOCK/OPCODE
DATA
  TXID7660428426/TXID
  
PATH/workspace/dm/recommend/VideoQuality/VRII/AppList/data/interactivedata_month/_temporary/1/_temporary/attempt_1427018831005_178665_r_02_0/part
-r-2/PATH
  BLOCK
BLOCK_ID2107099231/BLOCK_ID
NUM_BYTES0/NUM_BYTES
GENSTAMP1100893452304/GENSTAMP
  /BLOCK
  RPC_CLIENTID/RPC_CLIENTID
  RPC_CALLID-2/RPC_CALLID
/DATA
  /RECORD

2. If we restart SNN A again, editlog-file-2 could be loaded correctly just 
like editlog-file-1 in last restart operation. It's weird.
Does the reported heartbeat impact its behavior? But the load process and 
report process should asynchronous, isn't it?

We are looking forward to you reply.

 standby nn can't started
 

 Key: HDFS-8011
 URL: https://issues.apache.org/jira/browse/HDFS-8011
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha
Affects Versions: 2.3.0
 Environment: centeros 6.2  64bit 
Reporter: fujie

 We have seen crash when starting the standby namenode, with fatal errors. Any 
 solutions, workarouds, or ideas would be helpful for us.
 1. Here is the context: 
   At begining we have 2 namenodes, take A as active and B as standby. For 
 some resons, namenode A was dead, so namenode B is working as active.
   When we try to restart A after a minute, it can't work. During this 
 time a lot of files were put to HDFS, and a lot of files were renamed. 
   Nodenode A crashed when awaiting reported blocks in safemode each 
 time.
  
 2. We can see error log below:
   1)2015-03-30  ERROR 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader: Encountered exception 
 on operation CloseOp [length=0, inodeId=0, 
 path=/xxx/_temporary/xxx/part-r-00074.bz2, replication=3, 
 mtime=1427699913947, atime=1427699081161, blockSize=268435456, 
 blocks=[blk_2103131025_1100889495739], permissions=dm:dm:rw-r--r--, 
 clientName=, clientMachine=, opCode=OP_CLOSE, txid=7632753612]
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hdfs.server.blockmanagement.BlockInfoUnderConstruction.setGenerationStampAndVerifyReplicas(BlockInfoUnderConstruction.java:247)
 at 
 org.apache.hadoop.hdfs.server.blockmanagement.BlockInfoUnderConstruction.commitBlock(BlockInfoUnderConstruction.java:267)
 at 
 org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.forceCompleteBlock(BlockManager.java:639)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.updateBlocks(FSEditLogLoader.java:813)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.applyEditLogOp(FSEditLogLoader.java:383)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadEditRecords(FSEditLogLoader.java:209)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadFSEdits(FSEditLogLoader.java:122)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSImage.loadEdits(FSImage.java:737)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer.doTailEdits(EditLogTailer.java:227)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread.doWork(EditLogTailer.java:321)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread.access$0(EditLogTailer.java:302)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread$1.run(EditLogTailer.java:296)
 at java.security.AccessController.doPrivileged(Native Method)
 at javax.security.auth.Subject.doAs(Subject.java:356)
 at

[jira] [Commented] (HDFS-8011) standby nn can't started

2015-03-31 Thread fujie (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-8011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388332#comment-14388332
 ] 

fujie commented on HDFS-8011:
-

HDFS-6825 affects version is 2.5.0, but our hadoop version is 2.3.0. So are you 
sure it is the same issue?

1. I am sure that the file was deleted. And I have some new findings.

Such as we have image-file-1, editlog-file-1 and editlog-file-inprogress 
when start the standby namenode A.
I found the below behavior of these files:
step-1) SNN will load image-file-1 and editlog-file-1 and generate new 
image file, take it as image-file-2.
step-2) SNN will cp image-file-2 to ative namenode.
step-3) editlog-file-inprogress will be renamed to editlog-file-2 and a new 
editlog-file-inprogress will be opened.
step-4) SNN will load editlog-file-2, at the same time datanode will report 
heartbeat to both active and standby. 

The crash happends at step-4. We print all the failed files and all of them are 
in editlog-file-2.
We alse have a statistics, 20,000 operations failed in 500,000 operations. Then 
we parsed editlog-file-2, and got the familar contents
of failed records. All of them, RPC_CLIENTID is null(blank) , and RPC_CALLID is 
-2.
RECORD
OPCODEOP_ADD_BLOCK/OPCODE
DATA
  TXID7660428426/TXID
  
PATH/workspace/dm/recommend/VideoQuality/VRII/AppList/data/interactivedata_month/_temporary/1/_temporary/attempt_1427018831005_178665_r_02_0/part
-r-2/PATH
  BLOCK
BLOCK_ID2107099231/BLOCK_ID
NUM_BYTES0/NUM_BYTES
GENSTAMP1100893452304/GENSTAMP
  /BLOCK
  RPC_CLIENTID/RPC_CLIENTID
  RPC_CALLID-2/RPC_CALLID
/DATA
  /RECORD

2. If we restart SNN A again, editlog-file-2 could be loaded correctly just 
like editlog-file-1 in last restart operation. It's weird.
Does the reported heartbeat impact its behavior? But the load process and 
report process should asynchronous, isn't it?

We are looking forward to you reply.

 standby nn can't started
 

 Key: HDFS-8011
 URL: https://issues.apache.org/jira/browse/HDFS-8011
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha
Affects Versions: 2.3.0
 Environment: centeros 6.2  64bit 
Reporter: fujie

 We have seen crash when starting the standby namenode, with fatal errors. Any 
 solutions, workarouds, or ideas would be helpful for us.
 1. Here is the context: 
   At begining we have 2 namenodes, take A as active and B as standby. For 
 some resons, namenode A was dead, so namenode B is working as active.
   When we try to restart A after a minute, it can't work. During this 
 time a lot of files were put to HDFS, and a lot of files were renamed. 
   Nodenode A crashed when awaiting reported blocks in safemode each 
 time.
  
 2. We can see error log below:
   1)2015-03-30  ERROR 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader: Encountered exception 
 on operation CloseOp [length=0, inodeId=0, 
 path=/xxx/_temporary/xxx/part-r-00074.bz2, replication=3, 
 mtime=1427699913947, atime=1427699081161, blockSize=268435456, 
 blocks=[blk_2103131025_1100889495739], permissions=dm:dm:rw-r--r--, 
 clientName=, clientMachine=, opCode=OP_CLOSE, txid=7632753612]
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hdfs.server.blockmanagement.BlockInfoUnderConstruction.setGenerationStampAndVerifyReplicas(BlockInfoUnderConstruction.java:247)
 at 
 org.apache.hadoop.hdfs.server.blockmanagement.BlockInfoUnderConstruction.commitBlock(BlockInfoUnderConstruction.java:267)
 at 
 org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.forceCompleteBlock(BlockManager.java:639)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.updateBlocks(FSEditLogLoader.java:813)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.applyEditLogOp(FSEditLogLoader.java:383)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadEditRecords(FSEditLogLoader.java:209)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadFSEdits(FSEditLogLoader.java:122)
 at 
 org.apache.hadoop.hdfs.server.namenode.FSImage.loadEdits(FSImage.java:737)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer.doTailEdits(EditLogTailer.java:227)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread.doWork(EditLogTailer.java:321)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread.access$0(EditLogTailer.java:302)
 at 
 org.apache.hadoop.hdfs.server.namenode.ha.EditLogTailer$EditLogTailerThread$1.run(EditLogTailer.java:296)
 at java.security.AccessController.doPrivileged(Native Method)
 at javax.security.auth.Subject.doAs(Subject.java:356)
 at

[jira] [Commented] (HDFS-7701) Support reporting per storage type quota and usage with hadoop/hdfs shell


[ 
https://issues.apache.org/jira/browse/HDFS-7701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388349#comment-14388349
 ] 

Hadoop QA commented on HDFS-7701:
-

{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12708388/HDFS-7701.04.patch
  against trunk revision b5a22e9.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 2 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-common-project/hadoop-common.

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10128//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10128//console

This message is automatically generated.

 Support reporting per storage type quota and usage with hadoop/hdfs shell
 -

 Key: HDFS-7701
 URL: https://issues.apache.org/jira/browse/HDFS-7701
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: datanode, namenode
Reporter: Xiaoyu Yao
Assignee: Peter Shi
 Attachments: HDFS-7701.01.patch, HDFS-7701.02.patch, 
 HDFS-7701.03.patch, HDFS-7701.04.patch


 hadoop fs -count -q or hdfs dfs -count -q currently shows name space/disk 
 space quota and remaining quota information. With HDFS-7584, we want to 
 display per storage type quota and its remaining information as well.
 The current output format as shown below may not easily accomodate 6 more 
 columns = 3 (existing storage types) * 2 (quota/remaining quota). With new 
 storage types added in future, this will make the output even more crowded. 
 There are also compatibility issues as we don't want to break any existing 
 scripts monitoring hadoop fs -count -q output. 
 $ hadoop fs -count -q -v /test
QUOTA   REM_QUOTA SPACE_QUOTA REM_SPACE_QUOTADIR_COUNT   
 FILE_COUNT   CONTENT_SIZE PATHNAME
 none inf   524288000   5242665691 
   15  21431 /test
 Propose to add -t parameter to display ONLY the storage type quota 
 information of the directory in the separately. This way, existing scripts 
 will work as-is without using -t parameter. 
 1) When -t is not followed by a specific storage type, quota and usage 
 information for all storage types will be displayed. 
 $ hadoop fs -count -q  -t -h -v /test
SSD_QUOTA   REM_SSD_QUOTA DISK_QUOTA REM_DISK_QUOTA 
 ARCHIVAL_QUOTA REM_ARCHIVAL_QUOTA PATHNAME
 512MB 256MB   none inf none  
 inf/test
 2) If -t is followed by a storage type, only the quota and remaining quota of 
 the storage type is displayed. 
 $ hadoop fs -count -q  -t SSD -h -v /test
  
 SSD_QUOTA REM_SSD_QUOTA PATHNAME
 512 MB 256 MB   /test



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-8012) Updatable HAR Filesystem

2015-03-31 Thread Madhan Sundararajan Devaki (JIRA)

[
https://issues.apache.org/jira/browse/HDFS-8012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Madhan Sundararajan Devaki updated HDFS-8012:
-
Description:
Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files [ -a filename-uri1, filename-uri2, ...]
+ Remove existing files [ -d filename-uri1, filename-uri2, ...]
+ Update/Replace existing files (Optional) [ -u old-filename-uri
new-filename-uri]
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when there
are a large number of tables that need to be loaded into Hive as external
tables.

was:
Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files
+ Remove existing files
+ Replace existing files (Optional)
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when there
are a large number of tables that need to be loaded into Hive as external
tables.

Updatable HAR Filesystem

Is there a plan to support updatable HAR Filesystem? If so, by when is this
expected please?
The following operations may be supported.
+ Add new files [ -a filename-uri1, filename-uri2, ...]
+ Remove existing files [ -d filename-uri1, filename-uri2, ...]
+ Update/Replace existing files (Optional) [ -u old-filename-uri
new-filename-uri]
This is required in cases where data is stored in AVRO format in HDFS and the
corresponding .avsc files are used to create Hive external tables.
This will lead to the small files (.avsc files in this case) problem when
there are a large number of tables that need to be loaded into Hive as
external tables.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7888) Change DataStreamer/DFSOutputStream/DFSPacket for convenience of subclassing


[ 
https://issues.apache.org/jira/browse/HDFS-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388304#comment-14388304
 ] 

Hadoop QA commented on HDFS-7888:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12708333/HDFS-7888-trunk-001.patch
  against trunk revision 85dc3c1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-hdfs-project/hadoop-hdfs:

  
org.apache.hadoop.hdfs.server.blockmanagement.TestDatanodeManager

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10124//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10124//console

This message is automatically generated.

 Change DataStreamer/DFSOutputStream/DFSPacket for convenience of subclassing
 

 Key: HDFS-7888
 URL: https://issues.apache.org/jira/browse/HDFS-7888
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Li Bo
Assignee: Li Bo
 Attachments: HDFS-7888-001.patch, HDFS-7888-trunk-001.patch


 HDFS-7793 refactors class {{DFSOutputStream}} on trunk which makes 
 {{DFSOutputStream}} a class without any inner classes. We want to subclass 
 {{DFSOutputStream}} to support striping layout writing. This JIRA depends 
 upon HDFS-7793 and tries to change DataStreamer/DFSOutputStream/DFSPacket for 
 convenience of subclassing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7954) TestBalancer#testBalancerWithPinnedBlocks failed on Windows


[ 
https://issues.apache.org/jira/browse/HDFS-7954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388335#comment-14388335
 ] 

Hadoop QA commented on HDFS-7954:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12708345/HDFS-7947.00.patch
  against trunk revision 85dc3c1.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-hdfs-project/hadoop-hdfs:

  
org.apache.hadoop.hdfs.server.datanode.TestDataNodeMultipleRegistrations

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10125//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-HDFS-Build/10125//console

This message is automatically generated.

 TestBalancer#testBalancerWithPinnedBlocks failed on Windows
 ---

 Key: HDFS-7954
 URL: https://issues.apache.org/jira/browse/HDFS-7954
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: test
Reporter: Xiaoyu Yao
Assignee: Xiaoyu Yao
 Attachments: HDFS-7947.00.patch


 {code}
 testBalancerWithPinnedBlocks(org.apache.hadoop.hdfs.server.balancer.TestBalancer)
   Time elapsed: 22.624 sec   FAILURE!
 java.lang.AssertionError: expected:-3 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hdfs.server.balancer.TestBalancer.testBalancerWithPinnedBlocks(TestBalancer.java:353)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7937) Erasure Coding: INodeFile quota computation unit tests


[ 
https://issues.apache.org/jira/browse/HDFS-7937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388482#comment-14388482
 ] 

Rakesh R commented on HDFS-7937:


Thanks [~kaisasak], good number of unit test cases. I've few minor comments, 
please see it.
# TestINodeFile#testBlockStripedTotalBlockCount, Do we need the below logic in 
this testcase ?
{code}
+INodeFile inf = createINodeFile(HdfsConstants.EC_STORAGE_POLICY_ID);
+inf.addStripedBlocksFeature();
{code}
# Could you please do the assertion by reversing the {{actual}} and 
{{expected}} arguments. I could see this kind of usage in many places, please 
modify all such cases. For example,
case-1) 
{code}
assertEquals(inf.getBlocks().length, 1);

can be written as :

assertEquals(1, inf.getBlocks().length);
{code}
Case-2) 
{code}
assertEquals(blockInfoStriped.getTotalBlockNum(), 9);

can be written as :

assertEquals(9, blockInfoStriped.getTotalBlockNum());
{code}

 Erasure Coding: INodeFile quota computation unit tests
 --

 Key: HDFS-7937
 URL: https://issues.apache.org/jira/browse/HDFS-7937
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Kai Sasaki
Assignee: Kai Sasaki
Priority: Minor
 Attachments: HDFS-7937.1.patch, HDFS-7937.2.patch


 Unit test for [HDFS-7826|https://issues.apache.org/jira/browse/HDFS-7826]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HDFS-7937) Erasure Coding: INodeFile quota computation unit tests