[ 
https://issues.apache.org/jira/browse/HDFS-12993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16316765#comment-16316765
 ] 

Kihwal Lee commented on HDFS-12993:
-----------------------------------

The requirement sounds fine at a high level, but {{getBlocks()}} can already be 
expensive. We've seen multi-millisecond processing times. Even if it is 
acquiring the read lock, a long lock lowers the throughput.  

bq. As of now, dn't have report for locking load(only readlock) when there are 
huge blocks.
{{getBlocks()}} is more expensive when there are many smaller blocks. E.g. more 
blocks to go through to gather 10GB worth.  Doing path exclusion in such 
clusters can have a huge performance impact.

Please make this have zero additional cost for the normal cases.

> Add an option to Balancer for excluding the paths
> -------------------------------------------------
>
>                 Key: HDFS-12993
>                 URL: https://issues.apache.org/jira/browse/HDFS-12993
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: balancer & mover
>            Reporter: Brahma Reddy Battula
>            Assignee: Brahma Reddy Battula
>         Attachments: HDFS-12993.patch
>
>
> *Usecase:* Users/Customers want to keep the important/frequently used 
> blocks(where blocks are not written with favoured nodes and block pinning is 
> not enabled) when balancer is triggered.This can be useful datacentric jobs 
> which will run frequently.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to