[ 
https://issues.apache.org/jira/browse/HDFS-7240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14615318#comment-14615318
 ] 

Jian Fang commented on HDFS-7240:
---------------------------------

Thanks for all your explanations, however, I think you missed my points. Doable 
and performance are two difference concepts. From my own experiences with S3 
and s3 native file system, the most costly operations are listing keys and 
copying data from one bucket to the other one to simulate the rename operation. 
The former one will take a very long time for a bucket with millions of objects 
and the latter one has a double performance penalty, i.e., assume your objects 
are 1TB, you actually almost upload 2TB of data to s3. That is why fast key 
listing and native fast rename operations are two of the most desirable 
features for s3. 

Before you make decision to follow the S3N API, I would suggest you actually 
test the performance of S3N and get to know what are good and what are bad. Why 
do you need to follow the bad ones at all?

It is still not very clear to me how do you guarantee your partitions are 
balanced. HBase used region auto split to achieve that, which is also my 
concern that the code and logic would grow rapidly when your object store 
becomes really mature. In my personal opinion, it is better to build the object 
store on top of HDFS and leave HDFS to be simple.

  

> Object store in HDFS
> --------------------
>
>                 Key: HDFS-7240
>                 URL: https://issues.apache.org/jira/browse/HDFS-7240
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: Jitendra Nath Pandey
>            Assignee: Jitendra Nath Pandey
>         Attachments: Ozone-architecture-v1.pdf
>
>
> This jira proposes to add object store capabilities into HDFS. 
> As part of the federation work (HDFS-1052) we separated block storage as a 
> generic storage layer. Using the Block Pool abstraction, new kinds of 
> namespaces can be built on top of the storage layer i.e. datanodes.
> In this jira I will explore building an object store using the datanode 
> storage, but independent of namespace metadata.
> I will soon update with a detailed design document.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to