Sorry to have send it twice, I mixed my email.
Apology for the spam.

Le lun. 20 juil. 2020 à 16:16, Michel Sumbul <michelsum...@yahoo.fr.invalid>
a écrit :

>  Thanks Pifta that really clear!
>
> If you dont mind a last question on data locality, does Ozone support
> short-circuit like HDFS?
> If not is it something on the roadmap? short circuit provide a significant
> performance boost in HDFS world, do you think it will be the same for Ozone?
>
> Thanks,
> Michel
>     Le mardi 14 juillet 2020 à 22:02:45 UTC+1, István Fajth <
> fapi...@gmail.com> a écrit :
>
>  Hi Michel,
>
> at the moment the placement policy is an interesting topic.
> In Ozone placement is considered in terms of containers, and not blocks.
> Block are sub-container structures.
> The container has a lifecycle, when it is open then the pipeline attached
> to it is defining the placement of data. The pipeline placement if there
> are racks and we are talking about replication factor 3 pipelines then it
> places two container replicas into one rack and one into an other rack.
> This is a wired behaviour, and pipelines are balanced between DataNodes. If
> there are no racks defined, or just one rack is defined pipeline placement
> falls back to random placement that considers space available on DataNodes
> and favors nodes with more available space.
>
> When a container gets closed, the replicas are managed by the
> ReplicationManager, which has a configurable policy. There are three
> policies at the moment, random, available space aware random, and rack
> aware policy.
> The closed containers are moved by the ReplicationManager as needed if
> replication violates the policy or replicas are created or removed when
> under or overreplication occurs.
>
> This is because Ozone aims to balance the write I/O by balancing the
> pipelines. Read I/O is balanced by the random placement within the rules
> defined by the policy.
>
> Ozone needs to harmonize the pipeline placement and the container placement
> in the future as we want to add more policies for sure but at the moment
> this is how placement works.
>
> In regards of balancing at the moment we do not have a balancing logic
> built in, and we do not have a balancer tool like HDFS at the moment it is
> part of the roadmap, however you can bet any balancing logic has to
> consider the placement policy configured for closed containers at least.
>
> If you need to have a policy like the one you mentioned, the closed
> container policy is pluggable, so you can write your own or even contribute
> it to the project if you want.
> But at the moment you need to consider the load which will be there if the
> custom policy is violated by the pipeline placement then at container
> closure containers have to be moved to fit with the closed container
> placement policy.
>
> Pifta
>
> Michel Sumbul <michelsum...@gmail.com> ezt írta (időpont: 2020. júl. 14.,
> Ke 15:38):
>
> > Hi Pifta,
> >
> > Thanks for your reply.
> > That's good news! Does Ozone also support other placement policies like
> one
> > replica in 3 different racks? That will be super useful from an
> operational
> > point of view. It will be possible to put in maintenance (for update or
> > other task) an entire rack and be sure that 2 other replicas are in 2
> > different racks still up and running and not losing 2 replicas.
> >
> > Does the placement policy is also enforced during the rebalancing like in
> > HDFS?
> >
> > Thanks,
> > Michel
> >
> > Le jeu. 9 juil. 2020 à 13:05, István Fajth <fapi...@gmail.com> a écrit :
> >
> > > Hi Michel,
> > >
> > > yes, Ozone has topology support (currently 3 levels are supported:
> root,
> > > rack, node) to specify cluster topology similarly as in HDFS. With
> > > replication factor 3 it works similarly as in HDFS and ensures that
> > > container replicas reside in 2 racks, 2 in one rack, and 1 in another
> > rack.
> > > Also the FileSystem APIs (o3fs:// and ofs://) are implementing the
> > methods
> > > required to provide the locality information to the clients similarly
> as
> > in
> > > HDFS, so YARN can take advantage of this information, and can bring
> > compute
> > > to the data as with HDFS.
> > >
> > > It is worth noting that there are not too many clusters currently using
> > > these features, but if any issues arise we are there to react, and
> there
> > > are some plans as well to harden the system further. There are a couple
> > of
> > > items already planned after the soon to be released 0.6.0 you can check
> > > into it in this JIRA (HDDS-3722)
> > > <https://issues.apache.org/jira/browse/HDDS-3722>.
> > >
> > > If you have any questions feel free to ask further :)
> > > Pifta
> > >
> > > Michel Sumbul <michelsum...@gmail.com> ezt írta (időpont: 2020. júl.
> 9.,
> > > Cs, 12:57):
> > >
> > > > Hi guys,
> > > >
> > > > First thanks for your work on this project, it looks really great as
> > the
> > > > next evolution of HDFS (if I can say that :-) )
> > > >
> > > > I saw in multiple slideshows on the web that Ozone will support data
> > > > locality like HDFS.
> > > > What's the status of that? Is it already implemented?
> > > >
> > > > Thanks,
> > > > Michel
> > > >
> > >
> > >
> > > --
> > > Pifta
> > >
> >

Reply via email to