Hi Pifta,

It's true that the technology is moving to a separation of compute/storage
which is not a bad thing depending on the use case and the workload.
However when it comes to processing PB of data, data locality becomes a key
element and the short circuit is part of that for me.
I definitely need to start to look at the code of ozone :)

Thanks for all the answers!
Michel

Le lun. 20 juil. 2020 à 13:23, István Fajth <fapi...@gmail.com> a écrit :

> Hi Michel,
>
> currently Ozone does not support short-circuit reads, it is on the roadmap
> but as we transition to a segregated storage and compute word it is not the
> most important one afaik.
>
> It is a FileSystem related thing and in Object Stores it is not a thing at
> all, as we continue our FS interface developments it certainly becomes
> important though but probably later than sooner.
> On the other hand, as with any placement policy, or any other improvement,
> you should feel free to add this functionality on your own of course and
> the community will be happy to help and review :)
>
> Pifta
>
> Michel Sumbul <michelsum...@gmail.com> ezt írta (időpont: 2020. júl. 20.,
> Hét 13:53):
>
> > Thanks Pifta that's really clear!
> >
> > If you don't mind a last question on data locality, does Ozone support
> > short-circuit like HDFS?
> > If not, is it something on the roadmap? Short circuit provide a
> significant
> > performance boost in the HDFS world, do you think it will be the same for
> > Ozone?
> >
> > Thanks,
> > Michel
> >
> > Le mar. 14 juil. 2020 à 22:02, István Fajth <fapi...@gmail.com> a écrit
> :
> >
> > > Hi Michel,
> > >
> > > at the moment the placement policy is an interesting topic.
> > > In Ozone placement is considered in terms of containers, and not
> blocks.
> > > Block are sub-container structures.
> > > The container has a lifecycle, when it is open then the pipeline
> attached
> > > to it is defining the placement of data. The pipeline placement if
> there
> > > are racks and we are talking about replication factor 3 pipelines then
> it
> > > places two container replicas into one rack and one into an other rack.
> > > This is a wired behaviour, and pipelines are balanced between
> DataNodes.
> > If
> > > there are no racks defined, or just one rack is defined pipeline
> > placement
> > > falls back to random placement that considers space available on
> > DataNodes
> > > and favors nodes with more available space.
> > >
> > > When a container gets closed, the replicas are managed by the
> > > ReplicationManager, which has a configurable policy. There are three
> > > policies at the moment, random, available space aware random, and rack
> > > aware policy.
> > > The closed containers are moved by the ReplicationManager as needed if
> > > replication violates the policy or replicas are created or removed when
> > > under or overreplication occurs.
> > >
> > > This is because Ozone aims to balance the write I/O by balancing the
> > > pipelines. Read I/O is balanced by the random placement within the
> rules
> > > defined by the policy.
> > >
> > > Ozone needs to harmonize the pipeline placement and the container
> > placement
> > > in the future as we want to add more policies for sure but at the
> moment
> > > this is how placement works.
> > >
> > > In regards of balancing at the moment we do not have a balancing logic
> > > built in, and we do not have a balancer tool like HDFS at the moment it
> > is
> > > part of the roadmap, however you can bet any balancing logic has to
> > > consider the placement policy configured for closed containers at
> least.
> > >
> > > If you need to have a policy like the one you mentioned, the closed
> > > container policy is pluggable, so you can write your own or even
> > contribute
> > > it to the project if you want.
> > > But at the moment you need to consider the load which will be there if
> > the
> > > custom policy is violated by the pipeline placement then at container
> > > closure containers have to be moved to fit with the closed container
> > > placement policy.
> > >
> > > Pifta
> > >
> > > Michel Sumbul <michelsum...@gmail.com> ezt írta (időpont: 2020. júl.
> > 14.,
> > > Ke 15:38):
> > >
> > > > Hi Pifta,
> > > >
> > > > Thanks for your reply.
> > > > That's good news! Does Ozone also support other placement policies
> like
> > > one
> > > > replica in 3 different racks? That will be super useful from an
> > > operational
> > > > point of view. It will be possible to put in maintenance (for update
> or
> > > > other task) an entire rack and be sure that 2 other replicas are in 2
> > > > different racks still up and running and not losing 2 replicas.
> > > >
> > > > Does the placement policy is also enforced during the rebalancing
> like
> > in
> > > > HDFS?
> > > >
> > > > Thanks,
> > > > Michel
> > > >
> > > > Le jeu. 9 juil. 2020 à 13:05, István Fajth <fapi...@gmail.com> a
> > écrit :
> > > >
> > > > > Hi Michel,
> > > > >
> > > > > yes, Ozone has topology support (currently 3 levels are supported:
> > > root,
> > > > > rack, node) to specify cluster topology similarly as in HDFS. With
> > > > > replication factor 3 it works similarly as in HDFS and ensures that
> > > > > container replicas reside in 2 racks, 2 in one rack, and 1 in
> another
> > > > rack.
> > > > > Also the FileSystem APIs (o3fs:// and ofs://) are implementing the
> > > > methods
> > > > > required to provide the locality information to the clients
> similarly
> > > as
> > > > in
> > > > > HDFS, so YARN can take advantage of this information, and can bring
> > > > compute
> > > > > to the data as with HDFS.
> > > > >
> > > > > It is worth noting that there are not too many clusters currently
> > using
> > > > > these features, but if any issues arise we are there to react, and
> > > there
> > > > > are some plans as well to harden the system further. There are a
> > couple
> > > > of
> > > > > items already planned after the soon to be released 0.6.0 you can
> > check
> > > > > into it in this JIRA (HDDS-3722)
> > > > > <https://issues.apache.org/jira/browse/HDDS-3722>.
> > > > >
> > > > > If you have any questions feel free to ask further :)
> > > > > Pifta
> > > > >
> > > > > Michel Sumbul <michelsum...@gmail.com> ezt írta (időpont: 2020.
> júl.
> > > 9.,
> > > > > Cs, 12:57):
> > > > >
> > > > > > Hi guys,
> > > > > >
> > > > > > First thanks for your work on this project, it looks really great
> > as
> > > > the
> > > > > > next evolution of HDFS (if I can say that :-) )
> > > > > >
> > > > > > I saw in multiple slideshows on the web that Ozone will support
> > data
> > > > > > locality like HDFS.
> > > > > > What's the status of that? Is it already implemented?
> > > > > >
> > > > > > Thanks,
> > > > > > Michel
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Pifta
> > > > >
> > > >
> > >
> >
>

Reply via email to