Re: Discussion about NameNode Fine-grained locking

Takanobu Asanuma Tue, 05 Mar 2024 19:52:54 -0800

> We're trying tuning gc options and even new gc engine like zgc, but they
are not very helpful.


I'm afraid this is a digression, but could you elaborate on using ZGC for
NameNode? Did you encounter any problems?
I've never heard of using ZGC for NameNode in practice, so I'm curious
about it.

Regards,
- Takanobu


2024年3月6日(水) 12:35 Yuanbo Liu <[email protected]>:

> > a. Snapshot, Symbolic link and reserved feature are not mentioned at the
> design doc, should it be considered
> Yes, I agree. Since the snapshot、symbol link is not popular in HADOOP, we
> can try to use global lock(write lock of root inode?). In our production
> env, we just ignore those features, but in the open source community, these
> should be considered carefully.
>
> > b. For the benchmark result, what Read/Write request ratio? And do you
> meet any GC issues when reaching
> FGL will need more memory as its qps becomes very high. In practice, if the
> percentage of used memory is greater than 90%, GC time will become a major
> problem. We're trying tuning gc options and even new gc engine like zgc,
> but they are not very helpful.
>
>
>
> On Wed, Mar 6, 2024 at 10:51 AM Hui Fei <[email protected]> wrote:
>
> > Thanks for suggestions.
> >
> > Actually Started working on this improvement. And cut the development
> > branch :)
> > From the proposal doc and the current reviewing work, seems that it
> > doesn't touch the existing logic codes too much. It keeps the original
> > logic there.
> >
> > @Yuanbo @Zengqiang XU <[email protected]>  Could you share any
> internal
> > improvement info Xiaoqiao mentioned above?
> >
> > Xiaoqiao He <[email protected]> 于2024年2月26日周一 19:50写道：
> >
> >> Thanks for this meaningful proposal. Some nit comments:
> >> a. Snapshot, Symbolic link and reserved feature are not mentioned at the
> >> design doc, should it be considered
> >> or different to this core design?
> >> b. For the benchmark result, what Read/Write request ratio? And do you
> >> meet
> >> any GC issues when reaching
> >> `108K QPS`? If true, would you mind sharing STW time cost?
> >> c. Is this deployed in your internal cluster now? If true,  any
> >> performance
> >> benefit differences compare to the
> >> benchmark?
> >> d. This is one huge feature IMO, If discussion passes, suggest creating
> a
> >> single branch to develop and follow-up
> >> works.
> >>
> >> Thanks again for this meaningful proposal.
> >>
> >> Best Regards,
> >> - He Xiaoqiao
> >>
> >>
> >> On Tue, Feb 20, 2024 at 5:38 PM Yuanbo Liu <[email protected]>
> wrote:
> >>
> >> > Nice to see this feature brought up. We've implemented this feature
> >> > internally and gained significant performance improvement. I'll be
> glad
> >> to
> >> > work on some jiras if necessary.
> >> >
> >> >
> >> > On Tue, Feb 20, 2024 at 4:41 PM ZanderXu <[email protected]> wrote:
> >> >
> >> > > Thank you everyone for reviewing this ticket.
> >> > >
> >> > > I think if there are no problems with the goal and the overall
> >> solution,
> >> > we
> >> > > are ready to push this ticket forward and I will create some
> detailed
> >> > > sub-tasks for this ticket.
> >> > >
> >> > > I will split this project into three milestones to make this project
> >> > > cleaner for review and merge.
> >> > > Milestone 1: Replacing the current global lock with two locks,
> global
> >> FS
> >> > > lock and global BM lock. End-user can choose which locking mode to
> use
> >> > > through configuration.
> >> > > Milestone 2: Replacing the global FS write lock with directory
> >> tree-based
> >> > > fine-grained lock.
> >> > > Milestone 3: Replacing the global BM lock with directory tree-based
> >> > > fine-grained lock.
> >> > >
> >> > > Each milestone can be merged into the trunk branch in time, which
> has
> >> > > multiple benefits:
> >> > > 1. Phased performance improvements can be quickly used by everyone
> >> > > 2. All developers can better understand the implementation ideas of
> >> the
> >> > > fine-grained locking mechanism as soon as possible
> >> > > 3. Each milestone is developed based on the latest trunk branch to
> >> reduce
> >> > > conflicts
> >> > >
> >> > > If you have any concerns, please feel free to discuss them together.
> >> > > I hope you can join us to push this project forward together,
> thanks.
> >> > >
> >> > >
> >> > > On Mon, 5 Feb 2024 at 11:33, haiyang hu <[email protected]>
> >> wrote:
> >> > >
> >> > > > Thank you for raising the issue of this long-standing bottleneck,
> >> this
> >> > > > will be a very important improvement!
> >> > > >
> >> > > > Hopefully can participate and push forward together.
> >> > > >
> >> > > > Best Regards~
> >> > > >
> >> > > > Brahma Reddy Battula <[email protected]> 于2024年2月3日周六 00:40写道：
> >> > > >
> >> > > >> Thanks for bringing this and considering all the history around
> >> this.
> >> > > >> One of the outstanding bottleneck(global lock) from a long time.
> >> > > >>
> >> > > >> Hopefully we can push forward this time.
> >> > > >>
> >> > > >>
> >> > > >> On Fri, Feb 2, 2024 at 12:23 PM Hui Fei <[email protected]>
> >> > wrote:
> >> > > >>
> >> > > >> > Thanks for driving this. It's very meaningful. The performance
> >> > > >> improvement
> >> > > >> > looks very good.
> >> > > >> >
> >> > > >> > Many users are facing the write performance issue. As far as I
> >> know,
> >> > > >> some
> >> > > >> > companies already implemented the similar idea on their
> internal
> >> > > >> branches.
> >> > > >> > But the internal branch is very different from the community
> >> one. So
> >> > > >> it's
> >> > > >> > very hard to be in sync with the community. If this improvement
> >> can
> >> > be
> >> > > >> > involved in the community, that would be great to both end-user
> >> and
> >> > > the
> >> > > >> > community.
> >> > > >> >
> >> > > >> > It is very worth doing.
> >> > > >> >
> >> > > >> > Zengqiang XU <[email protected]> 于2024年2月2日周五 11:07写道：
> >> > > >> >
> >> > > >> > > Hi everyone
> >> > > >> > >
> >> > > >> > > I have started a discussion about NameNode Fine-grained
> >> Locking to
> >> > > >> > improve
> >> > > >> > > performance of write operations in NameNode.
> >> > > >> > >
> >> > > >> > > I started this discussion again for serval main reasons:
> >> > > >> > > 1. We have implemented it and gained nearly 7x performance
> >> > > >> improvement in
> >> > > >> > > our prod environment
> >> > > >> > > 2. Many other companies made similar improvements based on
> >> their
> >> > > >> internal
> >> > > >> > > branch.
> >> > > >> > > 3. This topic has been discussed for a long time, but still
> >> > without
> >> > > >> any
> >> > > >> > > results.
> >> > > >> > >
> >> > > >> > > I hope we can push this important improvement in the
> community
> >> so
> >> > > that
> >> > > >> > all
> >> > > >> > > end-users can enjoy this significant improvement.
> >> > > >> > >
> >> > > >> > > I'd really appreciate you can join in and work with me to
> push
> >> > this
> >> > > >> > feature
> >> > > >> > > forward.
> >> > > >> > >
> >> > > >> > > Thanks very much.
> >> > > >> > >
> >> > > >> > > Ticket: HDFS-17366 <
> >> > > https://issues.apache.org/jira/browse/HDFS-17366>
> >> > > >> > > Design: NameNode Fine-grained locking based on directory tree
> >> > > >> > > <
> >> > > >> > >
> >> > > >> >
> >> > > >>
> >> > >
> >> >
> >>
> https://docs.google.com/document/d/1bVBQcI4jfzS0UrczB7UhsrQTXmrERGvBV-a9W3HCCjk/edit?usp=sharing
> >> > > >> > > >
> >> > > >> > >
> >> > > >> >
> >> > > >>
> >> > > >
> >> > >
> >> >
> >>
> >
>

Re: Discussion about NameNode Fine-grained locking

Reply via email to