Re: [ovs-dev] [RFC ovn PATCH 0/5] Separate pinctrl to its own process

Mark Michelson Thu, 07 Nov 2019 14:02:44 -0800

Hi Han, I had some time to get back to this. See my comments below.


On 10/21/19 4:01 PM, Han Zhou wrote:

Hi Mark,
Thanks for the patch. We had a brief discussion during last OVN meeting.Let me put my points inlined.
On Fri, Oct 18, 2019 at 1:43 PM Mark Michelson <[email protected]<mailto:[email protected]>> wrote:
 >
 > This proposes a set of patches to move pinctrl operations out of the
 > ovn-controller process and into its own.
 >
 > The main reasons for doing this are the following:
 > 1) Separating pinctrl makes it so that receiving a packet-in can't wake
 > up ovn-controller.
To avoid waking up ovn-controller, it doesn't have to be in a separateprocess. A thread with its own OVSDB IDL to SB DB can achieve the same,as what this old patch did:https://mail.openvswitch.org/pipermail/ovs-dev/2017-May/332887.html
However, the problem of a separate SB connection introduced the concernfor scalability. There were discussions and thoughts for a separatethread without introducing new SB connection, but once the two threadsshare same SB connection, there has to be some synchronization betweenthe threads that ends up waking up or blocking each other whenever thereis a pinctrl processing that requires read/write SB data. The currentmulti-thread implementation from Numan is a trade off that avoids new SBconnection but syncing with the main thread when SB data is needed. Itis perfect for pinctrl handling that doesn't require SB data, and thenwakes up ovn-controller for updating SB data.

I followed the discussions that resulted from the patch you sent. Itlooks like the concerns are that you either have to

1) Have two separate connections to the SB database, resulting in doublethe connections (this is what my patch does)2) Have one connection to the SB database but synchronise the efforts ofthe different concerns of ovn-controller (namely logical flow processingand pinctrl processing).

1 is easy but resource intensive, and 2 is difficult but has thepotential for not having the same bottlenecks.

But this has me thinking. The current IDL code assumes that one IDLclient == one database connection. I suppose it may be possible to alterthe OVSDB IDL code so that a single connection could be shared bymultiple IDL clients. In other words, you could create the SBconnection, then create a separate thread. Each thread would then createan IDL that makes use of the same connection. The OVSDB client codewould need to be altered to be able to notify multiple IDLs aboutchanges. The client would also need to be modified to be thread safe, inthe case that multiple threads want to write to the database at once.

This would allow for multithreading, and the controller code wouldn'tneed to worry about synchronization. Each thread would have its own datait manipulates, and the synchronization would be handled by the IDL itself.

Having said all this, though, it would not be a trivial task toimplement this. And so the question becomes, is it worth it?

Today (2.12) there were improvements on both ovn-controller and OVSDBserver, that alleviated the scale problems on both side.- For ovn-controller, with incremental processing, when there is noinput change, it doesn't trigger flow recomputing, even when pinctrlwakes up the main thread. The major concern may be when main thread doesneed a recompute, it could block pinctrl processing for messages thatrequires SB data accessing, such as ARP handling.
- For SB DB
- Active-active cluster alleviates the burden of a single server andspread to 3 or 5. However, RAFT is not designed for scale. Write alwayshappen on the leader node, and the cost of cluster sync between leaderand follower becomes higher when number of nodes increases. - The fast-resync feature (requiring active-active clustered mode)avoids the slowness of data resync to all clients after DBrestart/failover. However, it doesn't help if ovsdb-server is overloadedfor regular updates and notifications during normal operations, giventhat it is single threaded. Also, there are corner cases thatfast-resync doesn't help, e.g. when DB restart/failover happened justafter a compress, when all the transaction history is lost.
I'd suggest to reconsider these scalability concerns, the pros and consfor a dedicated SB connection for pinctrl, before moving forward to thisapproach.
 > 2) Separating pinctrl allows for manipulating the southbound database
 > directly while handling packets in, thus minimizing the need for storing
 > local copies of data
This is true, but similar as point 1), it doesn't necessarily need aseparate process. The point is whether pinctrl (thread or process)should use a dedicated SB connection.

Yep, you're definitely correct here. I guess my thought here was that ifthe two threads have no need to share any memory or state, then it makesmore sense for them to exist as two processes instead.

However, what I suggested above about multiple IDL clients sharing aconnection would require that the code be multithreaded rather thanmultiprocess.

 > 3) This lays the groundwork for an easier eventual conversion of
 > ovn-controller to DDlog, since the DDlog code would need to only handle
 > flow creation, not packet in handling.
 >
Agree with this point. This is probably the most important benefit ofseparating pinctrl as a process. Although it is still possible to havepinctrl as a thread sharing SB connection while converting the flowprocessing part with DDlog, a separate process does make the conversioncleaner.
In addition, a separate process introduces some operational costs,although not a big concern. The tooling like ovn-ctl and packaging alsoneeds to be updated.

If you look at the ovn-northd implementation of DDLog, you'll see thatthere is still a C process that controls everything. So it definitelycould still work that for ovn-controller, the C program would create aseparate thread for pinctrl and then perform the DDLog flow generationin the main thread.

If we were to attempt to limit the SB DB connections but also allow formultiple processes, then you'd likely need to create a third processthat actually performs database communications, and use IPC tocommunicate between this third process and the pinctrl and controllerprocesses. I don't think this is any simpler than just having twothreads in a single process.


Thanks,
Han

I'm going to back out with this change for the time being. I'm going totake a closer look at the IDL client code to see how feasible it wouldbe to make it thread-safe and allow for multiple clients to share asingle connection. No guarantee that I actually come forward with such apatch any time soon though :)


_______________________________________________
dev mailing list
[email protected]
https://mail.openvswitch.org/mailman/listinfo/ovs-dev

Re: [ovs-dev] [RFC ovn PATCH 0/5] Separate pinctrl to its own process

Reply via email to