Re: Fwd: put vs. send

Bozo Dragojevic Sat, 09 Mar 2013 00:21:57 -0800

On 3/8/13 3:19 PM, Rafael Schloming wrote:

On Thu, Mar 7, 2013 at 5:15 AM, Bozo Dragojevic <[email protected]<mailto:[email protected]>> wrote:


    On 3/6/13 3:45 AM, Rafael Schloming wrote:

        Oops, meant to send this to the list.

        ---------- Forwarded message ----------
        From: Rafael Schloming<[email protected] <mailto:[email protected]>>
        Date: Tue, Mar 5, 2013 at 6:44 PM
        Subject: Re: put vs. send
        To: Bozo Dragojevic<[email protected]
        <mailto:[email protected]>>

[snip]

Thank you for taking the time to explain, that's very helpful. Itmakes me wonder if we shouldn't include some kind ofpn_messenger_interrupt() that to pull out of blocking calls early.


That'd be great!

Few weeks ago I've submittedhttps://issues.apache.org/jira/browse/PROTON-231

for this feature and the JIRA has a link to my attempt at implementation
(based off 0.3 branch) -- this is what we use now.

I've named it pn_messenger_wakeup() after pn_driver_wakeup() that it calls.

In addition the pn_messenger_tsync() needed to be adjusted to actuallyexit early :)

    So for me this opens up two new areas (one of the reasons it took
    me a bit to reply):
    - study up the API on 'trackers',
      iiuc thats the name of what pn_messenger_put() returns
    - figure out how to use them inside our code so that
      the publisher (API user) will not drown itself in messages
      it cannot realistically send.
I'm not sure trackers are what you want here (although it may bedepending on your app). Trackers are used to track the status of themessage at the receiver, e.g. it will tell you whether the receiverhas accepted or rejected the message.


the fourth paragraph below circles back to trackers :)

If you just care about limiting the rate of your producer to matchwhat you can actually push onto the wire, then you can just look atpn_messenger_outgoing(), that will tell you how many messages areactually backed up waiting to go onto the wire, e.g. before or afterdoing a put/send/recv/etc, you can check the size of the outgoingqueue and based on that decide whether you need to throttle your producer.

Looking at total size of outgoing queue is sufficient for the messengerthat is

part of the application that acts as the publisher and sends messages to the
broker as it then has just one peer.

The scenario that is important is this: Broker, too, wants to rate-limiteach peerbased solely on that peer's capability to receive messages. So if thereare twosubscribers and one is on a slower link, or is not behaving in someother way,

then the fast subscriber should not be held hostage of the slow subscriber.

This is even more so if we're talking about two publishers each having aset of own subscribers.

On the messenger level, the broker ends up having a bunch of amqpaddresses where

the message received from the publisher need to be forwarded.

The answer that is sought is pn_messenger_outgoing(address). Now, atracker doesuniquely identify the remote address that is embedded in the messagebeing sent.If there is/were a query of local messenger's opinion on progress statuswrt.

delivering of that one message where one distinct answer would be
"managed to put it on the wire".

A function like that makes it trivial for a messenger user (broker) toperform

the accounting of 'number of outstanding messages per address' on it's own.

    - ability to spread out load of serialization and deserialization
    over cores
      and as mentioned above there are two 'layers' of it:
      bytes <=> pn_message <=> API object
      Providing this within the messenger paradigm sounds tricky to me
       -- I definitely don't want messenger to grow it's own thread
    pool :)
This sounds to me like it could possibly be addressed by a smarterpn_message implementation. Right now pn_message_t is implemented in avery basic way where it actually does a lot of unnecessaryencode/decode work. Not only could that be made more efficient sothere is much less load to be distributed, but it could also be madeto happen on demand. That would give you the flexibility of hittingmost of the cost of the bytes <=> pn_message_t portion in whicheverthread you chose.

This multi-core business obviously makes most sense for cases wherethere is more

than one TCP connection where data flows in.

The thread that read a bunch of bytes from a socket already has them inthat CPU cache.Doing as much of the work with those bytes on that CPU, without contextswitches,

will be fast.


    - more control over IO and waiting (as per description above)
      this alone would still fit into your proposed extension quite nicely
      In the other thread you mention bringing send() and recv()
    closer wrt meaning of
      a parameter.
      It'd be a tiny step from there to provide a sync(send_limit,
    recv_limit)
      which would help me in reasoning of there being exactly one path
    that ends up
      in pn_driver_wait() -- this would simplify interactions between

    - ability to maybe (ab)use several TCP connections for sending of
    messages
      with different latency-sensitive information
      This one is more a possible solution than a requirement
      What would be sufficient from interface perspective is to have
    ability for
      messages (potentially to the same destination) to jump the
    queue, so to speak,
      but this could potentially still fit within the messenger paradigm

This is an interesting one AMQP does provide a priority header on themessage that is a clue that ordering is relaxed. Right now neithermessenger nor the engine pays any attention to it though. As aninterface it seems like the obvious thing, however implementation ofit might be interesting.




Thanks,
Bozzo

Re: Fwd: put vs. send

Reply via email to