Re: Using thrift as part of a game network protocol

Will Lowe Fri, 03 Apr 2009 20:48:40 -0700

I'm actually considering using Thrift in a similar way: as a fast,cross-language serialization-and-transport mechanism between a bunchof different apps in a pub/sub architecture.

There are a number of possible message types -- 100? -- and it won'talways be possible for each consumer to know if the other supports agiven message type, so I'd like to avoid each app having an RPCservice for each possible message type; I'd rather hand them objectsand let them just ignore the ones they don't care about.


I see a few different ways to do this:

1. Define only one struct with all possible fields for all possiblemessages, and a "type" field that lets you figure out what it is. Itseems kinda stupid to do this, since one of the major reasons I'minterested in Thrift is type-awareness.

2. Modify the TService layer so that RPC arguments aren't staticallytyped: make it possible to declare an RPC call that accepts anystruct. Feels like (void*), and probably also irritates purists wholike the simple no-object-inheritance, no-function-overloading modelThrift uses today.

3. Build something custom at the TProcessor layer and skip TServicealtogether.

Both #2 and #3 require changing some guts. I think that would gosomething like this:

* Add a TMessageType (T_STRUCT?) that indicates "I'm sending you data,not calling a function!". Or is that what T_ONEWAY is for?* Modify TBinaryProtocol so that writeStructBegin() andwriteStructEnd() aren't noops -- otherwise the receiver doesn't knowwhat he's receiving!* Implement a TProcessor that can read the struct type, instantiateit, and do some sort of dispatch to the client app.


Any thoughts on this?  Has someone else already solved this?

Will


On Apr 3, 2009, at 6:22 PM, Brian Hammond wrote:

That's neat Joel. However, does this scale? I mean, the underlyingassumption here is that clients are using persistent connections tothe service, and you [not so simply] are sending messages back tothe client over that same connection. Thus, your service now has tohandle a potentially large number of client connections. Unlessyou're using something libev[ent] I don't see this scaling beyondsay 20K connections. Two things, I could be missing something here,and this level of scalability is probably just fine for *many* typesof services (perhaps not for a chat server though)!
I'm curious to hear other people's thoughts on this and how it couldbe made scalable since, well, I'm planning on using polling in myproject since I am expecting potentially a very large number ofsimultaneous users of the service and my servers can only handle somany connections.
Thanks for sharing this.

Brian

On Apr 3, 2009, at 7:50 PM, Joel Meyer wrote:
On Thu, Apr 2, 2009 at 4:17 PM, Joel Meyer <[email protected]>wrote:
On Tue, Mar 24, 2009 at 5:01 PM, Doug Daniels <[email protected]>wrote:
Ok I definitely plan on giving the Async RPC methods a trytonight, but Ifigured I'd just throw out some questions before I get home tostart
hacking
on this stuff.
The one-to-one message to RPC call Async solution will let aclient sendmessages of any given type in my defined protocol, but how woulda serverrespond to a client with a message that the client didn'trequest? Forexample say I was trying to write a FPS like Quake and I want toserver tosend position updates for all clients to all clients, how would imodel
that
as a client RPC request for that. With the Async RPC solutions Icould
make
a RPC call for Map<Integer, Position> getPositionUpdates(), Nowsay that
the
client needs to request 50 other messages to be notified of. Iguess thesolution would be to make an Async RPC call requesting thoseupdates andrespond to it when I receive it asynchronously and then reissueanotherAsync RPC call for the next set of updates. It just seemsinefficient toactively make the client request for data when the server couldimplicitlyknow that when connected on this game protocol I can just sendthesemessages to the clients without them asking for it. Not tomention you'd
have make sure you don't "miss" sending a client a message if they
finished
their Async call but haven't reestablished a new one.
I think I've done something similar to what you're trying to do,and aslong as you can commit to using only async messages it's possibleto pull itoff without having to start a server on the client to accept RPCsfrom the
server.
When your RPC is marked as async the server doesn't send aresponse and theclient doesn't try to read one. So, if all your RPC calls from theclient tothe server are async you have effectively freed up the inboundhalf of thesocket connection. That means that you can use it for receivingasyncmessages from the server - the only catch is that you have tostart a new
thread to read and dispatch the incoming async RPC calls.
In a typical Thrift RPC system you'd create a MyService.Processoron yourserver and a MyService.Client on your client. To do bidirectionalasync
message sending you'll need to go a step further and create a
MyService.Client on your server for each client that connects(this can beaccomplished by providing your own TProcessorFactory) and then oneachclient you create a MyService.Processor. (This assumes that you'vegone witha generic MyService definition like you described above that has abunch ofoptional messages, another option would be to define separateservicedefinitions for the client and server.) With two clients connectedthe
objects in existence would look something like this:

Server:
MyService.Processor mainProcessor - handles incoming async RPCs
MyService.Client clientA - used to send outgoing async RPCs toClientAMyService.Client clientB - used to send outgoing async RPCs toClientB
ClientA:
MyService.Client - used to send messages to Server
MyService.Processor clientProcessor - used (by a separate thread) to
process incoming async RPCs

ClientB:
MyService.Client - used to send messages to Server
MyService.Processor clientProcessor - used (by a separate thread) to
process incoming async RPCs
Hopefully that explains the concept. If you need example code Ican try andpull something together (it will be in Java). The nice thing aboutthismethod is that you don't have to establish two connections, so youcan getaround the firewall issues others have mentioned. I've been usingthismethod on a service in production and haven't had any problems.When youhave a separate thread in your client running a Processor you'rebasicallyblocking on a read, waiting for a message from the server. Thebenefit ofthis is that you're notified immediately when the server shutsdown insteadof having to wait until you send a message and then finding outthat the TCP
connection was reset.

Cheers,
Joel
Thanks for the feedback. I've created a simple example in Javademonstrating
this in action:
http://www.joelpm.com/wp-content/uploads/2009/04/bidimessages.tgz

Post with a few details on the implementation:
http://www.joelpm.com/2009/04/03/thrift-bidirectional-async-rpc/
Please add me to the list of people who think there's value in afull asynctransport that provides (optional?) synchronization at the apilevel using
futures/deferreds/etc.

Cheers,
Joel
The biggest issue is that not all client request will result in asingleresponse (like shooting a bullet, may blowup an entity, anddamage allplayers in the area those events are seperate messages sent fromthe
respective entities).

At a game development studio I used to work at we developed a cross
language
IDL network protocol definition (C++, Java) very similiar toProtocolBuffers and Thrift (without some of the more mature features likebeingtransport agnostic we explicitly built it for binary TCP sockettransport,or protocol versioning), the stream of packets would contain asthe first
32
bits a message ID that would be a key to a map a Message classthat would
have methods to read in that message type from a byte[] stream.
Looking through Thrift code in the TBinaryProtocol writeMessageit lookslike it's including the name of the message being sent and it'stype (is
the
concept of Message in thrift the same as RPC?), if so what's the
corresponding code pathway for the client waiting for an RPCresponsebecause if I could just use this message name or type to key intowhat Ineed to serialize off the network from both client and server endthen
that
would be perfect.
On Tue, Mar 24, 2009 at 1:51 PM, Ted Dunning<[email protected]>
wrote:
I really think that using async service methods which arematched one to
one
with the message types that you want to send gives you exactly the
semantics
that are being requested with very simple implementation cost.
It is important to not get toooo hung up on what RPC standsfor. I useasync methods all the time to stream data structures for loggingand itworks great. Moreover, it provides a really simple way ofbuilding
extractors and processors for this data since I have an interface
sitting
there that will tell me about all of the methods (data types)that I
need
to
handle or explicitly ignore.

So the trick works and works really well.  Give it a try!

On Tue, Mar 24, 2009 at 8:23 AM, Bryan Duxbury <[email protected]>
wrote:
Optional fields are not serialized onto the wire. There is aslightperformance penalty at serialization time if you have a ton ofunset
fields,
but that's it.

Am I over complicating things
Personally, sounds like it to me. Why do you need this streaming
behavior
or whatnot? Hotwiring the rpc stack to let you send any messageyou
want
is
going to be a ton of work and not really that much of afunctionality
improvement.

-Bryan
--
Ted Dunning, CTO
DeepDyve

Re: Using thrift as part of a game network protocol

Reply via email to