Re: [thrift] RPC with completion functions

Philip Fung Thu, 29 May 2008 11:40:58 -0700

Not good enough. I actually don't want a lock object because nothread will be
waiting.

Hm, I assume you're using a ThreadPoolServer so your insert() call isusing a thread already. The wait/notifyAll()

would be on that thread.

It seems like no matter what you'll have to hang on the thread waitingon the notification.

If you don't want the overhead of leaving 100's of threads hanging andyour process is I/O bound (which it seems to be), it may make sense totry a non-blocking server, whichcan handle ten of thousands of idle connections. Unfortunately, Idon't think Thrift currently supports a NonBlockingServer for Java.I am actually in the process of writing this into Thrift; buts thatslike a month or two away...(you can look over http://www.oreilly.com/catalog/9780596002886/chapter/index.html?CMP=ILL-4GV796923290).

I actually think your best bet is to not wait for a status and returnimmediately, but again its up to your system design.


--------------------------
Philip Fung
Engineering
Facebook, Inc.
[EMAIL PROTECTED]


On May 28, 2008, at 3:07 PM, Benjamin Reed wrote:

Not good enough. I actually don't want a lock object because nothread will be

waiting. (I can't have hundreds of threads waiting around for a

notification.) I actually want to send the response back to theclient where

you have the notifyAll().

ben

On Wednesday 28 May 2008 14:53:00 Philip Fung wrote:
> K, in that case try using synchronized statements.
>
> skeleton:
>
> private       Object lock = new       Object();
> static Queue<dataType> dataToInsert = new Queue<dataType>();
> static long TIMEOUT = 30;
>
> // thrift function
> public bool insert(dataType data) {
>
>     long timeBegin = System.currentTimeMillis();
>     bool      timedOut = false;
>     synchronized(lock) {
>
>         dataToInsert.add(data);
>
>         while (dataToInsert.size() <= INSERT_SIZE_THRESHOLD &&
> ((timedOut = ((System.currentTimeMillis() - timeBegin()) >
> this.TIMEOUT)))) {
>            wait(this.TIMEOUT);
>         }
>
>         if (dataToInsert.size() > 0) {
>
>           // clear out dataToInsert and dump to DB HERE
>
>            notifyAll();
>         }
>
>     }
>
>     return (!timedOut && !someDBErrorOccurred) ;
> }
>
>
> --------------------------
> Philip Fung
> Engineering
> Facebook, Inc.
> [EMAIL PROTECTED]
>
> On May 28, 2008, at 1:46 PM, Benjamin Reed wrote:
> > Yes, I really need a success code. (The client is going to be
> > waiting for it.)
> > I want to do exactly what you are proposing except that I want to
> > queue a
> > completion function into dataToInsert as well, so that the
> > insertWorker can
> > send back the response to the rpc to the client when the commit
> > completes.
> >
> > thanx
> > ben
> >
> > On Wednesday 28 May 2008 12:01:49 Philip Fung wrote:
> > > Does your calling function really need to know if the insert was

> > > successful? If you can do without this extra overhead, then Ithink> > > the best way to do this is to place the insert request on aqueue

> >
> > and
> >

> > > free up the RPC request immediately rather than wait for theinserts

> > > to batch up and complete.
> > >
> > > So some skeleton code would be:
> > >
> > > static LinkedBlockingQueue<dataType> dataToInsert = new
> > > LinkedBlockingQueue<dataType>(); //java.util.concurrent
> > > static int INSERT_TIME_THRESHOLD = 3600;
> > > static int INSERT_SIZE_THRESHOLD = 1000;
> > >
> > > // thrift-accessible function
> > > public void insert(dataType data) {
> > >               this.dataToInsert.add(data); //thread-safe
> > > }
> > >
> > > // worker threads
> > > private static class insertWorker extends Thread {
> > >
> > >       public void run() {
> > >
> > >               ArrayList<dataType> dataToInsertWorker = new
> > > ArrayList<dataType>();
> > >               while (true) {
> > >
> > > dataToInsertWorker.add(this.dataToInsert.take()); //blocks
> > >
> > >                        if ((dataToInsertWorker.size() >
> >
> > INSERT_SIZE_THRESHOLD) ||
> >
> > > (dataToInsertWorker.size() > 1 &&
> > >                               (System.currentTimeMillis() -
> > > lastInsertTime) > INSERT_TIME_THRESHOLD)) {
> > >
> > >                                // insert into DB here
> > >
> > >                                lastInsertTime =
> > > System.currentTimeMillis();
> > >                           }
> > >                  }
> > >
> > >           }
> > > }
> > >
> > >
> > >
> > >
> > > --------------------------
> > > Philip Fung
> > > Engineering
> > > Facebook, Inc.
> > > [EMAIL PROTECTED]
> > >
> > > On May 28, 2008, at 11:46 AM, Ben Maurer wrote:
> > > > Hey,
> > > >

> > > > Usually, when writing stuff for thrift, I've found it's bestto

> > > > create an
> > > > object that takes parameters:
> > > >
> > > > FooReturn doFoo(1: FooArgs args);
> > > >
> > > > For the return value, this is pretty critical because thrift
> >
> > doesn't
> >
> > > > allow
> > > > you to return multiple values. For the arguments, I've found
> >
> > that even
> >

> > > > though thrift can support multiple arguments, doing thismakes it

> > > > easier
> > > > (eg, you can serialize args and log it).
> > > >
> > > > So for this kind of API I'd just take the args value insert it
> >
> > into a
> >

> > > > queue. It does require a bit of work for each function,however,

> >
> > you
> >
> > > > can

> > > > also do stuff like validate the request and raise anexception if

> > > > you know
> > > > the insert will fail.
> > > >
> > > > -b
> > > >
> > > > On Wed, 28 May 2008, Benjamin Reed wrote:

> > > >> Could I get a pointer to how to deal with the followingscenario:

> > > >>

> > > >> I have a Java server using thrift. There are potentiallyhundreds

> > > >> of clients
> > > >> sending hundreds of requests at a time. The server receives a
> > > >> request,

> > > >> batches it up with other pending requests, processes abatch at a

> > > >> time, and
> > > >> then generates the responses when the batch finishes.
> > > >>
> > > >> For example, clients A, B, and C, are each sending up records
> >
> > to be
> >
> > > >> inserted
> > > >> into a database. The clients are sending up 1000 requests per
> > > >> second. The

> > > >> server will grab some number, lets say 100 requests at atime,

> > > >> insert them
> > > >> into the database, issue a commit, and send back successful
> > > >> responses. Doing

> > > >> batch commits of 100 requests at a time allows the serverto keep

> > > >> up with the

> > > >> clients. Committing each request individually would be tooslow.

> > > >>

> > > >> So, in my Java server, how do I get an RPC request and thenput

> >
> > it
> >
> > > >> on a

> > > >> completion list so that I can free up the thread for thenext RPC

> > > >> call and
> > > >> complete the RPC when I do the batch processing?
> > > >>
> > > >> thanx
> > > >> ben
> > > >> _______________________________________________
> > > >> thrift mailing list
> > > >> [EMAIL PROTECTED]
> > > >> http://publists.facebook.com/mailman/listinfo/thrift
> > > >
> > > > _______________________________________________
> > > > thrift mailing list
> > > > [EMAIL PROTECTED]
> > > > http://publists.facebook.com/mailman/listinfo/thrift

Re: [thrift] RPC with completion functions

Reply via email to