Re: [Sequoia] Failed write request behavior

Emmanuel Cecchet Fri, 12 Jan 2007 06:36:04 -0800

Hi Jim,

I am seeing failed writes to a postgresql database backend remain inthe write queue on the controller. The duplicate key error message forthe correspondingwrite only appears on one of the 3 controllers. But the two sistercontrollers have the same request id 10577 in the scheduler queuealong with any otherwrite requests which arrived after the 10577 request. Is this normalbehavior? How can I clear a failed write from the controller's writequeue? My three controllersbasically just start queueing any addtional writes after the duplicatekey write occurs. Any assistance with resolving this issue would begreatly appreciated.

From the log you attached, I understand that the query was issued onthe first controller (where it failed) but it is still pending on the 2other controllers. This is why is still shows as 'pending' because ithas to wait for the result of the other controllers to decide whetherthat was a real failure (all controllers fail) or if only the localcontroller failed (in which case its local backend are disabled and wecontinue with the other controllers).

*2nd Controller where no duplicate key error is recored but request isqueued:


*ANGe(admin) > dump scheduler queues
Active transactions: 7
        Transaction id list: 3800 3802 3803 3804 3805 3806 3807
Pending write requests: 6
        Write request id list: 10586 10593 10591 10581 10587 10577

*3rd controller where no duplicate error is recorded but request isqueued:

*ANGe(admin) > dump scheduler queues
Active transactions: 8
        Transaction id list: 3703 3800 3802 3803 3804 3805 3806 3807
Pending write requests: 6
        Write request id list: 10586 10593 10591 10581 10587 10577

Any suggestions on how I can recover when this happens?

What puzzles me is is the old transaction 3703 that remains open on the3rd controller. No idea where this could come from since if it was aread-only transaction it would have executed on the first controller(given its id).

Another reason could be a problem with the group communication. Whichone are you using?

Something else to investigate is potential query indeterminism. This canhappen with multiple table updates or update with subselects. In suchcase, a strict table locking might be needed. Was this duplicate keyexception something you expected?


Thanks for your feedback,
Emmanuel

--
Emmanuel Cecchet
Chief Scientific Officer, Continuent

Blog: http://emanux.blogspot.com/
Open source: http://www.continuent.org
Corporate: http://www.continuent.com
Skype: emmanuel_cecchet
Cell: +33 687 342 685


_______________________________________________
Sequoia mailing list
[email protected]
https://forge.continuent.org/mailman/listinfo/sequoia

Re: [Sequoia] Failed write request behavior

Reply via email to