Jim Davis wrote:

I noticed that the logging information I provided from the test re-run did not have the actual insert for the duplicate key violations which started the whole problem. So, I grep'd out for each transaction on each controller which had the duplicate error (n119 and n123 - controller n116 did not have a duplicate key error message logged).

I truly appreciate all of your efforts,

Jim D.


Emmanuel Cecchet wrote:

Hi Jim,

Sorry for the late reply.

I am using the appia configuration for group communications. I was having issues with jgroups when I starting really loading the system with alot of transactions.

Do you have a significant number of writes or manipulating large amount of data in inserts (like large objects) ?

As I am not a java developer, my knowledge comes from repeated trial and error. Because my developers cannot reproduce this problem in their development environment when they go straight to a postgres data server instead via sequoia, I am left to my own devices to find a working solution if I am to get to keep my sequoia configuration for our new production system. I am willing to try anything to get past this problem. I have DEBUG turned on everywhere... I have my virtualdatabase configuration set to 30 seconds idle timeout and 10 seconds wait.

Setting to DEBUG mode will significantly slowdown the controller and add latency to every single query. This might increase lock contention and induce deadlocks that would not be seen otherwise when accessing directly the database where lock contention would be much less important.

You should also check in the log if you don't see any warning message about table names that are not found in the schema. That could lead to locking conflicts. Stored procedures could also introduce artificial deadlocks. Do you use any?

I would appreciate any suggestions you could send my way for how to proceed.

If would be helpful to dump all controller queues the next time you get a deadlock. You should be able to get a snapshot of running queries and transactions. We could match the ids with what is in your debug log to see if anything wrong happened with any of these queries or transactions.

Keep us posted with your progress,
Emmanuel


2007-01-12 09:26:03,567 DEBUG controller.virtualdatabase.ANGe 
handleMessageSingleThreaded (class 
org.continuent.sequoia.controller.virtualdatabase.protocol.DistributedStatementExecuteUpdate):
 W 1657 569 insert into SubmissionContentBean (contentURL, contentLabel, 
submission_id, id) values (?, ?, ?, 
?)/<!%S|ftp://n140.cluster.larc.nasa.gov/47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|254afd37-d4ac-4c08-9d4f-cfa99d4399c9|!%><!%S|437a636f-0db3-441a-80ac-dc3441baf60b|!%>
2007-01-12 09:26:03,567 DEBUG controller.virtualdatabase.ANGe 
handleMessageSingleThreaded (class 
org.continuent.sequoia.controller.virtualdatabase.protocol.DistributedStatementExecuteUpdate):
 W 1657 569 insert into SubmissionContentBean (contentURL, contentLabel, 
submission_id, id) values (?, ?, ?, 
?)/<!%S|ftp://n140.cluster.larc.nasa.gov/47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|254afd37-d4ac-4c08-9d4f-cfa99d4399c9|!%><!%S|437a636f-0db3-441a-80ac-dc3441baf60b|!%>
2007-01-12 09:26:03,567 DEBUG controller.virtualdatabase.ANGe 
10.10.101.23:25323: Scheduling distributedRequest 1657 from 
Member(address=/10.10.101.16:54230, uid=10.10.101.16:54230)
2007-01-12 09:26:03,567 DEBUG controller.virtualdatabase.ANGe 
10.10.101.23:25323: Scheduling distributedRequest 1657 from 
Member(address=/10.10.101.16:54230, uid=10.10.101.16:54230)
2007-01-12 09:26:03,567 INFO  distributedvirtualdatabase.request.ANGe W 1657 
569 insert into SubmissionContentBean (contentURL, contentLabel, submission_id, 
id) values (?, ?, ?, 
?)/<!%S|ftp://n140.cluster.larc.nasa.gov/47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|254afd37-d4ac-4c08-9d4f-cfa99d4399c9|!%><!%S|437a636f-0db3-441a-80ac-dc3441baf60b|!%>
2007-01-12 09:26:03,567 DEBUG controller.virtualdatabase.ANGe 
handleMessageMultiThreaded (class 
org.continuent.sequoia.controller.virtualdatabase.protocol.DistributedStatementExecuteUpdate):
 W 1657 569 insert into SubmissionContentBean (contentURL, contentLabel, 
submission_id, id) values (?, ?, ?, 
?)/<!%S|ftp://n140.cluster.larc.nasa.gov/47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|254afd37-d4ac-4c08-9d4f-cfa99d4399c9|!%><!%S|437a636f-0db3-441a-80ac-dc3441baf60b|!%>
2007-01-12 09:26:03,567 DEBUG controller.virtualdatabase.ANGe 
handleMessageMultiThreaded (class 
org.continuent.sequoia.controller.virtualdatabase.protocol.DistributedStatementExecuteUpdate):
 W 1657 569 insert into SubmissionContentBean (contentURL, contentLabel, 
submission_id, id) values (?, ?, ?, 
?)/<!%S|ftp://n140.cluster.larc.nasa.gov/47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|254afd37-d4ac-4c08-9d4f-cfa99d4399c9|!%><!%S|437a636f-0db3-441a-80ac-dc3441baf60b|!%>
2007-01-12 09:26:03,568 DEBUG controller.virtualdatabase.ANGe 
10.10.101.23:25323: Executing distributedRequest 1657 from 
Member(address=/10.10.101.16:54230, uid=10.10.101.16:54230)
2007-01-12 09:26:03,568 DEBUG controller.virtualdatabase.ANGe 
10.10.101.23:25323: Executing distributedRequest 1657 from 
Member(address=/10.10.101.16:54230, uid=10.10.101.16:54230)
2007-01-12 09:26:03,568 DEBUG controller.RequestManager.ANGe Scheduling write 
request (1657): insert into SubmissionContentBean (conte...
2007-01-12 09:26:03,568 DEBUG controller.RequestManager.ANGe Scheduling write 
request (1657): insert into SubmissionContentBean (conte...
2007-01-12 09:26:03,569 DEBUG controller.RequestManager.ANGe Load balancing 
write request (1657,569): insert into SubmissionContentBean (conte...
2007-01-12 09:26:03,569 DEBUG controller.RequestManager.ANGe Load balancing 
write request (1657,569): insert into SubmissionContentBean (conte...
2007-01-12 09:26:03,606 DEBUG sequoia.controller.loadbalancer Request 1657 
execute on 2 backends
2007-01-12 09:26:03,606 DEBUG sequoia.controller.loadbalancer Request 1657 
execute on 2 backends
2007-01-12 09:26:03,617 DEBUG sequoia.controller.recoverylog INSERT INTO 
RECOVERY VALUES(1580,user,insert into SubmissionContentBean (contentURL, 
contentLabel, submission_id, id) values (?, ?, ?, 
?),<!%S|ftp://n140.cluster.larc.nasa.gov/47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|254afd37-d4ac-4c08-9d4f-cfa99d4399c9|!%><!%S|437a636f-0db3-441a-80ac-dc3441baf60b|!%>,T,569,1657,E,38,0)
2007-01-12 09:26:03,617 DEBUG sequoia.controller.recoverylog INSERT INTO 
RECOVERY VALUES(1580,user,insert into SubmissionContentBean (contentURL, 
contentLabel, submission_id, id) values (?, ?, ?, 
?),<!%S|ftp://n140.cluster.larc.nasa.gov/47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|47754ed2-707b-4fb8-8820-13ee92876143_LATIS1HOUR_Step_00000005_Sub_00000000_00000000|!%><!%S|254afd37-d4ac-4c08-9d4f-cfa99d4399c9|!%><!%S|437a636f-0db3-441a-80ac-dc3441baf60b|!%>,T,569,1657,E,38,0)
2007-01-12 09:26:03,617 ERROR controller.loadbalancer.RAIDb1 write request 1657 
failed:
2007-01-12 09:26:03,617 ERROR controller.loadbalancer.RAIDb1 write request 1657 
failed:
2007-01-12 09:26:03,620 WARN  controller.RequestManager.ANGe SQLException while 
executing distributed write request write request 1657 failed:
java.sql.SQLException: write request 1657 failed:
2007-01-12 09:26:03,620 WARN  controller.RequestManager.ANGe SQLException while 
executing distributed write request write request 1657 failed:
java.sql.SQLException: write request 1657 failed:
2007-01-12 09:26:04,783 DEBUG controller.virtualdatabase.ANGe 
handleMessageSingleThreaded (class 
org.continuent.sequoia.controller.virtualdatabase.protocol.DistributedStatementExecuteUpdate):
 W 1659 571 insert into SubmissionContentBean (contentURL, contentLabel, 
submission_id, id) values (?, ?, ?, 
?)/<!%S|ftp://n140.cluster.larc.nasa.gov/b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b1d65046-1de4-4cf0-b5e1-db9fc46bed9a|!%><!%S|7ca3f9ea-5ab4-4c90-806a-5d6b55fe1e00|!%>
2007-01-12 09:26:04,783 DEBUG controller.virtualdatabase.ANGe 
handleMessageSingleThreaded (class 
org.continuent.sequoia.controller.virtualdatabase.protocol.DistributedStatementExecuteUpdate):
 W 1659 571 insert into SubmissionContentBean (contentURL, contentLabel, 
submission_id, id) values (?, ?, ?, 
?)/<!%S|ftp://n140.cluster.larc.nasa.gov/b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b1d65046-1de4-4cf0-b5e1-db9fc46bed9a|!%><!%S|7ca3f9ea-5ab4-4c90-806a-5d6b55fe1e00|!%>
2007-01-12 09:26:04,783 DEBUG controller.virtualdatabase.ANGe 
10.10.101.19:25323: Scheduling distributedRequest 1659 from 
Member(address=/10.10.101.16:54230, uid=10.10.101.16:54230)
2007-01-12 09:26:04,783 DEBUG controller.virtualdatabase.ANGe 
10.10.101.19:25323: Scheduling distributedRequest 1659 from 
Member(address=/10.10.101.16:54230, uid=10.10.101.16:54230)
2007-01-12 09:26:04,783 INFO  distributedvirtualdatabase.request.ANGe W 1659 
571 insert into SubmissionContentBean (contentURL, contentLabel, submission_id, 
id) values (?, ?, ?, 
?)/<!%S|ftp://n140.cluster.larc.nasa.gov/b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b1d65046-1de4-4cf0-b5e1-db9fc46bed9a|!%><!%S|7ca3f9ea-5ab4-4c90-806a-5d6b55fe1e00|!%>
2007-01-12 09:26:04,783 DEBUG controller.virtualdatabase.ANGe 
handleMessageMultiThreaded (class 
org.continuent.sequoia.controller.virtualdatabase.protocol.DistributedStatementExecuteUpdate):
 W 1659 571 insert into SubmissionContentBean (contentURL, contentLabel, 
submission_id, id) values (?, ?, ?, 
?)/<!%S|ftp://n140.cluster.larc.nasa.gov/b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b1d65046-1de4-4cf0-b5e1-db9fc46bed9a|!%><!%S|7ca3f9ea-5ab4-4c90-806a-5d6b55fe1e00|!%>
2007-01-12 09:26:04,783 DEBUG controller.virtualdatabase.ANGe 
handleMessageMultiThreaded (class 
org.continuent.sequoia.controller.virtualdatabase.protocol.DistributedStatementExecuteUpdate):
 W 1659 571 insert into SubmissionContentBean (contentURL, contentLabel, 
submission_id, id) values (?, ?, ?, 
?)/<!%S|ftp://n140.cluster.larc.nasa.gov/b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b1d65046-1de4-4cf0-b5e1-db9fc46bed9a|!%><!%S|7ca3f9ea-5ab4-4c90-806a-5d6b55fe1e00|!%>
2007-01-12 09:26:04,783 DEBUG controller.virtualdatabase.ANGe 
10.10.101.19:25323: Executing distributedRequest 1659 from 
Member(address=/10.10.101.16:54230, uid=10.10.101.16:54230)
2007-01-12 09:26:04,783 DEBUG controller.virtualdatabase.ANGe 
10.10.101.19:25323: Executing distributedRequest 1659 from 
Member(address=/10.10.101.16:54230, uid=10.10.101.16:54230)
2007-01-12 09:26:04,784 DEBUG controller.RequestManager.ANGe Scheduling write 
request (1659): insert into SubmissionContentBean (conte...
2007-01-12 09:26:04,784 DEBUG controller.RequestManager.ANGe Scheduling write 
request (1659): insert into SubmissionContentBean (conte...
2007-01-12 09:26:04,785 DEBUG controller.RequestManager.ANGe Load balancing 
write request (1659,571): insert into SubmissionContentBean (conte...
2007-01-12 09:26:04,785 DEBUG controller.RequestManager.ANGe Load balancing 
write request (1659,571): insert into SubmissionContentBean (conte...
2007-01-12 09:26:04,785 DEBUG sequoia.controller.loadbalancer Request 1659 
execute on 2 backends
2007-01-12 09:26:04,785 DEBUG sequoia.controller.loadbalancer Request 1659 
execute on 2 backends
2007-01-12 09:26:04,804 DEBUG sequoia.controller.recoverylog INSERT INTO 
RECOVERY VALUES(1584,user,insert into SubmissionContentBean (contentURL, 
contentLabel, submission_id, id) values (?, ?, ?, 
?),<!%S|ftp://n140.cluster.larc.nasa.gov/b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b1d65046-1de4-4cf0-b5e1-db9fc46bed9a|!%><!%S|7ca3f9ea-5ab4-4c90-806a-5d6b55fe1e00|!%>,T,571,1659,E,2,0)
2007-01-12 09:26:04,804 DEBUG sequoia.controller.recoverylog INSERT INTO 
RECOVERY VALUES(1584,user,insert into SubmissionContentBean (contentURL, 
contentLabel, submission_id, id) values (?, ?, ?, 
?),<!%S|ftp://n140.cluster.larc.nasa.gov/b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b8b5c1c3-4430-4465-8853-f1a5f882154d.APES|!%><!%S|b1d65046-1de4-4cf0-b5e1-db9fc46bed9a|!%><!%S|7ca3f9ea-5ab4-4c90-806a-5d6b55fe1e00|!%>,T,571,1659,E,2,0)
2007-01-12 09:26:04,809 ERROR controller.loadbalancer.RAIDb1 write request 1659 
failed:
2007-01-12 09:26:04,809 ERROR controller.loadbalancer.RAIDb1 write request 1659 
failed:
2007-01-12 09:26:04,812 WARN  controller.RequestManager.ANGe SQLException while 
executing distributed write request write request 1659 failed:
java.sql.SQLException: write request 1659 failed:
2007-01-12 09:26:04,812 WARN  controller.RequestManager.ANGe SQLException while 
executing distributed write request write request 1659 failed:
java.sql.SQLException: write request 1659 failed:
begin:vcard
fn:Jim Davis
n:Davis;Jim
org:Atmospheric Sciences Data Center;NASA Langley Research Center
adr;dom:;;;Hampton;VA
email;internet:[EMAIL PROTECTED]
title:SSAI, Systems Engineer
tel;work:757-864-7525
version:2.1
end:vcard

_______________________________________________
Sequoia mailing list
[email protected]
https://forge.continuent.org/mailman/listinfo/sequoia

Reply via email to