Problem:
Sequoia seems to run fine in a clustered environment for a couple of hours, then one of the backends gets dropped with the following error.


Logs:
-----
2007-06-26 20:10:20,968 ERROR controller.loadbalancer.RAIDb1 write request 562949953522065 failed: Backend Cardinal2Log - BackendWorkerThread for backend 'melodrama' with RAIDb level:1 failed (An I/O error occured while sending to the backend.)

2007-06-26 20:10:20,968 WARN controller.RequestManager.Cardinal2Log SQLException while executing distributed write request write request 562949953522065 failed: Backend Cardinal2Log - BackendWorkerThread for backend 'melodrama' with RAIDb level:1 failed (An I/O error occured while sending to the backend.)

java.sql.SQLException: write request 562949953522065 failed:
Backend Cardinal2Log - BackendWorkerThread for backend 'melodrama' with RAIDb level:1 failed (An I/O error occured while sending to the backend.)

at org.continuent.sequoia.common.exceptions.SQLExceptionFactory.getSQLException(SQLExceptionFactory.java:88) at org.continuent.sequoia.controller.loadbalancer.raidb1.RAIDb1.checkTaskCompletion(RAIDb1.java:939) at org.continuent.sequoia.controller.loadbalancer.raidb1.RAIDb1.execWriteRequest(RAIDb1.java:343) at org.continuent.sequoia.controller.loadbalancer.raidb1.RAIDb1.statementExecuteUpdate(RAIDb1.java:249) at org.continuent.sequoia.controller.requestmanager.RequestManager.loadBalanceStatementExecuteUpdate(RequestManager.java:925) at org.continuent.sequoia.controller.virtualdatabase.protocol.DistributedStatementExecuteUpdate.executeScheduledRequest(DistributedStatementExecuteUpdate.java:87) at org.continuent.sequoia.controller.virtualdatabase.protocol.DistributedRequest.handleMessageMultiThreaded(DistributedRequest.java:157) at org.continuent.sequoia.controller.virtualdatabase.DistributedVirtualDatabase.handleMessageMultiThreaded(DistributedVirtualDatabase.java:364) at org.continuent.hedera.adapters.MulticastRequestAdapterThread.run(MulticastRequestAdapterThread.java:102) 2007-06-26 20:10:21,015 WARN controller.RequestManager.Cardinal2Log Request 562949953522065 failed on controller Member(address=melodrama/192.168.129.75:7800, uid=Cardinal2Log) (java.sql.SQLException: write request 562949953522065 failed: Backend Cardinal2Log - BackendWorkerThread for backend 'melodrama' with RAIDb level:1 failed (An I/O error occured while sending to the backend.)
)



Configuration:
--------------
2x Sequoia 2.10.8 Controllers
2x PostgreSQL databases
Raid 1
JGroups with TCP connectivity



Does anyone have any ideas as to what could be causing this?

Thanks.

------------------------------------------------------------------------

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE SEQUOIA PUBLIC "-//Continuent//DTD SEQUOIA 2.10.6//EN" 
"http://sequoia.continuent.org/dtds/sequoia-2.10.6.dtd";>

<SEQUOIA>
 <VirtualDatabase name="Cardinal2Log">
   <Distribution hederaPropertiesFile="/hedera_jgroups.properties">
     <MessageTimeouts/>
   </Distribution>

   <Backup>
     <Backuper backuperName="pgdump"
       
className="org.continuent.sequoia.controller.backup.backupers.PostgreSQLTarBackuper"
       options="bindir=C:\Program Files\PostgreSQL\8.2\bin"/>
   </Backup>

   <AuthenticationManager>
     <Admin>
        <User username="admin" password="******"/>
</Admin> <VirtualUsers>
       <VirtualLogin vLogin="user" vPassword="******"/>
     </VirtualUsers>
   </AuthenticationManager>

   <DatabaseBackend name="melodrama" driver="org.postgresql.Driver" 
url="jdbc:postgresql://127.0.0.1:5432/Cardinal2Log" connectionTestStatement="select now()">
     <DatabaseSchema dynamicPrecision="all"/>
     <ConnectionManager vLogin="user" rLogin="postgres" rPassword="******">
       <VariablePoolConnectionManager initPoolSize="10" minPoolSize="5" maxPoolSize="50" 
idleTimeout="30" waitTimeout="10"/>
     </ConnectionManager>
   </DatabaseBackend>

   <RequestManager
     caseSensitiveParsing="true">
     <RequestScheduler>
       <RAIDb-1Scheduler level="passThrough"/>
     </RequestScheduler>

     <RequestCache>
        <MetadataCache/>
        <ParsingCache/>
        <ResultCache granularity="table"/>
     </RequestCache>

     <LoadBalancer>
       <RAIDb-1>
         <WaitForCompletion policy="first"/>
         <RAIDb-1-LeastPendingRequestsFirst/>
       </RAIDb-1>
     </LoadBalancer>



     <RecoveryLog driver="org.postgresql.Driver"
url="jdbc:postgresql://127.0.0.1:5432/Cardinal2LogRecover" login="postgres" password="******">

<RecoveryLogTable tableName="recovery" logIdColumnType="BIGINT NOT NULL" vloginColumnType="VARCHAR NOT NULL" sqlColumnType="VARCHAR NOT NULL"
         extraStatementDefinition=",PRIMARY KEY (log_id)"/>

       <CheckpointTable tableName="checkpoint"
         checkpointNameColumnType="VARCHAR NOT NULL"/>

       <BackendTable tableName="backend"
         databaseNameColumnType="VARCHAR NOT NULL"
         backendNameColumnType="VARCHAR NOT NULL"
         checkpointNameColumnType="VARCHAR NOT NULL"/>

<DumpTable tableName="dump" dumpNameColumnType="VARCHAR NOT NULL"
         dumpDateColumnType="TIMESTAMP"
         dumpPathColumnType="VARCHAR NOT NULL"
         dumpFormatColumnType="VARCHAR NOT NULL"
         checkpointNameColumnType="VARCHAR NOT NULL"
         backendNameColumnType="VARCHAR NOT NULL"
         tablesColumnType="VARCHAR NOT NULL"/>

     </RecoveryLog>
   </RequestManager>
</VirtualDatabase>
</SEQUOIA>


_______________________________________________
Sequoia mailing list
[email protected]
https://forge.continuent.org/mailman/listinfo/sequoia

Reply via email to