The branch, master has been updated
via 22712c577f64ec84851b4addcf4a46c7e99e0662 (commit)
via 3b0d44497800a16400d05a30bdaf6e6c285d4b36 (commit)
from 9560f8b7fe0f7ee0386a87c2653333071050fe4b (commit)
http://gitweb.samba.org/?p=sahlberg/ctdb.git;a=shortlog;h=master
- Log -----------------------------------------------------------------
commit 22712c577f64ec84851b4addcf4a46c7e99e0662
Author: Ronnie Sahlberg <[email protected]>
Date: Thu Oct 29 10:32:28 2009 +1100
update the manpage for "update" to indicate the "time since last" indicates
the time since the last recovery OR failover
commit 3b0d44497800a16400d05a30bdaf6e6c285d4b36
Author: Ronnie Sahlberg <[email protected]>
Date: Thu Oct 29 10:37:10 2009 +1100
update the "uptime" command to indicate the "time since last" is the time
since the last recovery OR failover.
-----------------------------------------------------------------------
Summary of changes:
client/ctdb_client.c | 48 +-------------
config/events.d/61.nfstickle | 2 +-
doc/ctdb.1 | 14 ++--
doc/ctdb.1.html | 146 +++++++++++++++++++++---------------------
doc/ctdb.1.xml | 10 ++--
include/ctdb_private.h | 3 -
server/ctdb_control.c | 4 -
server/ctdb_persistent.c | 20 ------
tools/ctdb.c | 10 ++--
9 files changed, 94 insertions(+), 163 deletions(-)
Changeset truncated at 500 lines:
diff --git a/client/ctdb_client.c b/client/ctdb_client.c
index d47f771..d4130cd 100644
--- a/client/ctdb_client.c
+++ b/client/ctdb_client.c
@@ -3141,42 +3141,12 @@ int ctdb_ctrl_getcapabilities(struct ctdb_context
*ctdb, struct timeval timeout,
return ret;
}
-/**
- * check whether a transaction is active on a given db on a given node
- */
-static int32_t ctdb_ctrl_transaction_active(struct ctdb_context *ctdb,
- uint32_t destnode,
- uint32_t db_id)
-{
- int32_t status;
- int ret;
- TDB_DATA indata;
-
- indata.dptr = (uint8_t *)&db_id;
- indata.dsize = sizeof(db_id);
-
- ret = ctdb_control(ctdb, destnode, 0,
- CTDB_CONTROL_TRANS2_ACTIVE,
- 0, indata, NULL, NULL, &status,
- NULL, NULL);
-
- if (ret != 0) {
- DEBUG(DEBUG_ERR, (__location__ " ctdb control for
transaction_active failed\n"));
- return -1;
- }
-
- return status;
-}
-
-
struct ctdb_transaction_handle {
struct ctdb_db_context *ctdb_db;
bool in_replay;
- /*
- * we store the reads and writes done under a transaction:
- * - one list stores both reads and writes (m_all),
- * - the other just writes (m_write)
- */
+ /* we store the reads and writes done under a transaction one
+ list stores both reads and writes, the other just writes
+ */
struct ctdb_marshall_buffer *m_all;
struct ctdb_marshall_buffer *m_write;
};
@@ -3200,7 +3170,6 @@ static int ctdb_transaction_fetch_start(struct
ctdb_transaction_handle *h)
int ret;
struct ctdb_db_context *ctdb_db = h->ctdb_db;
pid_t pid;
- int32_t status;
key.dptr = discard_const(keyname);
key.dsize = strlen(keyname);
@@ -3211,17 +3180,6 @@ static int ctdb_transaction_fetch_start(struct
ctdb_transaction_handle *h)
}
again:
- status = ctdb_ctrl_transaction_active(ctdb_db->ctdb,
- CTDB_CURRENT_NODE,
- ctdb_db->db_id);
- if (status == 1) {
- DEBUG(DEBUG_NOTICE, (__location__ " transaction is active "
- "on db_id[%u]. waiting for 1 second\n",
- ctdb_db->db_id));
- sleep(1);
- goto again;
- }
-
tmp_ctx = talloc_new(h);
rh = ctdb_fetch_lock(ctdb_db, tmp_ctx, key, NULL);
diff --git a/config/events.d/61.nfstickle b/config/events.d/61.nfstickle
index 332d006..3e557e4 100755
--- a/config/events.d/61.nfstickle
+++ b/config/events.d/61.nfstickle
@@ -56,7 +56,7 @@ case $cmd in
;;
monitor)
- # always create these direcotries since NFS might be enabled at runtime
+# always create these direcotries since NFS might be enabled at runtime
# and we dont want to restart ctdbd
mkdir -p $CTDB_BASE/state/nfstickle
mkdir -p $NFS_TICKLE_SHARED_DIRECTORY/`hostname`
diff --git a/doc/ctdb.1 b/doc/ctdb.1
index 0b59cf6..fa9482f 100644
--- a/doc/ctdb.1
+++ b/doc/ctdb.1
@@ -1,11 +1,11 @@
.\" Title: ctdb
.\" Author:
.\" Generator: DocBook XSL Stylesheets v1.73.2 <http://docbook.sf.net/>
-.\" Date: 08/13/2009
+.\" Date: 10/29/2009
.\" Manual:
.\" Source:
.\"
-.TH "CTDB" "1" "08/13/2009" "" ""
+.TH "CTDB" "1" "10/29/2009" "" ""
.\" disable hyphenation
.nh
.\" disable justification (adjust text to left margin only)
@@ -178,7 +178,7 @@ Recovery master:0
This command shows the pnn of the node which is currently the recmaster\.
.SS "uptime"
.PP
-This command shows the uptime for the ctdb daemon\. When the last recovery
completed and how long the last recovery took\. If the "duration" is shown as a
negative number, this indicates that there is a recovery in progress and it
started that many seconds ago\.
+This command shows the uptime for the ctdb daemon\. When the last recovery or
ip\-failover completed and how long it took\. If the "duration" is shown as a
negative number, this indicates that there is a recovery/failover in progress
and it started that many seconds ago\.
.PP
Example: ctdb uptime
.PP
@@ -186,10 +186,10 @@ Example output:
.sp
.RS 4
.nf
-Current time of node : Tue Mar 24 18:27:54 2009
-Ctdbd start time : (000 00:00:05) Tue Mar 24 18:27:49 2009
-Time of last recovery : (000 00:00:05) Tue Mar 24 18:27:49 2009
-Duration of last recovery : 0\.000000 seconds
+Current time of node : Thu Oct 29 10:38:54 2009
+Ctdbd start time : (000 16:54:28) Wed Oct 28 17:44:26 2009
+Time of last recovery/failover: (000 16:53:31) Wed Oct 28 17:45:23 2009
+Duration of last recovery/failover: 2\.248552 seconds
.fi
.RE
diff --git a/doc/ctdb.1.html b/doc/ctdb.1.html
index e4c58dd..46de1b6 100644
--- a/doc/ctdb.1.html
+++ b/doc/ctdb.1.html
@@ -1,6 +1,6 @@
-<html><head><meta http-equiv="Content-Type" content="text/html;
charset=ISO-8859-1"><title>ctdb</title><meta name="generator" content="DocBook
XSL Stylesheets V1.73.2"></head><body bgcolor="white" text="black"
link="#0000FF" vlink="#840084" alink="#0000FF"><div class="refentry"
lang="en"><a name="ctdb.1"></a><div class="titlepage"></div><div
class="refnamediv"><h2>Name</h2><p>ctdb — clustered tdb database
management utility</p></div><div class="refsynopsisdiv"><h2>Synopsis</h2><div
class="cmdsynopsis"><p><code class="command">ctdb [ OPTIONS ] COMMAND
...</code> </p></div><div class="cmdsynopsis"><p><code
class="command">ctdb</code> [-n <node>] [-Y] [-t <timeout>] [-T
<timelimit>] [-? --help] [--usage] [-d --debug=<INTEGER>]
[--socket=<filename>]</p></div></div><div class="refsect1" lang="en"><a
name="id2478257"></a><h2>DESCRIPTION</h2><p>
+<html><head><meta http-equiv="Content-Type" content="text/html;
charset=ISO-8859-1"><title>ctdb</title><meta name="generator" content="DocBook
XSL Stylesheets V1.73.2"></head><body bgcolor="white" text="black"
link="#0000FF" vlink="#840084" alink="#0000FF"><div class="refentry"
lang="en"><a name="ctdb.1"></a><div class="titlepage"></div><div
class="refnamediv"><h2>Name</h2><p>ctdb — clustered tdb database
management utility</p></div><div class="refsynopsisdiv"><h2>Synopsis</h2><div
class="cmdsynopsis"><p><code class="command">ctdb [ OPTIONS ] COMMAND
...</code> </p></div><div class="cmdsynopsis"><p><code
class="command">ctdb</code> [-n <node>] [-Y] [-t <timeout>] [-T
<timelimit>] [-? --help] [--usage] [-d --debug=<INTEGER>]
[--socket=<filename>]</p></div></div><div class="refsect1" lang="en"><a
name="id2478395"></a><h2>DESCRIPTION</h2><p>
ctdb is a utility to view and manage a ctdb cluster.
- </p></div><div class="refsect1" lang="en"><a
name="id2478267"></a><h2>OPTIONS</h2><div class="variablelist"><dl><dt><span
class="term">-n <pnn></span></dt><dd><p>
+ </p></div><div class="refsect1" lang="en"><a
name="id2478405"></a><h2>OPTIONS</h2><div class="variablelist"><dl><dt><span
class="term">-n <pnn></span></dt><dd><p>
This specifies the physical node number on which to execute the
command. Default is to run the command on the deamon running on
the local host.
@@ -28,13 +28,13 @@
You only need to specify this parameter if you run multiple ctdb
daemons on the same physical host and thus can not use the default
name for the domain socket.
- </p></dd></dl></div></div><div class="refsect1" lang="en"><a
name="id2478404"></a><h2>Administrative Commands</h2><p>
+ </p></dd></dl></div></div><div class="refsect1" lang="en"><a
name="id2479674"></a><h2>Administrative Commands</h2><p>
These are commands used to monitor and administrate a CTDB cluster.
- </p><div class="refsect2" lang="en"><a name="id2478413"></a><h3>pnn</h3><p>
+ </p><div class="refsect2" lang="en"><a name="id2479683"></a><h3>pnn</h3><p>
This command displays the pnn of the current node.
- </p></div><div class="refsect2" lang="en"><a
name="id2478423"></a><h3>status</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2479693"></a><h3>status</h3><p>
This command shows the current status of the ctdb node.
- </p><div class="refsect3" lang="en"><a name="id2478621"></a><h4>node
status</h4><p>
+ </p><div class="refsect3" lang="en"><a name="id2479702"></a><h4>node
status</h4><p>
Node status reflects the current status of the node. There are five
possible states:
</p><p>
OK - This node is fully functional.
@@ -52,7 +52,7 @@
RECMASTER or NATGW.
This node does not perticipate in the CTDB cluster but can still be
communicated with. I.e. ctdb commands can be sent to it.
- </p></div><div class="refsect3" lang="en"><a
name="id2478682"></a><h4>generation</h4><p>
+ </p></div><div class="refsect3" lang="en"><a
name="id2479763"></a><h4>generation</h4><p>
The generation id is a number that indicates the current generation
of a cluster instance. Each time a cluster goes through a
reconfiguration or a recovery its generation id will be changed.
@@ -73,10 +73,10 @@
All nodes start with generation "INVALID" and are not assigned a real
generation id until they have successfully been merged with a cluster
through a recovery.
- </p></div><div class="refsect3" lang="en"><a
name="id2478717"></a><h4>VNNMAP</h4><p>
+ </p></div><div class="refsect3" lang="en"><a
name="id2479798"></a><h4>VNNMAP</h4><p>
The list of Virtual Node Numbers. This is a list of all nodes that
actively participates in the cluster and that share the workload of hosting the
Clustered TDB database records.
Only nodes that are participating in the vnnmap can become lmaster
or dmaster for a database record.
- </p></div><div class="refsect3" lang="en"><a
name="id2478731"></a><h4>Recovery mode</h4><p>
+ </p></div><div class="refsect3" lang="en"><a
name="id2479812"></a><h4>Recovery mode</h4><p>
This is the current recovery mode of the cluster. There are two
possible modes:
</p><p>
NORMAL - The cluster is fully operational.
@@ -96,7 +96,7 @@
have been recovered, the node mode will change into NORMAL mode
and the databases will be "thawed", allowing samba to access the
databases again.
- </p></div><div class="refsect3" lang="en"><a
name="id2478773"></a><h4>Recovery master</h4><p>
+ </p></div><div class="refsect3" lang="en"><a
name="id2528501"></a><h4>Recovery master</h4><p>
This is the cluster node that is currently designated as the
recovery master. This node is responsible of monitoring the consistency of the
cluster and to perform the actual recovery process when reqired.
</p><p>
Only one node at a time can be the designated recovery master. Which
@@ -118,18 +118,18 @@ hash:2 lmaster:2
hash:3 lmaster:3
Recovery mode:NORMAL (0)
Recovery master:0
- </pre></div><div class="refsect2" lang="en"><a
name="id2478810"></a><h3>recmaster</h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528538"></a><h3>recmaster</h3><p>
This command shows the pnn of the node which is currently the
recmaster.
- </p></div><div class="refsect2" lang="en"><a
name="id2478821"></a><h3>uptime</h3><p>
- This command shows the uptime for the ctdb daemon. When the last
recovery completed and how long the last recovery took. If the "duration" is
shown as a negative number, this indicates that there is a recovery in progress
and it started that many seconds ago.
+ </p></div><div class="refsect2" lang="en"><a
name="id2528549"></a><h3>uptime</h3><p>
+ This command shows the uptime for the ctdb daemon. When the last
recovery or ip-failover completed and how long it took. If the "duration" is
shown as a negative number, this indicates that there is a recovery/failover in
progress and it started that many seconds ago.
</p><p>
Example: ctdb uptime
</p><p>Example output:</p><pre class="screen">
-Current time of node : Tue Mar 24 18:27:54 2009
-Ctdbd start time : (000 00:00:05) Tue Mar 24 18:27:49 2009
-Time of last recovery : (000 00:00:05) Tue Mar 24 18:27:49 2009
-Duration of last recovery : 0.000000 seconds
- </pre></div><div class="refsect2" lang="en"><a
name="id2478851"></a><h3>listnodes</h3><p>
+Current time of node : Thu Oct 29 10:38:54 2009
+Ctdbd start time : (000 16:54:28) Wed Oct 28 17:44:26 2009
+Time of last recovery/failover: (000 16:53:31) Wed Oct 28 17:45:23 2009
+Duration of last recovery/failover: 2.248552 seconds
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528580"></a><h3>listnodes</h3><p>
This command shows lists the ip addresses of all the nodes in the
cluster.
</p><p>
Example: ctdb listnodes
@@ -138,7 +138,7 @@ Duration of last recovery : 0.000000 seconds
10.0.0.72
10.0.0.73
10.0.0.74
- </pre></div><div class="refsect2" lang="en"><a
name="id2478874"></a><h3>ping</h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528604"></a><h3>ping</h3><p>
This command will "ping" all CTDB daemons in the cluster to verify
that they are processing commands correctly.
</p><p>
Example: ctdb ping
@@ -149,7 +149,7 @@ response from 0 time=0.000054 sec (3 clients)
response from 1 time=0.000144 sec (2 clients)
response from 2 time=0.000105 sec (2 clients)
response from 3 time=0.000114 sec (2 clients)
- </pre></div><div class="refsect2" lang="en"><a
name="id2528491"></a><h3>ip</h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528630"></a><h3>ip</h3><p>
This command will display the list of public addresses that are
provided by the cluster and which physical node is currently serving this ip.
By default this command will ONLY show those public addresses that are known to
the node itself. To see the full list of all public ips across the cluster you
must use "ctdb ip -n all".
</p><p>
Example: ctdb ip
@@ -161,7 +161,7 @@ Number of addresses:4
12.1.1.2 1
12.1.1.3 2
12.1.1.4 3
- </pre></div><div class="refsect2" lang="en"><a
name="id2528519"></a><h3>scriptstatus</h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528658"></a><h3>scriptstatus</h3><p>
This command displays which scripts where run in the previous
monitoring cycle and the result of each script. If a script failed with an
error, causing the node to become unhealthy, the output from that script is
also shown.
</p><p>
Example: ctdb scriptstatus
@@ -178,15 +178,15 @@ Number of addresses:4
41.httpd Status:OK Duration:0.039 Tue Mar 24 18:56:57 2009
50.samba Status:ERROR Duration:0.082 Tue Mar 24 18:56:57 2009
OUTPUT:ERROR: Samba tcp port 445 is not responding
- </pre></div><div class="refsect2" lang="en"><a
name="id2528555"></a><h3>disablescript <script></h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528694"></a><h3>disablescript <script></h3><p>
This command is used to disable an eventscript.
</p><p>
This will take effect the next time the eventscripts are being executed
so it can take a short while until this is reflected in 'scriptstatus'.
- </p></div><div class="refsect2" lang="en"><a
name="id2528572"></a><h3>enablescript <script></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2528711"></a><h3>enablescript <script></h3><p>
This command is used to enable an eventscript.
</p><p>
This will take effect the next time the eventscripts are being executed
so it can take a short while until this is reflected in 'scriptstatus'.
- </p></div><div class="refsect2" lang="en"><a
name="id2528588"></a><h3>getvar <name></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2528727"></a><h3>getvar <name></h3><p>
Get the runtime value of a tuneable variable.
</p><p>
Example: ctdb getvar MaxRedirectCount
@@ -194,11 +194,11 @@ Number of addresses:4
Example output:
</p><pre class="screen">
MaxRedirectCount = 3
- </pre></div><div class="refsect2" lang="en"><a
name="id2528611"></a><h3>setvar <name> <value></h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528750"></a><h3>setvar <name> <value></h3><p>
Set the runtime value of a tuneable variable.
</p><p>
Example: ctdb setvar MaxRedirectCount 5
- </p></div><div class="refsect2" lang="en"><a
name="id2528626"></a><h3>listvars</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2528765"></a><h3>listvars</h3><p>
List all tuneable variables.
</p><p>
Example: ctdb listvars
@@ -220,7 +220,7 @@ MonitorInterval = 15
EventScriptTimeout = 20
RecoveryGracePeriod = 60
RecoveryBanPeriod = 300
- </pre></div><div class="refsect2" lang="en"><a
name="id2528654"></a><h3>lvsmaster</h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528793"></a><h3>lvsmaster</h3><p>
This command shows which node is currently the LVSMASTER. The
LVSMASTER is the node in the cluster which drives the LVS system and
which receives all incoming traffic from clients.
@@ -231,7 +231,7 @@ RecoveryBanPeriod = 300
evenly onto the other nodes in the cluster. This is an alternative to
using
public ip addresses. See the manpage for ctdbd for more information
about LVS.
- </p></div><div class="refsect2" lang="en"><a
name="id2528676"></a><h3>lvs</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2528815"></a><h3>lvs</h3><p>
This command shows which nodes in the cluster are currently active in the
LVS configuration. I.e. which nodes we are currently loadbalancing
the single ip address across.
@@ -246,7 +246,7 @@ RecoveryBanPeriod = 300
</p><pre class="screen">
2:10.0.0.13
3:10.0.0.14
- </pre></div><div class="refsect2" lang="en"><a
name="id2528706"></a><h3>getcapabilities</h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528845"></a><h3>getcapabilities</h3><p>
This command shows the capabilities of the current node.
Please see manpage for ctdbd for a full list of all capabilities and
more detailed description.
@@ -265,7 +265,7 @@ RecoveryBanPeriod = 300
RECMASTER: YES
LMASTER: YES
LVS: NO
- </pre></div><div class="refsect2" lang="en"><a
name="id2528742"></a><h3>statistics</h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528881"></a><h3>statistics</h3><p>
Collect statistics from the CTDB daemon about how many calls it has
served.
</p><p>
Example: ctdb statistics
@@ -307,23 +307,23 @@ CTDB version 1
max_hop_count 0
max_call_latency 4.948321 sec
max_lockwait_latency 0.000000 sec
- </pre></div><div class="refsect2" lang="en"><a
name="id2528786"></a><h3>statisticsreset</h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528925"></a><h3>statisticsreset</h3><p>
This command is used to clear all statistics counters in a node.
</p><p>
Example: ctdb statisticsreset
- </p></div><div class="refsect2" lang="en"><a
name="id2528800"></a><h3>getreclock</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2528939"></a><h3>getreclock</h3><p>
This command is used to show the filename of the reclock file that is
used.
</p><p>
Example output:
</p><pre class="screen">
Reclock file:/gpfs/.ctdb/shared
- </pre></div><div class="refsect2" lang="en"><a
name="id2528820"></a><h3>setreclock [filename]</h3><p>
+ </pre></div><div class="refsect2" lang="en"><a
name="id2528959"></a><h3>setreclock [filename]</h3><p>
This command is used to modify, or clear, the file that is used as the
reclock file at runtime. When this command is used, the reclock file checks are
disabled. To re-enable the checks the administrator needs to activate the
"VerifyRecoveryLock" tunable using "ctdb setvar".
</p><p>
If run with no parameter this will remove the reclock file completely.
If run with a parameter the parameter specifies the new filename to use for the
recovery lock.
</p><p>
This command only affects the runtime settings of a ctdb node and will
be lost when ctdb is restarted. For persistent changes to the reclock file
setting you must edit /etc/sysconfig/ctdb.
- </p></div><div class="refsect2" lang="en"><a
name="id2528846"></a><h3>getdebug</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2528985"></a><h3>getdebug</h3><p>
Get the current debug level for the node. the debug level controls
what information is written to the log file.
</p><p>
The debug levels are mapped to the corresponding syslog levels.
@@ -333,42 +333,42 @@ Reclock file:/gpfs/.ctdb/shared
The list of debug levels from highest to lowest are :
</p><p>
EMERG ALERT CRIT ERR WARNING NOTICE INFO DEBUG
- </p></div><div class="refsect2" lang="en"><a
name="id2528872"></a><h3>setdebug <debuglevel></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529011"></a><h3>setdebug <debuglevel></h3><p>
Set the debug level of a node. This controls what information will be
logged.
</p><p>
The debuglevel is one of EMERG ALERT CRIT ERR WARNING NOTICE INFO DEBUG
- </p></div><div class="refsect2" lang="en"><a
name="id2528888"></a><h3>getpid</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529027"></a><h3>getpid</h3><p>
This command will return the process id of the ctdb daemon.
- </p></div><div class="refsect2" lang="en"><a
name="id2528898"></a><h3>disable</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529037"></a><h3>disable</h3><p>
This command is used to administratively disable a node in the cluster.
A disabled node will still participate in the cluster and host
clustered TDB records but its public ip address has been taken over by
a different node and it no longer hosts any services.
- </p></div><div class="refsect2" lang="en"><a
name="id2528912"></a><h3>enable</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529051"></a><h3>enable</h3><p>
Re-enable a node that has been administratively disabled.
- </p></div><div class="refsect2" lang="en"><a
name="id2528922"></a><h3>stop</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529061"></a><h3>stop</h3><p>
This command is used to administratively STOP a node in the cluster.
A STOPPED node is connected to the cluster but will not host any
public ip addresse, nor does it participate in the VNNMAP.
The difference between a DISABLED node and a STOPPED node is that
a STOPPED node does not host any parts of the database which means
that a recovery is required to stop/continue nodes.
- </p></div><div class="refsect2" lang="en"><a
name="id2528937"></a><h3>continue</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529076"></a><h3>continue</h3><p>
Re-start a node that has been administratively stopped.
- </p></div><div class="refsect2" lang="en"><a
name="id2528948"></a><h3>addip <public_ip/mask> <iface></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529087"></a><h3>addip <public_ip/mask> <iface></h3><p>
This command is used to add a new public ip to a node during runtime.
This allows public addresses to be added to a cluster without having
to restart the ctdb daemons.
</p><p>
Note that this only updates the runtime instance of ctdb. Any changes
will be lost next time ctdb is restarted and the public addresses file is
re-read.
If you want this change to be permanent you must also update the public
addresses file manually.
- </p></div><div class="refsect2" lang="en"><a
name="id2528968"></a><h3>delip <public_ip></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529107"></a><h3>delip <public_ip></h3><p>
This command is used to remove a public ip from a node during runtime.
If this public ip is currently hosted by the node it being removed
from, the ip will first be failed over to another node, if possible, before it
is removed.
</p><p>
Note that this only updates the runtime instance of ctdb. Any changes
will be lost next time ctdb is restarted and the public addresses file is
re-read.
If you want this change to be permanent you must also update the public
addresses file manually.
- </p></div><div class="refsect2" lang="en"><a
name="id2528989"></a><h3>moveip <public_ip> <node></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529132"></a><h3>moveip <public_ip> <node></h3><p>
This command can be used to manually fail a public ip address to a
specific node.
</p><p>
@@ -379,14 +379,14 @@ Reclock file:/gpfs/.ctdb/shared
DeterministicIPs = 0
</p><p>
NoIPFailback = 1
- </p></div><div class="refsect2" lang="en"><a
name="id2529014"></a><h3>shutdown</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529157"></a><h3>shutdown</h3><p>
This command will shutdown a specific CTDB daemon.
- </p></div><div class="refsect2" lang="en"><a
name="id2529024"></a><h3>recover</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529167"></a><h3>recover</h3><p>
This command will trigger the recovery daemon to do a cluster
recovery.
- </p></div><div class="refsect2" lang="en"><a
name="id2529034"></a><h3>ipreallocate</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529178"></a><h3>ipreallocate</h3><p>
This command will force the recovery master to perform a full ip
reallocation process and redistribute all ip addresses. This is useful to
"reset" the allocations back to its default state if they have been changed
using the "moveip" command. While a "recover" will also perform this
reallocation, a recovery is much more hevyweight since it will also rebuild all
the databases.
- </p></div><div class="refsect2" lang="en"><a
name="id2529050"></a><h3>setlmasterrole <on|off></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529193"></a><h3>setlmasterrole <on|off></h3><p>
This command is used ot enable/disable the LMASTER capability for a
node at runtime. This capability determines whether or not a node can be used
as an LMASTER for records in the database. A node that does not have the
LMASTER capability will not show up in the vnnmap.
</p><p>
Nodes will by default have this capability, but it can be stripped off
nodes by the setting in the sysconfig file or by using this command.
@@ -394,21 +394,21 @@ Reclock file:/gpfs/.ctdb/shared
Once this setting has been enabled/disabled, you need to perform a
recovery for it to take effect.
</p><p>
See also "ctdb getcapabilities"
- </p></div><div class="refsect2" lang="en"><a
name="id2529079"></a><h3>setrecmasterrole <on|off></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529222"></a><h3>setrecmasterrole <on|off></h3><p>
This command is used ot enable/disable the RECMASTER capability for a
node at runtime. This capability determines whether or not a node can be used
as an RECMASTER for the cluster. A node that does not have the RECMASTER
capability can not win a recmaster election. A node that already is the
recmaster for the cluster when the capability is stripped off the node will
remain the recmaster until the next cluster election.
</p><p>
Nodes will by default have this capability, but it can be stripped off
nodes by the setting in the sysconfig file or by using this command.
</p><p>
See also "ctdb getcapabilities"
- </p></div><div class="refsect2" lang="en"><a
name="id2529105"></a><h3>killtcp <srcip:port> <dstip:port></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529249"></a><h3>killtcp <srcip:port> <dstip:port></h3><p>
This command will kill the specified TCP connection by issuing a
TCP RST to the srcip:port endpoint. This is a command used by the
ctdb eventscripts.
- </p></div><div class="refsect2" lang="en"><a
name="id2529118"></a><h3>gratiousarp <ip> <interface></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529261"></a><h3>gratiousarp <ip> <interface></h3><p>
This command will send out a gratious arp for the specified interface
through the specified interface. This command is mainly used by the
ctdb eventscripts.
- </p></div><div class="refsect2" lang="en"><a
name="id2529130"></a><h3>reloadnodes</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529274"></a><h3>reloadnodes</h3><p>
This command is used when adding new nodes, or removing existing nodes
from an existing cluster.
</p><p>
Procedure to add a node:
@@ -442,7 +442,7 @@ Reclock file:/gpfs/.ctdb/shared
</p><p>
5, Use 'ctdb status' on all nodes and verify that the deleted node no
longer shows up in the list..
</p><p>
- </p></div><div class="refsect2" lang="en"><a
name="id2529214"></a><h3>tickle <srcip:port> <dstip:port></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529357"></a><h3>tickle <srcip:port> <dstip:port></h3><p>
This command will will send a TCP tickle to the source host for the
specified TCP connection.
A TCP tickle is a TCP ACK packet with an invalid sequence and
@@ -454,10 +454,10 @@ Reclock file:/gpfs/.ctdb/shared
TCP connection has been disrupted and that the client will need
to reestablish. This greatly speeds up the time it takes for a client
to detect and reestablish after an IP failover in the ctdb cluster.
- </p></div><div class="refsect2" lang="en"><a
name="id2529239"></a><h3>gettickles <ip></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529382"></a><h3>gettickles <ip></h3><p>
This command is used to show which TCP connections are registered with
CTDB to be "tickled" if there is a failover.
- </p></div><div class="refsect2" lang="en"><a
name="id2529250"></a><h3>repack [max_freelist]</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529394"></a><h3>repack [max_freelist]</h3><p>
Over time, when records are created and deleted in a TDB, the TDB list
of free space will become fragmented. This can lead to a slowdown in accessing
TDB records.
This command is used to defragment a TDB database and pruning the
freelist.
</p><p>
@@ -472,7 +472,7 @@ Reclock file:/gpfs/.ctdb/shared
Example: ctdb repack 1000
</p><p>
By default, this operation is issued from the 00.ctdb event script
every 5 minutes.
- </p></div><div class="refsect2" lang="en"><a
name="id2529296"></a><h3>vacuum [max_records]</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529440"></a><h3>vacuum [max_records]</h3><p>
Over time CTDB databases will fill up with empty deleted records which
will lead to a progressive slow down of CTDB database access.
This command is used to prune all databases and delete all empty
records from the cluster.
</p><p>
@@ -488,17 +488,17 @@ Reclock file:/gpfs/.ctdb/shared
Example: ctdb vacuum
</p><p>
By default, this operation is issued from the 00.ctdb event script
every 5 minutes.
- </p></div><div class="refsect2" lang="en"><a
name="id2529332"></a><h3>backupdb <database> <file></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529476"></a><h3>backupdb <database> <file></h3><p>
This command can be used to copy the entire content of a database out
to a file. This file can later be read back into ctdb using the restoredb
command.
This is mainly useful for backing up persistent databases such as secrets.tdb
and similar.
- </p></div><div class="refsect2" lang="en"><a
name="id2529346"></a><h3>restoredb <file></h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529490"></a><h3>restoredb <file></h3><p>
This command restores a persistent database that was previously backed
up using backupdb.
- </p></div></div><div class="refsect1" lang="en"><a
name="id2529358"></a><h2>Debugging Commands</h2><p>
+ </p></div></div><div class="refsect1" lang="en"><a
name="id2529502"></a><h2>Debugging Commands</h2><p>
These commands are primarily used for CTDB development and testing and
should not be used for normal administration.
- </p><div class="refsect2" lang="en"><a
name="id2529368"></a><h3>process-exists <pid></h3><p>
+ </p><div class="refsect2" lang="en"><a
name="id2529512"></a><h3>process-exists <pid></h3><p>
This command checks if a specific process exists on the CTDB host.
This is mainly used by Samba to check if remote instances of samba are still
running or not.
- </p></div><div class="refsect2" lang="en"><a
name="id2529381"></a><h3>getdbmap</h3><p>
+ </p></div><div class="refsect2" lang="en"><a
name="id2529524"></a><h3>getdbmap</h3><p>
This command lists all clustered TDB databases that the CTDB daemon
has attached to. Some databases are flagged as PERSISTENT, this means that the
database stores data persistently and the data will remain across reboots. One
example of such a database is secrets.tdb where information about how the
cluster was joined to the domain is stored.
</p><p>
Most databases are not persistent and only store the state information
that the currently running samba daemons need. These databases are always wiped
when ctdb/samba starts and when a node is rebooted.
@@ -517,25 +517,25 @@ dbid:0x2672a57f name:idmap2.tdb
path:/var/ctdb/persistent/idmap2.tdb.0 PERSISTEN
dbid:0xb775fff6 name:secrets.tdb path:/var/ctdb/persistent/secrets.tdb.0
PERSISTENT
dbid:0xe98e08b6 name:group_mapping.tdb
path:/var/ctdb/persistent/group_mapping.tdb.0 PERSISTENT
dbid:0x7bbbd26c name:passdb.tdb path:/var/ctdb/persistent/passdb.tdb.0
PERSISTENT
- </pre></div><div class="refsect2" lang="en"><a
name="id2529427"></a><h3>catdb <dbname></h3><p>
--
CTDB repository