Re: [GENERAL] pg_standby replication problem

2014-06-10 Thread Adrian Klaver

On 06/09/2014 10:02 PM, Khangelani Gama wrote:

-Original Message-
From: Adrian Klaver [mailto:adrian.kla...@aklaver.com]
Sent: Tuesday, June 10, 2014 1:42 AM
To: Khangelani Gama; pgsql-general@postgresql.org
Subject: Re: [GENERAL] pg_standby replication problem

On 06/09/2014 11:15 AM, Khangelani Gama wrote:

This is the standby replication setting with archive_command sending
the WALs from master to standby.






Thanks  for feedback from everyone, I will try and remove the correct old
walfiles.




What are the conf settings on the standby server?



Standby server config settings are as follows:



My mistake, I should have been more specific.

What is in your recovery.conf file on the standby?

Is the standby something you really want to try to rescue at this point 
or would it be feasible just to start over?


If you do decide to start over a little time spent on what you want to 
happen would help out.


Options to look at:

1) Streaming replication. WAL files are streamed from master to standby.

2) Hot standby. The standby can process read only queries while in 
standby mode.


3) Archiving. WAL files are archived in a location for possible use by 
standby. Needs some mechanism to prune files or you could fill a disk again.


All 3 of the above can be combined if desired, which is where the 
thinking time part comes in.


Places to start looking for information:

http://www.postgresql.org/docs/9.3/interactive/high-availability.html

https://wiki.postgresql.org/wiki/Binary_Replication_Tutorial



--
Adrian Klaver
adrian.kla...@aklaver.com


--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


Re: [GENERAL] pg_standby replication problem

2014-06-10 Thread Khangelani Gama
Thank You, I will have a look.

-Original Message-
From: Adrian Klaver [mailto:adrian.kla...@aklaver.com]
Sent: Tuesday, June 10, 2014 3:45 PM
To: Khangelani Gama; pgsql-general@postgresql.org
Subject: Re: [GENERAL] pg_standby replication problem

On 06/09/2014 10:02 PM, Khangelani Gama wrote:
 -Original Message-
 From: Adrian Klaver [mailto:adrian.kla...@aklaver.com]
 Sent: Tuesday, June 10, 2014 1:42 AM
 To: Khangelani Gama; pgsql-general@postgresql.org
 Subject: Re: [GENERAL] pg_standby replication problem

 On 06/09/2014 11:15 AM, Khangelani Gama wrote:
 This is the standby replication setting with archive_command sending
 the WALs from master to standby.





 Thanks  for feedback from everyone, I will try and remove the correct
 old walfiles.




 What are the conf settings on the standby server?



 Standby server config settings are as follows:


My mistake, I should have been more specific.

What is in your recovery.conf file on the standby?

Is the standby something you really want to try to rescue at this point or
would it be feasible just to start over?

If you do decide to start over a little time spent on what you want to
happen would help out.

Options to look at:

1) Streaming replication. WAL files are streamed from master to standby.

2) Hot standby. The standby can process read only queries while in standby
mode.

3) Archiving. WAL files are archived in a location for possible use by
standby. Needs some mechanism to prune files or you could fill a disk again.

All 3 of the above can be combined if desired, which is where the thinking
time part comes in.

Places to start looking for information:

http://www.postgresql.org/docs/9.3/interactive/high-availability.html

https://wiki.postgresql.org/wiki/Binary_Replication_Tutorial



--
Adrian Klaver
adrian.kla...@aklaver.com


CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


[GENERAL] pg_standby replication problem

2014-06-09 Thread Khangelani Gama
Please help me with this, my secondary server shows a replication problem.
It stopped at the file called *00054BAF00AF** …*then from here
primary server kept on sending walfiles, until the walfiles used up the
disc space in the data directory. How do I fix this problem. It’s postgres
9.1.2.

*Postgres log file Postgres-2014-06-08_00.log file **has the following
details :*

 2014-06-08 00:15:54 SAST LOG:  restored log file *00054BAF00AF
from* archive

Trigger file: /tmp/recovery.pgsql.trigger.5432

Waiting for WAL file: 00054BAF00B0

WAL file path:/pgsql2/walfiles/00054BAF00B0

Restoring to: pg_xlog/RECOVERYXLOG

Sleep interval:   2 seconds

Max wait interval:0 forever

*Command for restore:  cp /pgsql2/walfiles/00054BAF00B0
pg_xlog/RECOVERYXLOG*

Keep archive history: 00054BAE00F7 and later

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...



CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Khangelani Gama
FilesystemSize  Used Avail Use% Mounted on

/dev/sda3  57G   15G   39G  28% /

/dev/mapper/vg0-pgsql2

  5.4T  5.3T 0 100% /pgsql2

/dev/sda1  99M   12M   83M  13% /boot

tmpfs  30G 0   30G   0% /dev/shm





*Disc space Breakdown:*





4.0K./backup

12K ./copy

4.9T./data

204K./test

16K ./lost+found

361G./walfiles

5.3T.











*From:* Khangelani Gama [mailto:kg...@argility.com]
*Sent:* Monday, June 09, 2014 1:42 PM
*To:* pgsql-general@postgresql.org
*Subject:* pg_standby replication problem





Please help me with this, my secondary server shows a replication problem.
It stopped at the file called *00054BAF00AF …*then from here
primary server kept on sending walfiles, until the walfiles used up the
disc space in the data directory. How do I fix this problem. It’s postgres
9.1.2.

*Postgres log file Postgres-2014-06-08_00.log file **has the following
details :*

 2014-06-08 00:15:54 SAST LOG:  restored log file *00054BAF00AF
from* archive

Trigger file: /tmp/recovery.pgsql.trigger.5432

Waiting for WAL file: 00054BAF00B0

WAL file path:/pgsql2/walfiles/00054BAF00B0

Restoring to: pg_xlog/RECOVERYXLOG

Sleep interval:   2 seconds

Max wait interval:0 forever

*Command for restore:  cp /pgsql2/walfiles/00054BAF00B0
pg_xlog/RECOVERYXLOG*

Keep archive history: 00054BAE00F7 and later

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...



CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Khangelani Gama
The big question we can’t answer is that when the replication was at this
point (*Command for restore:  cp
/pgsql2/walfiles/00054BAF00B0 pg_xlog/RECOVERYXLOG*

) , it then started to say WAL file not present yet. We can’t find
this *00054BAF00B0
*file any where* . *



*Command for restore:  cp /pgsql2/walfiles/00054BAF00B0
pg_xlog/RECOVERYXLOG*

Keep archive history: 00054BAE00F7 and later

WAL file not present yet. Checking for trigger file...





-rw--- 1 postgres postgres 16M Jun  7 21:42 00054BAE00F7















*From:* Khangelani Gama [mailto:kg...@argility.com]
*Sent:* Monday, June 09, 2014 1:45 PM
*To:* pgsql-general@postgresql.org
*Subject:* RE: pg_standby replication problem







FilesystemSize  Used Avail Use% Mounted on

/dev/sda3  57G   15G   39G  28% /

/dev/mapper/vg0-pgsql2

  5.4T  5.3T 0 100% /pgsql2

/dev/sda1  99M   12M   83M  13% /boot

tmpfs  30G 0   30G   0% /dev/shm





*Disc space Breakdown:*





4.0K./backup

12K ./copy

4.9T./data

204K./test

16K ./lost+found

361G./walfiles

5.3T.











*From:* Khangelani Gama [mailto:kg...@argility.com kg...@argility.com]
*Sent:* Monday, June 09, 2014 1:42 PM
*To:* pgsql-general@postgresql.org
*Subject:* pg_standby replication problem





Please help me with this, my secondary server shows a replication problem.
It stopped at the file called *00054BAF00AF …*then from here
primary server kept on sending walfiles, until the walfiles used up the
disc space in the data directory. How do I fix this problem. It’s postgres
9.1.2.

*Postgres log file Postgres-2014-06-08_00.log file **has the following
details :*

 2014-06-08 00:15:54 SAST LOG:  restored log file *00054BAF00AF
from* archive

Trigger file: /tmp/recovery.pgsql.trigger.5432

Waiting for WAL file: 00054BAF00B0

WAL file path:/pgsql2/walfiles/00054BAF00B0

Restoring to: pg_xlog/RECOVERYXLOG

Sleep interval:   2 seconds

Max wait interval:0 forever

*Command for restore:  cp /pgsql2/walfiles/00054BAF00B0
pg_xlog/RECOVERYXLOG*

Keep archive history: 00054BAE00F7 and later

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...



CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Khangelani Gama
Please please help



*From:* Khangelani Gama [mailto:kg...@argility.com]
*Sent:* Monday, June 09, 2014 1:42 PM
*To:* pgsql-general@postgresql.org
*Subject:* pg_standby replication problem





Please help me with this, my secondary server shows a replication problem.
It stopped at the file called *00054BAF00AF …*then from here
primary server kept on sending walfiles, until the walfiles used up the
disc space in the data directory. How do I fix this problem. It’s postgres
9.1.2.

*Postgres log file Postgres-2014-06-08_00.log file **has the following
details :*

 2014-06-08 00:15:54 SAST LOG:  restored log file *00054BAF00AF
from* archive

Trigger file: /tmp/recovery.pgsql.trigger.5432

Waiting for WAL file: 00054BAF00B0

WAL file path:/pgsql2/walfiles/00054BAF00B0

Restoring to: pg_xlog/RECOVERYXLOG

Sleep interval:   2 seconds

Max wait interval:0 forever

*Command for restore:  cp /pgsql2/walfiles/00054BAF00B0
pg_xlog/RECOVERYXLOG*

Keep archive history: 00054BAE00F7 and later

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...



CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Adrian Klaver

On 06/09/2014 07:28 AM, Khangelani Gama wrote:

Please please help


Before anyone can help you will need to provide more information on what 
your archiving, replication setup is. To begin:


1)Are you doing both archiving and streaming replication?

2) What are the settings in the configuration files for those operations?

3) What is the layout for archiving, in other words do the archived 
files get copied remotely to a third site or some other arrangement?


4) What caused the trigger file to be set?




*From:*Khangelani Gama [mailto:kg...@argility.com
mailto:kg...@argility.com]
*Sent:* Monday, June 09, 2014 1:42 PM
*To:* pgsql-general@postgresql.org mailto:pgsql-general@postgresql.org
*Subject:* pg_standby replication problem

Please help me with this, my secondary server shows a replication
problem. It stopped at the file called *00054BAF00AF …*then
from here primary server kept on sending walfiles, until the walfiles
used up the disc space in the data directory. How do I fix this problem.
It’s postgres 9.1.2.

*_Postgres log file Postgres-2014-06-08_00.log file _*_has the
following details :_

  2014-06-08 00:15:54 SAST LOG:  restored log file
*00054BAF00AF from*archive

Trigger file: /tmp/recovery.pgsql.trigger.5432

Waiting for WAL file: 00054BAF00B0

WAL file path:/pgsql2/walfiles/00054BAF00B0

Restoring to: pg_xlog/RECOVERYXLOG

Sleep interval:   2 seconds

Max wait interval:0 forever

*Command for restore:  cp /pgsql2/walfiles/00054BAF00B0
pg_xlog/RECOVERYXLOG*

Keep archive history: 00054BAE00F7 and later

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...

WAL file not present yet. Checking for trigger file...


CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.






--
Adrian Klaver
adrian.kla...@aklaver.com


--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Alan Hodgson
On Monday, June 09, 2014 04:28:53 PM Khangelani Gama wrote:
 Please help me with this, my secondary server shows a replication problem.
 It stopped at the file called *00054BAF00AF …*then from here
 primary server kept on sending walfiles, until the walfiles used up the
 disc space in the data directory. How do I fix this problem. It’s postgres
 9.1.2.
 

It looks to me like your archive_command is probably failing on the primary 
server. If that fails, the logs will build up and fill up your disk as 
described. And they wouldn't be available to the slave to find.



-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Khangelani Gama
-Original Message-
From: pgsql-general-ow...@postgresql.org
[mailto:pgsql-general-ow...@postgresql.org] On Behalf Of Alan Hodgson
Sent: Monday, June 09, 2014 4:51 PM
To: pgsql-general@postgresql.org
Subject: Re: [GENERAL] pg_standby replication problem

On Monday, June 09, 2014 04:28:53 PM Khangelani Gama wrote:
 Please help me with this, my secondary server shows a replication problem.
 It stopped at the file called *00054BAF00AF …*then from
 here primary server kept on sending walfiles, until the walfiles used
 up the disc space in the data directory. How do I fix this problem.
 It’s postgres 9.1.2.


It looks to me like your archive_command is probably failing on the primary
server. If that fails, the logs will build up and fill up your disk as
described. And they wouldn't be available to the slave to find.


I am sorry, I am still trying to understand all the settings, the person who
set up the servers left the company.

In primary server, postgresql.conf shows the following:

# WRITE AHEAD LOG
#--

# - Settings -

wal_level = archive
# - Checkpoints -

checkpoint_segments = 128
checkpoint_timeout = 15min
checkpoint_warning = 885s
# - Archiving -

archive_mode = on
#archive_mode = off # allows archiving to be done
archive_command = '/home/cdbs/bin/run_replication.sh %p %f'

# REPLICATION
#--

# - Master Server -

# These settings are ignored on a standby server

max_wal_senders = 3



The setting archive_command points to a script being run and the variable %p
and %f being passed.




There is replication script running in the primary server  has the
following:


while [ $test = false ]
do
rsync -a /pgsql2/data/${src}
postgres@10.58.101.10:/pgsql2/walfiles/${dest} 
/tmp/run_replication.sh.out 2 /tmp/run_replication.sh.out
test=`ssh AB_CDS3 if [ -f /pgsql2/walfiles/${dest} ];then echo
'true' ;else echo 'false';fi`
if [ ${test} = false ]
then
echo Test is false for CDS3, sleeping 10 
/tmp/run_replication.sh.out
sleep 10
cnt=$(( $cnt + 1 ))
if [ ${cnt} -ge 60 ]
then
message=Replication ERROR: Unable to send WAL
file(${desc}) from CDS to CDS3
echo `date` : ${message} 
/tmp/run_replication.sh.out
sendsms
fi
fi
done





--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org) To make
changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Khangelani Gama
I just saw got  this from the primary server (/tmp/run_replication.sh.out),
secondary server's IP 10.58.101.10.


replication started: Sun Jun  8 00:05:26 SAST 2014 source:
pg_xlog/00054BAF00AF, dest: 00054BAF00AF
replication finished: Sun Jun  8 00:05:33 SAST 2014
replication started: Sun Jun  8 00:05:33 SAST 2014 source:
pg_xlog/00054BAF00B0, dest: 00054BAF00B0
ssh: connect to host 10.58.101.10 port 22: Connection timed out^M
rsync: connection unexpectedly closed (0 bytes received so far) [sender]
rsync error: unexplained error (code 255) at io.c(600) [sender=3.0.6]
replication finished: Sun Jun  8 00:07:41 SAST 2014
replication started: Sun Jun  8 00:07:41 SAST 2014 source:
pg_xlog/00054BAF00B1, dest: 00054BAF00B1
replication finished: Sun Jun  8 00:07:53 SAST 2014
replication started: Sun Jun  8 00:07:53 SAST 2014 source:
pg_xlog/00054BAF00B2, dest: 00054BAF00B2
replication finished: Sun Jun  8 00:07:57 SAST 2014
replication started: Sun Jun  8 00:07:58 SAST 2014 source:
pg_xlog/00054BAF00B3, dest: 00054BAF00B3
replication finished: Sun Jun  8 00:08:06 SAST 2014
replication started: Sun Jun  8 00:08:06 SAST 2014 source:
pg_xlog/00054BAF00B4, dest: 00054BAF00B4
replication finished: Sun Jun  8 00:08:11 SAST 2014
replication started: Sun Jun  8 00:08:11 SAST 2014 source:
pg_xlog/00054BAF00B5, dest: 00054BAF00B5
replication finished: Sun Jun  8 00:08:16 SAST 2014
replication started: Sun Jun  8 00:08:16 SAST 2014 source:
pg_xlog/00054BAF00B6, dest: 00054BAF00B6
replication finished: Sun Jun  8 00:08:22 SAST 2014


CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Khangelani Gama
-Original Message-
From: Khangelani Gama [mailto:kg...@argility.com]
Sent: Monday, June 09, 2014 5:26 PM
To: 'Alan Hodgson'; 'pgsql-general@postgresql.org'
Subject: RE: [GENERAL] pg_standby replication problem

I just saw got  this from the primary server (/tmp/run_replication.sh.out),
secondary server's IP 10.58.101.10.


replication started: Sun Jun  8 00:05:26 SAST 2014 source:
pg_xlog/00054BAF00AF, dest: 00054BAF00AF replication
finished: Sun Jun  8 00:05:33 SAST 2014 replication started: Sun Jun  8
00:05:33 SAST 2014 source: pg_xlog/00054BAF00B0, dest:
00054BAF00B0
ssh: connect to host 10.58.101.10 port 22: Connection timed out^M
rsync: connection unexpectedly closed (0 bytes received so far) [sender]
rsync error: unexplained error (code 255) at io.c(600) [sender=3.0.6]
replication finished: Sun Jun  8 00:07:41 SAST 2014 replication started: Sun
Jun  8 00:07:41 SAST 2014 source: pg_xlog/00054BAF00B1, dest:
00054BAF00B1 replication finished: Sun Jun  8 00:07:53 SAST 2014
replication started: Sun Jun  8 00:07:53 SAST 2014 source:
pg_xlog/00054BAF00B2, dest: 00054BAF00B2 replication
finished: Sun Jun  8 00:07:57 SAST 2014 replication started: Sun Jun  8
00:07:58 SAST 2014 source: pg_xlog/00054BAF00B3, dest:
00054BAF00B3 replication finished: Sun Jun  8 00:08:06 SAST 2014
replication started: Sun Jun  8 00:08:06 SAST 2014 source:
pg_xlog/00054BAF00B4, dest: 00054BAF00B4 replication
finished: Sun Jun  8 00:08:11 SAST 2014 replication started: Sun Jun  8
00:08:11 SAST 2014 source: pg_xlog/00054BAF00B5, dest:
00054BAF00B5 replication finished: Sun Jun  8 00:08:16 SAST 2014
replication started: Sun Jun  8 00:08:16 SAST 2014 source:
pg_xlog/00054BAF00B6, dest: 00054BAF00B6 replication
finished: Sun Jun  8 00:08:22 SAST 2014




Since there was a Connection time out Problem in the primary server, how can
I make disc space in the secondary server for the replication to continue
from where it stopped. Do I remove waltfiles from the secondary server?

Disc space Breakdown:


4.0K./backup
12K ./copy
4.9T./data
204K./test
16K ./lost+found
361G./walfiles
5.3T.


CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


[GENERAL] pg_standby replication problem

2014-06-09 Thread Khangelani Gama
Hi All

I would like to re-post the problem we have. The secondary server ran out
the disc space due the replication problem (Connection Time out).Since there
was a Connection time out Problem in the primary server, how can I make disc
space in the secondary server for the replication to continue from where it
stopped. Do I remove walfiles from the secondary server?


Below it's the details of the log file from the primary server:

replication started: Sun Jun  8 00:05:26 SAST 2014 source:
pg_xlog/00054BAF00AF, dest: 00054BAF00AF replication
finished: Sun Jun  8 00:05:33 SAST 2014 replication started: Sun Jun  8
00:05:33 SAST 2014 source: pg_xlog/00054BAF00B0, dest:
00054BAF00B0
ssh: connect to host 10.58.101.10 port 22: Connection timed out^M
rsync: connection unexpectedly closed (0 bytes received so far) [sender]
rsync error: unexplained error (code 255) at io.c(600) [sender=3.0.6]
replication finished: Sun Jun  8 00:07:41 SAST 2014 replication started: Sun
Jun  8 00:07:41 SAST 2014 source: pg_xlog/00054BAF00B1, dest:
00054BAF00B1 replication finished: Sun Jun  8 00:07:53 SAST 2014
replication started: Sun Jun  8 00:07:53 SAST 2014 source:
pg_xlog/00054BAF00B2, dest: 00054BAF00B2 replication
finished: Sun Jun  8 00:07:57 SAST 2014 replication started: Sun Jun  8
00:07:58 SAST 2014 source: pg_xlog/00054BAF00B3, dest:
00054BAF00B3 replication finished: Sun Jun  8 00:08:06 SAST 2014
replication started: Sun Jun  8 00:08:06 SAST 2014 source:
pg_xlog/00054BAF00B4, dest: 00054BAF00B4 replication
finished: Sun Jun  8 00:08:11 SAST 2014 replication started: Sun Jun  8
00:08:11 SAST 2014 source: pg_xlog/00054BAF00B5, dest:
00054BAF00B5 replication finished: Sun Jun  8 00:08:16 SAST 2014
replication started: Sun Jun  8 00:08:16 SAST 2014 source:
pg_xlog/00054BAF00B6, dest: 00054BAF00B6 replication
finished: Sun Jun  8 00:08:22 SAST 2014




Disc space Breakdown:


4.0K./backup
12K ./copy
4.9T./data
204K./test
16K ./lost+found
361G./walfiles
5.3T.



FilesystemSize  Used Avail Use% Mounted on
/dev/sda3  57G   15G   39G  28% /
/dev/mapper/vg0-pgsql2
  5.4T  5.3T 0 100% /pgsql2
/dev/sda1  99M   12M   83M  13% /boot
tmpfs  30G 0   30G   0% /dev/shm


CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Khangelani Gama
This is the standby replication setting with archive_command sending the
WALs from master to standby.



-Original Message-
From: Khangelani Gama [mailto:kg...@argility.com]
Sent: Monday, June 09, 2014 8:06 PM
To: pgsql-general@postgresql.org
Subject: pg_standby replication problem

Hi All

I would like to re-post the problem we have. The secondary server ran out
the disc space due the replication problem (Connection Time out).Since there
was a Connection time out Problem in the primary server, how can I make disc
space in the secondary server for the replication to continue from where it
stopped. Do I remove walfiles from the secondary server?


Below it's the details of the log file from the primary server:

replication started: Sun Jun  8 00:05:26 SAST 2014 source:
pg_xlog/00054BAF00AF, dest: 00054BAF00AF replication
finished: Sun Jun  8 00:05:33 SAST 2014 replication started: Sun Jun  8
00:05:33 SAST 2014 source: pg_xlog/00054BAF00B0, dest:
00054BAF00B0
ssh: connect to host 10.58.101.10 port 22: Connection timed out^M
rsync: connection unexpectedly closed (0 bytes received so far) [sender]
rsync error: unexplained error (code 255) at io.c(600) [sender=3.0.6]
replication finished: Sun Jun  8 00:07:41 SAST 2014 replication started: Sun
Jun  8 00:07:41 SAST 2014 source: pg_xlog/00054BAF00B1, dest:
00054BAF00B1 replication finished: Sun Jun  8 00:07:53 SAST 2014
replication started: Sun Jun  8 00:07:53 SAST 2014 source:
pg_xlog/00054BAF00B2, dest: 00054BAF00B2 replication
finished: Sun Jun  8 00:07:57 SAST 2014 replication started: Sun Jun  8
00:07:58 SAST 2014 source: pg_xlog/00054BAF00B3, dest:
00054BAF00B3 replication finished: Sun Jun  8 00:08:06 SAST 2014
replication started: Sun Jun  8 00:08:06 SAST 2014 source:
pg_xlog/00054BAF00B4, dest: 00054BAF00B4 replication
finished: Sun Jun  8 00:08:11 SAST 2014 replication started: Sun Jun  8
00:08:11 SAST 2014 source: pg_xlog/00054BAF00B5, dest:
00054BAF00B5 replication finished: Sun Jun  8 00:08:16 SAST 2014
replication started: Sun Jun  8 00:08:16 SAST 2014 source:
pg_xlog/00054BAF00B6, dest: 00054BAF00B6 replication
finished: Sun Jun  8 00:08:22 SAST 2014




Disc space Breakdown:


4.0K./backup
12K ./copy
4.9T./data
204K./test
16K ./lost+found
361G./walfiles
5.3T.



FilesystemSize  Used Avail Use% Mounted on
/dev/sda3  57G   15G   39G  28% /
/dev/mapper/vg0-pgsql2
  5.4T  5.3T 0 100% /pgsql2
/dev/sda1  99M   12M   83M  13% /boot
tmpfs  30G 0   30G   0% /dev/shm


CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Alan Hodgson
On Monday, June 09, 2014 08:05:41 PM Khangelani Gama wrote:
 Hi All
 
 I would like to re-post the problem we have. The secondary server ran out
 the disc space due the replication problem (Connection Time out).

The secondary server would not (could not) run out of drive space due to a 
problem on the primary. You probably need to figure out why that server is out 
of drive space and fix it, and then I expect your replication problem will fix 
itself. If you do not have a process cleaning up old archived WAL files, and 
those are stored on the secondary, that could be the source of your problem.

If you also have a separate networking issue (for the connection timeout), 
then you might need to fix that, too.




-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Adrian Klaver

On 06/09/2014 11:15 AM, Khangelani Gama wrote:

This is the standby replication setting with archive_command sending the
WALs from master to standby.




What are the conf settings on the standby server?


--
Adrian Klaver
adrian.kla...@aklaver.com


--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general


Re: [GENERAL] pg_standby replication problem

2014-06-09 Thread Khangelani Gama
-Original Message-
From: Adrian Klaver [mailto:adrian.kla...@aklaver.com]
Sent: Tuesday, June 10, 2014 1:42 AM
To: Khangelani Gama; pgsql-general@postgresql.org
Subject: Re: [GENERAL] pg_standby replication problem

On 06/09/2014 11:15 AM, Khangelani Gama wrote:
 This is the standby replication setting with archive_command sending
 the WALs from master to standby.





Thanks  for feedback from everyone, I will try and remove the correct old
walfiles.




What are the conf settings on the standby server?



Standby server config settings are as follows:


# WRITE AHEAD LOG
#--

# - Settings -

wal_level = archive
#wal_level = minimal# minimal, archive, or hot_standby
# (change requires restart)
#fsync = on # turns forced synchronization on or
off
synchronous_commit = off# synchronization level; on, off, or
local
#wal_sync_method = fsync




# - Checkpoints -

checkpoint_segments = 128
checkpoint_timeout = 15min
checkpoint_warning = 885s
#checkpoint_segments = 3

# - Archiving -

archive_mode = on
#archive_mode = off # allows archiving to be done
# (change requires restart)

# REPLICATION
#--

# - Master Server -

# These settings are ignored on a standby server

max_wal_senders = 3 # max number of walsender processes
# (change requires restart)



# This is used when logging to stderr:
logging_collector = on
#logging_collector = off







--
Adrian Klaver
adrian.kla...@aklaver.com


CONFIDENTIALITY NOTICE
The contents of and attachments to this e-mail are intended for the addressee 
only, and may contain the confidential
information of Argility (Proprietary) Limited and/or its subsidiaries. Any 
review, use or dissemination thereof by anyone
other than the intended addressee is prohibited.If you are not the intended 
addressee please notify the writer immediately
and destroy the e-mail. Argility (Proprietary) Limited and its subsidiaries 
distance themselves from and accept no liability
for unauthorised use of their e-mail facilities or e-mails sent other than 
strictly for business purposes.



-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general