[PATCHES] Docs for PITR and full_page_writes interaction

Simon Riggs Sun, 02 Oct 2005 15:34:50 -0700

Some additional doc changes based around compression of page images in
WAL and the interaction of the new full_page_writes parameter with PITR.


The too-small WAL first sect1 has been merged with the one following
sect1 for clarity. 

Some minor comments have been made in the WAL config section also.

Passes SGML make and proofread for typos.
Files changed:
patching file doc/src/sgml/backup.sgml
patching file doc/src/sgml/config.sgml
patching file doc/src/sgml/wal.sgml

Best Regards, Simon Riggs

Index: doc/src/sgml/backup.sgml
===================================================================
RCS file: /projects/cvsroot/pgsql/doc/src/sgml/backup.sgml,v
retrieving revision 2.69
diff -c -c -r2.69 backup.sgml
*** doc/src/sgml/backup.sgml	25 Jun 2005 22:47:28 -0000	2.69
--- doc/src/sgml/backup.sgml	2 Oct 2005 22:27:47 -0000
***************
*** 1147,1159 ****
     </para>
  
     <para>
!     It should also be noted that the present <acronym>WAL</acronym>
!     format is extremely bulky since it includes many disk page
!     snapshots.  This is appropriate for crash recovery purposes,
      since we may need to fix partially-written disk pages.  It is not
!     necessary to store so many page copies for PITR operations, however.
      An area for future development is to compress archived WAL data by
!     removing unnecessary page copies.  In the meantime, administrators
      may wish to reduce the number of page snapshots included in WAL by
      increasing the checkpoint interval parameters as much as feasible.
     </para>
--- 1147,1168 ----
     </para>
  
     <para>
!     It should also be noted that the default <acronym>WAL</acronym>
!     format is fairly bulky since it includes many disk page snapshots. The pages
!     are partially compressed, using the simple expedient of removing the
!     empty space (if any) within each block. You can significantly reduce
!     the total volume of archived logs by turning off page snapshots 
!     using the <xref linkend="guc-full-page-writes"> parameter, 
!     though you should read the notes and warnings in 
!     <xref linkend="reliability"> before you do so. 
!     These page snapshots are designed to allow crash recovery,
      since we may need to fix partially-written disk pages.  It is not
!     necessary to store these page copies for PITR operations, however.
!     If you turn off <xref linkend="guc-full-page-writes">, your PITR
!     backup and recovery operations will continue to work successfully.
      An area for future development is to compress archived WAL data by
!     removing unnecessary page copies when <xref linkend="guc-full-page-writes">
!     is turned on.  In the meantime, administrators
      may wish to reduce the number of page snapshots included in WAL by
      increasing the checkpoint interval parameters as much as feasible.
     </para>
Index: doc/src/sgml/config.sgml
===================================================================
RCS file: /projects/cvsroot/pgsql/doc/src/sgml/config.sgml,v
retrieving revision 1.23
diff -c -c -r1.23 config.sgml
*** doc/src/sgml/config.sgml	24 Sep 2005 23:25:31 -0000	1.23
--- doc/src/sgml/config.sgml	2 Oct 2005 22:27:49 -0000
***************
*** 1360,1366 ****
         <para>
          When this option is on, the <productname>PostgreSQL</> server
          writes full pages to WAL when they are first modified after a
!         checkpoint so full recovery is possible. Turning this option off
          might lead to a corrupt system after an operating system crash
          or power failure because uncorrected partial pages might contain
          inconsistent or corrupt data. The risks are less but similar to
--- 1360,1366 ----
         <para>
          When this option is on, the <productname>PostgreSQL</> server
          writes full pages to WAL when they are first modified after a
!         checkpoint so crash recovery is possible. Turning this option off
          might lead to a corrupt system after an operating system crash
          or power failure because uncorrected partial pages might contain
          inconsistent or corrupt data. The risks are less but similar to
Index: doc/src/sgml/wal.sgml
===================================================================
RCS file: /projects/cvsroot/pgsql/doc/src/sgml/wal.sgml,v
retrieving revision 1.35
diff -c -c -r1.35 wal.sgml
*** doc/src/sgml/wal.sgml	1 Oct 2005 01:42:43 -0000	1.35
--- doc/src/sgml/wal.sgml	2 Oct 2005 22:27:50 -0000
***************
*** 12,18 ****
     failure (unrelated to the non-volatile area itself). To accomplish
     this, <productname>PostgreSQL</> uses the magnetic platters of modern
     disk drives for permanent storage that is immune to the failures
!    listed above. In fact, a computer can be completely destroyed, but if
     the disk drives survive they can be moved to another computer with
     similar hardware and all committed transactions will remain intact.
    </para>
--- 12,18 ----
     failure (unrelated to the non-volatile area itself). To accomplish
     this, <productname>PostgreSQL</> uses the magnetic platters of modern
     disk drives for permanent storage that is immune to the failures
!    listed above. In fact, even if a computer is fatally damaged, if
     the disk drives survive they can be moved to another computer with
     similar hardware and all committed transactions will remain intact.
    </para>
***************
*** 68,78 ****
     these partially written cases. To guard against that,
     <productname>PostgreSQL</> periodically writes full page images to
     permanent storage <emphasis>before</> modifying the actual page on
!    disk. By doing this, during recovery <productname>PostgreSQL</> can
     restore partially-written pages. If you have a battery-backed disk
!    controller that prevents partial page writes, you can turn off this
!    page imaging by using the <xref linkend="guc-full-page-writes">
!    parameter.
    </para>
   
    <para>
--- 68,80 ----
     these partially written cases. To guard against that,
     <productname>PostgreSQL</> periodically writes full page images to
     permanent storage <emphasis>before</> modifying the actual page on
!    disk. By doing this, during crash recovery <productname>PostgreSQL</> can
     restore partially-written pages. If you have a battery-backed disk
!    controller or filesystem (e.g. Reiser4) that prevents partial page writes, 
!    you can turn off this page imaging by using the 
!    <xref linkend="guc-full-page-writes"> parameter. This parameter has no 
!    effect on the successful use of Point in Time Recovery (PITR), 
!    described in <xref linkend="backup-online">.
    </para>
   
    <para>
***************
*** 107,120 ****
      the data pages can be redone from the log records.  (This is
      roll-forward recovery, also known as REDO.)
     </para>
-   </sect1>
- 
-   <sect1 id="wal-benefits">
-    <title>Benefits of Write-Ahead Logging</title>
  
!    <indexterm zone="wal-benefits">
!     <primary>fsync</primary>
!    </indexterm>
  
     <para>
      The first major benefit of using <acronym>WAL</acronym> is a
--- 109,118 ----
      the data pages can be redone from the log records.  (This is
      roll-forward recovery, also known as REDO.)
     </para>
  
!    <para>
!     WAL brings three major benefits:
!    </para>
  
     <para>
      The first major benefit of using <acronym>WAL</acronym> is a
***************
*** 131,141 ****
     </para>
  
     <para>
!     The next benefit is consistency of the data pages. The truth is
!     that, before <acronym>WAL</acronym>,
      <productname>PostgreSQL</productname> was never able to guarantee
!     consistency in the case of a crash.  Before
!     <acronym>WAL</acronym>, any crash during writing could result in:
  
      <orderedlist>
       <listitem>
--- 129,139 ----
     </para>
  
     <para>
!     The next benefit is crash recovery protection. The truth is
!     that, before <acronym>WAL</acronym> was introduced back in release 7.1,
      <productname>PostgreSQL</productname> was never able to guarantee
!     consistency in the case of a crash.  Now, 
!     <acronym>WAL</acronym> protects fully against the following problems:
  
      <orderedlist>
       <listitem>
***************
*** 151,163 ****
        of partially written data pages</simpara>
       </listitem>
      </orderedlist>
- 
-     Problems with indexes (problems 1 and 2) could possibly have been
-     fixed by additional <function>fsync</function> calls, but it is
-     not obvious how to handle the last case without
-     <acronym>WAL</acronym>.  <acronym>WAL</acronym> saves the entire data
-     page content in the log if that is required to ensure page
-     consistency for after-crash recovery.
     </para>
  
     <para>
--- 149,154 ----
***************
*** 214,225 ****
     <varname>checkpoint_timeout</varname> causes checkpoints to be done
     more often. This allows faster after-crash recovery (since less work
     will need to be redone). However, one must balance this against the
!    increased cost of flushing dirty data pages more often. In addition,
!    to ensure data page consistency, the first modification of a data
!    page after each checkpoint results in logging the entire page
!    content. Thus a smaller checkpoint interval increases the volume of
!    output to the WAL log, partially negating the goal of using a smaller
!    interval, and in any case causing more disk I/O.
    </para>
  
    <para>
--- 205,218 ----
     <varname>checkpoint_timeout</varname> causes checkpoints to be done
     more often. This allows faster after-crash recovery (since less work
     will need to be redone). However, one must balance this against the
!    increased cost of flushing dirty data pages more often. If 
!    <xref linkend="guc-full-page-writes"> is set (the default), there is 
!    another factor to consider. To ensure data page consistency, 
!    the first modification of a data page after each checkpoint results in 
!    logging the entire page content. In that case,
!    a smaller checkpoint interval increases the volume of output to the WAL log,
!    partially negating the goal of using a smaller interval, 
!    and in any case causing more disk I/O.
    </para>
  
    <para>
***************
*** 234,240 ****
     a message will be output to the server log recommending increasing 
     <varname>checkpoint_segments</varname>.  Occasional appearance of such
     a message is not cause for alarm, but if it appears often then the
!    checkpoint control parameters should be increased.
    </para>
  
    <para>
--- 227,235 ----
     a message will be output to the server log recommending increasing 
     <varname>checkpoint_segments</varname>.  Occasional appearance of such
     a message is not cause for alarm, but if it appears often then the
!    checkpoint control parameters should be increased. Bulk operations such
!    as a COPY, INSERT SELECT etc. may cause a number of such warnings if you
!    do not set <xref linkend="guc-checkpoint-segments"> high enough.
    </para>
  
    <para>
***************
*** 252,258 ****
    </para>
  
    <para>
!    There are two commonly used <acronym>WAL</acronym> functions:
     <function>LogInsert</function> and <function>LogFlush</function>.
     <function>LogInsert</function> is used to place a new record into
     the <acronym>WAL</acronym> buffers in shared memory. If there is no
--- 247,253 ----
    </para>
  
    <para>
!    There are two commonly used internal <acronym>WAL</acronym> functions:
     <function>LogInsert</function> and <function>LogFlush</function>.
     <function>LogInsert</function> is used to place a new record into
     the <acronym>WAL</acronym> buffers in shared memory. If there is no
***************
*** 275,283 ****
     modifying the configuration parameter <xref
     linkend="guc-wal-buffers">.  The default number of <acronym>WAL</acronym>
     buffers is 8.  Increasing this value will
!    correspondingly increase shared memory usage.  (It should be noted
!    that there is presently little evidence to suggest that increasing
!    <varname>wal_buffers</> beyond the default is worthwhile.)
    </para>
  
    <para>
--- 270,280 ----
     modifying the configuration parameter <xref
     linkend="guc-wal-buffers">.  The default number of <acronym>WAL</acronym>
     buffers is 8.  Increasing this value will
!    correspondingly increase shared memory usage.  When 
!    <xref linkend="guc-full-page-writes"> is set and the system is very busy, 
!    setting this value higher will help smooth response times during the 
!    period immediately following each checkpoint.  As a guide, a setting of 1024 
!    would be considered to be high.
    </para>
  
    <para>
***************
*** 313,319 ****
     (provided that <productname>PostgreSQL</productname> has been
     compiled with support for it) will result in each
     <function>LogInsert</function> and <function>LogFlush</function>
!    <acronym>WAL</acronym> call being logged to the server log. This
     option may be replaced by a more general mechanism in the future.
    </para>
   </sect1>
--- 310,317 ----
     (provided that <productname>PostgreSQL</productname> has been
     compiled with support for it) will result in each
     <function>LogInsert</function> and <function>LogFlush</function>
!    <acronym>WAL</acronym> call being logged to the server log. The output
!    is too verbose for use as a guide to performance tuning. This
     option may be replaced by a more general mechanism in the future.
    </para>
   </sect1>

---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

               http://www.postgresql.org/docs/faq

[PATCHES] Docs for PITR and full_page_writes interaction

Reply via email to