date:20130109

On Tuesday, January 08, 2013 8:57 PM Andres Freund wrote:
 On 2013-01-08 20:33:28 +0530, Amit Kapila wrote:
  On Tuesday, January 08, 2013 8:01 PM Andres Freund wrote:
   On 2013-01-08 19:51:39 +0530, Amit Kapila wrote:
On Monday, January 07, 2013 7:15 PM Andres Freund wrote:
 On 2013-01-07 19:03:35 +0530, Amit Kapila wrote:
  On Monday, January 07, 2013 6:30 PM Simon Riggs wrote:
   On 7 January 2013 12:39, Amit Kapila
 amit.kap...@huawei.com
 wrote:
  

 The information that no transactions are currently running
 allows
   you
 to
 build a recovery snapshot, without that information the standby
   won't
 start answering queries. Now that doesn't matter if all
 standbys
 already
 have built a snapshot, but the primary cannot know that.
   
Can't we make sure that checkpoint operation doesn't happen for
 below
   conds.
a. nothing has happened during or after last checkpoint
OR
b. nothing except snapshotstanby WAL has happened
   
Currently it is done for point a.
   
 Having to issue a checkpoint while ensuring transactions are
   running
 just to get a standby up doesn't seem like a good idea to me :)
   
Simon:
 If you make the correct test, I'd be more inclined to accept
 the
   premise.
   
Not sure, what exact you are expecting from test?
The test is do any one operation on system and then keep the
 system
   idle.
Now at each checkpoint interval, it logs WAL for SnapshotStandby.
  
   I can't really follow what you want to do here. The snapshot is
 only
   logged if a checkpoint is performed anyway?  As recovery starts at
 (the
   logical) checkpoint's location we need to log a snapshot exactly
   there. If you want to avoid activity when the system is idle you
 need
   to
   prevent checkpoints from occurring itself.
 
  Even if the checkpoint is scheduled, it doesn't perform actual
 operation if
  there's nothing logged between
  current and previous checkpoint due to below check in
 CreateCheckPoint()
  function.
  if (curInsert == ControlFile-checkPoint +
  MAXALIGN(SizeOfXLogRecord +
 sizeof(CheckPoint)) 
  ControlFile-checkPoint ==
  ControlFile-checkPointCopy.redo)
 
  But if we set the wal_level as hot_standby, it will log snapshot, now
 next
  time again when function CreateCheckPoint()
  will get called due to scheduled checkpoint, the above check will
 fail and
  it will again log snapshot, so this will continue, even if the system
 is
  totally idle.
  I understand that it doesn't cause any problem, but I think it is
 better if
  the repeated log of snapshot in this scenario can be avoided.
 
 ISTM in that case you just need a way to cope with the additionally
 logged record in the above piece of code. Not logging seems to be the
 entirely wrong way to go at this.

I think one of the ways code can be modified is as below:

+   /*size of running transactions log when there is no
active transation*/ 
+if (!shutdown  XLogStandbyInfoActive()) 
+{ 
+runningXactXLog =
MAXALIGN(MinSizeOfXactRunningXacts) + SizeOfXLogRecord; 
+}

!if (curInsert == ControlFile-checkPoint + 
!MAXALIGN(SizeOfXLogRecord + sizeof(CheckPoint))  
!ControlFile-checkPoint ==
ControlFile-checkPointCopy.redo)

!if (curInsert == ControlFile-checkPoint + 
!MAXALIGN(SizeOfXLogRecord + sizeof(CheckPoint))  
!ControlFile-checkPoint ==
ControlFile-checkPointCopy.redo + runningXactXLog)

Second condition is checking the last checkpoint WAL position with the
current one. 
Since  ControlFile-checkPointCopy.redo holds the value before running
Xact WAL was inserted 
and ControlFile-checkPoint holds the value after running Xact WAL got
inserted, so if no new WAL was inserted apart from running Xacts and
Checkpoint WAL, then this condition will be true. 


 Not logging seems to be the entirely wrong way to go at this.

True. 

 I admit its not totally simple, but making HS less predictable seems
 like a cure *far* worse than the disease.

Right, that's why I am trying to figure out if there can be a way to handle
without any compromise on HS.

With Regards,
Amit Kapila.



-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] proposal: Set effective_cache_size to greater of .conf value, shared_buffers

2013-01-09 Thread Benedikt Grundmann

On Wed, Jan 9, 2013 at 2:01 AM, Josh Berkus j...@agliodbs.com wrote:

 All,

  Well, the problem of find out the box's physical RAM is doubtless
  solvable if we're willing to put enough sweat and tears into it, but
  I'm dubious that it's worth the trouble.  The harder part is how to know
  if the box is supposed to be dedicated to the database.  Bear in mind
  that the starting point of this debate was the idea that we're talking
  about an inexperienced DBA who doesn't know about any configuration knob
  we might provide for the purpose.

 For what it is worth even if it is a dedicated database box 75% might be
way too high. I remember investigating bad performance on our biggest
database server, that in the end turned out to be a too high setting of
effective_cache_size. From reading the code back then my rationale for it
being to high was that the code that makes use of the effective_cache_size
tries very hard to account for what the current query would do to the cache
but doesn't take into account how many queries (on separate datasets!) are
currently begin executed (and competing for the same cache).  On that box
we often have 100+ active connections and many looking at different big
datasets.

Cheers,

bene

[HACKERS] inconsistent behave of boolean based domains in XML functions

2013-01-09 Thread Pavel Stehule

Hello

On Czech pg mailing list was reported issue about problems with
boolean based domains and XML functions.

There are maybe more issues, but probably there is little bit strange
and unexpected result

postgres=# CREATE DOMAIN booldomain as bool;
CREATE DOMAIN

-- fully expected behave
postgres=# select true, true::booldomain;
 bool | booldomain
--+
 t| t
(1 row)

postgres=# select true::text, true::booldomain::text;
 text | text
--+--
 true | true
(1 row)

-- unexpected behave
postgres=# select xmlforest(true as bool, true::booldomain as booldomain);
  xmlforest
-
 booltrue/boolbooldomaint/booldomain
(1 row)

Best regards

Pavel Stehule


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Extra XLOG in Checkpoint for StandbySnapshot

On 2013-01-09 14:04:32 +0530, Amit Kapila wrote:
 On Tuesday, January 08, 2013 8:57 PM Andres Freund wrote:
  On 2013-01-08 20:33:28 +0530, Amit Kapila wrote:
   On Tuesday, January 08, 2013 8:01 PM Andres Freund wrote:
On 2013-01-08 19:51:39 +0530, Amit Kapila wrote:
 On Monday, January 07, 2013 7:15 PM Andres Freund wrote:
  On 2013-01-07 19:03:35 +0530, Amit Kapila wrote:
   On Monday, January 07, 2013 6:30 PM Simon Riggs wrote:
On 7 January 2013 12:39, Amit Kapila
  amit.kap...@huawei.com
  wrote:
   
 
  The information that no transactions are currently running
  allows
you
  to
  build a recovery snapshot, without that information the standby
won't
  start answering queries. Now that doesn't matter if all
  standbys
  already
  have built a snapshot, but the primary cannot know that.

 Can't we make sure that checkpoint operation doesn't happen for
  below
conds.
 a. nothing has happened during or after last checkpoint
 OR
 b. nothing except snapshotstanby WAL has happened

 Currently it is done for point a.

  Having to issue a checkpoint while ensuring transactions are
running
  just to get a standby up doesn't seem like a good idea to me :)

 Simon:
  If you make the correct test, I'd be more inclined to accept
  the
premise.

 Not sure, what exact you are expecting from test?
 The test is do any one operation on system and then keep the
  system
idle.
 Now at each checkpoint interval, it logs WAL for SnapshotStandby.
   
I can't really follow what you want to do here. The snapshot is
  only
logged if a checkpoint is performed anyway?  As recovery starts at
  (the
logical) checkpoint's location we need to log a snapshot exactly
there. If you want to avoid activity when the system is idle you
  need
to
prevent checkpoints from occurring itself.
  
   Even if the checkpoint is scheduled, it doesn't perform actual
  operation if
   there's nothing logged between
   current and previous checkpoint due to below check in
  CreateCheckPoint()
   function.
   if (curInsert == ControlFile-checkPoint +
   MAXALIGN(SizeOfXLogRecord +
  sizeof(CheckPoint)) 
   ControlFile-checkPoint ==
   ControlFile-checkPointCopy.redo)
  
   But if we set the wal_level as hot_standby, it will log snapshot, now
  next
   time again when function CreateCheckPoint()
   will get called due to scheduled checkpoint, the above check will
  fail and
   it will again log snapshot, so this will continue, even if the system
  is
   totally idle.
   I understand that it doesn't cause any problem, but I think it is
  better if
   the repeated log of snapshot in this scenario can be avoided.
 
  ISTM in that case you just need a way to cope with the additionally
  logged record in the above piece of code. Not logging seems to be the
  entirely wrong way to go at this.

 I think one of the ways code can be modified is as below:

 + /*size of running transactions log when there is no
 active transation*/
 +if (!shutdown  XLogStandbyInfoActive())
 +{
 +runningXactXLog =
 MAXALIGN(MinSizeOfXactRunningXacts) + SizeOfXLogRecord;
 +}

 !if (curInsert == ControlFile-checkPoint +
 !MAXALIGN(SizeOfXLogRecord + sizeof(CheckPoint)) 
 !ControlFile-checkPoint ==
 ControlFile-checkPointCopy.redo)

 !if (curInsert == ControlFile-checkPoint +
 !MAXALIGN(SizeOfXLogRecord + sizeof(CheckPoint)) 
 !ControlFile-checkPoint ==
 ControlFile-checkPointCopy.redo + runningXactXLog)

 Second condition is checking the last checkpoint WAL position with the
 current one.
 Since  ControlFile-checkPointCopy.redo holds the value before running
 Xact WAL was inserted
 and ControlFile-checkPoint holds the value after running Xact WAL got
 inserted, so if no new WAL was inserted apart from running Xacts and
 Checkpoint WAL, then this condition will be true.


I don't think thats safe, there could have been another record inserted
that happens to be MinSizeOfXactRunningXacts big and we would still skip
the checkpoint.

Greetings,

Andres Freund

--
 Andres Freund http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Cascading replication: should we detect/prevent cycles?

On 9 January 2013 01:51, Josh Berkus j...@agliodbs.com wrote:

 Anyway, I'm not saying we solve this now.  I'm saying, put it on the
 TODO list in case someone has time/an itch to scratch.

I think its reasonable to ask whether a usability feature needs to
exist whenever a problem is encountered. That shouldn't need to
translate to a new feature/TODO every time we ask the question though.

IMHO, in this case, we should document this as an issue that can
happen and we should caution that careful testing is required.

-- 
 Simon Riggs   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Extra XLOG in Checkpoint for StandbySnapshot

On Wednesday, January 09, 2013 2:28 PM Andres Freund wrote:
 On 2013-01-09 14:04:32 +0530, Amit Kapila wrote:
  On Tuesday, January 08, 2013 8:57 PM Andres Freund wrote:
   On 2013-01-08 20:33:28 +0530, Amit Kapila wrote:
On Tuesday, January 08, 2013 8:01 PM Andres Freund wrote:
 On 2013-01-08 19:51:39 +0530, Amit Kapila wrote:
  On Monday, January 07, 2013 7:15 PM Andres Freund wrote:
   On 2013-01-07 19:03:35 +0530, Amit Kapila wrote:
On Monday, January 07, 2013 6:30 PM Simon Riggs wrote:
 On 7 January 2013 12:39, Amit Kapila
   amit.kap...@huawei.com
   wrote:

  
   The information that no transactions are currently running
   allows
 you
   to
   build a recovery snapshot, without that information the
 standby
 won't
   start answering queries. Now that doesn't matter if all
   standbys
   already
   have built a snapshot, but the primary cannot know that.
 
  Can't we make sure that checkpoint operation doesn't happen
 for
   below
 conds.
  a. nothing has happened during or after last checkpoint
  OR
  b. nothing except snapshotstanby WAL has happened
 
  Currently it is done for point a.
 
   Having to issue a checkpoint while ensuring transactions
 are
 running
   just to get a standby up doesn't seem like a good idea to
 me :)
 
  Simon:
   If you make the correct test, I'd be more inclined to
 accept
   the
 premise.
 
  Not sure, what exact you are expecting from test?
  The test is do any one operation on system and then keep the
   system
 idle.
  Now at each checkpoint interval, it logs WAL for
 SnapshotStandby.

 I can't really follow what you want to do here. The snapshot is
   only
 logged if a checkpoint is performed anyway?  As recovery starts
 at
   (the
 logical) checkpoint's location we need to log a snapshot
 exactly
 there. If you want to avoid activity when the system is idle
 you
   need
 to
 prevent checkpoints from occurring itself.
   
Even if the checkpoint is scheduled, it doesn't perform actual
   operation if
there's nothing logged between
current and previous checkpoint due to below check in
   CreateCheckPoint()
function.
if (curInsert == ControlFile-checkPoint +
MAXALIGN(SizeOfXLogRecord +
   sizeof(CheckPoint)) 
ControlFile-checkPoint ==
ControlFile-checkPointCopy.redo)
   
But if we set the wal_level as hot_standby, it will log snapshot,
 now
   next
time again when function CreateCheckPoint()
will get called due to scheduled checkpoint, the above check will
   fail and
it will again log snapshot, so this will continue, even if the
 system
   is
totally idle.
I understand that it doesn't cause any problem, but I think it is
   better if
the repeated log of snapshot in this scenario can be avoided.
  
   ISTM in that case you just need a way to cope with the
 additionally
   logged record in the above piece of code. Not logging seems to be
 the
   entirely wrong way to go at this.
 
  I think one of the ways code can be modified is as below:
 
  +   /*size of running transactions log when there is no
  active transation*/
  +if (!shutdown  XLogStandbyInfoActive())
  +{
  +runningXactXLog =
  MAXALIGN(MinSizeOfXactRunningXacts) + SizeOfXLogRecord;
  +}
 
  !if (curInsert == ControlFile-checkPoint +
  !MAXALIGN(SizeOfXLogRecord +
 sizeof(CheckPoint)) 
  !ControlFile-checkPoint ==
  ControlFile-checkPointCopy.redo)
 
  !if (curInsert == ControlFile-checkPoint +
  !MAXALIGN(SizeOfXLogRecord +
 sizeof(CheckPoint)) 
  !ControlFile-checkPoint ==
  ControlFile-checkPointCopy.redo + runningXactXLog)
 
  Second condition is checking the last checkpoint WAL position with
 the
  current one.
  Since  ControlFile-checkPointCopy.redo holds the value before
 running
  Xact WAL was inserted
  and ControlFile-checkPoint holds the value after running Xact WAL
 got
  inserted, so if no new WAL was inserted apart from running Xacts
 and
  Checkpoint WAL, then this condition will be true.
 
 
 I don't think thats safe, there could have been another record inserted
 that happens to be MinSizeOfXactRunningXacts big and we would still
 skip the checkpoint.

I think such can happen only for when first time checkpoint is triggered,
and even then the first part of the check (curInsert ==
ControlFile-checkPoint + MAXALIGN(SizeOfXLogRecord + sizeof(CheckPoint))
will fail.

Value to runningXactXLog will be assigned only if wal_level is hot_stanby. 
In that case if checkpoint is getting scheduled for 2nd or consecutive time,
it will include WAL for running Xact along with WAL for any other data.

Re: [HACKERS] PL/perl should fail on configure, not make

2013-01-09 Thread Christoph Berg

Re: Tom Lane 2013-01-09 9802.1357702...@sss.pgh.pa.us
 Item: there is not a test for perl.h, as such, in configure.  There
 probably should be, just because we have comparable tests for tcl.h
 and Python.h.  However, adding one won't fix your problem on
 Debian-based distros, because for some wacko reason they put the
 headers and the shlib .so symlink in different packages, cf
 http://packages.debian.org/squeeze/amd64/perl/filelist
 http://packages.debian.org/squeeze/amd64/libperl-dev/filelist
 I am unfamiliar with a good reason for doing that.  (I can certainly
 see segregating the .a static library, or even not shipping it at
 all, but what's it save to leave out the .so symlink?)

Because the .so symlink is only needed at build time. At runtime, you
need the .so.5.14 file. Hence .so.* - $pkg, .h .a .so - $pkg-dev.

Christoph
-- 
c...@df7cb.de | http://www.df7cb.de/


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Re: Proposal: Store timestamptz of database creation on pg_database

2013-01-09 Thread Hannu Krosing

One thing i'd really like to be in this common object info catalog is DDL which 
created or altered the referenced object. 
If we additionally could make it possible to have ordinary triggers on this 
catalog it would solve most logical DDL replication problems

Hannu




Sent from Samsung Galaxy NotePeter Eisentraut pete...@gmx.net wrote:On Tue, 
2013-01-08 at 17:17 -0500, Stephen Frost wrote:
 Seriously tho, the argument for not putting these things into the
 various individual catalogs is that they'd create bloat and these
 items
 don't need to be performant.  I would think that the kind of
 timestamps
 that we're talking about fall into the same data category as comments
 on
 tables.
 
 If there isn't a good reason for comments on objects to be off in a
 generic this is for any kind of object table, then perhaps we should
 move them into the appropriate catalog tables?

I think basic refactoring logic would support taking common things out
of the individual catalogs and keeping them in a common structure,
especially when they are for amusement only and not needed in any
critical paths.  All the ALTER command refactoring and so on that's been
going on is also moving into the direction that for data definition
management, there should be mainly one kind of object with a few
variants here and there.

Re: [HACKERS] Re: patch submission: truncate trailing nulls from heap rows to reduce the size of the null bitmap [Review]

On 24 December 2012 16:57, Amit Kapila amit.kap...@huawei.com wrote:

 Performance: Average of 3 runs of pgbench in tps
 9.3devel  |  with trailing null patch
 --+--
 578.9872  |   573.4980

On balance, it would seem optimizing for this special case would
affect everybody negatively; not much, but enough. Which means we
should rekect this patch.

Do you have a reason why a different view might be taken?

-- 
 Simon Riggs   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Performance Improvement by reducing WAL for Update Operation

On 9 January 2013 08:05, Amit kapila amit.kap...@huawei.com wrote:

 Update patch contains handling of below Comments

Thanks


 Test results with modified pgbench (1800 record size) on the latest patch:

 -Patch- -tps@-c1- -WAL@-c1-  -tps@-c2-  -WAL@-c2-
 Head831   4.17 GB1416   7.13 GB
 WAL modification846   2.36 GB1712   3.31 GB

 -Patch- -tps@-c4- -WAL@-c4-  -tps@-c8-  -WAL@-c8-
 Head2196  11.01 GB   2758   13.88 GB
 WAL modification3295   5.87 GB   54729.02 GB

And test results on normal pgbench?

-- 
 Simon Riggs   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] [PATCH 2/2] use pg_malloc instead of an unchecked malloc in pg_resetxlog

---
 src/bin/pg_resetxlog/pg_resetxlog.c | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/src/bin/pg_resetxlog/pg_resetxlog.c b/src/bin/pg_resetxlog/pg_resetxlog.c
index 8734f2c..60fb30c 100644
--- a/src/bin/pg_resetxlog/pg_resetxlog.c
+++ b/src/bin/pg_resetxlog/pg_resetxlog.c
@@ -54,6 +54,7 @@
 #include access/xlog_internal.h
 #include catalog/catversion.h
 #include catalog/pg_control.h
+#include port/palloc.h
 
 extern int	optind;
 extern char *optarg;
@@ -390,7 +391,7 @@ ReadControlFile(void)
 	}
 
 	/* Use malloc to ensure we have a maxaligned buffer */
-	buffer = (char *) malloc(PG_CONTROL_SIZE);
+	buffer = (char *) pg_malloc(PG_CONTROL_SIZE);
 
 	len = read(fd, buffer, PG_CONTROL_SIZE);
 	if (len  0)
@@ -904,7 +905,7 @@ WriteEmptyXLOG(void)
 	int			nbytes;
 
 	/* Use malloc() to ensure buffer is MAXALIGNED */
-	buffer = (char *) malloc(XLOG_BLCKSZ);
+	buffer = (char *) pg_malloc(XLOG_BLCKSZ);
 	page = (XLogPageHeader) buffer;
 	memset(buffer, 0, XLOG_BLCKSZ);
 

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] [PATCH] unified frontend support for pg_malloc et al and palloc/pfree mulation (was xlogreader-v4)

Hi,

As promised here's a patch to provide palloc emulation for frontend-ish
environments.

The patch:
- makes palloc() into a real function so CurrentMemoryContext doesn't
  need to be provided
- provides common pg_(malloc,malloc0, realloc, strdup, free) wrappers
  and removes various versions of those across different utilities
- removes ugly palloc redefinery for frontend use of backend code (dirmod.c)

Controversial/Unclear things:
- palloc[0] are currently copies of the MemoryContextAlloc[Zero]
  functions to preclude performance regressions, imo the level of
  duplication is ok though
- the common memory management is implemented in [pg]port/palloc.[ch], I
  am not too happy with the name and location
- pgport/palloc.c is only built in the backend, not sure if there is a
  nicer way to do this from a make POV
- the different versions of pg_malloc et al used different error
  signaling methods, I've settled on
fprintf(stderr, _(out of memory\n));
exit(EXIT_FAILURE);

Results in a nice net removal of code:
 37 files changed, 218 insertions(+), 621 deletions(-)



-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] information schema parameter_default implementation

2013-01-09 Thread Peter Eisentraut

Here is an implementation of the
information_schema.parameters.parameter_default column.

I ended up writing a C function to decode the whole thing from the
system catalogs, because it was too complicated in SQL, so I abandoned
the approach discussed in [0].


[0]: 
http://archives.postgresql.org/message-id/1356092400.25658.6.ca...@vanquo.pezone.net
diff --git a/doc/src/sgml/information_schema.sgml b/doc/src/sgml/information_schema.sgml
index ddbc56c..4fa4ab8 100644
--- a/doc/src/sgml/information_schema.sgml
+++ b/doc/src/sgml/information_schema.sgml
@@ -3323,6 +3323,15 @@ titleliteralparameters/literal Columns/title
in future versions.)
   /entry
  /row
+
+ row
+  entryliteralparameter_default/literal/entry
+  entrytypecharacter_data/type/entry
+  entry
+   The default expression of the parameter, or null if none or if the
+   function is not owned by a currently enabled role.
+  /entry
+ /row
 /tbody
/tgroup
   /table
diff --git a/src/backend/catalog/information_schema.sql b/src/backend/catalog/information_schema.sql
index 2307586..82d686a 100644
--- a/src/backend/catalog/information_schema.sql
+++ b/src/backend/catalog/information_schema.sql
@@ -1132,10 +1132,15 @@ CREATE VIEW parameters AS
CAST(null AS sql_identifier) AS scope_schema,
CAST(null AS sql_identifier) AS scope_name,
CAST(null AS cardinal_number) AS maximum_cardinality,
-   CAST((ss.x).n AS sql_identifier) AS dtd_identifier
+   CAST((ss.x).n AS sql_identifier) AS dtd_identifier,
+   CAST(
+ CASE WHEN pg_has_role(proowner, 'USAGE')
+  THEN pg_get_function_arg_default(p_oid, (ss.x).n)
+  ELSE NULL END
+ AS character_data) AS parameter_default
 
 FROM pg_type t, pg_namespace nt,
- (SELECT n.nspname AS n_nspname, p.proname, p.oid AS p_oid,
+ (SELECT n.nspname AS n_nspname, p.proname, p.oid AS p_oid, p.proowner,
  p.proargnames, p.proargmodes,
  _pg_expandarray(coalesce(p.proallargtypes, p.proargtypes::oid[])) AS x
   FROM pg_namespace n, pg_proc p
diff --git a/src/backend/utils/adt/ruleutils.c b/src/backend/utils/adt/ruleutils.c
index 266cec5..b9ebb78 100644
--- a/src/backend/utils/adt/ruleutils.c
+++ b/src/backend/utils/adt/ruleutils.c
@@ -2248,6 +2248,76 @@ static char *generate_function_name(Oid funcid, int nargs, List *argnames,
 	return argsprinted;
 }
 
+Datum
+pg_get_function_arg_default(PG_FUNCTION_ARGS)
+{
+	Oid			funcid = PG_GETARG_OID(0);
+	int32		argn = PG_GETARG_INT32(1);
+	HeapTuple	proctup;
+	Form_pg_proc proc;
+	int			numargs;
+	Oid		   *argtypes;
+	char	  **argnames;
+	char	   *argmodes;
+	int			i;
+	List	   *argdefaults;
+	Node	   *node;
+	char	   *str;
+	int			inputargn;
+	Datum		proargdefaults;
+	bool		isnull;
+	int			nth;
+
+	proctup = SearchSysCache1(PROCOID, ObjectIdGetDatum(funcid));
+	if (!HeapTupleIsValid(proctup))
+		elog(ERROR, cache lookup failed for function %u, funcid);
+
+	numargs = get_func_arg_info(proctup, argtypes, argnames, argmodes);
+	if (argn  numargs)
+	{
+		ReleaseSysCache(proctup);
+		PG_RETURN_NULL();
+	}
+
+	inputargn = 0;
+
+	for (i = 0; i  argn; i++)
+	{
+		if (!argmodes || argmodes[i] == PROARGMODE_IN || argmodes[i] == PROARGMODE_INOUT || argmodes[i] == PROARGMODE_VARIADIC)
+			inputargn++;
+	}
+
+	proargdefaults = SysCacheGetAttr(PROCOID, proctup,
+	 Anum_pg_proc_proargdefaults,
+	 isnull);
+
+	if (isnull)
+	{
+		ReleaseSysCache(proctup);
+		PG_RETURN_NULL();
+	}
+
+	str = TextDatumGetCString(proargdefaults);
+	argdefaults = (List *) stringToNode(str);
+	Assert(IsA(argdefaults, List));
+	pfree(str);
+
+	proc = (Form_pg_proc) GETSTRUCT(proctup);
+
+	nth = inputargn - 1 - (proc-pronargs - proc-pronargdefaults);
+	if (nth  0 || nth = list_length(argdefaults))
+	{
+		ReleaseSysCache(proctup);
+		PG_RETURN_NULL();
+	}
+	node = list_nth(argdefaults, nth);
+	str = deparse_expression_pretty(node, NIL, false, false, 0, 0);
+
+	ReleaseSysCache(proctup);
+
+	PG_RETURN_TEXT_P(string_to_text(str));
+}
+
 
 /*
  * deparse_expression			- General utility for deparsing expressions
diff --git a/src/include/catalog/catversion.h b/src/include/catalog/catversion.h
index 1e235c6..dc38532 100644
--- a/src/include/catalog/catversion.h
+++ b/src/include/catalog/catversion.h
@@ -53,6 +53,6 @@
  */
 
 /*			mmddN */
-#define CATALOG_VERSION_NO	201212081
+#define CATALOG_VERSION_NO	201212261
 
 #endif
diff --git a/src/include/catalog/pg_proc.h b/src/include/catalog/pg_proc.h
index 010605d..64fbe7e 100644
--- a/src/include/catalog/pg_proc.h
+++ b/src/include/catalog/pg_proc.h
@@ -1964,6 +1964,8 @@ DATA(insert OID = 2232 (  pg_get_function_identity_arguments	   PGNSP PGUID 12 1
 DESCR(identity argument list of a function);
 DATA(insert OID = 2165 (  pg_get_function_result	   PGNSP PGUID 12 1 0 0 0 f f f f t f s 1 0 25 26 _null_ _null_ _null_ _null_ pg_get_function_result

[HACKERS] [PATCH 1/2] Provide a common malloc wrappers and palloc et al. emulation for frontend'ish environs

---
 contrib/oid2name/oid2name.c|  52 +
 contrib/pg_upgrade/pg_upgrade.h|   5 +-
 contrib/pg_upgrade/util.c  |  49 -
 contrib/pgbench/pgbench.c  |  54 +-
 src/backend/utils/mmgr/mcxt.c  |  78 +++-
 src/bin/initdb/initdb.c|  40 +-
 src/bin/pg_basebackup/pg_basebackup.c  |   2 +-
 src/bin/pg_basebackup/pg_receivexlog.c |   1 +
 src/bin/pg_basebackup/receivelog.c |   1 +
 src/bin/pg_basebackup/streamutil.c |  38 +-
 src/bin/pg_basebackup/streamutil.h |   4 -
 src/bin/pg_ctl/pg_ctl.c|  39 +-
 src/bin/pg_dump/Makefile   |   6 +-
 src/bin/pg_dump/common.c   |   1 -
 src/bin/pg_dump/compress_io.c  |   1 -
 src/bin/pg_dump/dumpmem.c  |  76 ---
 src/bin/pg_dump/dumpmem.h  |  22 --
 src/bin/pg_dump/dumputils.h|   1 +
 src/bin/pg_dump/pg_backup_archiver.c   |   1 -
 src/bin/pg_dump/pg_backup_custom.c |   2 +-
 src/bin/pg_dump/pg_backup_db.c |   1 -
 src/bin/pg_dump/pg_backup_directory.c  |   1 -
 src/bin/pg_dump/pg_backup_null.c   |   1 -
 src/bin/pg_dump/pg_backup_tar.c|   1 -
 src/bin/pg_dump/pg_dump.c  |   1 -
 src/bin/pg_dump/pg_dump_sort.c |   1 -
 src/bin/pg_dump/pg_dumpall.c   |   1 -
 src/bin/pg_dump/pg_restore.c   |   1 -
 src/bin/psql/common.c  |  50 -
 src/bin/psql/common.h  |  10 +--
 src/bin/scripts/common.c   |  49 -
 src/bin/scripts/common.h   |   5 +-
 src/include/port/palloc.h  |  19 +
 src/include/utils/palloc.h |  12 +--
 src/port/Makefile  |   8 +-
 src/port/dirmod.c  |  75 +--
 src/port/palloc.c  | 130 +
 37 files changed, 218 insertions(+), 621 deletions(-)
 delete mode 100644 src/bin/pg_dump/dumpmem.c
 delete mode 100644 src/bin/pg_dump/dumpmem.h
 create mode 100644 src/include/port/palloc.h
 create mode 100644 src/port/palloc.c

diff --git a/contrib/oid2name/oid2name.c b/contrib/oid2name/oid2name.c
index a666731..dfd8105 100644
--- a/contrib/oid2name/oid2name.c
+++ b/contrib/oid2name/oid2name.c
@@ -9,6 +9,8 @@
  */
 #include postgres_fe.h
 
+#include port/palloc.h
+
 #include unistd.h
 #ifdef HAVE_GETOPT_H
 #include getopt.h
@@ -50,9 +52,6 @@ struct options
 /* function prototypes */
 static void help(const char *progname);
 void		get_opts(int, char **, struct options *);
-void	   *pg_malloc(size_t size);
-void	   *pg_realloc(void *ptr, size_t size);
-char	   *pg_strdup(const char *str);
 void		add_one_elt(char *eltname, eary *eary);
 char	   *get_comma_elts(eary *eary);
 PGconn	   *sql_conn(struct options *);
@@ -201,53 +200,6 @@ help(const char *progname)
 		   progname, progname);
 }
 
-void *
-pg_malloc(size_t size)
-{
-	void	   *ptr;
-
-	/* Avoid unportable behavior of malloc(0) */
-	if (size == 0)
-		size = 1;
-	ptr = malloc(size);
-	if (!ptr)
-	{
-		fprintf(stderr, out of memory\n);
-		exit(1);
-	}
-	return ptr;
-}
-
-void *
-pg_realloc(void *ptr, size_t size)
-{
-	void	   *result;
-
-	/* Avoid unportable behavior of realloc(NULL, 0) */
-	if (ptr == NULL  size == 0)
-		size = 1;
-	result = realloc(ptr, size);
-	if (!result)
-	{
-		fprintf(stderr, out of memory\n);
-		exit(1);
-	}
-	return result;
-}
-
-char *
-pg_strdup(const char *str)
-{
-	char	   *result = strdup(str);
-
-	if (!result)
-	{
-		fprintf(stderr, out of memory\n);
-		exit(1);
-	}
-	return result;
-}
-
 /*
  * add_one_elt
  *
diff --git a/contrib/pg_upgrade/pg_upgrade.h b/contrib/pg_upgrade/pg_upgrade.h
index c1a2f53..3324918 100644
--- a/contrib/pg_upgrade/pg_upgrade.h
+++ b/contrib/pg_upgrade/pg_upgrade.h
@@ -11,6 +11,7 @@
 #include sys/time.h
 
 #include libpq-fe.h
+#include port/palloc.h
 
 /* Use port in the private/dynamic port number range */
 #define DEF_PGUPORT			50432
@@ -438,10 +439,6 @@ void
 prep_status(const char *fmt,...)
 __attribute__((format(PG_PRINTF_ATTRIBUTE, 1, 2)));
 void		check_ok(void);
-char	   *pg_strdup(const char *s);
-void	   *pg_malloc(size_t size);
-void	   *pg_realloc(void *ptr, size_t size);
-void		pg_free(void *ptr);
 const char *getErrorText(int errNum);
 unsigned int str2uint(const char *str);
 void		pg_putenv(const char *var, const char *val);
diff --git a/contrib/pg_upgrade/util.c b/contrib/pg_upgrade/util.c
index c91003a..80d0733 100644
--- a/contrib/pg_upgrade/util.c
+++ b/contrib/pg_upgrade/util.c
@@ -213,55 +213,6 @@ get_user_info(char **user_name)
 }
 
 
-void *
-pg_malloc(size_t size)
-{
-	void	   *p;
-
-	/* Avoid unportable behavior of malloc(0) */
-	if (size == 0)
-		size = 1;
-	p = malloc(size);
-	if (p == NULL)
-		pg_log(PG_FATAL, %s: out of memory\n, os_info.progname);
-	return p;
-}
-
-void *
-pg_realloc(void *ptr, size_t size)
-{
-	void	   *p;
-

Re: [HACKERS] [PATCH] unified frontend support for pg_malloc et al and palloc/pfree mulation (was xlogreader-v4)

2013-01-09 Thread Heikki Linnakangas


On 09.01.2013 13:27, Andres Freund wrote:

- makes palloc() into a real function so CurrentMemoryContext doesn't
   need to be provided


I don't understand the need for this change. Can't you just:

#define palloc(s) pg_malloc(s)

in the frontend context?

- Heikki


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Further pg_upgrade analysis for many tables

On 23 November 2012 22:34, Jeff Janes jeff.ja...@gmail.com wrote:

 I got rid of need_eoxact_work entirely and replaced it with a short
 list that fulfills the functions of indicating that work is needed,
 and suggesting which rels might need that work.  There is no attempt
 to prevent duplicates, nor to remove invalidated entries from the
 list.   Invalid entries are skipped when the hash entry is not found,
 and processing is idempotent so duplicates are not a problem.

 Formally speaking, if MAX_EOXACT_LIST were 0, so that the list
 overflowed the first time it was accessed, then it would be identical
 to the current behavior or having only a flag.  So formally all I did
 was increase the max from 0 to 10.

...

 It is not obvious what value to set the MAX list size to.

A few questions, that may help you...

Why did you pick 10, when your create temp table example needs 110?

Why does the list not grow as needed?

-- 
 Simon Riggs   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [PATCH] unified frontend support for pg_malloc et al and palloc/pfree mulation (was xlogreader-v4)

On 2013-01-09 13:46:53 +0200, Heikki Linnakangas wrote:
 On 09.01.2013 13:27, Andres Freund wrote:
 - makes palloc() into a real function so CurrentMemoryContext doesn't
need to be provided

 I don't understand the need for this change. Can't you just:

 #define palloc(s) pg_malloc(s)

 in the frontend context?

Yes, that would be possible, but imo its the inferior solution:
* it precludes ever sharing code without compiling twice
* removing allows us to get rid of the following ugliness in dirmod.c:
-#ifndef FRONTEND
-
-/*
- * On Windows, call non-macro versions of palloc; we can't reference
- * CurrentMemoryContext in this file because of PGDLLIMPORT conflict.
- */
-#if defined(WIN32) || defined(__CYGWIN__)
-#undef palloc
-#undef pstrdup
-#define palloc(sz) pgport_palloc(sz)
-#define pstrdup(str)   pgport_pstrdup(str)
-#endif
-#else  /* FRONTEND */
-
* it opens the window for moving more stuff from utils/palloc.h to memutils.h

Greetings,

Andres Freund

--
 Andres Freund http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Further pg_upgrade analysis for many tables

On 9 November 2012 18:50, Jeff Janes jeff.ja...@gmail.com wrote:

 quadratic behavior in the resource owner/lock table

I didn't want to let that particular phrase go by without saying
exactly what behaviour is that?, so we can discuss fixing that also.

This maybe something I already know about, but its worth asking about.

-- 
 Simon Riggs   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Re: patch submission: truncate trailing nulls from heap rows to reduce the size of the null bitmap [Review]

On Wednesday, January 09, 2013 4:52 PM Simon Riggs wrote:
 On 24 December 2012 16:57, Amit Kapila amit.kap...@huawei.com wrote:
 
  Performance: Average of 3 runs of pgbench in tps
  9.3devel  |  with trailing null patch
  --+--
  578.9872  |   573.4980
 
 On balance, it would seem optimizing for this special case would
 affect everybody negatively; not much, but enough. Which means we
 should rekect this patch.
 
 Do you have a reason why a different view might be taken?

I have tried to dig why this gap is coming. I have observed that there is
very less change in normal path.
I wanted to give it some more time to exactly find if something can be done
to avoid performance dip in normal execution.

Right now I am busy in certain other work. But definitely in coming week or
so, I shall spare time to work on it again.

With Regards,
Amit Kapila.



-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Performance Improvement by reducing WAL for Update Operation

On Wednesday, January 09, 2013 4:57 PM Simon Riggs wrote:
 On 9 January 2013 08:05, Amit kapila amit.kap...@huawei.com wrote:
 
  Update patch contains handling of below Comments
 
 Thanks
 
 
  Test results with modified pgbench (1800 record size) on the latest
 patch:
 
  -Patch- -tps@-c1- -WAL@-c1-  -tps@-c2-  -
 WAL@-c2-
  Head831   4.17 GB1416   7.13
 GB
  WAL modification846   2.36 GB1712   3.31
 GB
 
  -Patch- -tps@-c4- -WAL@-c4-  -tps@-c8-  -
 WAL@-c8-
  Head2196  11.01 GB   2758   13.88
 GB
  WAL modification3295   5.87 GB   54729.02
 GB
 
 And test results on normal pgbench?

As there was no gain for original pgbench as was shown in performance
readings, so I thought it is not mandatory.
However I shall run for normal pgbench as it should not lead any further dip
in normal pgbench.
Thanks for pointing.

With Regards,
Amit Kapila.





-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Re: patch submission: truncate trailing nulls from heap rows to reduce the size of the null bitmap [Review]

On 9 January 2013 12:06, Amit Kapila amit.kap...@huawei.com wrote:
 On Wednesday, January 09, 2013 4:52 PM Simon Riggs wrote:
 On 24 December 2012 16:57, Amit Kapila amit.kap...@huawei.com wrote:

  Performance: Average of 3 runs of pgbench in tps
  9.3devel  |  with trailing null patch
  --+--
  578.9872  |   573.4980

 On balance, it would seem optimizing for this special case would
 affect everybody negatively; not much, but enough. Which means we
 should rekect this patch.

 Do you have a reason why a different view might be taken?

 I have tried to dig why this gap is coming. I have observed that there is
 very less change in normal path.
 I wanted to give it some more time to exactly find if something can be done
 to avoid performance dip in normal execution.

 Right now I am busy in certain other work. But definitely in coming week or
 so, I shall spare time to work on it again.

Perhaps. Not every idea produces useful outcomes. Even after your
excellent research, it appears we haven't made this work yet. It's a
shame. Should we invest more time? It's considered rude to advise
others how to spend their time, but let me say this: we simply don't
have enough time to do everything and we need to be selective,
prioritising our time on to the things that look to give the best
benefit.

-- 
 Simon Riggs   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Extra XLOG in Checkpoint for StandbySnapshot

On 2013-01-09 15:06:04 +0530, Amit Kapila wrote:
 On Wednesday, January 09, 2013 2:28 PM Andres Freund wrote:
  On 2013-01-09 14:04:32 +0530, Amit Kapila wrote:
   On Tuesday, January 08, 2013 8:57 PM Andres Freund wrote:
On 2013-01-08 20:33:28 +0530, Amit Kapila wrote:
 On Tuesday, January 08, 2013 8:01 PM Andres Freund wrote:
  On 2013-01-08 19:51:39 +0530, Amit Kapila wrote:
   On Monday, January 07, 2013 7:15 PM Andres Freund wrote:
On 2013-01-07 19:03:35 +0530, Amit Kapila wrote:
 On Monday, January 07, 2013 6:30 PM Simon Riggs wrote:
  On 7 January 2013 12:39, Amit Kapila
amit.kap...@huawei.com
wrote:
 
   
The information that no transactions are currently running
allows
  you
to
build a recovery snapshot, without that information the
  standby
  won't
start answering queries. Now that doesn't matter if all
standbys
already
have built a snapshot, but the primary cannot know that.
  
   Can't we make sure that checkpoint operation doesn't happen
  for
below
  conds.
   a. nothing has happened during or after last checkpoint
   OR
   b. nothing except snapshotstanby WAL has happened
  
   Currently it is done for point a.
  
Having to issue a checkpoint while ensuring transactions
  are
  running
just to get a standby up doesn't seem like a good idea to
  me :)
  
   Simon:
If you make the correct test, I'd be more inclined to
  accept
the
  premise.
  
   Not sure, what exact you are expecting from test?
   The test is do any one operation on system and then keep the
system
  idle.
   Now at each checkpoint interval, it logs WAL for
  SnapshotStandby.
 
  I can't really follow what you want to do here. The snapshot is
only
  logged if a checkpoint is performed anyway?  As recovery starts
  at
(the
  logical) checkpoint's location we need to log a snapshot
  exactly
  there. If you want to avoid activity when the system is idle
  you
need
  to
  prevent checkpoints from occurring itself.

 Even if the checkpoint is scheduled, it doesn't perform actual
operation if
 there's nothing logged between
 current and previous checkpoint due to below check in
CreateCheckPoint()
 function.
 if (curInsert == ControlFile-checkPoint +
 MAXALIGN(SizeOfXLogRecord +
sizeof(CheckPoint)) 
 ControlFile-checkPoint ==
 ControlFile-checkPointCopy.redo)

 But if we set the wal_level as hot_standby, it will log snapshot,
  now
next
 time again when function CreateCheckPoint()
 will get called due to scheduled checkpoint, the above check will
fail and
 it will again log snapshot, so this will continue, even if the
  system
is
 totally idle.
 I understand that it doesn't cause any problem, but I think it is
better if
 the repeated log of snapshot in this scenario can be avoided.
   
ISTM in that case you just need a way to cope with the
  additionally
logged record in the above piece of code. Not logging seems to be
  the
entirely wrong way to go at this.
  
   I think one of the ways code can be modified is as below:
  
   + /*size of running transactions log when there is no
   active transation*/
   +if (!shutdown  XLogStandbyInfoActive())
   +{
   +runningXactXLog =
   MAXALIGN(MinSizeOfXactRunningXacts) + SizeOfXLogRecord;
   +}
  
   !if (curInsert == ControlFile-checkPoint +
   !MAXALIGN(SizeOfXLogRecord +
  sizeof(CheckPoint)) 
   !ControlFile-checkPoint ==
   ControlFile-checkPointCopy.redo)
  
   !if (curInsert == ControlFile-checkPoint +
   !MAXALIGN(SizeOfXLogRecord +
  sizeof(CheckPoint)) 
   !ControlFile-checkPoint ==
   ControlFile-checkPointCopy.redo + runningXactXLog)
  
   Second condition is checking the last checkpoint WAL position with
  the
   current one.
   Since  ControlFile-checkPointCopy.redo holds the value before
  running
   Xact WAL was inserted
   and ControlFile-checkPoint holds the value after running Xact WAL
  got
   inserted, so if no new WAL was inserted apart from running Xacts
  and
   Checkpoint WAL, then this condition will be true.
  
  
  I don't think thats safe, there could have been another record inserted
  that happens to be MinSizeOfXactRunningXacts big and we would still
  skip the checkpoint.
 
 I think such can happen only for when first time checkpoint is triggered,
 and even then the first part of the check (curInsert ==
 ControlFile-checkPoint + MAXALIGN(SizeOfXLogRecord + sizeof(CheckPoint))
 will fail.
 
 Value to runningXactXLog will be

Re: [HACKERS] [PATCH 1/2] Provide a common malloc wrappers and palloc et al. emulation for frontend'ish environs

2013-01-09 Thread Magnus Hagander

Am I the only one who finds this way of posting patches really annoying?

Here is a patch with no description other than a list of changed
files. And discussion happens in a completely different email.

What's wrong with just posting the patch as a regular attachment(s) to
a regular thread, like other people do?

Yes, I'm well aware that some mailers thread them in right. Our
archives code does. But many MUAs don't. But even when using a MUA
that threads it correctly, I find it quite annoying that you have to
open one email to read the comments and a different email toview the
patch.

It may be just me. But it may be others as well, so I figured I should
raise the issue :)

//Magnus


On Wed, Jan 9, 2013 at 12:27 PM, Andres Freund and...@2ndquadrant.com wrote:
 ---
  contrib/oid2name/oid2name.c|  52 +
  contrib/pg_upgrade/pg_upgrade.h|   5 +-
  contrib/pg_upgrade/util.c  |  49 -
  contrib/pgbench/pgbench.c  |  54 +-
  src/backend/utils/mmgr/mcxt.c  |  78 +++-
  src/bin/initdb/initdb.c|  40 +-
  src/bin/pg_basebackup/pg_basebackup.c  |   2 +-
  src/bin/pg_basebackup/pg_receivexlog.c |   1 +
  src/bin/pg_basebackup/receivelog.c |   1 +
  src/bin/pg_basebackup/streamutil.c |  38 +-
  src/bin/pg_basebackup/streamutil.h |   4 -
  src/bin/pg_ctl/pg_ctl.c|  39 +-
  src/bin/pg_dump/Makefile   |   6 +-
  src/bin/pg_dump/common.c   |   1 -
  src/bin/pg_dump/compress_io.c  |   1 -
  src/bin/pg_dump/dumpmem.c  |  76 ---
  src/bin/pg_dump/dumpmem.h  |  22 --
  src/bin/pg_dump/dumputils.h|   1 +
  src/bin/pg_dump/pg_backup_archiver.c   |   1 -
  src/bin/pg_dump/pg_backup_custom.c |   2 +-
  src/bin/pg_dump/pg_backup_db.c |   1 -
  src/bin/pg_dump/pg_backup_directory.c  |   1 -
  src/bin/pg_dump/pg_backup_null.c   |   1 -
  src/bin/pg_dump/pg_backup_tar.c|   1 -
  src/bin/pg_dump/pg_dump.c  |   1 -
  src/bin/pg_dump/pg_dump_sort.c |   1 -
  src/bin/pg_dump/pg_dumpall.c   |   1 -
  src/bin/pg_dump/pg_restore.c   |   1 -
  src/bin/psql/common.c  |  50 -
  src/bin/psql/common.h  |  10 +--
  src/bin/scripts/common.c   |  49 -
  src/bin/scripts/common.h   |   5 +-
  src/include/port/palloc.h  |  19 +
  src/include/utils/palloc.h |  12 +--
  src/port/Makefile  |   8 +-
  src/port/dirmod.c  |  75 +--
  src/port/palloc.c  | 130 
 +
  37 files changed, 218 insertions(+), 621 deletions(-)
  delete mode 100644 src/bin/pg_dump/dumpmem.c
  delete mode 100644 src/bin/pg_dump/dumpmem.h
  create mode 100644 src/include/port/palloc.h
  create mode 100644 src/port/palloc.c



 --
 Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
 To make changes to your subscription:
 http://www.postgresql.org/mailpref/pgsql-hackers




-- 
 Magnus Hagander
 Me: http://www.hagander.net/
 Work: http://www.redpill-linpro.com/


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] Commitfest Topics

I've set up commifest topics for CFJan15.

This will allow people to move across any patches from earlier commitfests.

-- 
 Simon Riggs   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [PATCH 1/2] Provide a common malloc wrappers and palloc et al. emulation for frontend'ish environs

On 2013-01-09 13:34:12 +0100, Magnus Hagander wrote:
Am I the only one who finds this way of posting patches really annoying?

Well, I unsurprisingly don't ;)

Here is a patch with no description other than a list of changed
files. And discussion happens in a completely different email.

They contain the commit message - which in most of the cases is more
informative than the one just posted, which was definitely rather
short. It should like in e.g.

http://archives.postgresql.org/message-id/1352942234-3953-11-git-send-email-andres%402ndquadrant.com

What's wrong with just posting the patch as a regular attachment(s) to
a regular thread, like other people do?

Two issues:
- If you have a bigger series of patches (like the whole logical
decoding thing) posting all patches in a single mail makes the
following thread even harder to follow than its currently the
case. Note how even in this, far smaller, case the discussion actually
happened in the appropriate subthreads. I find it way much easier to
reread through an old thread that way to reassure myself what was
discussed.
- mhonarc does really strange things if you attach two git created
patches (splits them into multiple mails)

It may be just me. But it may be others as well, so I figured I should
raise the issue :)

I am happy to comply with whatever others prefer.

Greetings,

Andres Freund

--
Andres Freund http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training Services

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [PATCH 1/2] Provide a common malloc wrappers and palloc et al. emulation for frontend'ish environs

2013-01-09 Thread Magnus Hagander

On Wed, Jan 9, 2013 at 1:47 PM, Andres Freund and...@2ndquadrant.com wrote:
On 2013-01-09 13:34:12 +0100, Magnus Hagander wrote:
Am I the only one who finds this way of posting patches really annoying?

Well, I unsurprisingly don't ;)

Yeah, that's not surprising :)

Here is a patch with no description other than a list of changed
files. And discussion happens in a completely different email.

They contain the commit message - which in most of the cases is more
informative than the one just posted, which was definitely rather
short. It should like in e.g.

http://archives.postgresql.org/message-id/1352942234-3953-11-git-send-email-andres%402ndquadrant.com

They are really two different issues - the posting a patch without a
description, and the separation of threads. It's when they are
combined together that it becomes *really* annoying :) When it'sposted
as a separate email *with* a better commit message it's at least
easier to start a discussion off it. But I still find it much omre
annoying than just posting the patch in-thread.

What's wrong with just posting the patch as a regular attachment(s) to
a regular thread, like other people do?

Yes. So one thread per patch. That's what you already have. That's not
a factor of how the patches are posted, that's just a factor of how
many threads you break it up in. I can agree that posting 20 different
patches inthe same thread is even worse :)

- mhonarc does really strange things if you attach two git created
patches (splits them into multiple mails)

mhonarc does a lot of strange things. But this part is actually not
mhonarc's fault - it's majordomo that writes them into an mbox file in
a format that you can't see the difference between the patch and the
different message. Heck, it quite often gets it wrong even if you just
post *one* patch when it's generated by git.

This is handled better by the new archives code.

It may be just me. But it may be others as well, so I figured I should
raise the issue :)

I am happy to comply with whatever others prefer.

Yeah, so far it's also just my opinion in the other direction :)
Hopefully, some others will have thoughts about it too.

--
Magnus Hagander
Me: http://www.hagander.net/
Work: http://www.redpill-linpro.com/

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] askpass program for libpq

2013-01-09 Thread Peter Eisentraut

I would like to have something like ssh-askpass for libpq.  The main
reason is that I don't want to have passwords in plain text on disk,
even if .pgpass is read protected.  By getting the password from an
external program, I can integrate libpq tools with the host system's key
chain or wallet thing, which stores passwords encrypted.

I'm thinking about adding a new connection option askpass with
environment variable PGASKPASS.  One thing I haven't quite figured out
is how to make this ask for passwords only if needed.  Maybe it needs
two connection options, one to say which program to use and one to say
whether to use it.

Ideas?


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [PATCH 1/2] Provide a common malloc wrappers and palloc et al. emulation for frontend'ish environs

2013-01-09 Thread Michael Paquier

On Wed, Jan 9, 2013 at 9:54 PM, Magnus Hagander mag...@hagander.net wrote:

 On Wed, Jan 9, 2013 at 1:47 PM, Andres Freund and...@2ndquadrant.com
 wrote:

Yeah, so far it's also just my opinion in the other direction :)
 Hopefully, some others will have thoughts about it too.

Just giving my 2c here...

Instead of posting multiple 5~7 patches at the same time, why not limiting
the number of patches published at the same time to a lower number (max
2~3)? The logical replication implementation can be surely broken down into
many more pieces that could be reviewed carefully one by one, and in a way
that would make the implementation steps clearer than it is now for all the
people of this ML.
OK this would make the review process longer but the good point is that
some hackers who are only specialized in some areas of the PG code would be
able to give precious feedback.
-- 
Michael Paquier
http://michael.otacoo.com

Re: [HACKERS] Extra XLOG in Checkpoint for StandbySnapshot

On Wednesday, January 09, 2013 5:49 PM Andres Freund wrote:
 On 2013-01-09 15:06:04 +0530, Amit Kapila wrote:
  On Wednesday, January 09, 2013 2:28 PM Andres Freund wrote:
   On 2013-01-09 14:04:32 +0530, Amit Kapila wrote:
On Tuesday, January 08, 2013 8:57 PM Andres Freund wrote:
 On 2013-01-08 20:33:28 +0530, Amit Kapila wrote:
  On Tuesday, January 08, 2013 8:01 PM Andres Freund wrote:
   On 2013-01-08 19:51:39 +0530, Amit Kapila wrote:
On Monday, January 07, 2013 7:15 PM Andres Freund wrote:
 On 2013-01-07 19:03:35 +0530, Amit Kapila wrote:
  On Monday, January 07, 2013 6:30 PM Simon Riggs
 wrote:
   On 7 January 2013 12:39, Amit Kapila
 amit.kap...@huawei.com
 wrote:
  

 The information that no transactions are currently
 running
 allows
   you
 to
 build a recovery snapshot, without that information the
   standby
   won't
 start answering queries. Now that doesn't matter if all
 standbys
 already
 have built a snapshot, but the primary cannot know
 that.
   
Can't we make sure that checkpoint operation doesn't
 happen
   for
 below
   conds.
a. nothing has happened during or after last checkpoint
OR
b. nothing except snapshotstanby WAL has happened
   
Currently it is done for point a.
   
 Having to issue a checkpoint while ensuring
 transactions
   are
   running
 just to get a standby up doesn't seem like a good idea
 to
   me :)
   
Simon:
 If you make the correct test, I'd be more inclined to
   accept
 the
   premise.
   
Not sure, what exact you are expecting from test?
The test is do any one operation on system and then keep
 the
 system
   idle.
Now at each checkpoint interval, it logs WAL for
   SnapshotStandby.
  
   I can't really follow what you want to do here. The
 snapshot is
 only
   logged if a checkpoint is performed anyway?  As recovery
 starts
   at
 (the
   logical) checkpoint's location we need to log a snapshot
   exactly
   there. If you want to avoid activity when the system is
 idle
   you
 need
   to
   prevent checkpoints from occurring itself.
 
  Even if the checkpoint is scheduled, it doesn't perform
 actual
 operation if
  there's nothing logged between
  current and previous checkpoint due to below check in
 CreateCheckPoint()
  function.
  if (curInsert == ControlFile-checkPoint +
  MAXALIGN(SizeOfXLogRecord +
 sizeof(CheckPoint)) 
  ControlFile-checkPoint ==
  ControlFile-checkPointCopy.redo)
 
  But if we set the wal_level as hot_standby, it will log
 snapshot,
   now
 next
  time again when function CreateCheckPoint()
  will get called due to scheduled checkpoint, the above check
 will
 fail and
  it will again log snapshot, so this will continue, even if
 the
   system
 is
  totally idle.
  I understand that it doesn't cause any problem, but I think
 it is
 better if
  the repeated log of snapshot in this scenario can be avoided.

 ISTM in that case you just need a way to cope with the
   additionally
 logged record in the above piece of code. Not logging seems to
 be
   the
 entirely wrong way to go at this.
   
I think one of the ways code can be modified is as below:
   
+   /*size of running transactions log when there is
 no
active transation*/
+if (!shutdown  XLogStandbyInfoActive())
+{
+runningXactXLog =
MAXALIGN(MinSizeOfXactRunningXacts) + SizeOfXLogRecord;
+}
   
!if (curInsert == ControlFile-checkPoint +
!MAXALIGN(SizeOfXLogRecord +
   sizeof(CheckPoint)) 
!ControlFile-checkPoint ==
ControlFile-checkPointCopy.redo)
   
!if (curInsert == ControlFile-checkPoint +
!MAXALIGN(SizeOfXLogRecord +
   sizeof(CheckPoint)) 
!ControlFile-checkPoint ==
ControlFile-checkPointCopy.redo + runningXactXLog)
   
Second condition is checking the last checkpoint WAL position
 with
   the
current one.
Since  ControlFile-checkPointCopy.redo holds the value before
   running
Xact WAL was inserted
and ControlFile-checkPoint holds the value after running Xact
 WAL
   got
inserted, so if no new WAL was inserted apart from running
 Xacts
   and
Checkpoint WAL, then this condition will be true.
   
  
   I don't think thats safe, there could have been another record
 inserted
   that happens to be MinSizeOfXactRunningXacts big and we would still
   skip the checkpoint.
 
  I think such can happen

Re: [HACKERS] [PATCH 1/2] Provide a common malloc wrappers and palloc et al. emulation for frontend'ish environs

On 2013-01-09 22:23:25 +0900, Michael Paquier wrote:
 On Wed, Jan 9, 2013 at 9:54 PM, Magnus Hagander mag...@hagander.net wrote:
 
  On Wed, Jan 9, 2013 at 1:47 PM, Andres Freund and...@2ndquadrant.com
  wrote:
 
 Yeah, so far it's also just my opinion in the other direction :)
  Hopefully, some others will have thoughts about it too.
 
 Just giving my 2c here...
 
 Instead of posting multiple 5~7 patches at the same time, why not limiting
 the number of patches published at the same time to a lower number (max
 2~3)?

I tried to do this. ilist, binaryheap and this this thread... ;)

 The logical replication implementation can be surely broken down into
 many more pieces that could be reviewed carefully one by one, and in a way
 that would make the implementation steps clearer than it is now for all the
 people of this ML.

I don't really see any additional useful splits from the ones made last
round. The relfilenode stuff could be submitted separately but thats
about it and its seemingly not all that useful itself (I personally want
the (tablespace, relfilenode) = reloid mapping function independently,
but I seem to be alone). Its hard to get useful review for patches which
don't have a patch using the facility nearby in my experience.

Where do you see a useful split?

Greetings,

Andres Freund

-- 
 Andres Freund http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] pg_upgrade with parallel tablespace copying

2013-01-09 Thread Bruce Momjian


Slightly modified patch applied.  This is my last planned pg_upgrade
change for 9.3.

---

On Mon, Jan  7, 2013 at 10:51:21PM -0500, Bruce Momjian wrote:
 Pg_upgrade by default (without --link) copies heap/index files from the
 old to new cluster.  This patch implements parallel heap/index file
 copying in pg_upgrade using the --jobs option.  It uses the same
 infrastructure used for pg_upgrade parallel dump/restore.  Here are the
 performance results:
 
  --- seconds ---
  GBgitpatched
   2   62.0963.75
   4   95.93   107.22
   8  194.96   195.29
  16  494.38   348.93
  32  983.28   644.23
  64 2227.73  1244.08
 128 4735.83  2547.09
 
 Because of the kernel cache, you only see a big win when the amount of
 copy data exceeds the kernel cache.  For testing, I used a 24GB, 16-core
 machine with two magnetic disks with one tablespace on each.  Using more
 tablespaces would yield larger improvements.  My test script is
 attached.  
 
 I consider this patch ready for application.  This is the last
 pg_upgrade performance improvement idea I am considering.
 
 -- 
   Bruce Momjian  br...@momjian.ushttp://momjian.us
   EnterpriseDB http://enterprisedb.com
 
   + It's impossible for everything to be true. +

 diff --git a/contrib/pg_upgrade/check.c b/contrib/pg_upgrade/check.c
 new file mode 100644
 index 59f8fd0..1780788
 *** a/contrib/pg_upgrade/check.c
 --- b/contrib/pg_upgrade/check.c
 *** create_script_for_old_cluster_deletion(c
 *** 606,612 
   fprintf(script, RMDIR_CMD  %s\n, 
 fix_path_separator(old_cluster.pgdata));
   
   /* delete old cluster's alternate tablespaces */
 ! for (tblnum = 0; tblnum  os_info.num_tablespaces; tblnum++)
   {
   /*
* Do the old cluster's per-database directories share a 
 directory
 --- 606,612 
   fprintf(script, RMDIR_CMD  %s\n, 
 fix_path_separator(old_cluster.pgdata));
   
   /* delete old cluster's alternate tablespaces */
 ! for (tblnum = 0; tblnum  os_info.num_old_tablespaces; tblnum++)
   {
   /*
* Do the old cluster's per-database directories share a 
 directory
 *** create_script_for_old_cluster_deletion(c
 *** 621,634 
   /* remove PG_VERSION? */
   if (GET_MAJOR_VERSION(old_cluster.major_version) = 804)
   fprintf(script, RM_CMD  %s%s%cPG_VERSION\n,
 ! 
 fix_path_separator(os_info.tablespaces[tblnum]), 
   
 fix_path_separator(old_cluster.tablespace_suffix),
   PATH_SEPARATOR);
   
   for (dbnum = 0; dbnum  old_cluster.dbarr.ndbs; dbnum++)
   {
   fprintf(script, RMDIR_CMD  %s%s%c%d\n,
 ! 
 fix_path_separator(os_info.tablespaces[tblnum]),
   
 fix_path_separator(old_cluster.tablespace_suffix),
   PATH_SEPARATOR, 
 old_cluster.dbarr.dbs[dbnum].db_oid);
   }
 --- 621,634 
   /* remove PG_VERSION? */
   if (GET_MAJOR_VERSION(old_cluster.major_version) = 804)
   fprintf(script, RM_CMD  %s%s%cPG_VERSION\n,
 ! 
 fix_path_separator(os_info.old_tablespaces[tblnum]), 
   
 fix_path_separator(old_cluster.tablespace_suffix),
   PATH_SEPARATOR);
   
   for (dbnum = 0; dbnum  old_cluster.dbarr.ndbs; dbnum++)
   {
   fprintf(script, RMDIR_CMD  %s%s%c%d\n,
 ! 
 fix_path_separator(os_info.old_tablespaces[tblnum]),
   
 fix_path_separator(old_cluster.tablespace_suffix),
   PATH_SEPARATOR, 
 old_cluster.dbarr.dbs[dbnum].db_oid);
   }
 *** create_script_for_old_cluster_deletion(c
 *** 640,646 
* or a version-specific subdirectory.
*/
   fprintf(script, RMDIR_CMD  %s%s\n,
 ! 
 fix_path_separator(os_info.tablespaces[tblnum]), 
   
 fix_path_separator(old_cluster.tablespace_suffix));
   }
   
 --- 640,646 
* or a version-specific subdirectory.
*/
   fprintf(script, RMDIR_CMD

Re: [HACKERS] [PATCH 1/2] Provide a common malloc wrappers and palloc et al. emulation for frontend'ish environs

2013-01-09 Thread Alvaro Herrera


How hard is the backend hit by palloc being now an additional function
call?  Would it be a good idea to make it (and friends) STATIC_IF_INLINE?

 diff --git a/src/include/port/palloc.h b/src/include/port/palloc.h
 new file mode 100644
 index 000..a7900bf
 --- /dev/null
 +++ b/src/include/port/palloc.h
 @@ -0,0 +1,19 @@
 +/*
 + *   common.h
 + *   Common support routines for bin/scripts/
 + *
 + *   Copyright (c) 2003-2013, PostgreSQL Global Development Group
 + *
 + *   src/bin/scripts/common.h
 + */

You forgot to update the above comment.


-- 
Álvaro Herrerahttp://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training  Services


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] askpass program for libpq

2013-01-09 Thread Magnus Hagander

On Wed, Jan 9, 2013 at 2:17 PM, Peter Eisentraut pete...@gmx.net wrote:
 I would like to have something like ssh-askpass for libpq.  The main
 reason is that I don't want to have passwords in plain text on disk,
 even if .pgpass is read protected.  By getting the password from an
 external program, I can integrate libpq tools with the host system's key
 chain or wallet thing, which stores passwords encrypted.

Sounds very useful.


 I'm thinking about adding a new connection option askpass with
 environment variable PGASKPASS.  One thing I haven't quite figured out
 is how to make this ask for passwords only if needed.  Maybe it needs
 two connection options, one to say which program to use and one to say
 whether to use it.

 Ideas?

You could call it basically where conn-password_needed is set today.
So instead of dropping it directly back to the user, call the
callback, try again, and drop back to the user only if it doesn't
work.

That means it gets called only after the connection to the server is
established, but that seems reasonable given that that's the only case
when you can get a password prompt as well... You don't know the
server is going to ask for a password until it gets that far.

In fact, might it be interesting to allow libpq to do a simple
callback for the password *as well*? to implement a password prompt
directly in the application, instead of having to make multiple
connections? So not just as an external command, but also availbale as
a direct calback.


--
 Magnus Hagander
 Me: http://www.hagander.net/
 Work: http://www.redpill-linpro.com/


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] Re: [PATCH 1/2] Provide a common malloc wrappers and palloc et al. emulation for frontend'ish environs