date:20101230

On Wed, Dec 29, 2010 at 22:30, Dimitri Fontaine dimi...@2ndquadrant.fr wrote:
 Magnus Hagander mag...@hagander.net writes:
 Would people be interested in putting pg_streamrecv
 (http://github.com/mhagander/pg_streamrecv) in bin/ or contrib/ for
 9.1? I think it would make sense to do so.

 +1 for having that in core, only available for the roles WITH
 REPLICATION I suppose?

Yes.

Well, anybody who wants can run it, but they need those permissions on
the server to make it work. pg_streamrecv is entirely a client app.


 I think that the base backup feature is more important than simple streaming
 chunks of the WAL (SR already does this). Talking about the base backup over
 libpq, it is something we should implement to fulfill people's desire that
 claim an easy replication setup.

 Yes, definitely. But that also needs server side support.

 Yeah, but it's already in core for 9.1, we have pg_read_binary_file()
 there. We could propose a contrib module for previous version
 implementing the function in C, that should be pretty easy to code.

Oh. I didn't actually think about that one. So yeah, we could use that
- making it easy to code. However, I wonder how much less efficient it
would be than being able to stream the base backup. It's going to be a
*lot* more roundtrips across the network, and we're also going to
open/close the files a lot more.

Also, I haven't tested it, but a quick look at the code makes me
wonder how it will actually work with tablespaces - it seems to only
allow files under PGDATA? That could of course be changed..


  The only reason I didn't do that for pg_basebackup is that I wanted a
  self-contained python script, so that offering a public git repo is
  all I needed as far as distributing the tool goes.

Right, there's an advantage with that when it comes to being able to
work on old versions.


 Yeah, the WIP patch heikki posted is simliar, except it uses tar
 format and is implemented natively in the backend with no need for
 pl/pythonu to be installed.

 As of HEAD the dependency on pl/whatever is easily removed.

 The included C tool would need to have a parallel option from the get-go
 if at all possible, but if you look at the pg_basebackup prototype, it
 would be good to drop the wrong pg_xlog support in there and rely on a
 proper archiving setup on the master.

 Do you want to work on internal archive and restore commands over libpq
 in the same effort too?  I think this tool should be either a one time
 client or a daemon with support for:

Definitely a one-time client. If you want it to be a deamon, you write
a small wrapper that makes it one :)


  - running a base backup when receiving a signal
  - continuous WAL streaming from a master

Yes.

  - accepting standby connections and streaming to them

I see that as a separate tool, I think. But still a useful one, sure.

  - one-time libpq streaming of a WAL file, either way

Hmm. That might be interesting, yes.


 Maybe we don't need to daemonize the tool from the get-go, but if you're
 going parallel for the base-backup case you're almost there, aren't you?
 Also having internal commands for archive and restore commands that rely
 on this daemon running would be great too.

I don't want anything *relying* on this tool. I want to keep the
current way where you can choose whatever you prefer - I just want us
to ship a good default tool.


 I'd offer more help if it wasn't for finishing the extension patches,

:-) Yeah, focus on that, please - don't want to get it stalled.

-- 
 Magnus Hagander
 Me: http://www.hagander.net/
 Work: http://www.redpill-linpro.com/

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] pg_streamrecv for 9.1?

On Wed, Dec 29, 2010 at 20:19, Gurjeet Singh singh.gurj...@gmail.com wrote:
 On Wed, Dec 29, 2010 at 1:42 PM, Robert Haas robertmh...@gmail.com wrote:

 On Dec 29, 2010, at 1:01 PM, Tom Lane t...@sss.pgh.pa.us wrote:
  Is it really stable enough for bin/?  My impression of the state of
  affairs is that there is nothing whatsoever about replication that
  is really stable yet.

 Well, that's not stopping us from shipping a core feature called
 replication.  I'll defer to others on how mature pg_streamrecv is, but if
 it's no worse than replication in general I think putting it in bin/ is the
 right thing to do.

 As the README says that is not self-contained (for no fault of its own) and
 one should typically set archive_command to guarantee zero WAL loss.

Yes. Though you can combine it fine with wal_keep_segments if you
think that's safe - but archive_command is push and this tool is pull,
so if your backup server goes down for a while, pg_streamrecv will get
a gap and fail. Whereas if you configure an archive_command, it will
queue up the log on the master if it stops working, up to the point of
shutting it down because of out-of-disk. Which you *want*, if you want
to be really sure about the backups.


 quote
 TODO: Document some ways of setting up an archive_command that works well
 together with pg_streamrecv.
 /quote

     I think implementing just that TODO might make it a candidate.

Well, yes, that's obviously a requirement.

     I have neither used it nor read the code, but if it works as advertised
 then it is definitely a +1 from me; no preference of bin/ or contrib/, since
 the community will have to maintain it anyway.

It's not that much code, but some more eyes on it would always be good!


-- 
 Magnus Hagander
 Me: http://www.hagander.net/
 Work: http://www.redpill-linpro.com/

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] pg_streamrecv for 9.1?

On Wed, Dec 29, 2010 at 19:42, Robert Haas robertmh...@gmail.com wrote:
 On Dec 29, 2010, at 1:01 PM, Tom Lane t...@sss.pgh.pa.us wrote:
 Is it really stable enough for bin/?  My impression of the state of
 affairs is that there is nothing whatsoever about replication that
 is really stable yet.

 Well, that's not stopping us from shipping a core feature called 
 replication.  I'll defer to others on how mature pg_streamrecv is, but if 
 it's no worse than replication in general I think putting it in bin/ is the 
 right thing to do.

It has had less eyes on it, which puts it worse off than general
replication. OTOH, it's a lot simper code, which puts it better.

Either way, as long as it gets those eyes before release if we put it
in, it shouldn't be worse off than general replication.


-- 
 Magnus Hagander
 Me: http://www.hagander.net/
 Work: http://www.redpill-linpro.com/

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Re: new patch of MERGE (merge_204) a question about duplicated ctid

2010-12-30 Thread Marko Tiikkaja


On 2010-12-30 9:02 AM +0200, Greg Smith wrote:

Marko Tiikkaja wrote:

I have no idea why it worked in the past, but the patch was never
designed to work for UPSERT.  This has been discussed in the past and
some people thought that that's not a huge deal.


It takes an excessively large lock when doing UPSERT, which means its
performance under a heavy concurrent load can't be good.  The idea is
that if the syntax and general implementation issues can get sorted out,
fixing the locking can be a separate performance improvement to be
implemented later.  Using MERGE for UPSERT is the #1 use case for this
feature by a gigantic margin.  If that doesn't do what's expected, the
whole implementation doesn't provide the community anything really worth
talking about.  That's why I keep hammering on this particular area in
all my testing.


I'm confused.  Are you saying that the patch is supposed to lock the 
table against concurrent INSERT/UPDATE/DELETE/MERGE?  Because I don't 
see it in the patch, and the symptoms you're having are a clear 
indication of the fact that it's not happening.  I also seem to recall 
that people thought locking the table would be excessive.



Regards,
Marko Tiikkaja

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Streaming replication as a separate permissions

On Wed, Dec 29, 2010 at 20:12, Alvaro Herrera
alvhe...@commandprompt.com wrote:
 Excerpts from Magnus Hagander's message of mié dic 29 11:40:34 -0300 2010:
 On Wed, Dec 29, 2010 at 15:05, Gurjeet Singh singh.gurj...@gmail.com wrote:

  Any specific reason NOREPLICATION_P and REPLICATION_P use the _P suffix?

 Um, I just copied it off a similar entry elsewhere. I saw no comment
 about what _P actually means, and I can't say I know. I know very
 little about the bison files :-)

 Some lexer keywords have a _P prefix because otherwise they'd collide
 with some symbol in Windows header files or something like that.  It's
 old stuff, but I think you, Magnus, were around at that time.

Heh. That doesn't mean I *remember* it :-)

But yes, I see in commit 12c942383296bd626131241c012c2ab81b081738 the
comment convert some keywords.c symbols to KEYWORD_P to prevent
conflict.

Based on that, I should probably change it back, right? I just tried a
patch for it and it compiles and checks just fine with the _P parts
removed.

-- 
 Magnus Hagander
 Me: http://www.hagander.net/
 Work: http://www.redpill-linpro.com/

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] pg_streamrecv for 9.1?

2010-12-30 Thread Aidan Van Dyk

On Thu, Dec 30, 2010 at 6:41 AM, Magnus Hagander mag...@hagander.net wrote:

As the README says that is not self-contained (for no fault of its own) and
one should typically set archive_command to guarantee zero WAL loss.

Yes. Though you can combine it fine with wal_keep_segments if you
think that's safe - but archive_command is push and this tool is pull,
so if your backup server goes down for a while, pg_streamrecv will get
a gap and fail. Whereas if you configure an archive_command, it will
queue up the log on the master if it stops working, up to the point of
shutting it down because of out-of-disk. Which you *want*, if you want
to be really sure about the backups.

I was thinking I'ld like use pg_streamrecv to make my archive, and
the archive script on the master would just verify the archive has
that complete segment.

This get's you an archive synced as it's made (as long as streamrecv
is running), and my verifyarchive command would make sure that if
for some reason, the backup archive went down, the wal segments
would be blocked on the master until it's up again and current.

--
Aidan Van Dyk Create like a god,
ai...@highrise.ca command like a king,
http://www.highrise.ca/ work like a slave.

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] Snapshot synchronization, again...

2010-12-30 Thread Joachim Wieland

The snapshot synchronization discussion from the parallel pg_dump
patch somehow ended without a clear way to go forward.

Let me sum up what has been brought up and propose a short- and
longterm solution.

Summary:

Passing snapshot sync information can be done either:

a) by returning complete snapshot information from the backend to the
client so that the client can pass it along to a different backend
b) or by returning only a unique identifier to the client and storing
the actual snapshot data somewhere on the server side

Advantage of a: no memory is used in the backend and no memory needs
to get cleaned up, it is also theoretically possible that we could
forward that data to a hot standby server and do e.g. a dump partially
on the master server and partially on the hot standby server or among
several hot standby servers.
Disadvantage of a: The snapshot must be validated to make sure that
its information is still current, it might be difficult to cover all
cases of this validation. A client can not only access exactly a
published snapshot, but just about any snapshot that fits and passes
the validation checks (this is more a disadvantage than an advantage
because it allows to see a database state that never existed in
reality).

Advantage of b: No validation necessary, as soon as the transaction
that publishes the snapshot loses that snapshot, it will also revoke
the snapshot information (either by removing a temp file or deleting
it from shared memory)
Disadvantage of b: It doesn't allow a snapshot to be installed on a
different server. It requires a serializable open transaction to hold
the snapshot.

What I am proposing now is the following:

We return snapshot information as a chunk of data to the client. At
the same time however, we set a checksum in shared memory to protect
against modification of the snapshot. A publishing backend can revoke
its snapshot by deleting the checksum and a backend that is asked to
install a snapshot can verify that the snapshot is correct and current
by calculating the checksum and comparing it with the one in shared
memory.

This only costs us a few bytes for the checksum * max_connection in
shared memory and apart from resetting the checksum it does not have
cleanup and verification issues. Note that we are also free to change
the internal format of the chunk of data we return whenever we like,
so we are free to enhance this feature in the future, transparently to
the client.


Thoughts?


Joachim

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] pg_streamrecv for 9.1?

On Thu, Dec 30, 2010 at 13:30, Aidan Van Dyk ai...@highrise.ca wrote:
 On Thu, Dec 30, 2010 at 6:41 AM, Magnus Hagander mag...@hagander.net wrote:

 As the README says that is not self-contained (for no fault of its own) and
 one should typically set archive_command to guarantee zero WAL loss.

 Yes. Though you can combine it fine with wal_keep_segments if you
 think that's safe - but archive_command is push and this tool is pull,
 so if your backup server goes down for a while, pg_streamrecv will get
 a gap and fail. Whereas if you configure an archive_command, it will
 queue up the log on the master if it stops working, up to the point of
 shutting it down because of out-of-disk. Which you *want*, if you want
 to be really sure about the backups.

 I was thinking I'ld like use pg_streamrecv to make my archive, and
 the archive script on the master would just verify the archive has
 that complete segment.

 This get's you an archive synced as it's made (as long as streamrecv
 is running), and my verifyarchive command would make sure that if
 for some reason, the backup archive went down, the wal segments
 would be blocked on the master until it's up again and current.

That's exactly the method I was envisionning, and in fact that I am
using in a couple of cases - jus thaven't documented it properly :)

Since pg_streamrecv only moves a segment into the correct archive
location when it's completed, the archive_command only needs to check
if the file *exists* - if it does, it's transferred, if not, it
returns an error to make sure the wal segments don't get cleaned out.

-- 
 Magnus Hagander
 Me: http://www.hagander.net/
 Work: http://www.redpill-linpro.com/

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Re: Vacuum of newly activated 8.3.12 standby receives warnings page xxx is uninitialized --- fixing

On Thu, Dec 30, 2010 at 3:55 AM, Mark Kirkwood
mark.kirkw...@catalyst.net.nz wrote:
 Well, it is none of the things I considered.

 The problem seems to be due to use of --delete in the base backup rsync
 (see diff attached).  In fact I can now reproduce the uninitialized pages
 using the bare bones method:

Any time a relation is extended, we end up with a page of all zeros at
the end until the updated page is written out, which often doesn't
happen until the next checkpoint.  So it doesn't seem too mysterious
that you could end up with all zeroes pages on the standby initially,
but WAL replay ought to fix that.  I suppose the reason it isn't is
because you've excluded the backup label, so recovery will begin from
the wrong place.  Unless I'm missing something, that seems like a
really bad idea.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] Old git repo

Hi!

Are we ready to drop the old git mirror? The one that's still around
(as postgresql-old.git) from before we migrated the main repository to
git, and thus has the old hashes around.

-- 
 Magnus Hagander
 Me: http://www.hagander.net/
 Work: http://www.redpill-linpro.com/

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Streaming replication as a separate permissions

Excerpts from Magnus Hagander's message of jue dic 30 08:57:09 -0300 2010:
 On Wed, Dec 29, 2010 at 20:12, Alvaro Herrera
 alvhe...@commandprompt.com wrote:

  Some lexer keywords have a _P prefix because otherwise they'd collide
  with some symbol in Windows header files or something like that.  It's
  old stuff, but I think you, Magnus, were around at that time.
 
 Heh. That doesn't mean I *remember* it :-)

:-)

 But yes, I see in commit 12c942383296bd626131241c012c2ab81b081738 the
 comment convert some keywords.c symbols to KEYWORD_P to prevent
 conflict.

Wow, what a mess of a patch ... nowadays this would be like 10 commits
(or so I hope) ... hey, did Bruce sabotage the qnx4 port surreptitiously?

 Based on that, I should probably change it back, right? I just tried a
 patch for it and it compiles and checks just fine with the _P parts
 removed.

Hmm, I wouldn't bother really.  It's not that important anyway, IMHO.

-- 
Álvaro Herrera alvhe...@commandprompt.com
The PostgreSQL Company - Command Prompt, Inc.
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Function for dealing with xlog data

On Tue, Dec 28, 2010 at 16:30, Tom Lane t...@sss.pgh.pa.us wrote:
 Alvaro Herrera alvhe...@commandprompt.com writes:
 Excerpts from Magnus Hagander's message of mar dic 28 10:46:31 -0300 2010:
 Well, yeah, that was obvious ;) The question is, how much do we prefer
 the more elegant method? ;)

 If we go the new type route, do we need it to have an implicit cast to
 text, for backwards compatibility?

 I'd argue not.  Probably all existing uses are just selecting the
 function value.  What comes back to the client will just be the text
 form anyway.

That's certainly the only thing I've seen.


 I'm of the opinion that a new type isn't worth the work, myself,
 but it would mostly be up to whoever was doing the work.

Fair enough - at least enough people have said it won't be rejected
because it's done as a function rather than a datatype - so that seems
like the easiest way to proceed.

-- 
 Magnus Hagander
 Me: http://www.hagander.net/
 Work: http://www.redpill-linpro.com/

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Old git repo

On Thu, Dec 30, 2010 at 8:31 AM, Magnus Hagander mag...@hagander.net wrote:
 Are we ready to drop the old git mirror? The one that's still around
 (as postgresql-old.git) from before we migrated the main repository to
 git, and thus has the old hashes around.

I see no reason to drop that ever, or at least not any time soon.
What is it costing us?

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Old git repo

On Thu, Dec 30, 2010 at 15:28, Robert Haas robertmh...@gmail.com wrote:
 On Thu, Dec 30, 2010 at 8:31 AM, Magnus Hagander mag...@hagander.net wrote:
 Are we ready to drop the old git mirror? The one that's still around
 (as postgresql-old.git) from before we migrated the main repository to
 git, and thus has the old hashes around.

 I see no reason to drop that ever, or at least not any time soon.
 What is it costing us?

Some disk space, so almost nothing. And the potential that people grab
it by mistake - it adds a bit to confusion.

Looking at it from the other side, what's the use-case for keeping it?
If you want to diff against it or something like that, you can just
do that against your local clone (that you already had - if you
didn't, you shouldn't be using it at all)...


-- 
 Magnus Hagander
 Me: http://www.hagander.net/
 Work: http://www.redpill-linpro.com/

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Re: new patch of MERGE (merge_204) a question about duplicated ctid

2010-12-30 Thread Andrew Dunstan




On 12/30/2010 02:02 AM, Greg Smith wrote:

Marko Tiikkaja wrote:
I have no idea why it worked in the past, but the patch was never 
designed to work for UPSERT.  This has been discussed in the past and 
some people thought that that's not a huge deal.


It takes an excessively large lock when doing UPSERT, which means its 
performance under a heavy concurrent load can't be good.  The idea is 
that if the syntax and general implementation issues can get sorted 
out, fixing the locking can be a separate performance improvement to 
be implemented later.  Using MERGE for UPSERT is the #1 use case for 
this feature by a gigantic margin.  If that doesn't do what's 
expected, the whole implementation doesn't provide the community 
anything really worth talking about.  That's why I keep hammering on 
this particular area in all my testing.


One of the reflexive I can't switch to PostgreSQL easily stopping 
points for MySQL users is I can't convert my ON DUPLICATE KEY UPDATE 
code.  Every other use for MERGE is a helpful side-effect of adding 
the implementation in my mind, but not the primary driver of why this 
is important.  My hints in this direction before didn't get adopted, 
so I'm saying it outright now:  this patch must have an UPSERT 
implementation in its regression tests.  And the first thing I'm going 
to do every time a new rev comes in is try and break it with the 
pgbench test I attached.  If Boxuan can start doing that as part of 
his own testing, I think development here might start moving forward 
faster.  I don't care so much about the rate at which concurrent 
UPSERT-style MERGE happens, so long as it doesn't crash.  But that's 
where this has been stuck at for a while now.


I strongly agree. It *is* a huge deal.

cheers

andrew

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Snapshot synchronization, again...

Excerpts from Joachim Wieland's message of jue dic 30 09:31:47 -0300 2010:

 Advantage of b: No validation necessary, as soon as the transaction
 that publishes the snapshot loses that snapshot, it will also revoke
 the snapshot information (either by removing a temp file or deleting
 it from shared memory)
 Disadvantage of b: It doesn't allow a snapshot to be installed on a
 different server. It requires a serializable open transaction to hold
 the snapshot.

Why does it require a serializable transaction?  You could simply
register the snapshot in any transaction.  (Of course, the net effect
would be pretty similar to a serializable transaction).

 We return snapshot information as a chunk of data to the client. At
 the same time however, we set a checksum in shared memory to protect
 against modification of the snapshot. A publishing backend can revoke
 its snapshot by deleting the checksum and a backend that is asked to
 install a snapshot can verify that the snapshot is correct and current
 by calculating the checksum and comparing it with the one in shared
 memory.
 
 This only costs us a few bytes for the checksum * max_connection in
 shared memory and apart from resetting the checksum it does not have
 cleanup and verification issues.

So one registered snapshot per transaction?  Sounds a reasonable
limitation (I doubt there's a use case for more than that, anyway).

-- 
Álvaro Herrera alvhe...@commandprompt.com
The PostgreSQL Company - Command Prompt, Inc.
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] estimating # of distinct values

2010-12-30 Thread Florian Pflug

On Dec27, 2010, at 23:49 , Kevin Grittner wrote:
 Robert Haas robertmh...@gmail.com wrote:
 
 With respect to (b), I think I'd need to see a much more detailed
 design for how you intend to make this work.  Off the top of my
 head there seems to be some pretty serious feasibility problems.
 
 I had one random thought on that -- it seemed like a large concern
 was that there would need to be at least an occasional scan of the
 entire table to rebuild the distinct value information.

I believe we could actually avoid that.

First, the paper An Optimal Algorithm for the Distinct Elements Problem
actually contains an algorithm with *does* handle deletes - it's called
L_0 estimate there.

Second, as Tomas pointed out, the stream-based estimator is essentially a
simplified version of a bloom filter. It starts out with a field of
N zero bits, and sets K of them to 1 for each value v in the stream.
Which bits are set to 1 depends on some hash function(s) H_i(v). It's
then easy to compute how many 1-bits you'd expect to find in the bit
field after seeing D distinct values, and by reversing that you can
estimate D from the number of 1-bits in the bit field.

To avoid having to rescan large tables, instead of storing one such
bit field, we'd store one per B pages of data. We'd then only need
to scan a range of B pages around every updated or deleted tuple,
and could afterwards compute a new global estimate of D by combining
the individual bit fields with bitwise and.

Since the need to regularly VACUUM tables hit by updated or deleted
won't go away any time soon, we could piggy-back the bit field
rebuilding onto VACUUM to avoid a second scan.

A good value for B would probably be around
1000*size of bitfield/page size. If the bitfield needs ~100k, that'd
make B ~= 12000 pages ~= 100MB.

best regards,
Florian Pflug


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Snapshot synchronization, again...

2010-12-30 Thread Florian Pflug

On Dec30, 2010, at 13:31 , Joachim Wieland wrote:
 We return snapshot information as a chunk of data to the client. At
 the same time however, we set a checksum in shared memory to protect
 against modification of the snapshot. A publishing backend can revoke
 its snapshot by deleting the checksum and a backend that is asked to
 install a snapshot can verify that the snapshot is correct and current
 by calculating the checksum and comparing it with the one in shared
 memory.

We'd still have to stream these checksums to the standbys though,
or would they be exempt from the checksum checks?

I still wonder whether these checks are worth the complexity. I
believe we'd only allow snapshot modifications for read-only queries
anyway, so what point is there in preventing clients from setting
broken snapshots?

best regards,
Florian Pflug



-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Old git repo

On Thu, Dec 30, 2010 at 9:30 AM, Magnus Hagander mag...@hagander.net wrote:
 On Thu, Dec 30, 2010 at 15:28, Robert Haas robertmh...@gmail.com wrote:
 On Thu, Dec 30, 2010 at 8:31 AM, Magnus Hagander mag...@hagander.net wrote:
 Are we ready to drop the old git mirror? The one that's still around
 (as postgresql-old.git) from before we migrated the main repository to
 git, and thus has the old hashes around.

 I see no reason to drop that ever, or at least not any time soon.
 What is it costing us?

 Some disk space, so almost nothing. And the potential that people grab
 it by mistake - it adds a bit to confusion.

Well if it's clearly labeled old I don't think it should confuse
anyone much.  You could even tack one more commit on there adding a
README file with a big ol' warning.

 Looking at it from the other side, what's the use-case for keeping it?
 If you want to diff against it or something like that, you can just
 do that against your local clone (that you already had - if you
 didn't, you shouldn't be using it at all)...

I realize it's not as official as the CVS repository was, but I
still think we ought to hold onto it for a year or two.  Maybe no one
will ever look at it again, but I'm not prepared to bet on that.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Streaming replication as a separate permissions

2010-12-30 Thread Peter Eisentraut

On tor, 2010-12-23 at 17:29 -0500, Tom Lane wrote:
 Josh Berkus j...@agliodbs.com writes:
  On 12/23/10 2:21 PM, Tom Lane wrote:
  Well, that's one laudable goal here, but secure by default is another
  one that ought to be taken into consideration.
 
  I don't see how *not* granting the superuser replication permissions
  makes things more secure.  The superuser can grant replication
  permissions to itself, so why is suspending them by default beneficial?
   I'm not following your logic here.
 
 Well, the reverse of that is just as true: if we ship it without
 replication permissions on the postgres user, people can change that if
 they'd rather not create a separate role for replication.  But I think
 we should encourage people to NOT do it that way.  Setting it up that
 way by default hardly encourages use of a more secure arrangement.

I think this argument is a bit inconsistent in the extreme.  You might
as well argue that a superuser shouldn't have any permissions by
default, to discourage users from using it.  They can always grant
permissions back to it.  I don't see why this particular one is so
different.

If we go down this road, we'll end up with a mess of permissions that a
superuser has and doesn't have.


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Streaming replication as a separate permissions

2010-12-30 Thread Peter Eisentraut

On ons, 2010-12-29 at 11:09 +0100, Magnus Hagander wrote:
 I've applied this version (with some minor typo-fixes).

This page is now somewhat invalidated:

http://developer.postgresql.org/pgdocs/postgres/role-attributes.html

First, it doesn't mention the replication privilege, and second it
continues to claim that superuser status bypasses all permission checks.


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Streaming replication as a separate permissions

On Thu, Dec 30, 2010 at 9:54 AM, Peter Eisentraut pete...@gmx.net wrote:
 On tor, 2010-12-23 at 17:29 -0500, Tom Lane wrote:
 Josh Berkus j...@agliodbs.com writes:
  On 12/23/10 2:21 PM, Tom Lane wrote:
  Well, that's one laudable goal here, but secure by default is another
  one that ought to be taken into consideration.

  I don't see how *not* granting the superuser replication permissions
  makes things more secure.  The superuser can grant replication
  permissions to itself, so why is suspending them by default beneficial?
   I'm not following your logic here.

 Well, the reverse of that is just as true: if we ship it without
 replication permissions on the postgres user, people can change that if
 they'd rather not create a separate role for replication.  But I think
 we should encourage people to NOT do it that way.  Setting it up that
 way by default hardly encourages use of a more secure arrangement.

 I think this argument is a bit inconsistent in the extreme.  You might
 as well argue that a superuser shouldn't have any permissions by
 default, to discourage users from using it.  They can always grant
 permissions back to it.  I don't see why this particular one is so
 different.

 If we go down this road, we'll end up with a mess of permissions that a
 superuser has and doesn't have.

+1.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] SLRU API tweak

Excerpts from Kevin Grittner's message of mié dic 29 20:46:55 -0300 2010:
 Attached is a small patch to avoid putting an opaque structure into
 the slru.h file and using it in an external function call where
 external callers must always specify NULL.

Thanks, committed.

-- 
Álvaro Herrera alvhe...@commandprompt.com
The PostgreSQL Company - Command Prompt, Inc.
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

RIGHT/FULL OUTER hash joins (was Re: [HACKERS] small table left outer join big table)

I had an epiphany about this topic, or actually two of them.

1. Whether or not you think there's a significant performance reason
to support hash right joins, there's a functionality reason.  The
infrastructure for right join could just as easily do full joins.
And AFAICS, a hash full join would only require one hashable join
clause --- the other FULL JOIN ON conditions could be anything at all.
This is unlike the situation for merge join, where all the JOIN ON
conditions have to be mergeable or it doesn't work right.  So we could
greatly reduce the scope of the dreaded FULL JOIN is only supported
with merge-joinable join conditions error.  (Well, okay, it's not
*that* dreaded, but people complain about it occasionally.)

2. The obvious way to implement this would involve adding an extra bool
field to struct HashJoinTupleData.  The difficulty with that, and the
reason I'd been resistant to the whole idea, is that it'd eat up a full
word per hashtable entry because of alignment considerations.  (On
64-bit machines it'd be free because of alignment considerations, but
that's cold comfort when 32-bit machines are the ones pressed for
address space.)  But we only need one bit, so what about commandeering
an infomask bit in the tuple itself?  For the initial implementation
I'd be inclined to take one of the free bits in t_infomask2.  We could
actually get away with overlaying the flag bit with one of the tuple
visibility bits, since it will only be used in tuples that are in the
in-memory hash table, which don't need visibility info anymore.  But
that seems like a kluge that could wait until we really need the flag
space.

Comments?

regards, tom lane

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] and it's not a bunny rabbit, either

On Wed, Dec 29, 2010 at 5:14 PM, David Fetter da...@fetter.org wrote:
 On Wed, Dec 29, 2010 at 04:53:47PM -0500, Robert Haas wrote:
 On Wed, Dec 29, 2010 at 4:09 AM, Heikki Linnakangas
 heikki.linnakan...@enterprisedb.com wrote:
  On 29.12.2010 06:54, Robert Haas wrote:
 
   With the patch:
 
  rhaas=# cluster v;
  ERROR:  views do not support CLUSTER
 
  do not support sounds like a missing feature, rather than a nonsensical
  command. How about something like CLUSTER cannot be used on views

 In the latest version of this patch, I created four translatable
 strings per object type:

 blats do not support %s (where %s is an SQL command)
 blats do not support constraints
 blats do not support rules
 blats do not support triggers

 It's reasonable enough to write CLUSTER cannot be used on views, but
 does constraints cannot be used on views seems more awkward to me.
 Or do we think that's OK?

 That particular one looks good insofar as it describes reality.  With
 predicate locks, etc., it may become untrue later, though :)

After further thought, I think it makes sense to change this around a
bit and create a family of functions that can be invoked like this:

void check_relation_for_FEATURE_support(Relation rel);

...where FEATURE is constraint, trigger, rule, index, etc.  The
function will be defined to throw an error if the relation isn't of a
type that can support the named feature.  The error message will be of
the form:

constraints can only be used on tables
triggers can be used only on tables and views
etc.

This avoids the need to define a separate error message for each
unsupported relkind, and I think it's just as informative as, e.g.,
constraints cannot be used on whatever object type you tried to
invoke it on.  We can adopt the same language for commands, e.g.:
CLUSTER can only be used on tables.

Comments?

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: RIGHT/FULL OUTER hash joins (was Re: [HACKERS] small table left outer join big table)

On Thu, Dec 30, 2010 at 10:45 AM, Tom Lane t...@sss.pgh.pa.us wrote:
 I had an epiphany about this topic, or actually two of them.

 1. Whether or not you think there's a significant performance reason
 to support hash right joins, there's a functionality reason.  The
 infrastructure for right join could just as easily do full joins.
 And AFAICS, a hash full join would only require one hashable join
 clause --- the other FULL JOIN ON conditions could be anything at all.
 This is unlike the situation for merge join, where all the JOIN ON
 conditions have to be mergeable or it doesn't work right.  So we could
 greatly reduce the scope of the dreaded FULL JOIN is only supported
 with merge-joinable join conditions error.  (Well, okay, it's not
 *that* dreaded, but people complain about it occasionally.)

Yeah, that would be neat.  It might be a lot faster in some cases, too.

 2. The obvious way to implement this would involve adding an extra bool
 field to struct HashJoinTupleData.  The difficulty with that, and the
 reason I'd been resistant to the whole idea, is that it'd eat up a full
 word per hashtable entry because of alignment considerations.  (On
 64-bit machines it'd be free because of alignment considerations, but
 that's cold comfort when 32-bit machines are the ones pressed for
 address space.)  But we only need one bit, so what about commandeering
 an infomask bit in the tuple itself?  For the initial implementation
 I'd be inclined to take one of the free bits in t_infomask2.  We could
 actually get away with overlaying the flag bit with one of the tuple
 visibility bits, since it will only be used in tuples that are in the
 in-memory hash table, which don't need visibility info anymore.  But
 that seems like a kluge that could wait until we really need the flag
 space.

I think that's a reasonable approach, although I might be inclined to
do the overlay sooner rather than later if it doesn't add too much
complexity.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Streaming replication as a separate permissions

Magnus Hagander mag...@hagander.net writes:
 But yes, I see in commit 12c942383296bd626131241c012c2ab81b081738 the
 comment convert some keywords.c symbols to KEYWORD_P to prevent
 conflict.

 Based on that, I should probably change it back, right? I just tried a
 patch for it and it compiles and checks just fine with the _P parts
 removed.

I'd leave it be, it's fine as-is.

regards, tom lane

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] and it's not a bunny rabbit, either

Excerpts from Robert Haas's message of jue dic 30 12:47:42 -0300 2010:

 After further thought, I think it makes sense to change this around a
 bit and create a family of functions that can be invoked like this:
 
 void check_relation_for_FEATURE_support(Relation rel);
 
 ...where FEATURE is constraint, trigger, rule, index, etc.  The
 function will be defined to throw an error if the relation isn't of a
 type that can support the named feature.  The error message will be of
 the form:
 
 constraints can only be used on tables
 triggers can be used only on tables and views
 etc.

So this will create a combinatorial explosion of strings to translate?
I liked the other idea because the number of translatable strings was
kept within reasonable bounds.

-- 
Álvaro Herrera alvhe...@commandprompt.com
The PostgreSQL Company - Command Prompt, Inc.
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Old git repo

Robert Haas robertmh...@gmail.com writes:
 On Thu, Dec 30, 2010 at 9:30 AM, Magnus Hagander mag...@hagander.net wrote:
 On Thu, Dec 30, 2010 at 15:28, Robert Haas robertmh...@gmail.com wrote:
 I see no reason to drop that ever, or at least not any time soon.
 What is it costing us?

 Some disk space, so almost nothing. And the potential that people grab
 it by mistake - it adds a bit to confusion.

 I realize it's not as official as the CVS repository was, but I
 still think we ought to hold onto it for a year or two.  Maybe no one
 will ever look at it again, but I'm not prepared to bet on that.

I'm with Magnus on this: the risk of confusion seems to greatly
outweigh any possible benefit from keeping it.  There is no reason for
anyone to use that old repo unless they are still working with a local
clone of it, and even if they do have a local clone, such a clone is
self-sufficient.  And more to the point, it seems quite unlikely that
anyone is still working with such a clone rather than having rebased
by now.

We should wait a week or so to see if anyone does pipe up and say they
still use that repo; but in the absence of such feedback, it should go.

regards, tom lane

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] and it's not a bunny rabbit, either

On Thu, Dec 30, 2010 at 11:00 AM, Alvaro Herrera
alvhe...@commandprompt.com wrote:
 Excerpts from Robert Haas's message of jue dic 30 12:47:42 -0300 2010:

 After further thought, I think it makes sense to change this around a
 bit and create a family of functions that can be invoked like this:

 void check_relation_for_FEATURE_support(Relation rel);

 ...where FEATURE is constraint, trigger, rule, index, etc.  The
 function will be defined to throw an error if the relation isn't of a
 type that can support the named feature.  The error message will be of
 the form:

 constraints can only be used on tables
 triggers can be used only on tables and views
 etc.

 So this will create a combinatorial explosion of strings to translate?
 I liked the other idea because the number of translatable strings was
 kept within reasonable bounds.

No, quite the opposite.  With the other approach, you needed:

constraints cannot be used on views
constraints cannot be used on composite types
constraints cannot be used on TOAST tables
constraints cannot be used on indexes
constraints cannot be used on foreign tables

With this, you just need:

constraints can only be used on tables

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Old git repo

On Thu, Dec 30, 2010 at 11:02 AM, Tom Lane t...@sss.pgh.pa.us wrote:
 Robert Haas robertmh...@gmail.com writes:
 On Thu, Dec 30, 2010 at 9:30 AM, Magnus Hagander mag...@hagander.net wrote:
 On Thu, Dec 30, 2010 at 15:28, Robert Haas robertmh...@gmail.com wrote:
 I see no reason to drop that ever, or at least not any time soon.
 What is it costing us?

 Some disk space, so almost nothing. And the potential that people grab
 it by mistake - it adds a bit to confusion.

 I realize it's not as official as the CVS repository was, but I
 still think we ought to hold onto it for a year or two.  Maybe no one
 will ever look at it again, but I'm not prepared to bet on that.

 I'm with Magnus on this: the risk of confusion seems to greatly
 outweigh any possible benefit from keeping it.  There is no reason for
 anyone to use that old repo unless they are still working with a local
 clone of it, and even if they do have a local clone, such a clone is
 self-sufficient.  And more to the point, it seems quite unlikely that
 anyone is still working with such a clone rather than having rebased
 by now.

 We should wait a week or so to see if anyone does pipe up and say they
 still use that repo; but in the absence of such feedback, it should go.

Well, I still have at least on repo against the old respository, which
is why I mentioned it.  Maybe there's nothing valuable in there and
maybe I don't need the origin anyway, but I haven't bothered to check
it over carefully yet because, well, there's no rush to clean up my
old repositories, and there is a rush to finish 9.1 development real
soon now.  I can, of course, carve out time to deal with it, but I
think that it's a poor use of time and that the risk of confusion that
you and Magnus are postulating is mostly hypothetical.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: RIGHT/FULL OUTER hash joins (was Re: [HACKERS] small table left outer join big table)

Robert Haas robertmh...@gmail.com writes:
 On Thu, Dec 30, 2010 at 10:45 AM, Tom Lane t...@sss.pgh.pa.us wrote:
 ... But we only need one bit, so what about commandeering
 an infomask bit in the tuple itself?  For the initial implementation
 I'd be inclined to take one of the free bits in t_infomask2.  We could
 actually get away with overlaying the flag bit with one of the tuple
 visibility bits, since it will only be used in tuples that are in the
 in-memory hash table, which don't need visibility info anymore.  But
 that seems like a kluge that could wait until we really need the flag
 space.

 I think that's a reasonable approach, although I might be inclined to
 do the overlay sooner rather than later if it doesn't add too much
 complexity.

Well, there's no complexity involved, it's just which bit do we define
the macro as.  Any complexity is conceptual.

regards, tom lane

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] and it's not a bunny rabbit, either

Robert Haas robertmh...@gmail.com writes:
 After further thought, I think it makes sense to change this around a
 bit and create a family of functions that can be invoked like this:
 void check_relation_for_FEATURE_support(Relation rel);

That seems like a reasonable idea, but ...

 ... The error message will be of the form:

 constraints can only be used on tables
 triggers can be used only on tables and views
 etc.

 This avoids the need to define a separate error message for each
 unsupported relkind, and I think it's just as informative as, e.g.,
 constraints cannot be used on whatever object type you tried to
 invoke it on.  We can adopt the same language for commands, e.g.:
 CLUSTER can only be used on tables.

ISTM there are four things we might potentially want to state in the
error message: the feature/operation you tried to apply, the name of the
object you tried to apply it to, the type of that object, and the set of
object types that the feature/operation will actually work for.  Our
current wording (foo is not a table or view) covers the second and
fourth of these, though the fourth is stated rather awkwardly.  Your
proposal above covers the first and fourth.  I'm not happy about leaving
out the object name, because there are going to be cases where people
get this type of error out of a long sequence or nest of operations and
it's not clear what it's talking about.  It'd probably be okay to leave
out the actual object type as long as you include its name, though.

One possibility is to break it down like this:

ERROR: foo is a sequence
DETAIL: Triggers can only be used on tables and views.

This could still be emitted by a function such as you suggest, and
indeed that would be the most practical way from both a consistency
and code-size standpoint.

regards, tom lane

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: RIGHT/FULL OUTER hash joins (was Re: [HACKERS] small table left outer join big table)

2010-12-30 Thread Jie Li

On Thu, Dec 30, 2010 at 11:50 PM, Robert Haas robertmh...@gmail.com wrote:

 On Thu, Dec 30, 2010 at 10:45 AM, Tom Lane t...@sss.pgh.pa.us wrote:
  I had an epiphany about this topic, or actually two of them.
 
  1. Whether or not you think there's a significant performance reason
  to support hash right joins, there's a functionality reason.  The
  infrastructure for right join could just as easily do full joins.
  And AFAICS, a hash full join would only require one hashable join
  clause --- the other FULL JOIN ON conditions could be anything at all.
  This is unlike the situation for merge join, where all the JOIN ON
  conditions have to be mergeable or it doesn't work right.  So we could
  greatly reduce the scope of the dreaded FULL JOIN is only supported
  with merge-joinable join conditions error.  (Well, okay, it's not
  *that* dreaded, but people complain about it occasionally.)

 Yeah, that would be neat.  It might be a lot faster in some cases, too.


Yeah, PostgreSQL should have this great feature.

Actually Oracle 10g already has the right hash join,
http://dbcrusade.blogspot.com/2008/01/oracle-hash-join-right-outer.html

 And Oracle 11g has the full hash join.
http://www.dba-oracle.com/oracle11g/oracle_11g_full_hash_join.htm

Haven't checked whether other DBMS have this feature.

Thanks,
Li Jie

[HACKERS] pl/python do not delete function arguments

2010-12-30 Thread Jan Urbański

(continuing the flurry of patches)

Here's a patch that stops PL/Python from removing the function's
arguments from its globals dict after calling it. It's
an incremental patch on top of the plpython-refactor patch sent in
http://archives.postgresql.org/message-id/4d135170.3080...@wulczer.org.

Git branch for this patch:
https://github.com/wulczer/postgres/tree/dont-remove-arguments

Apart from being useless, as the whole dict is unreffed and thus freed
in PLy_procedure_delete, removing args actively breaks things for
recursive invocation of the same function. The recursive callee after
returning will remove the args from globals, and subsequent access to
the arguments in the caller will cause a NameError (see new regression
test in patch).

Cheers,
Jan
diff --git a/src/pl/plpython/expected/plpython_spi.out b/src/pl/plpython/expected/plpython_spi.out
index 7f4ae5c..cb11f60 100644
*** a/src/pl/plpython/expected/plpython_spi.out
--- b/src/pl/plpython/expected/plpython_spi.out
*** CONTEXT:  PL/Python function result_nro
*** 133,135 
--- 133,163 
   2
  (1 row)
  
+ --
+ -- check recursion with same argument does not clobber globals
+ --
+ CREATE FUNCTION recursion_test(n integer) RETURNS integer
+ AS $$
+ if n in (0, 1):
+ return 1
+ 
+ return n * plpy.execute(select recursion_test(%d) as result % (n - 1))[0][result]
+ $$ LANGUAGE plpythonu;
+ SELECT recursion_test(5);
+  recursion_test 
+ 
+ 120
+ (1 row)
+ 
+ SELECT recursion_test(4);
+  recursion_test 
+ 
+  24
+ (1 row)
+ 
+ SELECT recursion_test(1);
+  recursion_test 
+ 
+   1
+ (1 row)
+ 
diff --git a/src/pl/plpython/plpython.c b/src/pl/plpython/plpython.c
index 67eb0f3..1827fc9 100644
*** a/src/pl/plpython/plpython.c
--- b/src/pl/plpython/plpython.c
*** static Datum PLy_function_handler(Functi
*** 307,313 
  static HeapTuple PLy_trigger_handler(FunctionCallInfo fcinfo, PLyProcedure *);
  
  static PyObject *PLy_function_build_args(FunctionCallInfo fcinfo, PLyProcedure *);
- static void PLy_function_delete_args(PLyProcedure *);
  static PyObject *PLy_trigger_build_args(FunctionCallInfo fcinfo, PLyProcedure *,
  	   HeapTuple *);
  static HeapTuple PLy_modify_tuple(PLyProcedure *, PyObject *,
--- 307,312 
*** PLy_function_handler(FunctionCallInfo fc
*** 988,1001 
  			 */
  			plargs = PLy_function_build_args(fcinfo, proc);
  			plrv = PLy_procedure_call(proc, args, plargs);
- 			if (!proc-is_setof)
- 			{
- /*
-  * SETOF function parameters will be deleted when last row is
-  * returned
-  */
- PLy_function_delete_args(proc);
- 			}
  			Assert(plrv != NULL);
  		}
  
--- 987,992 
*** PLy_function_handler(FunctionCallInfo fc
*** 1053,1060 
  Py_XDECREF(plargs);
  Py_XDECREF(plrv);
  
- PLy_function_delete_args(proc);
- 
  if (has_error)
  	ereport(ERROR,
  			(errcode(ERRCODE_DATA_EXCEPTION),
--- 1044,1049 
*** PLy_function_build_args(FunctionCallInfo
*** 1267,1287 
  	return args;
  }
  
- 
- static void
- PLy_function_delete_args(PLyProcedure *proc)
- {
- 	int			i;
- 
- 	if (!proc-argnames)
- 		return;
- 
- 	for (i = 0; i  proc-nargs; i++)
- 		if (proc-argnames[i])
- 			PyDict_DelItemString(proc-globals, proc-argnames[i]);
- }
- 
- 
  /* Decide if a cached PLyProcedure struct is still valid */
  static bool
  PLy_procedure_valid(PLyProcedure *proc, HeapTuple procTup)
--- 1256,1261 
diff --git a/src/pl/plpython/sql/plpython_spi.sql b/src/pl/plpython/sql/plpython_spi.sql
index 7f8f6a3..3b65f95 100644
*** a/src/pl/plpython/sql/plpython_spi.sql
--- b/src/pl/plpython/sql/plpython_spi.sql
*** else:
*** 105,107 
--- 105,123 
  $$ LANGUAGE plpythonu;
  
  SELECT result_nrows_test();
+ 
+ 
+ --
+ -- check recursion with same argument does not clobber globals
+ --
+ CREATE FUNCTION recursion_test(n integer) RETURNS integer
+ AS $$
+ if n in (0, 1):
+ return 1
+ 
+ return n * plpy.execute(select recursion_test(%d) as result % (n - 1))[0][result]
+ $$ LANGUAGE plpythonu;
+ 
+ SELECT recursion_test(5);
+ SELECT recursion_test(4);
+ SELECT recursion_test(1);

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] and it's not a bunny rabbit, either

Excerpts from Tom Lane's message of jue dic 30 13:49:20 -0300 2010:

 One possibility is to break it down like this:
 
 ERROR: foo is a sequence
 DETAIL: Triggers can only be used on tables and views.
 
 This could still be emitted by a function such as you suggest, and
 indeed that would be the most practical way from both a consistency
 and code-size standpoint.

This seems good to me.  There will only be as many messages as relkinds
we have, plus as many as features there are.

-- 
Álvaro Herrera alvhe...@commandprompt.com
The PostgreSQL Company - Command Prompt, Inc.
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] and it's not a bunny rabbit, either