date:20101111

Re: [HACKERS] B-tree parent pointer and checkpoints

2010-11-11 Thread Heikki Linnakangas

On 11.11.2010 00:49, Tom Lane wrote:

I wrote:

What happens if you error out in between? Or is it assumed that the
*entire* sequence is a critical section? If it has to be that way,
one might wonder what's the point of trying to split it into multiple
WAL records.

Or, to be more concrete: I'm wondering if this *entire* mechanism isn't
a bad idea that we should just rip out.

The question that ought to be asked here, I think, is whether it
shouldn't be required that every inter-WAL-record state is a valid
consistent state that doesn't require post-crash fixups. If that
isn't the case, then a simple ERROR or FATAL exit out of the backend
that was creating the sequence originally will leave the system in
an unacceptable state. We could prevent such an exit by wrapping the
whole sequence in a critical section, but if you have to do that then
it's not apparent why you shouldn't fold it into one WAL record.

IOW, forget this patch. Take out the logic that tries to complete
pending splits during replay, instead. I believe this is perfectly safe
for btree: loss of a parent record isn't fatal, as proven by the fact
that searches don't have to be locked out while a split proceeds.
(We might want to make btree_page_del not think that a missing parent
record is an error, but it shouldn't think that anyway, because of the
possibility of a non-crashing failure during the original split.)
This approach might not be safe for GIST or GIN; but if it isn't, they
need fixes anyway.

GIN is similar to b-tree, the incomplete split logic there is for
inserting the parent pointers in the b-tree within the GIN index, just
like nbtree.

GiST is different. When you insert a key to a leaf page, you (sometimes)
need to adjust the parent pointer to reflect the new key as well. B-tree
tolerates incomplete splits with the 'next page' pointer, but that is
not applicable to gist. Teodor described the issue back in 2005 when
WAL-logging was added to GiST
(http://archives.postgresql.org/pgsql-hackers/2005-06/msg00555.php):

The problem with incopmleted inserts is: when new entry is installed into leaf page, all chain (in a worst case) of keys from root page to leaf should be updated. Of course, case of updating all chain is rarely and usially it's updated only part. Each key on inner pages contains union of keys (bounding box in a case of rtree, for example) on page which it points. This union can formed only with a help of user-defined function of opclass, because of GiST doesn't know something about nature of keys. Returning to WAL, GiST core write xlog entry with all nessary information for restoration before write page, but in this moment it doesn't know it should update keys on parent page or key is unchanged. So GiST's WAL restoration code should remember this page's update as incompleted insert. When insert complited, GiST's core write to log that insert is completed and restore code can clean up stored incompleted insert. If it was crash, then sign of completed insert can be absent in

log, and GiST's restore code should continue it. While continue, it's know
which page was changed and should walk up to root. On each step of walk it
should form union for page and insert it to parent.

Reading that I wonder: what harm would an incomplete insert cause if we
just left it in the tree? Imagine that you insert a key to a leaf page,
but crash before updating the parent. If you search for the key starting
from the root, you'll fail to find it, because the parent pointer claims
that there are no entries with such a key on the child page. But that's
OK, the inserting transaction aborted with the crash!

Do some of the other GiST algorithms get confused if there's a key on a
page that's not represented by the parent pointer? It's possible that
you insert a key to the leaf, update the leaf's immediate parent, but
crash before updating the parent's parent. As far as search is
concerned, that's OK as well, but it creates a hazard for subsequent
inserts. If you later insert a tuple with the same key to the same leaf
page, the insertion will see that the parent pointer already includes
the key, and will fail to update the parent's parent. That's a problem.

Would it be hard to change the algorithm to update the parent keys
top-down rather than bottom-up?

--
Heikki Linnakangas
EnterpriseDB http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] B-tree parent pointer and checkpoints

2010-11-11 Thread Heikki Linnakangas

On 11.11.2010 00:49, Tom Lane wrote:

I wrote:

Or, to be more concrete: I'm wondering if this *entire* mechanism isn't
a bad idea that we should just rip out.

GIN is similar to b-tree, the incomplete split logic there is for
inserting the parent pointers in the b-tree within the GIN index, just
like nbtree.

Would it be hard to change the algorithm to update the parent keys
top-down rather than bottom-up?

--
Heikki Linnakangas
EnterpriseDB http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] improved parallel make support

2010-11-11 Thread Dave Page

On Wed, Nov 10, 2010 at 6:13 PM, Andrew Dunstan and...@dunslane.net wrote:

 Yeah, it's complaining about not finding bison, but configure managed to
 find bison just fine. Are you sure the right make was installed? It looks
 suspicious because it's not talking about msys virtual maths like the old
 make did. It needs to be make-3.81-3-msys-1.0.13
 http://sourceforge.net/projects/mingw/files/MSYS/make/make-3.81-3/make-3.81-3-msys-1.0.13-bin.tar.lzma/download
 You'll need another couple of libraries as well (libiconv and libintl) if
 they are not already installed. Making this change took me a while to get
 right on dawn_bat.

I installed the latest make from gnu.org (which I've now uninstalled).
The Msys installation on this box is old, and doesn't support the lzma
packages used by the latest releases - and from what I can tell, it
would take a major upgrade of the installation to get that support.
I'm not sure thats a path I want to go down, as I have no idea how
much will break if I do that, and I don't exactly have much in the way
of spare time to fix it if that happens.

I'm currently leaning towards removing the 9.1 build from the machine;
on a purely selfish note, I have no interest in mingw/msys builds
anymore anyway. However, I'm open to suggestions if anyone knows a
relatively safe way to resolve this.

/D

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] improved parallel make support

2010-11-11 Thread Andrew Dunstan

On 11/11/2010 06:58 AM, Dave Page wrote:

On Wed, Nov 10, 2010 at 6:13 PM, Andrew Dunstanand...@dunslane.net wrote:

Yeah, it's complaining about not finding bison, but configure managed to
find bison just fine. Are you sure the right make was installed? It looks
suspicious because it's not talking about msys virtual maths like the old
make did. It needs to be make-3.81-3-msys-1.0.13
http://sourceforge.net/projects/mingw/files/MSYS/make/make-3.81-3/make-3.81-3-msys-1.0.13-bin.tar.lzma/download
You'll need another couple of libraries as well (libiconv and libintl) if
they are not already installed. Making this change took me a while to get
right on dawn_bat.

I installed the latest make from gnu.org (which I've now uninstalled).
The Msys installation on this box is old, and doesn't support the lzma
packages used by the latest releases - and from what I can tell, it
would take a major upgrade of the installation to get that support.
I'm not sure thats a path I want to go down, as I have no idea how
much will break if I do that, and I don't exactly have much in the way
of spare time to fix it if that happens.

I'm currently leaning towards removing the 9.1 build from the machine;
on a purely selfish note, I have no interest in mingw/msys builds
anymore anyway. However, I'm open to suggestions if anyone knows a
relatively safe way to resolve this.

No, all you need to unpack those is the basic-bsdtar package. But to
save you the pain of all this, I have copied the three objects I
installed to get this working on my likewise pretty old Msys to where
you can get them. Just grab

http://developer.postgresql.org/~adunstan/msys-make.tgz

As a matter of policy, I do not want to drop support for a FOSS build
tool chain on Windows if at all avoidable.

cheers

andrew

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] multi-platform, multi-locale regression tests

2010-11-11 Thread Dimitri Fontaine

Robert Haas robertmh...@gmail.com writes:
   I think the big hurdle with contrib isn't
 that it's called contrib but that it's not part of the core server
 and, in many cases, enabling a contrib module means editing
 postgresql.conf and bouncing the server.  Of course, there are
 certainly SOME people who wouldn't mind editing postgresql.conf and
 bouncing the server but are scared off by the name contrib, but I
 suspect the hassle-factor is the larger issue by a substantial margin.

You're forgetting about the dump and restore problems you now have as
soon as you're using any contrib. They are more visible at upgrade time,
of course, but still bad enough otherwise.

Regards,
-- 
Dimitri Fontaine
http://2ndQuadrant.fr PostgreSQL : Expertise, Formation et Support

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] renaming contrib.

2010-11-11 Thread Dimitri Fontaine

Robert Haas robertmh...@gmail.com writes:
 work will help with that somewhat, but there's still that nasty
 business of needing to update shared_preload_libraries and bounce the
 server, at least for some modules.

We have 45 contribs (ls -l contrib | grep -c ^d), out of which:

 auto_explain  is shared_preload_libraries but I think could be
   local_preload_libraries

 pg_stat_statements is shared_preload_libraries (needs SHM)

 and that's it

So my reading is that currently the only contrib module that needs more
than a server reload is pg_stat_statements, because it needs some shared
memory. Am I missing anything?

Ok, now I'll add the custom_variable_classes setting to the control
files in the extension's patch for the contribs that expose some of
them.

Regards,
-- 
Dimitri Fontaine
http://2ndQuadrant.fr PostgreSQL : Expertise, Formation et Support

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] renaming contrib. (was multi-platform, multi-locale regression tests)

2010-11-11 Thread Andrew Dunstan




On 11/10/2010 07:51 PM, Robert Haas wrote:

  (And no, don't you dare breathe a word about git making that
all automagically better.  I have enough back-patching experience with
git by now to be unimpressed; in fact, I notice that its rename-tracking
feature falls over entirely when trying to back-patch further than 8.3.
Apparently there's some hardwired limit on the number of files it can
cope with.)

That's very sad. Did you file a bug?

It's intentional behavior.  It gives up when there are too many
differences to avoid being slow.


We should adopt that philosophy. I suggest we limit all tables in future 
to 1m rows in the interests of speed.


cheers

andrew

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] improved parallel make support

2010-11-11 Thread Dave Page

On Thu, Nov 11, 2010 at 1:04 PM, Andrew Dunstan and...@dunslane.net wrote:

 No, all you need to unpack those is the basic-bsdtar package.

Ahh, OK. That seems to be in the MinGW (compiler) section of the
downloads for some reason.

 But to save
 you the pain of all this, I have copied the three objects I installed to get
 this working on my likewise pretty old Msys to where you can get them. Just
 grab
 http://developer.postgresql.org/~adunstan/msys-make.tgz

Thanks - installed.

 As a matter of policy, I do not want to drop support for a FOSS build tool
 chain on Windows if at all avoidable.

Nor I, however I only have limited time to dedicate to that goal.


-- 
Dave Page
Blog: http://pgsnake.blogspot.com
Twitter: @pgsnake

EnterpriseDB UK: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

74 matches

Mail list logo