date:20150716

Re: [HACKERS] Support for N synchronous standby servers - take 2

2015-07-16 Thread Robert Haas

On Wed, Jul 15, 2015 at 5:03 AM, Michael Paquier
michael.paqu...@gmail.com wrote:
 Group labels are essential.

 OK, so this is leading us to the following points:
 - Use a JSON object to define the quorum/priority groups for the sync state.
 - Store it as a GUC, and use the check hook to validate its format,
 which is what we have now with s_s_names
 - Rely on SIGHUP to maintain an in-memory image of the quorum/priority
 sync state
 - Have the possibility to define group labels in this JSON blob, and
 be able to use those labels in a quorum or priority sync definition.
 - For backward-compatibility, use for example s_s_names = 'json' to
 switch to the new system.

Personally, I think we're going to find that using JSON for this
rather than a custom syntax makes the configuration strings two or
three times as long for no discernable benefit.

But I just work here.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Support for N synchronous standby servers - take 2

2015-07-16 Thread Simon Riggs

On 16 July 2015 at 18:27, Robert Haas robertmh...@gmail.com wrote:

 On Wed, Jul 15, 2015 at 5:03 AM, Michael Paquier
 michael.paqu...@gmail.com wrote:
  Group labels are essential.
 
  OK, so this is leading us to the following points:
  - Use a JSON object to define the quorum/priority groups for the sync
 state.
  - Store it as a GUC, and use the check hook to validate its format,
  which is what we have now with s_s_names
  - Rely on SIGHUP to maintain an in-memory image of the quorum/priority
  sync state
  - Have the possibility to define group labels in this JSON blob, and
  be able to use those labels in a quorum or priority sync definition.
  - For backward-compatibility, use for example s_s_names = 'json' to
  switch to the new system.

 Personally, I think we're going to find that using JSON for this
 rather than a custom syntax makes the configuration strings two or
 three times as long for


They may well be 2-3 times as long. Why is that a negative?


 no discernable benefit.


Benefits:
* More readable
* Easy to validate
* No additional code required in the server to support this syntax (so no
bugs)
* Developers will immediately understand the format
* Easy to programmatically manipulate in a range of languages

-- 
Simon Riggshttp://www.2ndQuadrant.com/
http://www.2ndquadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training  Services

Re: [HACKERS] Support for N synchronous standby servers - take 2

2015-07-16 Thread Robert Haas

On Thu, Jul 16, 2015 at 1:32 PM, Simon Riggs si...@2ndquadrant.com wrote:
 Personally, I think we're going to find that using JSON for this
 rather than a custom syntax makes the configuration strings two or
 three times as long for

 They may well be 2-3 times as long. Why is that a negative?

In my opinion, brevity makes things easier to read and understand.  We
also don't support multi-line GUCs, so if your configuration takes 140
characters, you're going to have a very long line in your
postgresql.conf (and in your pg_settings output, etc.)

 * No additional code required in the server to support this syntax (so no
 bugs)

I think you'll find that this is far from true.  Presumably not any
arbitrary JSON object will be acceptable.  You'll have to parse it as
JSON, and then validate that it is of the expected form.  It may not
be MORE code than implementing a mini-language from scratch, but I
wouldn't expect to save much.

 * Developers will immediately understand the format

I doubt it.  I think any format that we pick will have to be carefully
documented.  People may know what JSON looks like in general, but they
will not immediately know what bells and whistles are available in
this context.

 * Easy to programmatically manipulate in a range of languages

I agree that JSON has that advantage, but I doubt that it is important
here.  I would expect that people might need to generate a new config
string and dump it into postgresql.conf, but that should be easy with
any reasonable format.  I think it will be rare to need to parse the
postgresql.conf string, manipulate it programatically, and then put it
back.  As we've already said, most configurations are simple and
shouldn't change frequently.  If they're not or they do, that's a
problem of itself.

However, I'm not trying to ram my idea through; I'm just telling you my opinion.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [PATCH] Generalized JSON output functions

2015-07-16 Thread Robert Haas

On Wed, Jul 15, 2015 at 12:58 PM, Andrew Dunstan and...@dunslane.net wrote:
 The approach take was both invasive and broken.

Well, then let's not do it that way.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [PATCH] Generalized JSON output functions

2015-07-16 Thread Robert Haas

On Wed, Jul 15, 2015 at 1:10 PM, Ryan Pedela rped...@datalanche.com wrote:
 Like I said previously, the
 situation with Javascript will hopefully be remedied in a few years with ES7
 anyway.

I don't understand these issues in great technical depth, but if
somebody is arguing that it's OK for PostgreSQL to be difficult to use
for a certain category of user for several years until the next
language rev becomes mainstream, then I disagree.  The fact that
somebody wrote a patch to try to solve a problem means that the thing
in question is a problem for at least that one user.  If he's the only
one, maybe we don't need to care all that much.  If his needs are
representative of a significant user community, we should not turn our
backs on that community, regardless of whether we like the patch he
wrote, and regardless of how well we are meeting the needs of other
communities (like node.js users).

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [PATCH] Generalized JSON output functions

2015-07-16 Thread Pavel Stehule

2015-07-16 19:51 GMT+02:00 Robert Haas robertmh...@gmail.com:

 On Wed, Jul 15, 2015 at 1:10 PM, Ryan Pedela rped...@datalanche.com
 wrote:
  Like I said previously, the
  situation with Javascript will hopefully be remedied in a few years with
 ES7
  anyway.

 I don't understand these issues in great technical depth, but if
 somebody is arguing that it's OK for PostgreSQL to be difficult to use
 for a certain category of user for several years until the next
 language rev becomes mainstream, then I disagree.  The fact that
 somebody wrote a patch to try to solve a problem means that the thing
 in question is a problem for at least that one user.  If he's the only
 one, maybe we don't need to care all that much.  If his needs are
 representative of a significant user community, we should not turn our
 backs on that community, regardless of whether we like the patch he
 wrote, and regardless of how well we are meeting the needs of other
 communities (like node.js users).


I don't think so this issue is too hot. How long we support XML? The output
format is static - the date format is fixed. How much  issues  was there?
Was there any issue, that was not solvable by casting?

If somebody needs different quoting, then it can be solved by explicit cast
in SQL query, and not in hacking our output routines.

Regards

Pavel



 --
 Robert Haas
 EnterpriseDB: http://www.enterprisedb.com
 The Enterprise PostgreSQL Company

[HACKERS] Bugs in our qsort implementation

2015-07-16 Thread Tom Lane

I've been trying to figure out the crash in qsort reported here:
http://www.postgresql.org/message-id/flat/cal8hzunr2fr1owzhwg-p64gjtnfbbmpx1y2oxmj_xuq3p8y...@mail.gmail.com

I first noticed that our qsort code uses an int to hold some transient
values representing numbers of elements. Since the passed array size
argument is a size_t, this is broken; the integer could overflow.
I do not think this is a live bug so far as our core code is concerned,
because tuplesort.c still limits itself to no more than INT_MAX items
to be sorted, and nothing else in the core would even approach that.
However, it's in principle a hazard for third-party modules that might try
to sort more than that; and in any case it's a bug waiting to bite us on
the rear whenever somebody decides they have enough RAM that they should
be able to sort more than INT_MAX items.

However, Yiqing reported the crash as occurring here:

Program terminated with signal 11, Segmentation fault.
#0 0x00785180 in med3_tuple (a=0x7f31613f1028, b=0x7f31613f1040,
c=0x3ffd, cmp_tuple=0x7f43613f1010, state=0x1) at qsort_tuple.c:66
66 {

which is a bit curious because that function does not itself access any
of the data --- it just calls the cmp_tuple function, so even if we'd
somehow computed a bad address, the crash should occur inside the
comparator function, not here.

After awhile a theory occurred to me: the qsort functions recurse without
bothering with a stack depth check, so maybe the SEGV actually represents
running out of stack space. And after a bit of research, that theory
seems pretty plausible. It turns out that qsort is guaranteed to recurse
no deeper than log(N) levels, but *only if you take care to recurse on the
smaller partition and iterate on the larger one*. And we're not doing
that, we just blindly recurse on the left partition. So given a fairly
huge amount of data and some bad luck in partition-picking, it seems
possible that stack overflow explains this report.

I propose to
(1) fix the code to use a size_t variable rather than int where
appropriate;
(2) teach it to recurse on the smaller partition.

It's possible that this issue can only manifest on 9.4 and up where
we have the ability for tuplesort to allocate work arrays approaching
INT_MAX elements. But I don't have a lot of faith in that; I think the
worst-case stack depth for the way we have it now could be as bad as O(N),
so in principle a crash could be possible with significantly smaller input
arrays. I think we'd better back-patch this all the way.

regards, tom lane

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

64 matches

Mail list logo