date:20140908

Re: [HACKERS] pg_background (and more parallelism infrastructure patches)

2014-09-08 Thread Amit Kapila

On Mon, Sep 8, 2014 at 10:39 AM, Amit Kapila amit.kapil...@gmail.com
wrote:

 On Sat, Jul 26, 2014 at 9:32 PM, Robert Haas robertmh...@gmail.com
wrote:
  On Fri, Jul 25, 2014 at 4:16 PM, Alvaro Herrera
  alvhe...@2ndquadrant.com wrote:
   On Fri, Jul 25, 2014 at 02:11:32PM -0400, Robert Haas wrote:
   + pq_mq_busy = true;
   +
   + iov[0].data = msgtype;
   + iov[0].len = 1;
   + iov[1].data = s;
   + iov[1].len = len;
   +
   + Assert(pq_mq_handle != NULL);
   + result = shm_mq_sendv(pq_mq_handle, iov, 2, false);
   +
   + pq_mq_busy = false;
  
   Don't you need a PG_TRY block here to reset pq_mq_busy?
 
  No.  If shm_mq_sendv is interrupted, we can't use the shm_mq any more.
  But since that should only happen if an interrupt arrives while the
  queue is full, I think that's OK.

 I think here not only on interrupt, but any other error in this
 function shm_mq_sendv() path (one example is WaitLatch())
 could lead to same behaviour.

  (Think about the alternatives: if
  the queue is full, we have no way of notifying the launching process
  without waiting for it to retrieve the results, but it might not do
  that right away, and if we've been killed we need to die *now* not
  later.)

 So in such cases what is the advise to users, currently they will
 see the below message:
 postgres=# select * from pg_background_result(5124) as (x int);
 ERROR:  lost connection to worker process with PID 5124

 One way is to ask them to check logs, but what about if they want
 to handle error and take some action?

 Another point about error handling is that to execute the sql in
 function pg_background_worker_main(), it starts the transaction
 which I think doesn't get aborted if error occurs

For this I was just referring error handling code of
StartBackgroundWorker(), however during shutdown of process, it
will call AbortOutOfAnyTransaction() which will take care of aborting
transaction.

 and similarly handling
 for timeout seems to be missing in error path.

As we are anyway going to exit on error, so not sure, if this will be
required, however having it for clean exit seems to be better.


With Regards,
Amit Kapila.
EnterpriseDB: http://www.enterprisedb.com

[HACKERS] Re: proposal: ignore null fields in not relation type composite type based constructors

2014-09-08 Thread Pavel Stehule

2014-09-08 6:27 GMT+02:00 Stephen Frost sfr...@snowman.net:

 * Pavel Stehule (pavel.steh...@gmail.com) wrote:
  ignore_nulls in array_to_json_pretty probably is not necessary. On second
  hand, the cost is zero, and we can have it for API consistency.

 I'm willing to be persuaded either way on this, really.  I do think it
 would be nice for both array_to_json and row_to_json to be single
 functions which take default values, but as for if array_to_json has a
 ignore_nulls option, I'm on the fence and would defer to people who use
 that function regularly (I don't today).

 Beyond that, I'm pretty happy moving forward with this patch.


ok

Regards

Pavel



 Thanks,

 Stephen

Re: [HACKERS] gist vacuum gist access

2014-09-08 Thread Heikki Linnakangas

On 09/07/2014 05:11 PM, Костя Кузнецов wrote:

hello.
i recode vacuum for gist index.
all tests is ok.
also i test vacuum on table size 2 million rows. all is ok.
on my machine old vaccum work about 9 second. this version work about 6-7 sec .
review please.

If I'm reading this correctly, the patch changes gistbulkdelete to scan
the index in physical order, while the old code starts from the root and
scans the index from left to right, in logical order.

Scanning the index in physical order is wrong, if any index pages are
split while vacuum runs. A page split could move some tuples to a
lower-numbered page, so that the vacuum will not scan those tuples.

In the b-tree code, we solved that problem back in 2006, so it can be
done but requires a bit more code. In b-tree, we solved it with a
vacuum cycle ID number that's set on the page halves when a page is
split. That allows VACUUM to identify pages that have been split
concurrently sees them, and jump back to vacuum them too. See commit
http://git.postgresql.org/gitweb/?p=postgresql.git;a=commit;h=5749f6ef0cc1c67ef9c9ad2108b3d97b82555c80.
It should be possible to do something similar in GiST, and in fact you
might be able to reuse the NSN field that's already set on the page
halves on split, instead of adding a new vacuum cycle ID.

- Heikki

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

49 matches

Mail list logo