Re: [ADMIN] Index error on recovery

K P Manoj Thu, 22 Dec 2011 20:52:22 -0800

Hi

Could any one else give any pointers, or is there something obviously
missing in this request / thread ?. Index "IND_tbl_1_ObjId" and
  "IND_tbl_1_Act_ObjId" are B-Tree Index.


Thanks
Manoj

2011/12/19 K P Manoj <[email protected]>

> No, all my index B-Tree Index
>
> Thanks
> Manoj K P
>
>
> 2011/12/18 杨晓青 <[email protected]>
>
>> is your index  a hash index? xlog does not include the changes on hash
>> index,your must rebuild hash index after recovery from xlog.
>>
>>
>> 2011/12/16 K P Manoj <[email protected]>
>>
>>> Hi
>>>
>>> The details given below is for a test server, that I brought back up
>>> from the base (after running a day worth wals and by giving
>>> recovery_target_time before starting the recovery). The recovery went fine
>>> and the DB came online as expected. But I see that although trivial select
>>> queries seem to work well (while using indexes of the table concerned) it
>>> throws up errors which indicate that the index is corrupted when we try to
>>> run a query that requires access to all rows of the table.
>>>
>>> My question is that if this is index corruption (seems like it) where
>>> did we go wrong with the recovery (the live server is still working fine
>>> without any such error in the live-server-logs)? Do we need to reindex
>>> after such a recovery (I thought that was required only when we do
>>> something like pgresetxlog)? If this isn't a fault with recovery, does it
>>> mean that the primary DB (which is currently active) need strict checking?
>>> Any way we could do that, without bringing down the DB (i.e. anything
>>> besides a PGDump)?
>>>
>>> Thanks
>>> Manoj K P
>>>
>>>
>>> ============================
>>>
>>> PG Version:
>>> ==========
>>> mydb=# SELECT  version () ;
>>>                                                     version
>>>
>>>
>>> ------------------------------------------------------------------------------------------------------------------
>>> PostgreSQL 8.4.9 on x86_64-redhat-linux-gnu, compiled by GCC gcc (GCC)
>>> 4.1.2 20080704 (Red Hat 4.1.2-50), 64-bit
>>>
>>>
>>>
>>> Successful WAL Recovery:
>>> =======================
>>>
>>> Dec  6 14:25:47 database_host_name postgres[19329]: [3150-1] 2011-12-06
>>> 14:25:47.207 GMT 19329 LOG:  restored log file "000000010000011000000013"
>>> from archive
>>> Dec  6 14:25:48 database_host_name postgres[19329]: [3151-1] 2011-12-06
>>> 14:25:48.540 GMT 19329 LOG:  restored log file "000000010000011000000014"
>>> from archive
>>> Dec  6 14:25:49 database_host_name postgres[19329]: [3152-1] 2011-12-06
>>> 14:25:49.833 GMT 19329 LOG:  restored log file "000000010000011000000015"
>>> from archive
>>> Dec  6 14:25:51 database_host_name postgres[19329]: [3153-1] 2011-12-06
>>> 14:25:51.755 GMT 19329 LOG:  restored log file "000000010000011000000016"
>>> from archive
>>> Dec  6 14:25:53 database_host_name postgres[19329]: [3154-1] 2011-12-06
>>> 14:25:53.155 GMT 19329 LOG:  restored log file "000000010000011000000017"
>>> from archive
>>> Dec  6 14:25:54 database_host_name postgres[19329]: [3155-1] 2011-12-06
>>> 14:25:54.462 GMT 19329 LOG:  restored log file "000000010000011000000018"
>>> from archive
>>> Dec  6 14:25:54 database_host_name postgres[19329]: [3156-1] 2011-12-06
>>> 14:25:54.570 GMT 19329 LOG:  recovery stopping before commit of transaction
>>> 19723746, time 2011-11-24 12:39:00.005756+00
>>> Dec  6 14:25:54 database_host_name postgres[19329]: [3157-1] 2011-12-06
>>> 14:25:54.570 GMT 19329 LOG:  redo done at 110/18F069E8
>>> Dec  6 14:25:54 database_host_name postgres[19329]: [3158-1] 2011-12-06
>>> 14:25:54.570 GMT 19329 LOG:  last completed transaction was at log time
>>> 2011-11-24 12:38:59.966238+00
>>> Dec  6 14:25:54 database_host_name postgres[19329]: [3159-1] 2011-12-06
>>> 14:25:54.597 GMT 19329 LOG:  selected new timeline ID: 2
>>> Dec  6 14:25:54 database_host_name postgres[19329]: [3160-1] 2011-12-06
>>> 14:25:54.623 GMT 19329 LOG:  archive recovery complete
>>> Dec  6 14:25:55 database_host_name postgres[19382]: [20-1] 2011-12-06
>>> 14:25:55.845 GMT 19382 LOG:  restartpoint complete: wrote 396106 buffers
>>> (37.8%); write=318.653 s, sync=0.000 s, total=318.654 s
>>> Dec  6 14:25:55 database_host_name postgres[19382]: [21-1] 2011-12-06
>>> 14:25:55.845 GMT 19382 LOG:  recovery restart point at 10E/FAE0DFA8
>>> Dec  6 14:25:55 database_host_name postgres[19382]: [22-1] 2011-12-06
>>> 14:25:55.848 GMT 19382 LOG:  checkpoint starting: end-of-recovery immediate
>>> wait
>>> Dec  6 14:26:01 database_host_name postgres[19382]: [23-1] 2011-12-06
>>> 14:26:01.484 GMT 19382 LOG:  checkpoint complete: wrote 376660 buffers
>>> (35.9%); 0 transaction log file(s) added, 0 removed, 0 recycled;
>>> write=5.635 s, sync=0.000 s, total=5.636 s
>>> Dec  6 14:26:04 database_host_name postgres[19328]: [1-1] 2011-12-06
>>> 14:26:04.340 GMT 19328 LOG:  database system is ready to accept connections
>>>
>>>
>>> Basic Querying uses Index and works fine:
>>> ========================================
>>>
>>> mydb =# EXPLAIN  SELECT  * from tb1_1  order by "ObjId" desc limit 10 ;
>>>
>>>                                                              QUERY PLAN
>>>
>>>
>>>
>>> ------------------------------------------------------------------------------------------------------------------------------
>>> --------
>>> Limit  (cost=0.00..7.41 rows=10 width=28)
>>>   ->  Index Scan Backward using "IND_tbl_1_ObjId" on tbl_1
>>>  (cost=0.00..1174428208.24 rows=1584137511 width=28)
>>>
>>>
>>>
>>> mydb=# SELECT  * from users_activity  order by "ObjId" desc limit 10 ;
>>> ObjId      |         date                    | Usr    | Obj_typ      |
>>>  Id     | Act_id   | new_id
>>>
>>> ------------+---------------------------------+--------+--------------+-----------+----------+----------------
>>> 1907972745 | Thu 24 Nov 07:38:59.013068 2011 |    272 |            2 |
>>> 479937676 |        1 |
>>> 1907972743 | Thu 24 Nov 07:38:59.829819 2011 |    272 |            2 |
>>> 350683600 |        1 |
>>> 1907972742 | Thu 24 Nov 07:38:59.679056 2011 |      0 |            2 |
>>> 479937674 |       11 |
>>> 1907972741 | Thu 24 Nov 07:38:59.550655 2011 |    272 |            2 |
>>> 423704886 |        1 |
>>> 1907972740 | Thu 24 Nov 07:38:59.355022 2011 |    239 |            2 |
>>> 441683722 |        1 |
>>> 1907972739 | Thu 24 Nov 07:38:59.40387 2011  |    272 |            2 |
>>> 1061714 |        1 |
>>> 1907972738 | Thu 24 Nov 07:38:59.262335 2011 |    272 |            2 |
>>> 350683588 |        1 |
>>> 1907972737 | Thu 24 Nov 07:38:59.146628 2011 |    272 |            2 |
>>> 340702121 |        1 |
>>> 1907972736 | Thu 24 Nov 07:38:59.102374 2011 |    239 |            2 |
>>> 467763707 |        1 |
>>> 1907972735 | Thu 24 Nov 07:38:59.06309 2011  |    272 |            2 |
>>> 423704873 |        1 |
>>> (10 rows)
>>>
>>>
>>>
>>>
>>> Accessing Each Row:
>>> ==================
>>>
>>> mydb=# ALTER TABLE tbl_1 ADD new_id bigint ;
>>> mydb=# EXPLAIN UPDATE tbl_1 SET new_id=id ;
>>>                                  QUERY PLAN
>>>
>>>
>>> -------------------------------------------------------------------------------
>>> Seq Scan on tbl_1  (cost=0.00..30853774.16 rows=1554095373 width=34)
>>> (1 row)
>>>
>>>
>>> mydb=# UPDATE tbl_1 SET new_id=id ;
>>>
>>> ERROR:  right sibling's left-link doesn't match: block 41923 links to
>>> 4268638 instead of expected 41922 in index "IND_tbl_1_ObjId"
>>>
>>>
>>> Further, tried to work around the Index error:
>>> =============================================
>>> mydb=# DROP INDEX "IND_tb_1_ObjId" ;
>>> mydb=# UPDATE tbl_1 SET new_id=id ;
>>>
>>> ERROR:  right sibling's left-link doesn't match: block 52886 links to
>>> 4266706 instead of expected 52885 in index "IND_tbl_1_Act_ObjId"
>>>
>>> But now we have an error with the next index on the same table.
>>> ==============================================================
>>>
>>> Other notes:
>>> 1. Primary server was running when the base backup was taken.
>>> 2. Base backup was taken with start / stop and recovery time was 1-2
>>> days after that, so enough logs were played on the standby server.
>>>
>>>
>>>
>>
>>
>> --
>> -------------------------------------------
>> best regards
>>
>> 杨晓青
>>
>
>

Re: [ADMIN] Index error on recovery

Reply via email to