Re: [PERFORM] Massive performance issues

Arjen van der Meijden Thu, 01 Sep 2005 13:58:38 -0700

On 1-9-2005 19:42, Matthew Sackman wrote:

Obviously, to me, this is a problem, I need these queries to be under a
second to complete. Is this unreasonable? What can I do to make this "go
faster"? I've considered normalising the table but I can't work out
whether the slowness is in dereferencing the pointers from the index
into the table or in scanning the index in the first place. And
normalising the table is going to cause much pain when inserting values
and I'm not entirely sure if I see why normalising it should cause a
massive performance improvement.

In this case, I think normalising will give a major decrease in on-disktable-size of this large table and the indexes you have. If that's thecase, that in itself will speed-up all i/o-bound queries quite a bit.

locality_1, _2, city and county can probably be normalised away withoutmuch problem, but going from varchar's to integers will probably safeyou quite a bit of (disk)space.

But since it won't change the selectivity of indexes, so you won't getmore index-scans instead of sequential scans, I suppose.I think its not that hard to create a normalized set of tables from thisdata-set (using insert into tablename select distinct ... from addressand such, insert into address_new (..., city) select ... (select cityidfrom cities where city = address.city) from address)So its at least relatively easy to figure out the performanceimprovement from normalizing the dataset a bit.

If you want to improve your hardware, have a look at the Western DigitalRaptor-series SATA disks, they are fast scsi-like SATA drives. You mayalso have a look at the amount of memory available, to allow cachingthis (entire) table.


Best regards,

Arjen

---------------------------(end of broadcast)---------------------------
TIP 9: In versions below 8.0, the planner will ignore your desire to
      choose an index scan if your joining column's datatypes do not
      match

Re: [PERFORM] Massive performance issues

Reply via email to