Re: [PERFORM] SSD options, small database, ZFS

CSS Thu, 17 Nov 2011 19:45:20 -0800

Resurrecting this long-dormant thread...

On Oct 14, 2011, at 6:41 AM, Arjen van der Meijden wrote:


> On 14-10-2011 10:23, CSS wrote:
>> -I'm calling our combined databases at 133GB "small", fair
>> assumption?  -Is there any chance that a server with dual quad core
>> xeons, 32GB RAM, and 2 or 4 SSDs (assume mirrored) could be slower
>> than the 4 old servers described above?  I'm beating those on raw
>> cpu, quadrupling the amount of RAM (and consolidating said RAM),
>> and going from disks that top out at 4x300 IOPS with SSDs that
>> conservatively should provide 2000 IOPS.
> 
> Whether 133GB is small or not probably mostly depends on how much of it is 
> actually touched during use. But I'd agree that it isn't a terribly large 
> database, I'd guess a few simple SSDs would be plenty to achieve 2000 IOPs. 
> For lineair writes, they're still not really faster than normal disks, but if 
> that's combined with random access (either read or write) you ought to be ok.
> We went from 15x 15k sas-disks to 6x ssd several years back in our MySQL-box, 
> but since we also increased the ram from 16GB to 72GB, the io-load dropped so 
> much the ssd's are normally only lightly loaded...

Thanks for your input on this.  It's taken some time, but I do finally have 
some hardware on hand  (http://imgur.com/LEC5I) and as more trickles in over 
the coming days, I'll be putting together our first SSD-based postgres box.  I 
have much testing to do, and I'm going to have some questions regarding that 
subject in another thread.

> Btw, the 5500 and 5600 Xeons are normally more efficient with a multiple of 6 
> ram-modules, so you may want to have a look at 24GB (6x4), 36GB (6x4+6x2) or 
> 48GB (12x4 or 6x8) RAM.

Thanks - I really had a hard time wrapping my head around the rules on 
populating the banks.  If I understand it correctly, this is due to the memory 
controller moving from the south(?)bridge to being integrated in the CPU.

> Given the historical questions on the list, there is always a risk of getting 
> slower queries with hardware that should be much faster. For instance, the 
> huge increase in RAM may trigger a less efficient query-plan. Or the disks 
> abide by the flush-policies more correctly.
> Assuming the queries are still getting good plans and there are no such 
> special differences, I'd agree with the assumption that its a win on every 
> count.
> Or your update to a newer OS and PostgreSQL may trigger some worse query plan 
> or hardware-usage.

That's an interesting point, I'd not even considered that.  Is there any sort 
of simple documentation on the query planner that might cover how things like 
increased RAM could impact how a query is executed?

>> -Should I even be looking at the option of ZFS on SATA or low-end
>> SAS drives and ZIL and L2ARC on SSDs?  Initially this intrigued me,
>> but I can't quite get my head around how the SSD-based ZIL can deal
>> with flushing the metadata out when the whole system is under any
>> sort of extreme write-heavy load - I mean if the ZIL is absorbing
>> 2000 IOPS of metadata writes, at some point it has to get full as
>> it's trying to flush this data to much slower spinning drives.
> 
> A fail-safe set-up with SSD's in ZFS assumes at least 3 in total, i.e. a pair 
> of SSD's for ZIL and as many as you want for L2ARC. Given your database size, 
> 4x160GB SSD (in "raid10") or 2x 300GB should yield plenty of space. So given 
> the same choice, I wouldn't bother with a set of large capacity sata disks 
> and ZIL/L2ARC-SSD's, I'd just go with 4x160GB or 2x300GB SSD's.

Well, I've bought 4x160GB, so that's what I'll use.  I will still do some tests 
with two SATA drives plus ZIL, just to see what happens.

> 
>> -Should my standby box be the same configuration or should I look
>> at actual spinning disks on that?  How rough is replication on the
>> underlying storage?  Would the total data written on the slave be
>> less or equal to the master?
> 
> How bad is it for you if the performance of your database potentially drops a 
> fair bit when your slave becomes the master? If you have a read-mostly 
> database, you may not even need SSD's in your master-db (given your amount of 
> RAM). But honestly, I don't know the answer to this question :)

It's complicated - during the day we're mostly looking at very scattered reads 
and writes, probably a bit biased towards writes.  But each evening we kick off 
a number of jobs to pre-generate stats for more complex queries...  If the job 
could still complete in 6-8 hours, we'd probably be OK, but if it starts 
clogging up our normal queries during the day, that would be a problem.

Thanks again for your input!

Charles

> 
> Good luck with your choices,
> Best regards,
> 
> Arjen
> 
> -- 
> Sent via pgsql-performance mailing list ([email protected])
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-performance


-- 
Sent via pgsql-performance mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

Re: [PERFORM] SSD options, small database, ZFS

Reply via email to