On 07/30/1999 09:34 -0700, Roeland M.J. Meyer wrote:
>> > Actually, it might be useful to consider several different
>> > cases (you mentioned
>> > 1 and 4, but there are a couple other common cases):
>> > 1) RDBMS raw block device usage
>> > 2) small-file file system (i.e. news server)
>> > 3) large-file file system (i.e. images, archives, data
>> > warehousing, ftp servers)
>> > 4) general purpose file system (somewhere between 2 and 3)
>> >
>>
>> Data warehousing is the RDBMS case, usually done with exactly the same
>> hardware and software, but different optimizations. The schemas are, of
>> course much different (star queries are a PITA!). Much of it depends on
>> how the RDBMS is implemented. However, I don't think anyone does RAW
Don't know about under Linux, but I know of a number of sites still using
raw for databases (granted, I don't think any of them are on Oracle, they're
all either Sybase or Informix).
>> anymore. Image archives, for anyone doing a production implementation,
>> isuslaly served up as BLOBs, under an RDBMS (back to the RDBMS case).
I wasn't thinking of images only in the graphical sense - we've got a server
where we keep large images of databases (kind of a fast rollback/recovery
solution - don't ask).
>> This gives us case 1 and 2 as special/rare-use cases and drops RDBMS and
>> FTP servers into case 3.
>>
>> What do you think?
Another way of looking at it is:
1) constant (or near constant, anyway) medium IO block size mostly
sequential access (corresponds to 1 above)
2) small random access (corresponds to 2 above)
3) large sequential access (3 above)
4) random size random access (4 above)
I guess I might concur that 1 may have features in common with any of 2-4,
depending on what exactly the DBMS is doing - small vs large tables,
small vs large updates, etc. Where 1 tends to be different, is in the
fact that the kernel buffer cache is usually bypassed in this case, so
the characteristic I/O patterns may be significantly different because
of the use of different caching and write gathering algorithms. I don't
know if there's a big enough difference to worry about, but I don't know
there isn't either. That's why I always recommend benchmarking several
different configs.
I still think 2-4 are sufficiently different, though - 2 and 3 are very
optimized for particular purposes, and 4 is less optimized, just designed
so that everything works as well as can be expected.
--
+------------------------------+--------------------------+
| Tim Walberg | Phone: 847-782-2472 |
| TERAbridge Technologies Corp | FAX: 847-623-1717 |
| 1375 Tri-State Parkway | [EMAIL PROTECTED] |
| Gurnee, IL 60031 | 800-SKY-TEL2 PIN 9353299 |
+------------------------------+--------------------------+
PGP signature