Does anyone have recommendations for hardware and/or OS to work with around 5TB datasets?
The data is for analysis, so there is virtually no inserting besides a big bulk load. Analysis involves full-database aggregations - mostly basic arithmetic and grouping. In addition, much smaller subsets of data would be pulled and stored to separate databases.
I have been working with datasets no bigger than around 30GB, and that (I'm afraid to admit) has been in MSSQL.