Another vote for R. huge (100s MB) tables of tab-delimited data on which I would like to do > some math (averaging, sigmas, simple arithmetic, etc) as well as some > sorting and rejecting
This is precisely what R is meant for. It was mentioned that it isn't so good at really large datasets, but a few hundreds of MB is only huge for Excel. It is also cross platform. -- David On 12 September 2012 15:32, Jacob Keller <[email protected]>wrote: > Dear List, > > since this probably comes up a lot in manipulation of pdb/reflection files > and so on, I was curious what people thought would be the best language for > the following: I have some huge (100s MB) tables of tab-delimited data on > which I would like to do some math (averaging, sigmas, simple arithmetic, > etc) as well as some sorting and rejecting. It can be done in Excel, but > this is exceedingly slow even in 64-bit, so I am looking to do it through > some scripting. Just as an example, a "sort" which takes >10 min in Excel > takes ~10 sec max with the unix command sort (seems crazy, no?). Any > suggestions? > > Thanks, and sorry for being off-topic, > > Jacob > > -- > ******************************************* > Jacob Pearson Keller > Northwestern University > Medical Scientist Training Program > email: [email protected] > ******************************************* >
