Anyone want to play? The goal is to find a small program that quickly computes some statistics over 45GB of log data on a 32-core box. Hadoop seems like a good candidate. Streaming? Pig? Java?

http://www.tbray.org/ongoing/When/200x/2008/05/01/Wide-Finder-2

Doug

Reply via email to