We are already using Thrift to move and store our log data and I'm looking onto how I could read the stored log data into MapReduce processes. This article http://www.lexemetech.com/2008/07/rpc-and-serialization-with-hadoop.html talks about using Thrift for the IO, but it doesn't say anything specific.
What's the current status of Thrift with Hadoop? Is there any documentation online or even some code in the SVN which I could look into? - Juho Mäkinen
