All, I've written a plugin for Hadoop's MapReduce implementation that allows MapReduce jobs to be run over data stored in Tahoe. I've tested it using machines from Amazon's Elastic Compute Cluster, and am currently gathering performance information.
I'm hoping to highlight security issues of public cloud computing services and spark new thinking about solutions. Tahoe enables the creation of discrete domains for compute consumers, making it easier for multiple departments within an organization or multiple organizations on a community cloud to do massive data analysis and share results. More details and code can be found at http://hadoop- lafs.googlecode.com . The code will be licensed under the Apache open source license, and hopefully will be included in the Hadoop distribution. Help, suggestions, comments and questions are welcome. - Aaron Cordova _______________________________________________ tahoe-dev mailing list [email protected] http://allmydata.org/cgi-bin/mailman/listinfo/tahoe-dev
