Distributed RapidMiner

Michael Wurst Mon, 11 Feb 2008 11:54:38 -0800

Hi all,

I am co-developer of RapidMiner (formerly Yale,http://www.rapidminer.com). Isa Drost pointet the Mahout project out tome. We are currently working on a distributed version of RapidMiner thatwill run in a cluster. The focus is, however, very different from theone of Mahout. We are focusing on distributed databases and highlycomplex tasks, such as evolutionary computing for feature selection orparameter optimization. These task require to perform many training andevaluation cycles on variants of the same data mining task (e.g. usingdifferent feature sets). Therefore, we decided not to use map/reduce forthis kind of application but some more traditional methods ofdistributed data mining, task distribution and scheduling.

Creating a map/reduce based data mining library/system is sure highlyrelevant. I'm looking forward to the first results of Mahout! If youneed any assistance, let me know.


Cheers,
Michael

Distributed RapidMiner

Reply via email to