Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.
The following page has been changed by Min Zhou: http://wiki.apache.org/hadoop/PoweredBy ------------------------------------------------------------------------------ * Run Naive Bayes classifiers in parallel over crawl data to discover event information * [http://code.google.com/p/redpoll/ Redpoll] - * Hardware: 35 nodes (2*4cpu 10TB disk 16G memory each) + * Hardware: 35 nodes (2*4cpu 10TB disk 16GB RAM each) * We intent to parallelize some traditional classification, clustering algorithms like Naive Bayes, K-Means, EM so that can deal with large-scale data sets. ''When applicable, please include details about your cluster hardware and size.''
