Author: srowen
Date: Wed Oct 20 10:57:43 2010
New Revision: 1025493
URL: http://svn.apache.org/viewvc?rev=1025493&view=rev
Log:
Add 0.4 release news to old and new sites
Modified:
mahout/site/new_website/index.html
mahout/site/src/documentation/content/xdocs/index.xml
Modified: mahout/site/new_website/index.html
URL:
http://svn.apache.org/viewvc/mahout/site/new_website/index.html?rev=1025493&r1=1025492&r2=1025493&view=diff
==============================================================================
--- mahout/site/new_website/index.html (original)
+++ mahout/site/new_website/index.html Wed Oct 20 10:57:43 2010
@@ -154,6 +154,36 @@
Interested in helping? See the Wiki or send us an email. Also
note, we are just getting off the ground, so please be patient as we get the
various infrastructure pieces in place.
<h2>Mahout News</h2>
+
+ <h3 class="boxed">21 October 2010 - Apache Mahout 0.4 released</h3>
+ <p>We are pleased to announce release 0.4 of Mahout. Virtually every
corner of the project has changed,
+ and significantly, since 0.3. Developers are invited to use and depend
on version 0.4 even as yet more
+ change is to be expected before the next release. Highlights include:
+ <ul>
+ <li>Model refactoring and CLI changes to improve integration and
consistency</li>
+ <li>New ClusterEvaluator and CDbwClusterEvaluator offer new ways
to evaluate clustering effectiveness</li>
+ <li>New Spectral Clustering and MinHash Clustering (still
experimental)</li>
+ <li>New VectorModelClassifier allows any set of clusters to be
used for classification</li>
+ <li>Map/Reduce job to compute the pairwise similarities of the
rows of a matrix using a customizable similarity measure</li>
+ <li>Map/Reduce job to compute the item-item-similarities for
item-based collaborative filtering</li>
+ <li>RecommenderJob has been evolved to a fully distributed
item-based recommender</li>
+ <li>Distributed Lanczos SVD implementation</li>
+ <li>More support for distributed operations on very large
matrices</li>
+ <li>Easier access to Mahout operations via the command line</li>
+ <li>New HMM based sequence classification from GSoC (currently as
sequential version only and still experimental)</li>
+ <li>Sequential logistic regression training framework</li>
+ <li>New SGD classifier</li>
+ <li>Experimental new type of NB classifier, and feature reduction
options for existing one</li>
+ <li>New vector encoding framework for high speed vectorization
without a pre-built dictionary</li>
+ <li>Additional elements of supervised model evaluation
framework</li>
+ <li>Promoted several pieces of old Colt framework to tested status
(QR decomposition, in particular)</li>
+ <li>Can now save random forests and use it to classify new
data</li>
+ <li>Many, many small fixes, improvements, refactorings and
cleanup</li>
+ </ul>
+ <p>Details on what's included can be found in the <a
href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12310751&styleName=Html&version=12314281">release
notes</a>.
+ Downloads are available from the <a
href="http://www.apache.org/dyn/closer.cgi/lucene/mahout/">Apache Mirrors</a>.
+ </p>
+
<h3 class="boxed">29 March 2010 - Google Summer Of Code
Projects</h3> <p>Its Summer of Code time again and ASF is accepting proposals
from students.
Mahout has a number of people willing to be mentors, so if you are a
student interested in working on machine learning algorithms using Hadoop or
Modified: mahout/site/src/documentation/content/xdocs/index.xml
URL:
http://svn.apache.org/viewvc/mahout/site/src/documentation/content/xdocs/index.xml?rev=1025493&r1=1025492&r2=1025493&view=diff
==============================================================================
--- mahout/site/src/documentation/content/xdocs/index.xml (original)
+++ mahout/site/src/documentation/content/xdocs/index.xml Wed Oct 20 10:57:43
2010
@@ -6,8 +6,8 @@
</title>
</header>
<body>
- <section id="Apache Lucene">
- <title>Apache Lucene Mahout</title>
+ <section id="Apache Mahout">
+ <title>Apache Mahout</title>
<p>Mahout's goal is to build scalable machine learning libraries. With
scalable we mean:
<ul><li>Scalable to reasonably large data sets. Our core algorithms
for clustering, classfication and batch based collaborative filtering are
implemented on top of Apache Hadoop using the map/reduce paradigm. However we
do not restrict contributions to Hadoop based implementations: Contributions
that run on a single node or on a non-Hadoop cluster are welcome as well. The
core libraries are highly optimized to allow for good performance also for
non-distributed algorithms.</li>
<li>Scalable to support your business case. Mahout is distributed
under a commercially friendly Apache Software license.</li>
@@ -38,6 +38,42 @@
</section>
<section>
+ <title>21 October 2010 - Apache Mahout 0.4 released</title>
+ <p>We are pleased to announce release 0.4 of Mahout. Virtually every
corner of the project has changed,
+ and significantly, since 0.3. Developers are invited to use and depend
on version 0.4 even as yet more
+ change is to be expected before the next release.</p>
+ <p>
+ Highlights include:
+ <ul>
+ <li>Model refactoring and CLI changes to improve integration and
consistency</li>
+ <li>New ClusterEvaluator and CDbwClusterEvaluator offer new ways
to evaluate clustering effectiveness</li>
+ <li>New Spectral Clustering and MinHash Clustering (still
experimental)</li>
+ <li>New VectorModelClassifier allows any set of clusters to be
used for classification</li>
+ <li>Map/Reduce job to compute the pairwise similarities of the
rows of a matrix using a customizable similarity measure</li>
+ <li>Map/Reduce job to compute the item-item-similarities for
item-based collaborative filtering</li>
+ <li>RecommenderJob has been evolved to a fully distributed
item-based recommender</li>
+ <li>Distributed Lanczos SVD implementation</li>
+ <li>More support for distributed operations on very large
matrices</li>
+ <li>Easier access to Mahout operations via the command line</li>
+ <li>New HMM based sequence classification from GSoC (currently as
sequential version only and still experimental)</li>
+ <li>Sequential logistic regression training framework</li>
+ <li>New SGD classifier</li>
+ <li>Experimental new type of NB classifier, and feature reduction
options for existing one</li>
+ <li>New vector encoding framework for high speed vectorization
without a pre-built dictionary</li>
+ <li>Additional elements of supervised model evaluation
framework</li>
+ <li>Promoted several pieces of old Colt framework to tested status
(QR decomposition, in particular)</li>
+ <li>Can now save random forests and use it to classify new
data</li>
+ <li>Many, many small fixes, improvements, refactorings and
cleanup</li>
+ </ul>
+ </p>
+ <p>Details on what's included can be found in the <a
href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12310751&styleName=Html&version=12314281">release
notes</a>.
+ </p>
+ <p>Downloads are available from the <a
href="http://www.apache.org/dyn/closer.cgi/lucene/mahout/">Apache Mirrors</a>.
+ </p>
+
+ </section>
+
+ <section>
<title>17 March 2010 - Apache Mahout 0.3 released</title>
<p>The Apache Lucene project is pleased to announce the release of
Apache Mahout 0.3.</p>
<p>