Some of the recommendation algorithms are less scalable, but the clustering
algorithms, frequent item set mining and most of the classification
algorithms are parallelized with good bounds on time and space or should be
relatively scalable single processor implementations.  There are relatively
scalable versions of recommendation algorithms that should exhibit fairly
good scaling even to billions of data points.

There are, however, relatively few users with really large data sets so far
so you will almost certain encounter some problems with some algorithms.
Your feedback would be of great interest and the result would likely be
fairly quick improvements.

If you can be more specific, I am sure you can get some better answers than
I just gave.

On Sun, Apr 25, 2010 at 10:00 PM, First Qaxy <qa...@yahoo.ca> wrote:

> Hi All,
> How scalable are Mahout's algorithms (clustering, classification, also
> recommendation) for very large datasets (>1bln points, or in other words
> have memory constraints)?
> Many thanks.
> -qf
>
>

Reply via email to