On Jun 23, 2011, at 7:21am, Sean Owen wrote:

> Colt? the mahout-math and mahout-collections code are based on that.
> 
> However FastMap isn't -- that's my own creation from the old collaborative
> filtering framework.
> 
> But recently there has been talk about switching all of this to use fastutil

We use the fastutil native set/map implementations in a number of projects, and 
they work well.

The issues we've run into are:

1. The full jar is pretty big - 12MB.

They've got versions of every nxm combination of native types for maps, as an 
example.

So we often wind up pulling out just the versions we need.

2. Wrapping these classes with Hadoop serialization can be slow (they support 
Java serialization)

Not sure if that would be a factor, but serializing a 20M entry map takes some 
time.

-- Ken

> On Thu, Jun 23, 2011 at 2:25 PM, Marko Ciric <[email protected]> wrote:
> 
>> How similar are Mahout collections (like FastMap) with Kolt (cern.kolt)?
>> 
>> 
>> --
>> Marko Ćirić
>> [email protected]
>> 

--------------------------
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom data mining solutions






Reply via email to