On Jun 23, 2011, at 7:21am, Sean Owen wrote: > Colt? the mahout-math and mahout-collections code are based on that. > > However FastMap isn't -- that's my own creation from the old collaborative > filtering framework. > > But recently there has been talk about switching all of this to use fastutil
We use the fastutil native set/map implementations in a number of projects, and they work well. The issues we've run into are: 1. The full jar is pretty big - 12MB. They've got versions of every nxm combination of native types for maps, as an example. So we often wind up pulling out just the versions we need. 2. Wrapping these classes with Hadoop serialization can be slow (they support Java serialization) Not sure if that would be a factor, but serializing a 20M entry map takes some time. -- Ken > On Thu, Jun 23, 2011 at 2:25 PM, Marko Ciric <[email protected]> wrote: > >> How similar are Mahout collections (like FastMap) with Kolt (cern.kolt)? >> >> >> -- >> Marko Ćirić >> [email protected] >> -------------------------- Ken Krugler +1 530-210-6378 http://bixolabs.com custom data mining solutions
