On Wed, Oct 20, 2010 at 1:04 AM, VanIngen, Erik (FIPS) < [email protected]> wrote:
> Good morning! > > I need to to cluster analysis on values like this: > 1.814263985 -0.633923297 > 2.501153739 -0.559033358 > 2.408755862 -0.509902975 > 1.935495243 -0.330554484 > 0.728818279 -0.169024633 > -0.523861032 0.110392311 > > I can use EuclideanIntegerPoint, but than I have to convert the values to > integers and would loose precission. So my trick would be to multiply with > 1000, cluster and multiply the values with 0.001. Would that be a valid > approach from a methodology point of view? > Numerically, this approach will often be a disaster. I wouldn't recommend it. > > Are there any plans to develop a EuclideanDoublePoint? > > Apache Mahout has a bunch of clustering code that you could use. It isn't limited to two dimensions, either, as a EDP might be.
