On Wed, Oct 20, 2010 at 1:04 AM, VanIngen, Erik (FIPS) <
[email protected]> wrote:

> Good morning!
>
> I need to to cluster analysis on values like this:
> 1.814263985     -0.633923297
> 2.501153739     -0.559033358
> 2.408755862     -0.509902975
> 1.935495243     -0.330554484
> 0.728818279     -0.169024633
> -0.523861032    0.110392311
>
> I can use EuclideanIntegerPoint, but than I have to convert the values to
> integers and would loose precission. So my trick would be to multiply with
> 1000, cluster and multiply the values with 0.001. Would that be a valid
> approach from a methodology point of view?
>

Numerically, this approach will often be a disaster.  I wouldn't recommend
it.


>
> Are there any plans to develop a EuclideanDoublePoint?
>
> Apache Mahout has a bunch of clustering code that you could use.  It isn't
limited to two dimensions, either, as a EDP might be.

Reply via email to