I believe that vectorized attributes are stored as doubles in mahout. Are some attributes "nominal"? That is, for some attributes is the distance function such that any two unequal values are at distance 1?
Looking at MapBackedARFFModel.java, I see that weka nominal attributes get converted to integer-valued doubles (1.0, 2.0, 3.0, ...). Will the nominal with value 1.0 be closer to the nominal with value 2.0 than to the nominal with value 3.0? Or is the distance between 1.0 and 3.0 also 1? Thanks, Don
