I am trying to run canopy clustering on vectors extracted from lucene index. I want to use CosineDistanceMeasure. How do I know what appropriate values to use for t1 and t2 distance threshold? I would assume that Cosine distance measure would return "distances" as a range from 0.0 to 1.0 but that seems not the case, so how do I know what the potential distance ranges are to pick t1 and t2 (other than many trial and errors)?
Thanks Bob
