I tracked the versions back to before the change to Writables were done. There is nothing significant change in the code.
Can you give me a small dataset 10 points maybe 5 dimensions. I can verify the trunk in Case? Robin On Wed, Feb 17, 2010 at 7:49 PM, Pallavi Palleti < pallavi.pall...@corp.aol.com> wrote: > I have a local version which I have submitted long back and I am using it > on real data and is not giving same point for all clusters. However, I > haven't tried with latest mahout code. I have kept my code to output data as > text so that it is easy for me to verify. However, current mahout code > outputs it as binary data (as sequencefile). So, it is difficult to verify. > > > Thanks > Pallavi > > Robin Anil wrote: > >> Have you verified the trunk code on some real data. I am getting same >> point >> for all clusters regardless of the distnce measure >> >> Robin >> >> >> >> On Wed, Feb 17, 2010 at 6:41 PM, Pallavi Palleti < >> pallavi.pall...@corp.aol.com> wrote: >> >> >> >>> Yes. It shouldn't be a problem. My point was that we are extending >>> numpoints as part of ClusterBase, though we are not using it in >>> SoftCluster. >>> Other that that, I don't see any issue w.r.t. functionality. >>> >>> >>> Thanks >>> Pallavi >>> >>> Robin Anil wrote: >>> >>> >>> >>>> In the impl of SoftClusters on writeOut it calculates the centroid and >>>> writes it and when read(in) it reads the centroid in to the center. >>>> >>>> In ClusterDumper it reads into the ClusterBase and does >>>> value.getCenter(); >>>> It should work normally right >>>> >>>> Robin >>>> >>>> >>>> >>>> On Wed, Feb 17, 2010 at 6:02 PM, Pallavi Palleti < >>>> pallavi.pall...@corp.aol.com> wrote: >>>> >>>> >>>> >>>> >>>> >>>>> Yes. But not the total number of points. So, the numpoints from >>>>> ClusterBase >>>>> will not be used in SoftCluster. numpoints is specific to Kmeans >>>>> similar >>>>> to >>>>> weightedpoint total for fuzzy kmeans. >>>>> >>>>> >>>>> Robin Anil wrote: >>>>> >>>>> >>>>> >>>>> >>>>> >>>>>> the center is still the averaged out centroid right? >>>>>> weightedtotalvector/totalprobWeight >>>>>> >>>>>> >>>>>> >>>>>> On Wed, Feb 17, 2010 at 5:10 PM, Pallavi Palleti < >>>>>> pallavi.pall...@corp.aol.com> wrote: >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>>> I haven't yet gone thru ClusterDumper. However, ClusterBase would be >>>>>>> having >>>>>>> number of points to average out (pointTotal/numPoints as per kmeans) >>>>>>> where >>>>>>> as SoftCluster will have weighted point total. So, I am wondering how >>>>>>> can >>>>>>> we >>>>>>> reuse ClusterBase here? >>>>>>> >>>>>>> >>>>>>> Thanks >>>>>>> Pallavi >>>>>>> >>>>>>> Robin Anil wrote: >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>>> yes. So that cluster dumper can print it out. >>>>>>>> >>>>>>>> On Wed, Feb 17, 2010 at 5:02 PM, Pallavi Palleti < >>>>>>>> pallavi.pall...@corp.aol.com> wrote: >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>>> Hi Robin, >>>>>>>>> >>>>>>>>> when you meant by reusing ClusterBase, are you planning to extend >>>>>>>>> ClusterBase in SoftCluster? For example, SoftCluster extends >>>>>>>>> ClusterBase? >>>>>>>>> >>>>>>>>> Thanks >>>>>>>>> Pallavi >>>>>>>>> >>>>>>>>> >>>>>>>>> Robin Anil wrote: >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>>> I have been trying to convert FuzzyKMeans SoftCluster(which should >>>>>>>>>> be >>>>>>>>>> ideally be named FuzzyKmeansCluster) to use the ClusterBase. >>>>>>>>>> >>>>>>>>>> I am getting* the same center* for all the clusters. To aid the >>>>>>>>>> conversion >>>>>>>>>> all i did was remove the center vector from the SoftCluster class >>>>>>>>>> and >>>>>>>>>> reuse >>>>>>>>>> the same from the ClusterBase. These are essentially making no >>>>>>>>>> change >>>>>>>>>> in >>>>>>>>>> the >>>>>>>>>> tests which passes correctly. >>>>>>>>>> >>>>>>>>>> So I am questioning whether the implementation keeps the average >>>>>>>>>> center >>>>>>>>>> at >>>>>>>>>> all ? Anyone who has used FuzzyKMeans experiencing this? >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> Robin >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>> >>>> >>> >> >> >