See inline, Jeff -----Original Message----- From: gaurav redkar [mailto:[email protected]] Sent: Wednesday, November 09, 2011 4:09 AM To: [email protected] Subject: meanshift clustering
Hi.. I am unable to identify where is the clusterPoints() function in the MeanShiftCanopyClusterer.java file being called during the execution of Meanshift job. [jeff] That method is not called except by a unit test TestMeanShift.testClustererReferenceImplementation. What i need to know is where are the files in clusteredPoints n clusters-* directory being written when we run the job on hadoop. [jeff] Those directories will be created within the --output directory which you specify for your job buildclustersMR() creates the clusters-* directory for each iteration but i am unable to locate the code which actually writes to d part-r-* files . [jeff] The code which writes the part-r-* files is Hadoop code which is called within MeanShiftCanopyReducer.reduce (line 55) Any suggestions..?? Thanks
