say column a has 1000 entries, maxPref=700 rowSampleRate = Math.min(maxObservationsPerRow, observationsPerRow) / observationsPerRow; we get rowSampleRate =0.0 ( not 0.7) do we totally skip this column or sample column entries with .7 probalility (roughly get 700 entries)
On Tue, Aug 13, 2013 at 11:58 AM, Ted Dunning <[email protected]> wrote: > Why do you think this? > > > On Tue, Aug 13, 2013 at 11:56 AM, sam wu <[email protected]> wrote: > > > Mahout 0.9 snapshot > > > > RowSimilarityJob.java , sampleDown method > > ----line 291 or 300 > > > > double rowSampleRate = Math.min(maxObservationsPerRow, > observationsPerRow) > > / observationsPerRow; > > > > return either 0.0 or 1.0, not fraction. needs (double) casting <---- > ???? > > > > > > BR > > > > Sam > > >
