Ouch. Sorry... your original posting made it sound like you *wanted* it to be 0.0 or 1.0.
This is a bug. Can you file a JIRA? On Tue, Aug 13, 2013 at 12:04 PM, sam wu <[email protected]> wrote: > say column a has 1000 entries, maxPref=700 > rowSampleRate = Math.min(maxObservationsPerRow, observationsPerRow) / > observationsPerRow; > we get rowSampleRate =0.0 ( not 0.7) > do we totally skip this column or sample column entries with .7 probalility > (roughly get 700 entries) > > > > > On Tue, Aug 13, 2013 at 11:58 AM, Ted Dunning <[email protected]> > wrote: > > > Why do you think this? > > > > > > On Tue, Aug 13, 2013 at 11:56 AM, sam wu <[email protected]> wrote: > > > > > Mahout 0.9 snapshot > > > > > > RowSimilarityJob.java , sampleDown method > > > ----line 291 or 300 > > > > > > double rowSampleRate = Math.min(maxObservationsPerRow, > > observationsPerRow) > > > / observationsPerRow; > > > > > > return either 0.0 or 1.0, not fraction. needs (double) casting <---- > > ???? > > > > > > > > > BR > > > > > > Sam > > > > > >
