Ouch.

Sorry... your original posting made it sound like you *wanted* it to be 0.0
or 1.0.

This is a bug.  Can you file a JIRA?


On Tue, Aug 13, 2013 at 12:04 PM, sam wu <[email protected]> wrote:

> say column a has 1000 entries, maxPref=700
> rowSampleRate = Math.min(maxObservationsPerRow, observationsPerRow) /
> observationsPerRow;
> we get rowSampleRate =0.0 ( not 0.7)
> do we totally skip this column or sample column entries with .7 probalility
> (roughly get 700 entries)
>
>
>
>
> On Tue, Aug 13, 2013 at 11:58 AM, Ted Dunning <[email protected]>
> wrote:
>
> > Why do you think this?
> >
> >
> > On Tue, Aug 13, 2013 at 11:56 AM, sam wu <[email protected]> wrote:
> >
> > > Mahout 0.9 snapshot
> > >
> > > RowSimilarityJob.java , sampleDown method
> > > ----line 291 or 300
> > >
> > >  double rowSampleRate = Math.min(maxObservationsPerRow,
> > observationsPerRow)
> > > / observationsPerRow;
> > >
> > > return either 0.0 or 1.0, not fraction. needs (double) casting  <----
> > ????
> > >
> > >
> > > BR
> > >
> > > Sam
> > >
> >
>

Reply via email to