> On 2011-12-13 13:08:20, Ted Dunning wrote:
> > /trunk/core/src/main/java/org/apache/mahout/math/hadoop/DistributedRowMatrix.java,
> >  line 199
> > <https://reviews.apache.org/r/3147/diff/5/?file=64279#file64279line199>
> >
> >     I would really rather use standard terminology here.
> >     
> >     A mean row is a row that is that average of all others, but a row mean 
> > would mean an average of the elements a single row.  The plural form, row 
> > means, indicates the means of all rows.  What you are computing are the 
> > means of every column.
> >     
> >     In contrast, R, Octave and Matlab all use columnMeans as the name of 
> > the function being implemented here.

Sure. In Matlab/Octave I'm used to mean(A,1) (takes the mean across the 1st 
dimension, ie. across rows, but done per column). I'll change this to 
colMeans(), which seems to be clearer.


> On 2011-12-13 13:08:20, Ted Dunning wrote:
> > /trunk/core/src/main/java/org/apache/mahout/math/hadoop/MatrixRowMeanJob.java,
> >  lines 129-132
> > <https://reviews.apache.org/r/3147/diff/5/?file=64280#file64280line129>
> >
> >     There are lots of lines with trailing white space.  Isn't this easily 
> > suppressed?

I can use sed, or perhaps there's a better way?


- Raphael


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/3147/#review3874
-----------------------------------------------------------


On 2011-12-13 04:46:47, Raphael Cendrillon wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/3147/
> -----------------------------------------------------------
> 
> (Updated 2011-12-13 04:46:47)
> 
> 
> Review request for mahout, lancenorskog and Dmitriy Lyubimov.
> 
> 
> Summary
> -------
> 
> Here's a patch with a simple job to calculate the row mean (column-wise 
> mean). One outstanding issue is the combiner, this requires a wrtiable class 
> IntVectorTupleWritable, where the Int stores the number of rows, and the 
> Vector stores the column-wise sum.
> 
> 
> This addresses bug MAHOUT-923.
>     https://issues.apache.org/jira/browse/MAHOUT-923
> 
> 
> Diffs
> -----
> 
>   
> /trunk/core/src/main/java/org/apache/mahout/math/hadoop/DistributedRowMatrix.java
>  1213474 
>   
> /trunk/core/src/main/java/org/apache/mahout/math/hadoop/MatrixRowMeanJob.java 
> PRE-CREATION 
>   
> /trunk/core/src/test/java/org/apache/mahout/math/hadoop/TestDistributedRowMatrix.java
>  1213474 
> 
> Diff: https://reviews.apache.org/r/3147/diff
> 
> 
> Testing
> -------
> 
> Junit test
> 
> 
> Thanks,
> 
> Raphael
> 
>

Reply via email to