I think that generally looks good except that I disagree with the
scalable = hadoop-based assertion. I think that scalable means that we
handle big problems that are difficult with other tools. Hadoop is one
key method for this and we use it a lot. In other cases we use other
methods. One example is how taste handles delivery of realtime recs.
Another is where clever algorithms like Pegasos or sgd which give
large scale results on single machines.
Sent from my iPhone
On May 10, 2010, at 4:19 PM, Sean Owen <[email protected]> wrote:
Our first board report as a top-level project is due quite shortly --
snuck up on me.
I need your feedback and edits on our report by end-of-day Wednesday
if you please:
https://svn.apache.org/repos/asf/lucene/mahout/pmc/board-reports/2010/board-report-may.txt
Feel free to edit this directly, or post comments here and I'll take
care of it.
Sean