Re: Reg. Netflix Prize Apache Mahout GSoC Application

Sean Owen Sun, 04 Apr 2010 02:40:09 -0700

Reusing code is fine, in principle. The code you mention, however,
will not help you much. It is non-distributed and has nothing to do
with Hadoop. You might reuse a bit of code to parse the input files,
that's about it.


Which data store are you referring to... if I understand right, you
are implementing an algorithm on Hadoop. You would definitely not
implement anything to load into memory, and I think you want to work
with HDFS and Amazon S3, not Hbase.

On Sun, Apr 4, 2010 at 9:29 AM, Sisir Koppaka <sisir.kopp...@gmail.com> wrote:
> Firstly, I am expecting to reuse the code at
> *org.apache.mahout.cf.taste.example.netflix
> *and have mentioned so in my proposal. Please let me know if this is OK, or
> if you foresee any problems doing this. Secondly, I will implement a
> HBase-based datastore as well as a InMemory-based one, but is the
> InMemory-based one a pre-requisite for the HBase-based one to be used?
> (Eventually everything has to go to memory, so is this being done elsewhere
> or does the InMemory datastore do it?)

Re: Reg. Netflix Prize Apache Mahout GSoC Application

Reply via email to