Use R.  Mahout is major over-kill for this problem.  You can always
transition later.

I am not saying your problem isn't difficult or that it isn't valuable to
solve.  Just that the virtue that Mahout brings (scale) isn't the virtue you
need (models sooner with least effort).


Got it. I've started reading on R today.


I'm a Debian Developer and I noticed Mahout is not in Debian. If I'm able
to wrap my head around everything and get it working I would love to
contribute back and package it.


We would love it if you did.  Mahout is fast moving and trunk will be
significantly more useful for most people for a while yet.  How does that
affect packaging for debian?


Debian has 3 distributions: stable, testing, unstable. A new stable gets released every 18 months or so. I've seen packages following trunk, like ruby1.9, so in theory it should be OK.

I want look into this when I start using mahout, it would help me too having it packaged. Right now I'm trying to get the hang of R though.


Offtopic: I can't find examples about how to implement my setup with partial queries. In either mahout or R.

I can train with "age", "interest1" ... "interestN", "demographic1", ..."demographicX" and when querying I could ask with "age", "interest1" .. "interestM" where M could be bigger or smaller than N.

I could break them into multiple rows, but it would result fake results. Someone interested in Books + Math could yield results, but just Math wouldn't.

Do you guys know anyone that offers consulting services at a reasonable price to help with modelling?

-r.

Reply via email to