Run a single mahout job using 2 data sources: I am implementing a system where we have nightly analytics data on user preferences (living in a hadoop cluster), and real time user behaviour (like adding items to a cart), updated more frequently in HBase. I'd like to be able to run a mahout job and provide recommendations based on both sets of data. (I understand the latencies involved and am fine with it.)
Is this possible? If yes, can you provide some pointers on how? Thank you. Thanks, Vani
