Hi Namit, The current Naive Bayes implementation is based on MapReduce and therefore dependant on Hadoop. You could run mahout trainnb and mahout testnb scripts locally by setting the environment variable MAHOUT_LOCAL=true.
This will keep everything on your local filesystem and prevent Mahout from attempting to run in cluster mode. But Hadoop is required. Andy > Date: Mon, 26 May 2014 10:22:18 +0530 > Subject: Re: Using existing model to train again > From: [email protected] > To: [email protected] > > Hi Subbu, > > I was too working with Naive Bayes. I wanted to know whether it is possible > to run *Naive Bayes without Hadoop* in Mahout or is it necessary to use > Hadoop. > > Thanks > Namit > > > On Mon, May 26, 2014 at 10:18 AM, Kasi Subrahmanyam > <[email protected]>wrote: > > > Hi team, > > I have trained a model in naive Bayes using training data of 1 million > > records. Now I have another 1 million records . Can I add this new training > > data to the existing model and train it again to get a new model instead of > > passing all the 2 million records at once to get a model. > > > > Thanks, > > Subbu > >
