Re: How to incorporate the new data in the MLlib-NaiveBayes model along with predicting?

2014-10-29 Thread Chris Fregly
jira created with comments/references to this discussion:
https://issues.apache.org/jira/browse/SPARK-4144



On Tue, Aug 19, 2014 at 4:47 PM, Xiangrui Meng men...@gmail.com wrote:

 No. Please create one but it won't be able to catch the v1.1 train.
 -Xiangrui

 On Tue, Aug 19, 2014 at 4:22 PM, Chris Fregly ch...@fregly.com wrote:
  this would be awesome.  did a jira get created for this?  I searched, but
  didn't find one.
 
  thanks!
 
  -chris
 
 
  On Tue, Jul 8, 2014 at 1:30 PM, Rahul Bhojwani 
 rahulbhojwani2...@gmail.com
  wrote:
 
  Thanks a lot Xiangrui. This will help.
 
 
  On Wed, Jul 9, 2014 at 1:34 AM, Xiangrui Meng men...@gmail.com wrote:
 
  Hi Rahul,
 
  We plan to add online model updates with Spark Streaming, perhaps in
  v1.1, starting with linear methods. Please open a JIRA for Naive
  Bayes. For Naive Bayes, we need to update the priors and conditional
  probabilities, which means we should also remember the number of
  observations for the updates.
 
  Best,
  Xiangrui
 
  On Tue, Jul 8, 2014 at 7:35 AM, Rahul Bhojwani
  rahulbhojwani2...@gmail.com wrote:
   Hi,
  
   I am using the MLlib Naive Bayes for a text classification problem. I
   have
   very less amount of training data. And then the data will be coming
   continuously and I need to classify it as either A or B. I am
 training
   the
   MLlib Naive Bayes model using the training data but next time when
 data
   comes, I want to predict its class and then incorporate that also in
   the
   model for next time prediction of new data(I think that is obvious).
  
   So I am not able to figure out what is the way to do that using MLlib
   Naive
   Bayes. Is it that I have to train the model on the whole data every
   time new
   data comes in??
  
   Thanks in Advance!
   --
   Rahul K Bhojwani
   3rd Year B.Tech
   Computer Science and Engineering
   National Institute of Technology, Karnataka
 
 
 
 
  --
  Rahul K Bhojwani
  3rd Year B.Tech
  Computer Science and Engineering
  National Institute of Technology, Karnataka
 
 



Re: How to incorporate the new data in the MLlib-NaiveBayes model along with predicting?

2014-08-19 Thread Chris Fregly
this would be awesome.  did a jira get created for this?  I searched, but
didn't find one.

thanks!

-chris


On Tue, Jul 8, 2014 at 1:30 PM, Rahul Bhojwani rahulbhojwani2...@gmail.com
wrote:

 Thanks a lot Xiangrui. This will help.


 On Wed, Jul 9, 2014 at 1:34 AM, Xiangrui Meng men...@gmail.com wrote:

 Hi Rahul,

 We plan to add online model updates with Spark Streaming, perhaps in
 v1.1, starting with linear methods. Please open a JIRA for Naive
 Bayes. For Naive Bayes, we need to update the priors and conditional
 probabilities, which means we should also remember the number of
 observations for the updates.

 Best,
 Xiangrui

 On Tue, Jul 8, 2014 at 7:35 AM, Rahul Bhojwani
 rahulbhojwani2...@gmail.com wrote:
  Hi,
 
  I am using the MLlib Naive Bayes for a text classification problem. I
 have
  very less amount of training data. And then the data will be coming
  continuously and I need to classify it as either A or B. I am training
 the
  MLlib Naive Bayes model using the training data but next time when data
  comes, I want to predict its class and then incorporate that also in the
  model for next time prediction of new data(I think that is obvious).
 
  So I am not able to figure out what is the way to do that using MLlib
 Naive
  Bayes. Is it that I have to train the model on the whole data every
 time new
  data comes in??
 
  Thanks in Advance!
  --
  Rahul K Bhojwani
  3rd Year B.Tech
  Computer Science and Engineering
  National Institute of Technology, Karnataka




 --
 Rahul K Bhojwani
 3rd Year B.Tech
 Computer Science and Engineering
 National Institute of Technology, Karnataka



Re: How to incorporate the new data in the MLlib-NaiveBayes model along with predicting?

2014-08-19 Thread Xiangrui Meng
No. Please create one but it won't be able to catch the v1.1 train. -Xiangrui

On Tue, Aug 19, 2014 at 4:22 PM, Chris Fregly ch...@fregly.com wrote:
 this would be awesome.  did a jira get created for this?  I searched, but
 didn't find one.

 thanks!

 -chris


 On Tue, Jul 8, 2014 at 1:30 PM, Rahul Bhojwani rahulbhojwani2...@gmail.com
 wrote:

 Thanks a lot Xiangrui. This will help.


 On Wed, Jul 9, 2014 at 1:34 AM, Xiangrui Meng men...@gmail.com wrote:

 Hi Rahul,

 We plan to add online model updates with Spark Streaming, perhaps in
 v1.1, starting with linear methods. Please open a JIRA for Naive
 Bayes. For Naive Bayes, we need to update the priors and conditional
 probabilities, which means we should also remember the number of
 observations for the updates.

 Best,
 Xiangrui

 On Tue, Jul 8, 2014 at 7:35 AM, Rahul Bhojwani
 rahulbhojwani2...@gmail.com wrote:
  Hi,
 
  I am using the MLlib Naive Bayes for a text classification problem. I
  have
  very less amount of training data. And then the data will be coming
  continuously and I need to classify it as either A or B. I am training
  the
  MLlib Naive Bayes model using the training data but next time when data
  comes, I want to predict its class and then incorporate that also in
  the
  model for next time prediction of new data(I think that is obvious).
 
  So I am not able to figure out what is the way to do that using MLlib
  Naive
  Bayes. Is it that I have to train the model on the whole data every
  time new
  data comes in??
 
  Thanks in Advance!
  --
  Rahul K Bhojwani
  3rd Year B.Tech
  Computer Science and Engineering
  National Institute of Technology, Karnataka




 --
 Rahul K Bhojwani
 3rd Year B.Tech
 Computer Science and Engineering
 National Institute of Technology, Karnataka



-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



Re: How to incorporate the new data in the MLlib-NaiveBayes model along with predicting?

2014-07-08 Thread Xiangrui Meng
Hi Rahul,

We plan to add online model updates with Spark Streaming, perhaps in
v1.1, starting with linear methods. Please open a JIRA for Naive
Bayes. For Naive Bayes, we need to update the priors and conditional
probabilities, which means we should also remember the number of
observations for the updates.

Best,
Xiangrui

On Tue, Jul 8, 2014 at 7:35 AM, Rahul Bhojwani
rahulbhojwani2...@gmail.com wrote:
 Hi,

 I am using the MLlib Naive Bayes for a text classification problem. I have
 very less amount of training data. And then the data will be coming
 continuously and I need to classify it as either A or B. I am training the
 MLlib Naive Bayes model using the training data but next time when data
 comes, I want to predict its class and then incorporate that also in the
 model for next time prediction of new data(I think that is obvious).

 So I am not able to figure out what is the way to do that using MLlib Naive
 Bayes. Is it that I have to train the model on the whole data every time new
 data comes in??

 Thanks in Advance!
 --
 Rahul K Bhojwani
 3rd Year B.Tech
 Computer Science and Engineering
 National Institute of Technology, Karnataka


Re: How to incorporate the new data in the MLlib-NaiveBayes model along with predicting?

2014-07-08 Thread Rahul Bhojwani
Thanks a lot Xiangrui. This will help.


On Wed, Jul 9, 2014 at 1:34 AM, Xiangrui Meng men...@gmail.com wrote:

 Hi Rahul,

 We plan to add online model updates with Spark Streaming, perhaps in
 v1.1, starting with linear methods. Please open a JIRA for Naive
 Bayes. For Naive Bayes, we need to update the priors and conditional
 probabilities, which means we should also remember the number of
 observations for the updates.

 Best,
 Xiangrui

 On Tue, Jul 8, 2014 at 7:35 AM, Rahul Bhojwani
 rahulbhojwani2...@gmail.com wrote:
  Hi,
 
  I am using the MLlib Naive Bayes for a text classification problem. I
 have
  very less amount of training data. And then the data will be coming
  continuously and I need to classify it as either A or B. I am training
 the
  MLlib Naive Bayes model using the training data but next time when data
  comes, I want to predict its class and then incorporate that also in the
  model for next time prediction of new data(I think that is obvious).
 
  So I am not able to figure out what is the way to do that using MLlib
 Naive
  Bayes. Is it that I have to train the model on the whole data every time
 new
  data comes in??
 
  Thanks in Advance!
  --
  Rahul K Bhojwani
  3rd Year B.Tech
  Computer Science and Engineering
  National Institute of Technology, Karnataka




-- 
Rahul K Bhojwani
3rd Year B.Tech
Computer Science and Engineering
National Institute of Technology, Karnataka