My understanding is if one feature is not presented in the input data(e.g. x==0), then we also need to include its probability, which is the second term (1 - P_xy)*(1 - x)
--- [Visit Topic](https://discuss.mxnet.io/t/naive-bayes/5155/4) or reply to this email to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.mxnet.io/email/unsubscribe/b3813602a87944d8d73b251e08a27007f99a7af043a04632e719daa4ebf97a21).
