yunfengzhou-hub opened a new pull request #21:
URL: https://github.com/apache/flink-ml/pull/21


   ## What is the purpose of the change
   This PR adds the implementation of Naive Bayes algorithm to Flink ML. This 
algorithm is implemented in reference to that implemented in Alibaba Alink and 
Apache Spark, but it uses Flink ML's framework and Flink Datastream.
   
   This PR also adds classes that could convert String to indexed numbers. This 
function is needed by Naive Bayes so that it only needs to deal with numeric 
input data.
   
   ## Brief change log
   This PR adds public classes NaiveBayes and MultiStringIndexer. Users can use 
NaiveBayes to do training and inference according to the algorithm with the 
same name. Users can also use MultiStringIndexer to convert strings into 
indices.
   
   ## Verifying this change
   The changes are tested by unit tests in MultiStringIndexerTest and 
NaiveBayesTest.
   
   ## Does this pull request potentially affect one of the following parts:
   Dependencies (does it add or upgrade a dependency): (yes)
   The public API, i.e., is any changed class annotated with @Public(Evolving): 
(yes)
   
   ## Documentation
   Does this pull request introduce a new feature? (yes)
   If yes, how is the feature documented? (Java doc)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to