yunfengzhou-hub opened a new pull request #21: URL: https://github.com/apache/flink-ml/pull/21
## What is the purpose of the change This PR adds the implementation of Naive Bayes algorithm to Flink ML. This algorithm is implemented in reference to that implemented in Alibaba Alink and Apache Spark, but it uses Flink ML's framework and Flink Datastream. This PR also adds classes that could convert String to indexed numbers. This function is needed by Naive Bayes so that it only needs to deal with numeric input data. ## Brief change log This PR adds public classes NaiveBayes and MultiStringIndexer. Users can use NaiveBayes to do training and inference according to the algorithm with the same name. Users can also use MultiStringIndexer to convert strings into indices. ## Verifying this change The changes are tested by unit tests in MultiStringIndexerTest and NaiveBayesTest. ## Does this pull request potentially affect one of the following parts: Dependencies (does it add or upgrade a dependency): (yes) The public API, i.e., is any changed class annotated with @Public(Evolving): (yes) ## Documentation Does this pull request introduce a new feature? (yes) If yes, how is the feature documented? (Java doc) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org