yunfengzhou-hub opened a new pull request #70:
URL: https://github.com/apache/flink-ml/pull/70


   ## What is the purpose of the change
   This PR adds Estimator and Transformer for the Streaming KMeans operator.
   
   Compared with the existing KMeans operator, Streaming KMeans allows to train 
KMeans model continuously from an unbounded train data stream. The 
corresponding Model operator also supports updating model data dynamically from 
a DataStream.
   
   Besides, this PR also adds simple infrastructures needed to test online 
algorithms, which allows to control the order to consume train data and predict 
data.
   
   ## Brief change log
   - Adds `StreamingKMeans`, `StreamingKMeansModel` and `StreamingKMeansParams` 
class to support Streaming KMeans algorithm. Also adds `StreamingKMeansTest` 
class to test these classes.
   - Adds `HasBatchStrategy` and `HasDecayFactor` interfaces to represent 
corresponding parameters for online algorithms.
   - Adds `MockBlockingQueueSinkFunction`, `MockBlockingQueueSourceFunction` 
and `TestBlockingQueueManager` to control the stream's velocity in online 
algorithm's test cases.
   
   ## Does this pull request potentially affect one of the following parts:
   - Dependencies (does it add or upgrade a dependency): (no)
   - The public API, i.e., is any changed class annotated with 
@public(Evolving): (no)
   - Does this pull request introduce a new feature? (yes)
   - If yes, how is the feature documented? (Java doc)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to