yunfengzhou-hub opened a new pull request, #178:
URL: https://github.com/apache/flink-ml/pull/178

   ## What is the purpose of the change
   
   This PR adds python tests for the public APIs of Flink ML algorithms.
   
   ## Brief change log
   
   Adds test cases for the following APIs for all existing offline algorithms 
in Flink ML. The tests added in this PR are as follows.
   
   | Name of Algorithm         | setModelData() | getModelData()  | Save/load |
   | ------------------------- | -------------- | --------------- | --------- |
   | KNN                       | O              | O               | --        |
   | LinearSVC                 | O              | O               | O         |
   | LogisticRegression        | --             | --              | --        |
   | NaiveBayes                | O              | X (FLINK-30124) | --        |
   | AgglomerativeClustering   | -              | -               | --        |
   | KMeans                    | O              | X (FLINK-30122) | --        |
   | BinaryClassification      | -              | -               | --        |
   | Binarizer                 | -              | -               | --        |
   | Bucketizer                | -              | -               | --        |
   | DCT                       | -              | -               | --        |
   | ELementWiseProduct        | -              | -               | --        |
   | FeatureHasher             | -              | -               | --        |
   | HashingTF                 | -              | -               | --        |
   | IDF                       | O              | X (FLINK-29477) | --        |
   | Imputer                   | O              | X (FLINK-30124) | --        |
   | Interaction               | -              | -               | --        |
   | KbinsDiscretizer          | O              | X (FLINK-30122) | --        |
   | MaxAbsScaler              | O              | O               | O         |
   | MinMaxScaler              | O              | O               | O         |
   | NGram                     | -              | -               | --        |
   | Normalizer                | -              | -               | --        |
   | OneHotEncoder             | O              | O               | O         |
   | PolynomialExpansion       | -              | -               | --        |
   | RandomSplitter            | -              | -               | --        |
   | RegexTokenizer            | -              | -               | --        |
   | RobustScaler              | O              | O               | --        |
   | StandardScaler            | O              | O               | O         |
   | StringIndexer             | O              | X (FLINK-30122) | O         |
   | IndexToStringModel        | -              | X (FLINK-30122) | O         |
   | Tokenizer                 | -              | -               | --        |
   | VarianceThresholdSelector | O              | X (FLINK-29477) | --        |
   | VectorAssembler           | -              | -               | --        |
   | VectorIndexer             | O              | X (FLINK-30124) | --        |
   | VectorSlicer              | -              | -               | --        |
   | LinearRegression          | --             | --              | --        |
   | ChiSqTest                 | -              | -               | --        |
   
   The marks in the table above have the following meanings:
   
   - `-`: The algorithm does not need to test this API. For example, a 
`Transformer` or `AlgoOperator` do not need to test `getModelData()` and 
`setModelData()`.
   - `--`: The algorithm needs to test this API, and the test has already been 
added in previous commits.
   - `O`: The algorithm needs to test this API, and the test is added in this 
PR.
   - `X`: The algorithm needs to test this API, but this API fails with an 
exception. The exceptions have all been recorded by Jira tickets as in the 
parentheses.
   
   ## Does this pull request potentially affect one of the following parts:
   
     - Dependencies (does it add or upgrade a dependency): (no)
     - The public API, i.e., is any changed class annotated with 
`@Public(Evolving)`: (no)
   
   ## Documentation
   
     - Does this pull request introduce a new feature? (no)
     - If yes, how is the feature documented? (not applicable)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to