This is an automated email from the ASF dual-hosted git repository.
lindong pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/flink-ml.git
from f0c1c7a [FLINK-26801] Support duplicate checkpoint aborted messages
new ba77607 [FLINK-27877] Improve performance for StringIndexer
new 341df45 [FLINK-27877] Reduce the length of the operator chain for
generating input table
new 58fce03 [FLINK-27877] Add benchmark configuration for StringIndexer,
StandardScaler and Bucketizer
new 966cedd [FLINK-27877] Enable object reuse in flink-ml-bench
The 4 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
.../org/apache/flink/ml/benchmark/Benchmark.java | 1 +
.../common/DenseVectorArrayGenerator.java | 114 +++--------
.../datagenerator/common/DenseVectorGenerator.java | 103 +++-------
.../datagenerator/common/DoubleGenerator.java | 58 ++++++
.../datagenerator/common/InputTableGenerator.java | 66 +++++++
.../common/LabeledPointWithWeightGenerator.java | 134 ++++---------
.../common/RandomStringGenerator.java | 76 +++++++
.../datagenerator/common/RowGenerator.java | 77 ++++++++
...ns-benchmark.json => bucketizer-benchmark.json} | 41 ++--
...enchmark.json => standardscaler-benchmark.json} | 30 +--
...benchmark.json => stringindexer-benchmark.json} | 34 ++--
.../flink/ml/benchmark/DataGeneratorTest.java | 96 +++++++--
.../ml/feature/stringindexer/StringIndexer.java | 219 +++++++++++----------
13 files changed, 647 insertions(+), 402 deletions(-)
create mode 100644
flink-ml-benchmark/src/main/java/org/apache/flink/ml/benchmark/datagenerator/common/DoubleGenerator.java
create mode 100644
flink-ml-benchmark/src/main/java/org/apache/flink/ml/benchmark/datagenerator/common/InputTableGenerator.java
create mode 100644
flink-ml-benchmark/src/main/java/org/apache/flink/ml/benchmark/datagenerator/common/RandomStringGenerator.java
create mode 100644
flink-ml-benchmark/src/main/java/org/apache/flink/ml/benchmark/datagenerator/common/RowGenerator.java
copy flink-ml-benchmark/src/main/resources/{kmeans-benchmark.json =>
bucketizer-benchmark.json} (65%)
copy flink-ml-benchmark/src/main/resources/{kmeans-benchmark.json =>
standardscaler-benchmark.json} (73%)
copy flink-ml-benchmark/src/main/resources/{kmeans-benchmark.json =>
stringindexer-benchmark.json} (68%)