GitHub user smurakozi opened a pull request:
https://github.com/apache/spark/pull/20235
[Spark-22887][ML][TESTS][WIP] ML test for StructuredStreaming: spark.ml.fpm
## What changes were proposed in this pull request?
Converting FPGrowth tests to also check code with structured streaming,
using the ML testing infrastructure implemented in SPARK-22882.
Note: this is a WIP, test with Array[Byte] is not yet working due to some
datatype issues (Array[Byte] vs Binary).
## How was this patch tested?
N/A
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/smurakozi/spark SPARK-22887
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20235.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20235
----
commit 331129556003bcf6e4bab6559e80e46ac0858706
Author: Sandor Murakozi <smurakozi@...>
Date: 2018-01-05T12:41:53Z
test 'FPGrowthModel setMinConfidence should affect rules generation and
transform' is converted to use testTransformer
commit 93aff2c999eee4a88f7f4a3c32d6c7b601a918ac
Author: Sandor Murakozi <smurakozi@...>
Date: 2018-01-08T13:14:38Z
Test 'FPGrowth fit and transform with different data types' works with
streaming, except for Byte
commit 8b0b00070a21bd47537a7c3ad580e2af38a481bd
Author: Sandor Murakozi <smurakozi@...>
Date: 2018-01-11T11:28:46Z
All tests use testTransformer.
Test with Array[Byte] is missing.
commit af61845ab6acfa82c4411bce3ab4a20afebd0aa3
Author: Sandor Murakozi <smurakozi@...>
Date: 2018-01-11T11:49:27Z
Unintentional changes in 93aff2c999 are reverted
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]