Hi,
Some of the test suites in SystemDS use external data filesĀ that are
stored in the test package along with the test files. I have observed
that there are some test files that use the dataset residing in another
test package such as the Iris dataset is being used by gmm and
gmmPredict testsĀ and it is stored inside the transform test package. The
same is the case for the Salary dataset that is used by different test
files.
In my opinion, it would be effective if we store all datasets inside the
resource folder so that the existing datasets are available up-front and
could be reused instead of introducing a new dataset every now and then
and it will also simplify the referencing of the datasets across test
suites.
--br,
Shafaq Siddiqi
- Refactoring datasets in SystemDS Shafaq Siddiqi
-