ilovemesomeramen opened a new pull request #1203: URL: https://github.com/apache/systemds/pull/1203
This PR restructures the Transformation Encoders to operate on a per column basis. `Encoder` 's are now `ColumnEncoder`'s which are grouped in a `MultiColumnEncoder`. Since `EncoderOmit` and `EncoderMVImpute` are deprecated they remain unchanged and Legacy Support was added until they are fazed out in upcoming releases. The changes where also applied to the existing Spark and Federated implementations, although functional these changes where only retrofitted which means there is room for optimization in those implementations in the future. There is still a small issue regarding the LineageCache in a federated environment. Specifically the test: `testHomesRecodeDummycodeIDsCSVLineage` in `TransformFederatedEncodeApplyTest` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
