creating consistency is great, and very welcome. However, the concrete proposal is not ideal as it does not follow the common convention of capital letters for matrices, small letters for vectors and scalars. In papers you often see capital bold for matrices, small bold for vectors, and small regular for scalars. This convention really helps to quickly understand the characteristics of matrix-vector multiplications etc.

So I would recommend to change the algorithms like multiLogReg and l2svm where this was missed initially. Right now, multiLogReg takes a vector (should be y) and internally does one-hot encoding to get Y (reassigning one Y variable is easy to get missed but completely changes shapes and characteristics).

Regards,
Matthias

On 5/16/2022 6:43 PM, Janardhan Pulivarthi wrote:
Thanks a lot, for starting this much needed discussion.

Best Regards,
Janardhan

On Mon, May 16, 2022 at 8:04 PM Baunsgaard, Sebastian
<[email protected]> wrote:

Hi All,


I would like to suggest that we change all builtin functions to use capital 
letters for single letter arguments.

Functions currently use small and large letters a randomly,

and it is confusing when changing algorithm you have to sometimes change "Y" to 
"y".


examples that would break this:

https://github.com/apache/systemds/blob/main/scripts/builtin/arima.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/cvlm.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/deepWalk.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/getAccuracy.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/gridSearch.dml

(All Img builtin)

https://github.com/apache/systemds/blob/main/scripts/builtin/kmeans.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/lasso.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/lm.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/lmCG.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/lmDS.dml

(All outlier)

etc..


I would also like to suggest that we change all instances where we call X and Y 
for X_Train or Y_Train to just X and Y.


example that break:

https://github.com/apache/systemds/blob/main/scripts/builtin/hyperband.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/knn.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/lasso.dml

https://github.com/apache/systemds/blob/main/scripts/builtin/sherlock.dml



All verbose booleans should be changed to "verbose"


https://github.com/apache/systemds/blob/main/scripts/builtin/matrixProfile.dml



best regards

Sebastian

Reply via email to