[ https://issues.apache.org/jira/browse/MADLIB-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16762162#comment-16762162 ]
Rahul Iyer commented on MADLIB-1299: ------------------------------------ @ [~pratikpawar] By baseline reference, do you mean the reference for dummy encoding of a categorical independent variable? If yes, then MADlib expects that the data has been encoded before passing it to the regression methods. See [Encoding Categorical Variables|http://madlib.apache.org/docs/latest/group__grp__encode__categorical.html] for the procedure to dummy encode. The {{value_to_drop}} parameter provides the functionality for user-defined reference level. > Baseline referencing for independent variables > ---------------------------------------------- > > Key: MADLIB-1299 > URL: https://issues.apache.org/jira/browse/MADLIB-1299 > Project: Apache MADlib > Issue Type: Improvement > Components: Module: Linear Regression, Module: Logistic Regression, > Module: Multinomial Logistic Regression > Reporter: Pratik > Assignee: Rahul Iyer > Priority: Minor > > When we perform regression between dep vs independent variables currently > function does not support control over baseline reference for the independent > variable. > I am not sure if Madlib does it internally with 1.15.1 release. If that is > the case it would be awesome to have documentation and get the return value > of the baseline variable. > [https://stackoverflow.com/questions/54523454/linear-regression-baseline-referencing-madlib] > > R way to do it - > https://stackoverflow.com/questions/3872070/how-to-force-r-to-use-a-specified-factor-level-as-reference-in-a-regression?noredirect=1&lq=1 > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)