Jingyi Mei created MADLIB-1213:
----------------------------------

             Summary: Support logical expression for minibatch preprocessor
                 Key: MADLIB-1213
                 URL: https://issues.apache.org/jira/browse/MADLIB-1213
             Project: Apache MADlib
          Issue Type: Improvement
          Components: Module: Utilities
            Reporter: Jingyi Mei
             Fix For: v1.14


The minibatch preprocessor currently does not support all logical expressions 
for independent and dependent variables.
 # Independent varname does not support any logical expression.
 # Dependent varname only supports logical expression for numerical columns. 
For ex 'length >1' is a valid expression but it does not support creating an 
alias for this expression.

This is the only expression that is supported
{code}
SELECT madlib.minibatch_preprocessor('minibatch_preprocessing_input', 
'minibatch_preprocessing_out',  'y > 10',  ' x1,x2', 4);
 {code}

Not supported :
{code}
SELECT madlib.minibatch_preprocessor('minibatch_preprocessing_input', 
'minibatch_preprocessing_out',  'y > 10 as foo',  'x1,x2', 4);
{code}

{code}
SELECT madlib.minibatch_preprocessor('minibatch_preprocessing_input', 
'minibatch_preprocessing_out',  'y=''F''',  'x1,x2', 4);
{code}

Open Questions :
1. Do we need to support logical expressions for both independent and dependent 
varname
2. If yes, to what extent ? 
3. Should the user be allowed to create an alias for logical expressions? 
4. There might be other modules that may partially support logical expressions. 
Should we find out which modules ?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to