[ 
https://issues.apache.org/jira/browse/MADLIB-943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Frank McQuillan updated MADLIB-943:
-----------------------------------
    Description: 
Story

As a data scientist, I want to be able to define multiple symbols per row for 
pattern matching.

See
http://madlib.incubator.apache.org/docs/latest/group__grp__path.html
for a description of what a symbol is.

Currently in 1.9, a given row can only match one symbol. If a row matches 
multiple symbols, the symbol that comes first in the symbol definition list 
will take precedence.   This story is about all matching symbols on a row being 
used.

Acceptance

The attached data set and query should should produce the following output:



There are symbol matches for:
Gender=Female
Region=East
Household Size=3

  was:
Story

As a data scientist, I want to be able to define multiple symbols per row for 
pattern matching.

See
http://madlib.incubator.apache.org/docs/latest/group__grp__path.html
for a description of what a symbol is.

Currently in 1.9, a given row can only match one symbol. If a row matches 
multiple symbols, the symbol that comes first in the symbol definition list 
will take precedence.   This story is about all matching symbols on a row being 
used.

Acceptance

The attached data set and query should should produce the following output:

Event Timestamp User ID Age Group       Income Group    Gender  Region  
Household Size  Click Event     Purchase Event  Revenue Margin
4/16/12 5:59    102871  3       4       Female  Central 5       1       0       
0       0
4/16/12 9:35    102871  3       4       Female  Central 5       1       0       
0       0

There are symbol matches for:
Gender=Female
Region=Central
Click Event=1


> Path - multiple symbol matches per row
> --------------------------------------
>
>                 Key: MADLIB-943
>                 URL: https://issues.apache.org/jira/browse/MADLIB-943
>             Project: Apache MADlib
>          Issue Type: New Feature
>          Components: Module: Utilities
>            Reporter: Frank McQuillan
>             Fix For: v1.9.1
>
>         Attachments: Ecommerce data set for path test 3.csv, 
> path-multi-symbol-per-row.ipynb
>
>
> Story
> As a data scientist, I want to be able to define multiple symbols per row for 
> pattern matching.
> See
> http://madlib.incubator.apache.org/docs/latest/group__grp__path.html
> for a description of what a symbol is.
> Currently in 1.9, a given row can only match one symbol. If a row matches 
> multiple symbols, the symbol that comes first in the symbol definition list 
> will take precedence.   This story is about all matching symbols on a row 
> being used.
> Acceptance
> The attached data set and query should should produce the following output:
> There are symbol matches for:
> Gender=Female
> Region=East
> Household Size=3



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to