matches clause problem with chars that have special meaning in dk.brics - #, @ 
..
---------------------------------------------------------------------------------

                 Key: PIG-1770
                 URL: https://issues.apache.org/jira/browse/PIG-1770
             Project: Pig
          Issue Type: Bug
          Components: impl
    Affects Versions: 0.8.0
            Reporter: Thejas M Nair
            Assignee: Thejas M Nair
             Fix For: 0.8.0


When special chars #, @ , and the 'optional' patterns described here - 
http://www.brics.dk/automaton/doc/dk/brics/automaton/RegExp.html#RegExp%28java.lang.String%29
 are used , the regex match fails to work. 

This is related to  PIG-965.

Example and workaround are as follows -

{code}
grunt> cat t.txt                           
asd#asdf
zxcasdf
2#asdf

grunt> l = load 't.txt' as (a : chararray);
grunt> f = filter l by (a matches '.*#.*');
grunt> dump f; 
-- No output, though two rows are expected.

--As a workaround, add a \ to escape the # . This regex is valid even in 0.7 , 
and it will be even after this bug is fixed (its valid java regex, which has 
same meaning as above regex).
grunt> f = filter l by (a matches '.*\\#.*');
grunt> dump f; 
asd#asdf
2#asdf
{code}


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to