Saad Patel created PIG-3619:
-------------------------------

             Summary: Provide XPath function
                 Key: PIG-3619
                 URL: https://issues.apache.org/jira/browse/PIG-3619
             Project: Pig
          Issue Type: Improvement
          Components: piggybank
            Reporter: Saad Patel


Xml is often loaded using XMLLoader with a record boundary tag as one of the 
parameters. A common use case is to then extract data from those records. XPath 
would allow those extractions to be done very easily. I'm  proposing a patch 
that adds simple XPath support as a UDF.

Example usage of this the XPath UDF would be:

{code}
extractions = FOREACH xmlrecords GENERATE XPath(record, 'book/author'), 
XPath(record, 'book/title');
{code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

Reply via email to