John Cavanaugh created PIG-4355:
-----------------------------------
Summary: Piggybank: XPath cant handle namespace in xpath, nor can
it return more than one match
Key: PIG-4355
URL: https://issues.apache.org/jira/browse/PIG-4355
Project: Pig
Issue Type: Bug
Components: piggybank
Affects Versions: 0.14.0
Reporter: John Cavanaugh
If you pass an xpath that contains a namespace the XPath UDF will always fail
to match.
It would be better to either silently remove the namespace or provide a
parameter that will remove it.
The reason it is desirable to ignore xpath's with namespaces is that many xml
tools when selecting an xpath provide the namespace. It makes cutting &
pasting into a pig script painful if you need to manually remove it.
Additionally XPath only returns the *first* match. It is often desirable to
return all matches and allow for a flattening to process multiple records. An
XPathAll would be useful to have.
A patch is available as a git pullrequest at
https://github.com/apache/pig/pull/14
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)