Hi

Just wondering if anyone out there works with and manipulates and stores XML 
data using Hadoop?  I've seen some threads about XML RecordReaders and people 
who use that XML StreamXmlRecordReader to do splits.  But, has anyone 
implemented a query framework that will use the hadoop layer to query against 
the XML in their map/reduce jobs?

I want to know if anyone has done an XQuery or XPath executed within a haoop 
job to find something within the XML stored in hadoop?

I can't find any samples or anyone else out there who uses XML data vs. 
traditional log text data.

Are there any use cases of using hadoop to work with XML and then do queries 
against XML in a distributed manner using hadoop?

Thanks.



      

Reply via email to