None that I know of-- I had to do the same thing to parse some XML data in a couple of chapters of the Spark book we were writing. Would obviously love to have that in crunch-core.
J On Sun, Jan 25, 2015 at 5:20 AM, Christian Tzolov < christian.tzo...@gmail.com> wrote: > Hi there, > > Recently I had to ingest some Xml formatted data. I couldn't find related > topic in the mailing lists so i've implemented a Crunch XmlSource ( > https://github.com/tzolov/crunch-xmlsource) reusing the Mahout's > XmlInputFormat/XmlRecordReader implementations. > > Are there any alternative approaches? > > Apologies if this topic has been discussed already! > > Cheers, > Chris > -- Director of Data Science Cloudera <http://www.cloudera.com> Twitter: @josh_wills <http://twitter.com/josh_wills>