Hi there, Recently I had to ingest some Xml formatted data. I couldn't find related topic in the mailing lists so i've implemented a Crunch XmlSource ( https://github.com/tzolov/crunch-xmlsource) reusing the Mahout's XmlInputFormat/XmlRecordReader implementations.
Are there any alternative approaches? Apologies if this topic has been discussed already! Cheers, Chris