[CODE4LIB] convert MODS XML into CSV or tab-delimted text

2014-04-22 Thread English, Eben
Hello, Does anyone out there have an XSL stylesheet to transform MODS XML into a CSV or tab-delimited text file? Even if it's highly localized to your own institution/project, it would probably still be useful. Thanks in advance, Eben English Web Services Developer Boston Public Library 700

Re: [CODE4LIB] convert MODS XML into CSV or tab-delimted text

2014-04-22 Thread Kyle Banerjee
Given that you'll most likely have to deal with elements that are missing and/or repeat variable amounts of times, conditional mappings, and data that needs to be transformed, it may be easier to use a string parsing routine to do what you need. kyle On Tue, Apr 22, 2014 at 11:35 AM, English,

Re: [CODE4LIB] convert MODS XML into CSV or tab-delimted text

2014-04-22 Thread Cary Gordon
The easiest approach to converting XML to flat formats is to use a tool like Oxygen or Altova MapForce. Unfortunately, MapForce Enterprise — the version you would need — runs around $1k and Oxygen XML Editor is also pricey, although you might qualify for a discount. You can also open XML files

Re: [CODE4LIB] convert MODS XML into CSV or tab-delimted text

2014-04-22 Thread John Palmer
Try XML Spy. It has the capability to export http://manual.altova.com/XMLSpy/spyenterprise/index.html?exporttotextfiles.htmand should also work for XSL. On Tue, Apr 22, 2014 at 1:37 PM, Cary Gordon listu...@chillco.com wrote: The easiest approach to converting XML to flat formats is to use a

Re: [CODE4LIB] convert MODS XML into CSV or tab-delimted text

2014-04-22 Thread Simon Brown
There is also the XML package for R with tools for parsing XML which are all completely free. It even has a function called xmlToDataFrame to convert an XML document to a data frame which can then be written to .csv This function can be used to extract data from an XML document (or sub-document)

Re: [CODE4LIB] convert MODS XML into CSV or tab-delimted text

2014-04-22 Thread Bryan Baldus
On Tuesday, April 22, 2014 1:36 PM, Eben English wrote: Does anyone out there have an XSL stylesheet to transform MODS XML into a CSV or tab-delimited text file? Even if it's highly localized to your own institution/project, it would probably still be useful. I'm not sure how well it would

Re: [CODE4LIB] convert MODS XML into CSV or tab-delimted text

2014-04-22 Thread davesgonechina
LoC has XSLT stylesheets to convert MODS to DC, HTML, and MARCXML. http://www.loc.gov/standards/mods/mods-conversions.html There are also XML to CSV XSLT scripts out here, and there's this app which I tested on a MODS 3.0 record and it didn't look too bad: