I have a challenge I am interested in getting feedback on.
I will on a regular basis download a series of data files from the web where
the data is in XML-format. The format is known in advance but is different
between the various data files. I then plan to extract the various data items
("elements?") from each data file, do some light formatting and then save
desired parts of each original data file as a formatted CSV-file for later
importing into a database.
As the plan is to use a bash shell script using curl to get the files, I have
begun looking at external XML parsers that I can call from my script, perhaps
specify which elements I want, get the data back in some kind of bash data
structure and finally format and save as CSV-files.
There seems to be a number of XML parsers available but perhaps someone on the
list has a recommendation for which one might suit my needs best? I should add
that I am running CentOS 7.
Thank you.
_______________________________________________
CentOS mailing list
[email protected]
https://lists.centos.org/mailman/listinfo/centos