I have a challenge I am interested in getting feedback on.

I will on a regular basis download a series of data files from the web where 
the data is in XML-format. The format is known in advance but is different 
between the various data files. I then plan to extract the various data items 
("elements?") from each data file, do some light formatting and then save 
desired parts of each original data file as a formatted CSV-file for later 
importing into a database.

As the plan is to use a bash shell script using curl to get the files, I have 
begun looking at external XML parsers that I can call from my script, perhaps 
specify which elements I want, get the data back in some kind of bash data 
structure and finally format and save as CSV-files.

There seems to be a number of XML parsers available but perhaps someone on the 
list has a recommendation for which one might suit my needs best? I should add 
that I am running CentOS 7.

Thank you.

_______________________________________________
CentOS mailing list
CentOS@centos.org
https://lists.centos.org/mailman/listinfo/centos

Reply via email to