Re: [CODE4LIB] TIFF Metadata to XML?
Also, see FITS (http://code.google.com/p/fits/) FITS is an open source java toolset we wrote that wraps JHOVE, ExifTool, and several other format analysis tools and produces a single XML output stream. It also includes a crosswalk to MIX XML as an optional output. Date:Mon, 18 Jul 2011 10:40:04 -0400 From:Edward M. Corrado ecorr...@ecorrado.us Subject: Re: TIFF Metadata to XML? Thanks for all the suggestions. I know have multiple ways to get an XML file... now I only need to figure out which fields map to what. Edward On Mon, Jul 18, 2011 at 9:57 AM, Dave Rice d...@avpreserve.com wrote: Try exiftool with the -X flag to get RDF XML output. Dave Rice avpreserve.com On Jul 18, 2011, at 9:18 AM, Edward M. Corrado wrote: Hello All, Before I re-invent the wheel or try many different programs, does anyone have a suggestion on a good way to extract embedded Metadata added by cameras and (more importantly) photo-editing programs such as Photoshop from TIFF files and save it as as XML? I have 60k photos that have metadata including keywords, descriptions, creator, and other fields embedded in them and I need to extract the metadata so I can load them into our digital archive. Right now, after looking at a few tools and having done a number of Google searches and haven't found anything that seems to do what I want. As of now I am leaning towards extracting the metadata using exiv2 and creating a script (shell, perl, whatever) to put the fields I need into a pseudo-Dublin Core XML format. I say pseudo because I have a few fields that are not Dublin Core. I am assuming there is a better way. (Although part of me thinks it might be easier to do that then exporting to XML and using XSLT to transform the file since I might need to do a lot of cleanup of the data regardless.) Anyway, before I go any further, does anyone have any thoughts/ideas/suggestions? Edward -- End of CODE4LIB Digest - 15 Jul 2011 to 18 Jul 2011 (#2011-171) ***
Re: [CODE4LIB] TIFF Metadata to XML?
On Jul 19, 2011, at 10:34 AM, Stern, Randall wrote: Also, see FITS (http://code.google.com/p/fits/) FITS is an open source java toolset we wrote that wraps JHOVE, ExifTool, and several other format analysis tools and produces a single XML output stream. It also includes a crosswalk to MIX XML as an optional output. Really? You named a tool that deals with image data 'FITS' ? You do realize there's actually a 30+ year old image standard called FITS: http://fits.gsfc.nasa.gov/ (which has its own metadata standard, just to make things even more interesting) -Joe
Re: [CODE4LIB] TIFF Metadata to XML?
On Jul 19, 2011, at 11:03 AM, Joe Hourcle wrote: On Jul 19, 2011, at 10:34 AM, Stern, Randall wrote: Also, see FITS (http://code.google.com/p/fits/) FITS is an open source java toolset we wrote that wraps JHOVE, ExifTool, and several other format analysis tools and produces a single XML output stream. It also includes a crosswalk to MIX XML as an optional output. Really? You named a tool that deals with image data 'FITS' ? You do realize there's actually a 30+ year old image standard called FITS: http://fits.gsfc.nasa.gov/ (which has its own metadata standard, just to make things even more interesting) This appears to the a known issue in their tracker: http://code.google.com/p/fits/issues/detail?id=10. Medium priority. Also it's worth comparing the FITS' output to the output of exiftool -X. IIRC FITS uses a low level of verbosity with their FITS integration, though this may not be noticeable with some formats. Dave Rice avpreserve.com
Re: [CODE4LIB] TIFF Metadata to XML?
Edward, JHOVE (1) should be able to do this, and I believe you can pass the included shell script a directory and have it extract data for everything it finds and can parse inside. -Jon On 07/18/2011 09:18 AM, Edward M. Corrado wrote: Hello All, Before I re-invent the wheel or try many different programs, does anyone have a suggestion on a good way to extract embedded Metadata added by cameras and (more importantly) photo-editing programs such as Photoshop from TIFF files and save it as as XML? I have 60k photos that have metadata including keywords, descriptions, creator, and other fields embedded in them and I need to extract the metadata so I can load them into our digital archive. Right now, after looking at a few tools and having done a number of Google searches and haven't found anything that seems to do what I want. As of now I am leaning towards extracting the metadata using exiv2 and creating a script (shell, perl, whatever) to put the fields I need into a pseudo-Dublin Core XML format. I say pseudo because I have a few fields that are not Dublin Core. I am assuming there is a better way. (Although part of me thinks it might be easier to do that then exporting to XML and using XSLT to transform the file since I might need to do a lot of cleanup of the data regardless.) Anyway, before I go any further, does anyone have any thoughts/ideas/suggestions? Edward
Re: [CODE4LIB] TIFF Metadata to XML?
JHOVE2 (www.jhove2.org) will work as well. Sheila From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Stroop [jstr...@princeton.edu] Sent: Monday, July 18, 2011 9:23 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] TIFF Metadata to XML? Edward, JHOVE (1) should be able to do this, and I believe you can pass the included shell script a directory and have it extract data for everything it finds and can parse inside. -Jon On 07/18/2011 09:18 AM, Edward M. Corrado wrote: Hello All, Before I re-invent the wheel or try many different programs, does anyone have a suggestion on a good way to extract embedded Metadata added by cameras and (more importantly) photo-editing programs such as Photoshop from TIFF files and save it as as XML? I have 60k photos that have metadata including keywords, descriptions, creator, and other fields embedded in them and I need to extract the metadata so I can load them into our digital archive. Right now, after looking at a few tools and having done a number of Google searches and haven't found anything that seems to do what I want. As of now I am leaning towards extracting the metadata using exiv2 and creating a script (shell, perl, whatever) to put the fields I need into a pseudo-Dublin Core XML format. I say pseudo because I have a few fields that are not Dublin Core. I am assuming there is a better way. (Although part of me thinks it might be easier to do that then exporting to XML and using XSLT to transform the file since I might need to do a lot of cleanup of the data regardless.) Anyway, before I go any further, does anyone have any thoughts/ideas/suggestions? Edward
Re: [CODE4LIB] TIFF Metadata to XML?
Hello, Jon -- Should have added -- thanks for the pointer to JHOVE/JHOVE2 -- There are still some modules in JHOVE for which there is not yet one in JHOVE2 (though coming to a Bitbucker repository near you soon!!) and vice versa-- but for TIFF -- folks might prefer using the later code. Best, Sehila From: Jon Stroop [jstr...@princeton.edu] Sent: Monday, July 18, 2011 9:41 AM To: Sheila M. Morrissey Subject: Re: [CODE4LIB] TIFF Metadata to XML? Oops! I wasn't trying to specify a version of JHOVE, I meant to add a footnote with a link and forgot. For what it's worth, I was going to link to JHOVE2 :-) . Hope all is well with you, Jon On 07/18/2011 09:36 AM, Sheila M. Morrissey wrote: JHOVE2 (www.jhove2.org) will work as well. Sheila From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Stroop [jstr...@princeton.edu] Sent: Monday, July 18, 2011 9:23 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] TIFF Metadata to XML? Edward, JHOVE (1) should be able to do this, and I believe you can pass the included shell script a directory and have it extract data for everything it finds and can parse inside. -Jon On 07/18/2011 09:18 AM, Edward M. Corrado wrote: Hello All, Before I re-invent the wheel or try many different programs, does anyone have a suggestion on a good way to extract embedded Metadata added by cameras and (more importantly) photo-editing programs such as Photoshop from TIFF files and save it as as XML? I have 60k photos that have metadata including keywords, descriptions, creator, and other fields embedded in them and I need to extract the metadata so I can load them into our digital archive. Right now, after looking at a few tools and having done a number of Google searches and haven't found anything that seems to do what I want. As of now I am leaning towards extracting the metadata using exiv2 and creating a script (shell, perl, whatever) to put the fields I need into a pseudo-Dublin Core XML format. I say pseudo because I have a few fields that are not Dublin Core. I am assuming there is a better way. (Although part of me thinks it might be easier to do that then exporting to XML and using XSLT to transform the file since I might need to do a lot of cleanup of the data regardless.) Anyway, before I go any further, does anyone have any thoughts/ideas/suggestions? Edward From: Sheila M. Morrissey Sent: Monday, July 18, 2011 9:36 AM To: Code for Libraries Subject: RE: [CODE4LIB] TIFF Metadata to XML? JHOVE2 (www.jhove2.org) will work as well. Sheila From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Stroop [jstr...@princeton.edu] Sent: Monday, July 18, 2011 9:23 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] TIFF Metadata to XML? Edward, JHOVE (1) should be able to do this, and I believe you can pass the included shell script a directory and have it extract data for everything it finds and can parse inside. -Jon On 07/18/2011 09:18 AM, Edward M. Corrado wrote: Hello All, Before I re-invent the wheel or try many different programs, does anyone have a suggestion on a good way to extract embedded Metadata added by cameras and (more importantly) photo-editing programs such as Photoshop from TIFF files and save it as as XML? I have 60k photos that have metadata including keywords, descriptions, creator, and other fields embedded in them and I need to extract the metadata so I can load them into our digital archive. Right now, after looking at a few tools and having done a number of Google searches and haven't found anything that seems to do what I want. As of now I am leaning towards extracting the metadata using exiv2 and creating a script (shell, perl, whatever) to put the fields I need into a pseudo-Dublin Core XML format. I say pseudo because I have a few fields that are not Dublin Core. I am assuming there is a better way. (Although part of me thinks it might be easier to do that then exporting to XML and using XSLT to transform the file since I might need to do a lot of cleanup of the data regardless.) Anyway, before I go any further, does anyone have any thoughts/ideas/suggestions? Edward
Re: [CODE4LIB] TIFF Metadata to XML?
Exiftool [1] and trusty ImageMagick [2] will work. With ImageMagick it is as easy as: convert image.tiff image.xmp Members of the Visual Resources Association (VRA) have been working on/with embedded metadata for a few years now. There may be something more to glean from the working group's wiki [3]. Cordially, Kevin [1] http://www.sno.phy.queensu.ca/~phil/exiftool/ [2] http://www.imagemagick.org/script/index.php [3] http://metadatadeluxe.pbworks.com/w/page/20792238/FrontPage -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Edward M. Corrado Sent: Monday, July 18, 2011 9:18 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] TIFF Metadata to XML? Hello All, Before I re-invent the wheel or try many different programs, does anyone have a suggestion on a good way to extract embedded Metadata added by cameras and (more importantly) photo-editing programs such as Photoshop from TIFF files and save it as as XML? I have 60k photos that have metadata including keywords, descriptions, creator, and other fields embedded in them and I need to extract the metadata so I can load them into our digital archive. Right now, after looking at a few tools and having done a number of Google searches and haven't found anything that seems to do what I want. As of now I am leaning towards extracting the metadata using exiv2 and creating a script (shell, perl, whatever) to put the fields I need into a pseudo-Dublin Core XML format. I say pseudo because I have a few fields that are not Dublin Core. I am assuming there is a better way. (Although part of me thinks it might be easier to do that then exporting to XML and using XSLT to transform the file since I might need to do a lot of cleanup of the data regardless.) Anyway, before I go any further, does anyone have any thoughts/ideas/suggestions? Edward
Re: [CODE4LIB] TIFF Metadata to XML?
On Jul 18, 2011, at 9:18 AM, Edward M. Corrado wrote: Hello All, Before I re-invent the wheel or try many different programs, does anyone have a suggestion on a good way to extract embedded Metadata added by cameras and (more importantly) photo-editing programs such as Photoshop from TIFF files and save it as as XML? I have 60k photos that have metadata including keywords, descriptions, creator, and other fields embedded in them and I need to extract the metadata so I can load them into our digital archive. Right now, after looking at a few tools and having done a number of Google searches and haven't found anything that seems to do what I want. As of now I am leaning towards extracting the metadata using exiv2 and creating a script (shell, perl, whatever) to put the fields I need into a pseudo-Dublin Core XML format. I say pseudo because I have a few fields that are not Dublin Core. I am assuming there is a better way. (Although part of me thinks it might be easier to do that then exporting to XML and using XSLT to transform the file since I might need to do a lot of cleanup of the data regardless.) Anyway, before I go any further, does anyone have any thoughts/ideas/suggestions? I haven't (yet) used it myself, but Exiv2 ( http://www.exiv2.org ) supports reading and writing XMP, EXIF and IPTC metadata from a large number of file formats. -Joe
Re: [CODE4LIB] TIFF Metadata to XML?
Try exiftool with the -X flag to get RDF XML output. Dave Rice avpreserve.com On Jul 18, 2011, at 9:18 AM, Edward M. Corrado wrote: Hello All, Before I re-invent the wheel or try many different programs, does anyone have a suggestion on a good way to extract embedded Metadata added by cameras and (more importantly) photo-editing programs such as Photoshop from TIFF files and save it as as XML? I have 60k photos that have metadata including keywords, descriptions, creator, and other fields embedded in them and I need to extract the metadata so I can load them into our digital archive. Right now, after looking at a few tools and having done a number of Google searches and haven't found anything that seems to do what I want. As of now I am leaning towards extracting the metadata using exiv2 and creating a script (shell, perl, whatever) to put the fields I need into a pseudo-Dublin Core XML format. I say pseudo because I have a few fields that are not Dublin Core. I am assuming there is a better way. (Although part of me thinks it might be easier to do that then exporting to XML and using XSLT to transform the file since I might need to do a lot of cleanup of the data regardless.) Anyway, before I go any further, does anyone have any thoughts/ideas/suggestions? Edward
Re: [CODE4LIB] TIFF Metadata to XML?
Thanks for all the suggestions. I know have multiple ways to get an XML file... now I only need to figure out which fields map to what. Edward On Mon, Jul 18, 2011 at 9:57 AM, Dave Rice d...@avpreserve.com wrote: Try exiftool with the -X flag to get RDF XML output. Dave Rice avpreserve.com On Jul 18, 2011, at 9:18 AM, Edward M. Corrado wrote: Hello All, Before I re-invent the wheel or try many different programs, does anyone have a suggestion on a good way to extract embedded Metadata added by cameras and (more importantly) photo-editing programs such as Photoshop from TIFF files and save it as as XML? I have 60k photos that have metadata including keywords, descriptions, creator, and other fields embedded in them and I need to extract the metadata so I can load them into our digital archive. Right now, after looking at a few tools and having done a number of Google searches and haven't found anything that seems to do what I want. As of now I am leaning towards extracting the metadata using exiv2 and creating a script (shell, perl, whatever) to put the fields I need into a pseudo-Dublin Core XML format. I say pseudo because I have a few fields that are not Dublin Core. I am assuming there is a better way. (Although part of me thinks it might be easier to do that then exporting to XML and using XSLT to transform the file since I might need to do a lot of cleanup of the data regardless.) Anyway, before I go any further, does anyone have any thoughts/ideas/suggestions? Edward
Re: [CODE4LIB] TIFF Metadata to XML?
Guess it depends on whether they actually followed any kind of standard in encoding the data in the TIFF files. http://www.metadataworkinggroup.com/pdf/mwg_guidance.pdf Genny Engel Internet Librarian Sonoma County Library gen...@sonoma.lib.ca.us www.sonomalibrary.org 707 545-0831 x581 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Edward M. Corrado Sent: Monday, July 18, 2011 7:40 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] TIFF Metadata to XML? Thanks for all the suggestions. I know have multiple ways to get an XML file... now I only need to figure out which fields map to what. Edward On Mon, Jul 18, 2011 at 9:57 AM, Dave Rice d...@avpreserve.com wrote: Try exiftool with the -X flag to get RDF XML output. Dave Rice avpreserve.com On Jul 18, 2011, at 9:18 AM, Edward M. Corrado wrote: Hello All, Before I re-invent the wheel or try many different programs, does anyone have a suggestion on a good way to extract embedded Metadata added by cameras and (more importantly) photo-editing programs such as Photoshop from TIFF files and save it as as XML? I have 60k photos that have metadata including keywords, descriptions, creator, and other fields embedded in them and I need to extract the metadata so I can load them into our digital archive. Right now, after looking at a few tools and having done a number of Google searches and haven't found anything that seems to do what I want. As of now I am leaning towards extracting the metadata using exiv2 and creating a script (shell, perl, whatever) to put the fields I need into a pseudo-Dublin Core XML format. I say pseudo because I have a few fields that are not Dublin Core. I am assuming there is a better way. (Although part of me thinks it might be easier to do that then exporting to XML and using XSLT to transform the file since I might need to do a lot of cleanup of the data regardless.) Anyway, before I go any further, does anyone have any thoughts/ideas/suggestions? Edward