Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-19 Thread Stern, Randall
Also, see FITS (http://code.google.com/p/fits/)

FITS is an open source java toolset we wrote that wraps JHOVE, ExifTool,
and several other format analysis tools and produces a single XML output
stream. It also includes a crosswalk to MIX XML as an optional output.

Date:Mon, 18 Jul 2011 10:40:04 -0400
From:Edward M. Corrado ecorr...@ecorrado.us
Subject: Re: TIFF Metadata to XML?

Thanks for all the suggestions. I know have multiple ways to get an XML
file... now I only need to figure out which fields map to what.

Edward

On Mon, Jul 18, 2011 at 9:57 AM, Dave Rice d...@avpreserve.com wrote:
 Try exiftool with the -X flag to get RDF XML output.
 Dave Rice
 avpreserve.com

 On Jul 18, 2011, at 9:18 AM, Edward M. Corrado wrote:

 Hello All,

 Before I re-invent the wheel or try many different programs, does 
 anyone have a suggestion on a good way to extract embedded Metadata 
 added by cameras and (more importantly) photo-editing programs such 
 as Photoshop from TIFF files and save it as as XML? I have  60k 
 photos that have metadata including keywords, descriptions, creator, 
 and other fields embedded in them and I need to extract the metadata 
 so I can load them into our digital archive.

 Right now, after looking at a few tools and having done a number of 
 Google searches and haven't found anything that seems to do what I 
 want. As of now I am leaning towards extracting the metadata using
 exiv2 and creating a script (shell, perl, whatever) to put the fields

 I need into a pseudo-Dublin Core XML format. I say pseudo because I 
 have a few fields that are not Dublin Core. I am assuming there is a 
 better way. (Although part of me thinks it might be easier to do that

 then exporting to XML and using XSLT to transform the file since I 
 might need to do a lot of cleanup of the data regardless.)

 Anyway, before I go any further, does anyone have any 
 thoughts/ideas/suggestions?

 Edward


--

End of CODE4LIB Digest - 15 Jul 2011 to 18 Jul 2011 (#2011-171)
***


Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-19 Thread Joe Hourcle
On Jul 19, 2011, at 10:34 AM, Stern, Randall wrote:

 Also, see FITS (http://code.google.com/p/fits/)
 
 FITS is an open source java toolset we wrote that wraps JHOVE, ExifTool,
 and several other format analysis tools and produces a single XML output
 stream. It also includes a crosswalk to MIX XML as an optional output.


Really?  You named a tool that deals with image data 'FITS' ?

You do realize there's actually a 30+ year old image standard called FITS:

http://fits.gsfc.nasa.gov/

(which has its own metadata standard, just to make things even more 
interesting)

-Joe


Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-19 Thread Dave Rice
On Jul 19, 2011, at 11:03 AM, Joe Hourcle wrote:

 On Jul 19, 2011, at 10:34 AM, Stern, Randall wrote:
 
 Also, see FITS (http://code.google.com/p/fits/)
 
 FITS is an open source java toolset we wrote that wraps JHOVE, ExifTool,
 and several other format analysis tools and produces a single XML output
 stream. It also includes a crosswalk to MIX XML as an optional output.
 
 Really?  You named a tool that deals with image data 'FITS' ?
 
 You do realize there's actually a 30+ year old image standard called FITS:
 
   http://fits.gsfc.nasa.gov/
 
 (which has its own metadata standard, just to make things even more 
 interesting)

This appears to the a known issue in their tracker: 
http://code.google.com/p/fits/issues/detail?id=10. Medium priority.

Also it's worth comparing the FITS' output to the output of exiftool -X. IIRC 
FITS uses a low level of verbosity with their FITS integration, though this may 
not be noticeable with some formats.

Dave Rice
avpreserve.com


Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-18 Thread Jon Stroop

Edward,
JHOVE (1)  should be able to do this, and I believe you can pass the 
included shell script a directory and have it extract data for 
everything it finds and can parse inside.

-Jon

On 07/18/2011 09:18 AM, Edward M. Corrado wrote:

Hello All,

Before I re-invent the wheel or try many different programs, does
anyone have a suggestion on a good way to extract embedded Metadata
added by cameras and (more importantly) photo-editing programs such as
Photoshop from TIFF files and save it as as XML? I have  60k photos
that have metadata including keywords, descriptions, creator, and
other fields embedded in them and I need to extract the metadata so I
can load them into our digital archive.

Right now, after looking at a few tools and having done a number of
Google searches and haven't found anything that seems to do what I
want. As of now I am leaning towards extracting the metadata using
exiv2 and creating a script (shell, perl, whatever) to put the fields
I need into a pseudo-Dublin Core XML format. I say pseudo because I
have a few fields that are not Dublin Core. I am assuming there is a
better way. (Although part of me thinks it might be easier to do that
then exporting to XML and using XSLT to transform the file since I
might need to do a lot of cleanup of the data regardless.)

Anyway, before I go any further, does anyone have any
thoughts/ideas/suggestions?

Edward


Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-18 Thread Sheila M. Morrissey
JHOVE2 (www.jhove2.org) will work as well.
Sheila


From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Stroop 
[jstr...@princeton.edu]
Sent: Monday, July 18, 2011 9:23 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] TIFF Metadata to XML?

Edward,
JHOVE (1)  should be able to do this, and I believe you can pass the
included shell script a directory and have it extract data for
everything it finds and can parse inside.
-Jon

On 07/18/2011 09:18 AM, Edward M. Corrado wrote:
 Hello All,

 Before I re-invent the wheel or try many different programs, does
 anyone have a suggestion on a good way to extract embedded Metadata
 added by cameras and (more importantly) photo-editing programs such as
 Photoshop from TIFF files and save it as as XML? I have  60k photos
 that have metadata including keywords, descriptions, creator, and
 other fields embedded in them and I need to extract the metadata so I
 can load them into our digital archive.

 Right now, after looking at a few tools and having done a number of
 Google searches and haven't found anything that seems to do what I
 want. As of now I am leaning towards extracting the metadata using
 exiv2 and creating a script (shell, perl, whatever) to put the fields
 I need into a pseudo-Dublin Core XML format. I say pseudo because I
 have a few fields that are not Dublin Core. I am assuming there is a
 better way. (Although part of me thinks it might be easier to do that
 then exporting to XML and using XSLT to transform the file since I
 might need to do a lot of cleanup of the data regardless.)

 Anyway, before I go any further, does anyone have any
 thoughts/ideas/suggestions?

 Edward


Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-18 Thread Sheila M. Morrissey
Hello, Jon --
Should have added -- thanks for the pointer to JHOVE/JHOVE2 --
There are still some modules in JHOVE for which there is not yet one in JHOVE2 
(though coming to a Bitbucker repository near you soon!!) and vice versa-- but 
for TIFF -- folks might prefer using the later code.
Best,
Sehila

From: Jon Stroop [jstr...@princeton.edu]
Sent: Monday, July 18, 2011 9:41 AM
To: Sheila M. Morrissey
Subject: Re: [CODE4LIB] TIFF Metadata to XML?
Oops!  I wasn't trying to specify a version of JHOVE, I meant to add a
footnote with a link and forgot. For what it's worth, I was going to
link to JHOVE2 :-) .
Hope all is well with you,
Jon
On 07/18/2011 09:36 AM, Sheila M. Morrissey wrote:
 JHOVE2 (www.jhove2.org) will work as well.
 Sheila

 
 From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Stroop 
 [jstr...@princeton.edu]
 Sent: Monday, July 18, 2011 9:23 AM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] TIFF Metadata to XML?

 Edward,
 JHOVE (1)  should be able to do this, and I believe you can pass the
 included shell script a directory and have it extract data for
 everything it finds and can parse inside.
 -Jon

 On 07/18/2011 09:18 AM, Edward M. Corrado wrote:
 Hello All,

 Before I re-invent the wheel or try many different programs, does
 anyone have a suggestion on a good way to extract embedded Metadata
 added by cameras and (more importantly) photo-editing programs such as
 Photoshop from TIFF files and save it as as XML? I have   60k photos
 that have metadata including keywords, descriptions, creator, and
 other fields embedded in them and I need to extract the metadata so I
 can load them into our digital archive.

 Right now, after looking at a few tools and having done a number of
 Google searches and haven't found anything that seems to do what I
 want. As of now I am leaning towards extracting the metadata using
 exiv2 and creating a script (shell, perl, whatever) to put the fields
 I need into a pseudo-Dublin Core XML format. I say pseudo because I
 have a few fields that are not Dublin Core. I am assuming there is a
 better way. (Although part of me thinks it might be easier to do that
 then exporting to XML and using XSLT to transform the file since I
 might need to do a lot of cleanup of the data regardless.)

 Anyway, before I go any further, does anyone have any
 thoughts/ideas/suggestions?

 Edward


From: Sheila M. Morrissey
Sent: Monday, July 18, 2011 9:36 AM
To: Code for Libraries
Subject: RE: [CODE4LIB] TIFF Metadata to XML?

JHOVE2 (www.jhove2.org) will work as well.
Sheila


From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Stroop 
[jstr...@princeton.edu]
Sent: Monday, July 18, 2011 9:23 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] TIFF Metadata to XML?

Edward,
JHOVE (1)  should be able to do this, and I believe you can pass the
included shell script a directory and have it extract data for
everything it finds and can parse inside.
-Jon

On 07/18/2011 09:18 AM, Edward M. Corrado wrote:
 Hello All,

 Before I re-invent the wheel or try many different programs, does
 anyone have a suggestion on a good way to extract embedded Metadata
 added by cameras and (more importantly) photo-editing programs such as
 Photoshop from TIFF files and save it as as XML? I have  60k photos
 that have metadata including keywords, descriptions, creator, and
 other fields embedded in them and I need to extract the metadata so I
 can load them into our digital archive.

 Right now, after looking at a few tools and having done a number of
 Google searches and haven't found anything that seems to do what I
 want. As of now I am leaning towards extracting the metadata using
 exiv2 and creating a script (shell, perl, whatever) to put the fields
 I need into a pseudo-Dublin Core XML format. I say pseudo because I
 have a few fields that are not Dublin Core. I am assuming there is a
 better way. (Although part of me thinks it might be easier to do that
 then exporting to XML and using XSLT to transform the file since I
 might need to do a lot of cleanup of the data regardless.)

 Anyway, before I go any further, does anyone have any
 thoughts/ideas/suggestions?

 Edward


Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-18 Thread Ford, Kevin
Exiftool [1] and trusty ImageMagick [2] will work.  With ImageMagick it is as 
easy as:

convert image.tiff image.xmp

Members of the Visual Resources Association (VRA) have been working on/with 
embedded metadata for a few years now.  There may be something more to glean 
from the working group's wiki [3].

Cordially,

Kevin


[1] http://www.sno.phy.queensu.ca/~phil/exiftool/
[2] http://www.imagemagick.org/script/index.php
[3] http://metadatadeluxe.pbworks.com/w/page/20792238/FrontPage
 

 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
 Edward M. Corrado
 Sent: Monday, July 18, 2011 9:18 AM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: [CODE4LIB] TIFF Metadata to XML?
 
 Hello All,
 
 Before I re-invent the wheel or try many different programs, does
 anyone have a suggestion on a good way to extract embedded Metadata
 added by cameras and (more importantly) photo-editing programs such as
 Photoshop from TIFF files and save it as as XML? I have  60k photos
 that have metadata including keywords, descriptions, creator, and other
 fields embedded in them and I need to extract the metadata so I can
 load them into our digital archive.
 
 Right now, after looking at a few tools and having done a number of
 Google searches and haven't found anything that seems to do what I want.
 As of now I am leaning towards extracting the metadata using
 exiv2 and creating a script (shell, perl, whatever) to put the fields I
 need into a pseudo-Dublin Core XML format. I say pseudo because I have
 a few fields that are not Dublin Core. I am assuming there is a better
 way. (Although part of me thinks it might be easier to do that then
 exporting to XML and using XSLT to transform the file since I might
 need to do a lot of cleanup of the data regardless.)
 
 Anyway, before I go any further, does anyone have any
 thoughts/ideas/suggestions?
 
 Edward


Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-18 Thread Joe Hourcle
On Jul 18, 2011, at 9:18 AM, Edward M. Corrado wrote:

 Hello All,
 
 Before I re-invent the wheel or try many different programs, does
 anyone have a suggestion on a good way to extract embedded Metadata
 added by cameras and (more importantly) photo-editing programs such as
 Photoshop from TIFF files and save it as as XML? I have  60k photos
 that have metadata including keywords, descriptions, creator, and
 other fields embedded in them and I need to extract the metadata so I
 can load them into our digital archive.
 
 Right now, after looking at a few tools and having done a number of
 Google searches and haven't found anything that seems to do what I
 want. As of now I am leaning towards extracting the metadata using
 exiv2 and creating a script (shell, perl, whatever) to put the fields
 I need into a pseudo-Dublin Core XML format. I say pseudo because I
 have a few fields that are not Dublin Core. I am assuming there is a
 better way. (Although part of me thinks it might be easier to do that
 then exporting to XML and using XSLT to transform the file since I
 might need to do a lot of cleanup of the data regardless.)
 
 Anyway, before I go any further, does anyone have any
 thoughts/ideas/suggestions?

I haven't (yet) used it myself, but Exiv2 ( http://www.exiv2.org )
supports reading and writing XMP, EXIF and IPTC metadata from
a large number of file formats.

-Joe


Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-18 Thread Dave Rice
Try exiftool with the -X flag to get RDF XML output.
Dave Rice
avpreserve.com

On Jul 18, 2011, at 9:18 AM, Edward M. Corrado wrote:

 Hello All,
 
 Before I re-invent the wheel or try many different programs, does
 anyone have a suggestion on a good way to extract embedded Metadata
 added by cameras and (more importantly) photo-editing programs such as
 Photoshop from TIFF files and save it as as XML? I have  60k photos
 that have metadata including keywords, descriptions, creator, and
 other fields embedded in them and I need to extract the metadata so I
 can load them into our digital archive.
 
 Right now, after looking at a few tools and having done a number of
 Google searches and haven't found anything that seems to do what I
 want. As of now I am leaning towards extracting the metadata using
 exiv2 and creating a script (shell, perl, whatever) to put the fields
 I need into a pseudo-Dublin Core XML format. I say pseudo because I
 have a few fields that are not Dublin Core. I am assuming there is a
 better way. (Although part of me thinks it might be easier to do that
 then exporting to XML and using XSLT to transform the file since I
 might need to do a lot of cleanup of the data regardless.)
 
 Anyway, before I go any further, does anyone have any
 thoughts/ideas/suggestions?
 
 Edward


Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-18 Thread Edward M. Corrado
Thanks for all the suggestions. I know have multiple ways to get an
XML file... now I only need to figure out which fields map to what.

Edward

On Mon, Jul 18, 2011 at 9:57 AM, Dave Rice d...@avpreserve.com wrote:
 Try exiftool with the -X flag to get RDF XML output.
 Dave Rice
 avpreserve.com

 On Jul 18, 2011, at 9:18 AM, Edward M. Corrado wrote:

 Hello All,

 Before I re-invent the wheel or try many different programs, does
 anyone have a suggestion on a good way to extract embedded Metadata
 added by cameras and (more importantly) photo-editing programs such as
 Photoshop from TIFF files and save it as as XML? I have  60k photos
 that have metadata including keywords, descriptions, creator, and
 other fields embedded in them and I need to extract the metadata so I
 can load them into our digital archive.

 Right now, after looking at a few tools and having done a number of
 Google searches and haven't found anything that seems to do what I
 want. As of now I am leaning towards extracting the metadata using
 exiv2 and creating a script (shell, perl, whatever) to put the fields
 I need into a pseudo-Dublin Core XML format. I say pseudo because I
 have a few fields that are not Dublin Core. I am assuming there is a
 better way. (Although part of me thinks it might be easier to do that
 then exporting to XML and using XSLT to transform the file since I
 might need to do a lot of cleanup of the data regardless.)

 Anyway, before I go any further, does anyone have any
 thoughts/ideas/suggestions?

 Edward



Re: [CODE4LIB] TIFF Metadata to XML?

2011-07-18 Thread Genny Engel
Guess it depends on whether they actually followed any kind of standard in 
encoding the data in the TIFF files.

http://www.metadataworkinggroup.com/pdf/mwg_guidance.pdf


Genny Engel
Internet Librarian
Sonoma County Library
gen...@sonoma.lib.ca.us
www.sonomalibrary.org
707 545-0831 x581


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Edward 
M. Corrado
Sent: Monday, July 18, 2011 7:40 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] TIFF Metadata to XML?

Thanks for all the suggestions. I know have multiple ways to get an
XML file... now I only need to figure out which fields map to what.

Edward

On Mon, Jul 18, 2011 at 9:57 AM, Dave Rice d...@avpreserve.com wrote:
 Try exiftool with the -X flag to get RDF XML output.
 Dave Rice
 avpreserve.com

 On Jul 18, 2011, at 9:18 AM, Edward M. Corrado wrote:

 Hello All,

 Before I re-invent the wheel or try many different programs, does
 anyone have a suggestion on a good way to extract embedded Metadata
 added by cameras and (more importantly) photo-editing programs such as
 Photoshop from TIFF files and save it as as XML? I have  60k photos
 that have metadata including keywords, descriptions, creator, and
 other fields embedded in them and I need to extract the metadata so I
 can load them into our digital archive.

 Right now, after looking at a few tools and having done a number of
 Google searches and haven't found anything that seems to do what I
 want. As of now I am leaning towards extracting the metadata using
 exiv2 and creating a script (shell, perl, whatever) to put the fields
 I need into a pseudo-Dublin Core XML format. I say pseudo because I
 have a few fields that are not Dublin Core. I am assuming there is a
 better way. (Although part of me thinks it might be easier to do that
 then exporting to XML and using XSLT to transform the file since I
 might need to do a lot of cleanup of the data regardless.)

 Anyway, before I go any further, does anyone have any
 thoughts/ideas/suggestions?

 Edward