Hi Shea, 

There are heaps of tools that can assist you, you've been pointed towards the 
excellent ExifTool in previous threads. The command line version is very easy 
to work with, and I have made a few different "tools" that whip out, or change 
exif data where required. A very versatile tool that handles many other 
metadata types on top of exif data (like MS office files, ID3 etc).

Other candidate tools are:-  

Apache Tika - http://tika.apache.org/ - I use this quite a bit in testing, and 
wrangling various text based objects

Jhove - http://sourceforge.net/projects/jhove/ - this will pull out all the 
exif in a lump where you can do things with it. We use in the Rosetta 
validation stack, and it forms one of the processes that we use to 
automatically extract and capture exif data from supported image files. 

All these tools will give you a structured object (CSV, XML etc) that you can 
use to seed a next step process, e.g. ingest into a CMS or repository. 

J  
   

-----Original Message-----
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of 
Swauger,Shea
Sent: Wednesday, 18 December 2013 10:37 a.m.
To: CODE4LIB@LISTSERV.ND.EDU
Subject: [CODE4LIB] Automated Embedded Metadata Extraction in Photographs: 
Possible or Pipedream?

Hi all,

I'm wondering if there is a systematic method that can extract metadata 
embedded in digital photographs and then ingest that metadata into a CMS and 
relate them to their corresponding images. We currently use DigiTool, if that 
makes a difference.

Thanks!

Shea Swauger
Data Management Librarian
Colorado State Univeristy

Reply via email to