[CODE4LIB] long-term preservation of digital files

2014-01-16 Thread Kathryn Frederick (Library)
Hi,
I'm trying to develop a process for long-term preservation of the files we're 
creating though our digitization projects. My current plan is to bag groups of 
files using Bagger. Each bag would include all versions of the file (generally 
TIFF, JPEG, PDF and .txt transcript), a file of technical metadata (generated 
using exiftool), and .xml and marc files of descriptive metadata. Bagger will 
generate the checksums and create a file manifest.  Our IT department is 
providing 8TB of Amazon S3 storage and have set up an AWS storage gateway. The 
storage will be dedicated to these files and access will strictly limited. I'm 
planning to regularly audit what's been stored but haven't decided on a tool to 
do that. Any recommendations? Is there anything else I should consider doing?
Thanks in advance for any advice!
Kathryn


[CODE4LIB] archiving web pages

2014-01-14 Thread Kathryn Frederick (Library)
Hi,
I'm trying to develop a strategy for preserving issues our school's online 
newspaper. Creating a WARC file of the content seems straightforward, but how 
will that content fair long-term? Also, how is the WARC served to an end-user? 
Is there some other method I should look at?
Thanks in advance for any advice!
Kathryn


Re: [CODE4LIB] archiving web pages

2014-01-14 Thread Kathryn Frederick (Library)
Thanks for the thoughtful responses. We've been actively digitizing our print 
paper (which ceased publication in 2011) and I was thinking of this as an 
extension of that effort. Right now, I think capturing a monthly WARC file of 
the site is definitely a good idea no matter what. But beyond that, as Kyle 
pointed out, it's not really the web site I'm after but the content. I'd like 
to present this content alongside print issues in our IR (currently ContentDM). 
In one sense, I can see doing a weekly capture of the site which would equate 
to an issue in the old format. But, I could also do a PDF of the content. A PDF 
makes sense to me in the context of a collection that is largely print-based 
and gets at what I want (keyword searchable content, authors, dates), but is it 
disingenuous to fundamentally alter the format? Plus there's the work 
involved... This may be a question for archivists, but I'm not one so would 
appreciate any additional thoughts from this group. 

On Tue, Jan 14, 2014 at 10:48 AM, Kathryn Frederick (Library) 
kfred...@skidmore.edu wrote:

 Hi,
 I'm trying to develop a strategy for preserving issues our school's online
 newspaper. Creating a WARC file of the content seems straightforward, but
 how will that content fair long-term? Also, how is the WARC served to an
 end-user? Is there some other method I should look at?
 Thanks in advance for any advice!
 Kathryn



[CODE4LIB] LITA Forum Proposal Submission Deadline February 17th

2012-02-06 Thread Kathryn Frederick (Library)
The 2012 LITA National Forum Committee seeks proposals for high quality 
pre-conferences, concurrent sessions and poster sessions for the 15th annual 
LITA National Forum to be held in Columbus, Ohio, October 4-7, 2012.
The 2012 theme is: New World of Data: Discover. Connect. Remix.
Due Date for proposals: February 17, 2012.
The committee is especially interested in presentations highlighting projects 
that are experimental or involve risk-taking.
Find additional information and submit a proposal at: 
http://connect.ala.org/node/164487.


[CODE4LIB] AquaBrowser Libraries Group

2009-10-21 Thread Kathryn Frederick
Please excuse cross-posting.

I've set up an AquaBrowser Google Group to share tips and post
questions. If your library uses AquaBrowser, please consider joining.
This group is restricted, email me at kfred...@skidmore.edu and I'll
send you an invite. 

Licensed sites only, please.

Thanks,
Kathryn