[CODE4LIB] Job: Lead Software Engineer at OCLC

2014-01-16 Thread jobs
Lead Software Engineer OCLC Dublin Township We have an immediate opening for a Lead Software Engineer to design and develop software solutions in a Hadoop cluster environment, with strong focus on support for Digital Repositories of historic documents, photographs, media, and Institutional Reposit

Re: [CODE4LIB] Creating pdfs from images and their text

2014-01-16 Thread Daron Dierkes
I don't think I can answer your question but I we have a similar problem. I'm not sure about all OCR programs, but the version of Tesseract I've seen in Islandora creates two files, one is the .txt file you would expect and the other is an hOCR file with very interesting mark up linking words in t

[CODE4LIB] Job: Technology Fellowship at Harvard Art Museums

2014-01-16 Thread jobs
Technology Fellowship Harvard Art Museums Cambridge Technology Fellowship at the Harvard Art Museums This is a 12 month fellowship, with the possibility of renewal for a second year. Technology is an integral part of museums in the 21st century. As such, the Harvard Art Museums, with its e

Re: [CODE4LIB] [code4libcon] Code4Lib 2014 Registration is now open!

2014-01-16 Thread Emily Lynema
Becky, Thought folks on both lists might be interested in a numbers update. We had 286 registrations as of yesterday. Pretty amazing how quickly that came in! Don't have numbers for today yet, but I assume the rate will drop off quickly. Emily On Thursday, January 16, 2014, Becky Yoose wrote:

[CODE4LIB] Job: Systems Integration Librarian at State Archives of North Carolina

2014-01-16 Thread jobs
Systems Integration Librarian State Archives of North Carolina Raleigh Job Class Title: Library Professional Working Title: Systems Integration Librarian Position Number: 60083357 Department: Dept of Cultural Resources Recruitment Range: $33,361 - $59,000 Salary Grade / Salary Grade Equivale

Re: [CODE4LIB] long-term preservation of digital files

2014-01-16 Thread Edward Iglesias
A colleague and I wrote up how we did it a while back in code4lib journal http://journal.code4lib.org/articles/4468 We used JHOVE in addition to bagit which was probably overkill. Edward Iglesias On Thu, Jan 16, 2014 at 11:57 AM, Kari R Smith wrote: > Kathryn, > Bagger provides for validatin

[CODE4LIB] Creating pdfs from images and their text

2014-01-16 Thread Padraic Stack
Hi folks, I have a number of typescript / manuscript images on which it is quite time consuming to run OCR. (Or more accurately it is quite time consuming to correct the OCR). For some of these I have text files containing accurate transcriptions. In other cases I have TEI files with these t

Re: [CODE4LIB] long-term preservation of digital files

2014-01-16 Thread Kari R Smith
Kathryn, Bagger provides for validating stored Bags. You might need to write a script to run that as a Batch. Also check out the AVPreserve tool Fixity, which is a fixity management / monitoring tool. Deciding on the appropriate schedule will be important if you're using the Amazon cloud for

[CODE4LIB] long-term preservation of digital files

2014-01-16 Thread Kathryn Frederick (Library)
Hi, I'm trying to develop a process for long-term preservation of the files we're creating though our digitization projects. My current plan is to bag groups of files using Bagger. Each bag would include all versions of the file (generally TIFF, JPEG, PDF and .txt transcript), a file of technica

[CODE4LIB] AVPreserve releases Fixity v0.3

2014-01-16 Thread Bert Lyons
On behalf of AVPreserve: Version 0.3 of Fixity, the free fixity monitoring tool developed by AVPreserve, has been officially released for download on AVPreserve's Tools page and via GitHub. Fixity creates a mani

Re: [CODE4LIB] archiving web pages

2014-01-16 Thread Kari R Smith
As an archivist I would suggest that rather than thinking up all the possible requirements, check with your archives staff, your institutional records policy, and your archives collections policy to find out what their actual requirements are. Having the full digital content as it was displayed

Re: [CODE4LIB] Code4Lib 2014 Registration is now open!

2014-01-16 Thread Becky Yoose
And reply all fail on my part. Meant to only send the message to the code4libcon group. Sorry, everyone... --- > > Colleagues, > > I am happy announce that the Code4Lib 2014 General Re

Re: [CODE4LIB] Code4Lib 2014 Registration is now open!

2014-01-16 Thread Becky Yoose
It looks like the system didn't crash, so congratulations to all on surviving the rush! What's the count so far? On Wednesday, January 15, 2014 11:00:03 AM UTC-6, Tim McGeary wrote: > > Colleagues, > > I am happy announce that the Code4Lib 2014 General Registration is now > open: > https://www.