Re: [CODE4LIB] Image de-duping and file identification

2013-03-20 Thread Dave Caroline
I had a project to de duplicate many images and other files too. I wrote a little ditty in PHP but the idea can by used in any language. I have a set of tables in MySQL. give the utility a set of root directories to test and compare trawl the filestems for filename location and size and store in

Re: [CODE4LIB] Image de-duping and file identification

2013-03-20 Thread chris fitzpatrick
inline: compose-unknown-contact.jpg

[CODE4LIB] Job: International Community Manager at Open Knowledge Foundation

2013-03-20 Thread jobs
**About the role** Someone highly articulate, enthusiastic and energetic who is willing to travel. While familiarity with email, blogs and Twitter is desirable, no specific technical knowledge is required. Being able to learn quickly, converse intelligently and evangelise convincingly are more

Re: [CODE4LIB] Image de-duping and file identification

2013-03-20 Thread Kyle Banerjee
On Wed, Mar 20, 2013 at 2:22 AM, chris fitzpatrick chrisfitz...@gmail.comwrote: Anyone please correct me if this is wrong. A md5/sha1 file hash would also not get any image derivatives, like crops or they added text or tweaked the contrast or photoshopped their cat into the shot... If you

[CODE4LIB] AdaCamp in San Francisco, 8-9 June 2013

2013-03-20 Thread Roy Tennant
My colleague Merrilee Proffitt asked me to post this to Code4LIb, as she is going to apply to attend this event and she would love see other tech-savvy library women at this event. Roy AdaCamp[1] is an Ada Initiative event focused on increasing women’s participation in open technology and