Re: [CODE4LIB] Looking for two coders to help with discoverability of videos

2013-11-12 Thread Edward Summers
Hi Kelley, 

Thanks for posting this. When I began work on jobs.code4lib.org I was hoping it 
would encourage people to post short term contracts. The thought being that it 
may be easier for some institutions to find money for projects than full-time 
staff, and it could encourage more open source collaboration between 
organizations, similar to what the Hydra Project are doing.

So, I added your post to jobs.code4lib.org [1]. Ordinarily the person who 
publishes a job posting is the only one who can edit it. But if you would like 
to make any changes to it please let me know and I’ll make you the editor.

Incidentally I was curious about your decision to hire two programmers to do 
what appears to be a very similar task. Was your intent to have two 
implementations to compare to see which you liked better? Were the two 
developers supposed to work together or separately?

//Ed

[1] http://jobs.code4lib.org/job/10658/

On Nov 11, 2013, at 10:58 PM, Kelley McGrath kell...@uoregon.edu wrote:

 I have a small amount of money to work with and am looking for two people to 
 help with extracting data from MARC records as described below. This is part 
 of a larger project to develop a FRBR-based data store and discovery 
 interface for moving images. Our previous work includes a consideration of 
 the feasibility of the project from a cataloging perspective 
 (http://www.olacinc.org/drupal/?q=node/27), a prototype end-user interface 
 (https://blazing-sunset-24.heroku.com/, 
 https://blazing-sunset-24.heroku.com/page/about) and a web form to 
 crowdsource the parsing of movie credits (http://olac-annotator.org/#/about).
 Planned work period: six months beginning around the second week of December 
 (I can be somewhat flexible on the dates if you want to wait and start after 
 the New Year)
 Payment: flat sum of $2500 upon completion of the work
 
 Required skills and knowledge:
 
  *   Familiarity with the MARC 21 bibliographic format
  *   Familiarity with Natural Language Processing concepts (or willingness to 
 learn)
  *   Experience with Java, Python, and/or Ruby programming languages
 
 Description of work: Use language and text processing tools and provided 
 strategies to write code to extract and normalize data in existing MARC 
 bibliographic records for moving images. Refine code based on feedback from 
 analysis of results obtained with a sample dataset.
 
 Data to be extracted:
 Tasks for Position 1:
 Titles (including the main title of the video, uniform titles, variant 
 titles, series titles, television program titles and titles of contents)
 Authors and titles of related works on which an adaptation is based
 Duration
 Color
 Sound vs. silent
 Tasks for Position 2:
 Format (DVD, VHS, film, online, etc.)
 Original language
 Country of production
 Aspect ratio
 Flag for whether a record represents multiple works or not
 We have already done some work with dates, names and roles and have a 
 framework to work in. I have the basic logic for the data extraction 
 processes, but expect to need some iteration to refine these strategies.
 
 To apply please send me an email at kelleym@uoregon explaining why you are 
 interested in this project, what relevant experience you would bring and any 
 other reasons why I should hire you. If you have a preference for position 1 
 or 2, let me know (it's not necessary to have a preference). The deadline for 
 applications is Monday, December 2, 2013. Let me know if you have any 
 questions.
 
 Thank you for your consideration.
 
 Kelley
 
 PS In the near future, I will also be looking for someone to help with work 
 clustering based on title, name, date and identifier data from MARC records. 
 This will not involve any direct interaction with MARC.
 
 
 Kelley McGrath
 Metadata Management Librarian
 University of Oregon Libraries
 541-346-8232
 kell...@uoregon.edu


Re: [CODE4LIB] Looking for two coders to help with discoverability of videos

2013-11-12 Thread Al Matthews
+1 for what I know of Avalon Media service

--
Al Matthews

Software Developer, Digital Services Unit
Atlanta University Center, Robert W. Woodruff Library
email: amatth...@auctr.edu; office: 1 404 978 2057





On 11/12/13 8:21 AM, Edward Summers e...@pobox.com wrote:

Hi Kelley,

Thanks for posting this. When I began work on jobs.code4lib.org I was
hoping it would encourage people to post short term contracts. The
thought being that it may be easier for some institutions to find money
for projects than full-time staff, and it could encourage more open
source collaboration between organizations, similar to what the Hydra
Project are doing.

So, I added your post to jobs.code4lib.org [1]. Ordinarily the person who
publishes a job posting is the only one who can edit it. But if you would
like to make any changes to it please let me know and I’ll make you the
editor.

Incidentally I was curious about your decision to hire two programmers to
do what appears to be a very similar task. Was your intent to have two
implementations to compare to see which you liked better? Were the two
developers supposed to work together or separately?

//Ed

[1] http://jobs.code4lib.org/job/10658/

On Nov 11, 2013, at 10:58 PM, Kelley McGrath kell...@uoregon.edu wrote:

 I have a small amount of money to work with and am looking for two
people to help with extracting data from MARC records as described
below. This is part of a larger project to develop a FRBR-based data
store and discovery interface for moving images. Our previous work
includes a consideration of the feasibility of the project from a
cataloging perspective (http://www.olacinc.org/drupal/?q=node/27), a
prototype end-user interface (https://blazing-sunset-24.heroku.com/,
https://blazing-sunset-24.heroku.com/page/about) and a web form to
crowdsource the parsing of movie credits
(http://olac-annotator.org/#/about).
 Planned work period: six months beginning around the second week of
December (I can be somewhat flexible on the dates if you want to wait
and start after the New Year)
 Payment: flat sum of $2500 upon completion of the work

 Required skills and knowledge:

  *   Familiarity with the MARC 21 bibliographic format
  *   Familiarity with Natural Language Processing concepts (or
willingness to learn)
  *   Experience with Java, Python, and/or Ruby programming languages

 Description of work: Use language and text processing tools and
provided strategies to write code to extract and normalize data in
existing MARC bibliographic records for moving images. Refine code based
on feedback from analysis of results obtained with a sample dataset.

 Data to be extracted:
 Tasks for Position 1:
 Titles (including the main title of the video, uniform titles, variant
titles, series titles, television program titles and titles of contents)
 Authors and titles of related works on which an adaptation is based
 Duration
 Color
 Sound vs. silent
 Tasks for Position 2:
 Format (DVD, VHS, film, online, etc.)
 Original language
 Country of production
 Aspect ratio
 Flag for whether a record represents multiple works or not
 We have already done some work with dates, names and roles and have a
framework to work in. I have the basic logic for the data extraction
processes, but expect to need some iteration to refine these strategies.

 To apply please send me an email at kelleym@uoregon explaining why you
are interested in this project, what relevant experience you would bring
and any other reasons why I should hire you. If you have a preference
for position 1 or 2, let me know (it's not necessary to have a
preference). The deadline for applications is Monday, December 2, 2013.
Let me know if you have any questions.

 Thank you for your consideration.

 Kelley

 PS In the near future, I will also be looking for someone to help with
work clustering based on title, name, date and identifier data from MARC
records. This will not involve any direct interaction with MARC.


 Kelley McGrath
 Metadata Management Librarian
 University of Oregon Libraries
 541-346-8232
 kell...@uoregon.edu


Re: [CODE4LIB] mass convert jpeg to pdf

2013-11-12 Thread raffaele messuti
Roy Tennant wrote:
 Throwing in my two cents on the IIP Image Server. I've been using it on my
 photos web site[0] for a while now and it works great. I was also happy to
 see that there is a version that supports the International Image
 Interoperability Framework (IIIF) API [1], which I was introduced to at DLF
 by Tom Cramer and company. That would make you compliant with the Mirador
 multi-windowing tool that he mentioned. Sounds like a win-win to me.

+1

iipsrv is extremely fast and easy to deploy (a single static fcgi binary).

i've learned now its iiif compliance, and i've just tried the branch
https://github.com/ruven/iipsrv/tree/iiif

works great, even if a bit undocumented, i looked the code to understand
the url:
http://{SERVER}/iipsrv.fcgi?iiif={IMAGE}.tif/full/full/0/native.jpg

but iipsrv serves only jp2 or tiff images.
are you aware of other decoding modules?
https://github.com/ruven/iipsrv/blob/master/README#L181

bye


--
raffaele, @atomotic


[CODE4LIB] Job: SOFTWARE ENGINEER SR. at Emory University

2013-11-12 Thread jobs
SOFTWARE ENGINEER SR.
Emory University
Atlanta

SOFTWARE ENGINEER SR.

  
JOB DESCRIPTION: Identifies, designs, develops, implements, and revises
software applications to meet business needs. Supports software applications
and associated operating systems. Programs, analyzes and writes
specifications. Devises solutions to system problems. Develops and tests
applications; makes revisions to improve functionality. Develops and analyzes
the effectiveness of new applications and test procedures. Writes and edits
reports to provide recommendations, conclusions and other data. Performs
related responsibilities as required.

MINIMUM QUALIFICATIONS: A bachelor's degree in computer science, math,
engineering or a related field and three years of related experience in
programming and software systems or an equivalent combination of education,
training and experience. Knowledge of software development in a research
institution context preferred.

  
QUALIFICATIONS required by the library in addition to the minimum required
qualifications of the University:

  
• Experience with Python/Django and/or with Java development technologies such
as Spring and GWT

• Experience with JavaScript, AJAX, and DOM manipulation

• Experience with web standards like REST for seamless integration between
complex systems

• Experience with relational databases

• Experience collaborating on software via version control

• Fundamental Linux skills

• Familiarity and desire to work with agile methods

  
Additional preferred qualifications include:

  
• Experience with continuous integration/deployment

• Experience with SOA and enterprise systems integrations

• Experience building and optimizing Solr/Lucene indexes

• Experience with Fedora Commons or other repository systems

• Experience with non-relational databases (e.g. eXist)

• Contributions to Open Source Projects and participation in developer
communities

• Experience working in an academic environment

• Experience with Metadata (e.g. MODS, Dublin Core, TEI, EAD, DDI) and
Semantic web standards.

  
ADDITIONAL JOB DETAILS: Works as part of an agile team to design, implement
and revise software applications that meet library and digital scholarship
needs. Acts as the technical lead for one or more code bases. Mentors other
developers with respect to development best-practices and standards.
Participates in the development and extension of the overall Emory University
Libraries and Emory Center for Digital Scholarship development environment and
architecture. Leads the evaluation of emerging technologies and promotes their
usage. Collaborates with stakeholders in the development of the user stories
that comprise the product backlog that defines the scope of a given
development effort.

  
HOW TO APPLY: Applications /resumes must be submitted online through [http://w
ww.hr.emory.edu/careers/index.html](http://www.hr.emory.edu/careers/index.html
) and looking for job posting #40461BR.

  



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/10659/


Re: [CODE4LIB] mass convert jpeg to pdf

2013-11-12 Thread Andrew Hankinson
Just thought I might plug some software we're developing to solve the book 
image navigation misery that Kyle mentions.

http://ddmal.music.mcgill.ca/diva/

and a demo:

http://ddmal.music.mcgill.ca/newdiva/demo/single.html

We developed it because we were frustrated with the image gallery paradigm 
for book image viewing, and wanted something more like Google Books' viewer, 
but with access to the highest resolution possible. We also were frustrated 
with having to download large PDFs to just view a couple pages.

Diva uses IIP on the back-end to serve out image tiles, so you're only ever 
downloading the part of the image that's viewable -- the rest is auto-loaded as 
the user scrolls. 

We've used it to display a manuscript that's ~80GB (total), with each image 
around 200MB.

http://coltrane.music.mcgill.ca/salzinnes/experiments/diva-cci-tif/

It's also got a couple other neat features, like in-browser 
brightness/contrast/rotation adjustments via canvas. (Click the little gear 
icon in the top left of each page image).

Cheers,
-Andrew

On 2013-11-08, at 4:22 PM, Kyle Banerjee kyle.baner...@gmail.com wrote:

 It is sad to me that converting to PDF for viewing off the Web seems like
 the answer. Isn’t there a tiling viewer (like Leaflet) that could be used
 to render jpeg derivatives of the original tif files in Omeka?
 
 
 This should be pretty easy. But the issue with tiling is that the nav
 process is miserable for all but the shortest books. Most of the people who
 want to download want are looking for jpegs rather than source tiffs and
 one pdf instead of a bunch of tiffs (which is good since each one is
 typically over 100MB). Of course there are people who want the real deal,
 but that's actually a much less common use case.
 
 As Karen observes, downloading and viewing serve different use cases so of
 course we will provide both. IIP Image Server looks intriguing. But most of
 our users who want the full res stuff really just want to download the
 source tiffs which will be made available.
 
 kyle


Re: [CODE4LIB] mapping LCSH from book records to shNNNN codes?

2013-11-12 Thread Trail, Nate
Dan,

You can look up an LCSH term for it's URI using:
http://id.loc.gov/authorities/subjects/label/[known lcsh term]

You are right that not all precoordinated strings are in LCSH, but you can try 
each one separately:

http://id.loc.gov/authorities/subjects/label/Social%20sciences
http://id.loc.gov/authorities/subjects/label/Statistical%20methods
http://id.loc.gov/authorities/subjects/label/Data%20processing

Don't forget url encoding. With content negotiation you can get other formats 
than the html (see the bottom of the page), or if you just want the uri, ask 
for headers only.

Nate

---
Nate Trail
---
LS/TECH/NDMSO
Library of Congress
202-707-2193
n...@loc.gov




-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Dan
Sent: Monday, November 11, 2013 2:34 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: [CODE4LIB] mapping LCSH from book records to sh codes?

Hi, I'm interested in mapping the LCSH topics described in Library of Congress 
book records to the predefined headings LCSH thesaurus defined at 
http://id.loc.gov.

For example, the MODS version of the LoC record for isbn 1606238760 contains 
this LCSH info:

  subject authority=lcsh
topicSocial sciences/topic
topicStatistical methods/topic
topicData processing/topic
  /subject

... which maps neatly to sh2010113695 (Social sciences--Statistical 
methods--Data processing) (see
http://id.loc.gov/search/?q=Social+sciences--Statistical+methods--Data+processingq=)

However, not all topics map to a single sh code (multiple sh's with the 
same description), and some topics have no sh code at id.loc.gov (e.g. 
Discourse analysis--Research, even though Discourse analysis exists)

Is there a way to retrieve the sh codes directly for a book record? I'm 
currently using the LoC SRU (search/retrieval via URL) service, but it only 
serves verbose, textual topics that must be mapped separately to a sh 
code (when possible)

Thanks,
Dan


Re: [CODE4LIB] mass convert jpeg to pdf

2013-11-12 Thread Al Matthews
Nice.

--
Al Matthews

Software Developer, Digital Services Unit
Atlanta University Center, Robert W. Woodruff Library
email: amatth...@auctr.edu; office: 1 404 978 2057





On 11/12/13 9:59 AM, Andrew Hankinson andrew.hankin...@gmail.com wrote:

Just thought I might plug some software we're developing to solve the
book image navigation misery that Kyle mentions.

http://ddmal.music.mcgill.ca/diva/

and a demo:

http://ddmal.music.mcgill.ca/newdiva/demo/single.html

We developed it because we were frustrated with the image gallery
paradigm for book image viewing, and wanted something more like Google
Books' viewer, but with access to the highest resolution possible. We
also were frustrated with having to download large PDFs to just view a
couple pages.

Diva uses IIP on the back-end to serve out image tiles, so you're only
ever downloading the part of the image that's viewable -- the rest is
auto-loaded as the user scrolls.

We've used it to display a manuscript that's ~80GB (total), with each
image around 200MB.

http://coltrane.music.mcgill.ca/salzinnes/experiments/diva-cci-tif/

It's also got a couple other neat features, like in-browser
brightness/contrast/rotation adjustments via canvas. (Click the little
gear icon in the top left of each page image).

Cheers,
-Andrew

On 2013-11-08, at 4:22 PM, Kyle Banerjee kyle.baner...@gmail.com wrote:

 It is sad to me that converting to PDF for viewing off the Web seems
like
 the answer. Isn’t there a tiling viewer (like Leaflet) that could be
used
 to render jpeg derivatives of the original tif files in Omeka?


 This should be pretty easy. But the issue with tiling is that the nav
 process is miserable for all but the shortest books. Most of the people
who
 want to download want are looking for jpegs rather than source tiffs and
 one pdf instead of a bunch of tiffs (which is good since each one is
 typically over 100MB). Of course there are people who want the real
deal,
 but that's actually a much less common use case.

 As Karen observes, downloading and viewing serve different use cases so
of
 course we will provide both. IIP Image Server looks intriguing. But
most of
 our users who want the full res stuff really just want to download the
 source tiffs which will be made available.

 kyle


Re: [CODE4LIB] Announcing Code4Lib 2014 keynote speakers

2013-11-12 Thread Bohyun Kim
This is just so awesome! :)

Big thx to the Code4Lib 2014 Keynote Speakers Committee!

~Bohyun




From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] on behalf of Jason Casden 
[jmcas...@ncsu.edu]
Sent: Tuesday, November 12, 2013 1:05 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: [CODE4LIB] Announcing Code4Lib 2014 keynote speakers

The Code4Lib 2014 Keynote Speakers Committee is excited to announce the
selection by open vote of this year's keynote speakers: Valerie Aurora and
Sumana Harihareswara.

Valerie Aurora is the founder of the Ada Initiative, a non-profit
organization that seeks to increase women's participation in the free
culture movement, open source technology, and open source culture. Aurora
is also known within the Linux community for advocating new developments in
filesystems in Linux, including ChunkFS and the Union file system. In 2012,
Aurora, and Ada Initiative co-founder Mary Gardiner, were named two of the
most influential people in computer security by SC Magazine. In 2013, she
won the O'Reilly Open Source Award. At Valerie's request, her keynote will
be in the form of an interview, which Roy Tennant has volunteered to
conduct. Questions from the Code4Lib community will be solicited, so please
be thinking about what you would like to ask her.

Sumana Harihareswara works as the Engineering Community Manager at the
Wikimedia Foundation. She has worked at Collabora, GNOME,
QuestionCopyright.org, Fog Creek Software, Behavior, and Salon.com, and
contributed to the MediaWiki, AltLaw, Empathy, Miro, and Zeitgeist open
source projects. She has been editor and release organizer for GNOME
Journal and is a blogger at Geek Feminism. Sumana has keynoted Open Source
Bridge in 2012 in addition to presentations at Open Source Bridge in 2010
and 2011, and at Foo Camp in 2010, and is the Google Summer of Code
administrator for MediaWiki. She holds an MS in technology management from
Columbia University and a BA in political science from the University of
California at Berkeley. She is also a regular in #libtechwomen IRC channel.

This year's speakers bring incredible expertise in topics that are highly
relevant to our community. We had an incredible slate of nominees, many of
whom will surely be nominated again.

Thanks for voting!
The Code4Lib 2014 Keynote Speakers Committee


Re: [CODE4LIB] mass convert jpeg to pdf

2013-11-12 Thread Art W Rhyno
 I’m similarly curious to hear if other people have done annotation with 
zoomable interfaces before. 

I have been trying OpenLayers' stock functions for this kind of thing for 
OurDigitalWorld, there is an example here [1]. Leaflet probably does this 
well too. The mapping tools do seem to have some slick drawing functions 
[2] though I have only used polygons.

art
---
1. 
http://tiles.uwindsor.ca/ink/cecil/focus/swoda/pictures/assumption/09_1915/1
2. http://openlayers.org/dev/examples/draw-feature.html


[CODE4LIB] Call for Proposals: Open Repositories Conference, June 9-13, Helsinki

2013-11-12 Thread Michael J. Giarlo
The Ninth International Conference on Open Repositories, OR2014, will be
held 9-13 June 2014 in Helsinki, Finland. The organizers are pleased to
invite you to contribute to the program. This year's conference theme is:

*Towards Repository Ecosystems*

Repository systems are but one part of the ecosystem in 21st century
research, and it is increasingly clear that no single repository will serve
as the sole resource for its community. How can repositories best be
positioned to offer complementary services in a network that includes
research data management systems, institutional and discipline
repositories, publishers, and the open Web? When should service providers
build to fill identified niches, and where should they connect with related
services?  How might these networks offer services to support organizations
that lack the resources to build their own, or researchers seeking
to optimize their domain workflows?

Examining how repositories best integrate into the holistic research flow;
exploring ties between domain-specific repositories and institutional
repositories; and understanding durable content strategies outside of
traditional repository environments are the central themes of the Open
Repositories 2014 conference. We welcome proposals on these themes, but
also on the theoretical, practical, organizational or administrative topics
related to digital repositories. We're particularly interested in hearing
about:

* Unconventional approaches to repository-like services
* Interconnection between publishers and repositories
* Researcher-centered design for scholarly workflows
* Adaptations to support curation lifecycle management, e.g., for research
data
* Real-world scalability and performance stories: working at web-scale,
with big data for global usage
* Requirements for holding restricted or classified data in repositories
* Infrastructure to accommodate national and international mandates for
data management and open access
* Positioning repositories closer to (local, consortial, or cloud-based)
cyberinfrastructure for data processing
* Leveraging connections to external services including:
  * Remote identifier services (e.g., DOI, ORCID)
  * (Re-)using repository data/metadata in new and unexpected ways,
including integrated discovery
  * Scholarly social media services, such as for annotation, review,
comment, reputation, citation, and altmetrics
  * CRIS and research management systems
  * Digital preservation tools, services  infrastructure
* Community and sustainability in an open world


KEY DATES

• 3 February 2014: Deadline for submissions
 • 4 April 2014: Submitters notified of acceptance to general conference
 • 17 April 2014: Submitters notified of acceptance to interest groups
 • 9-13 June 2014: OR2014 conference


SUBMISSION PROCESS

*Conference Papers and Panels*
We welcome proposals that are at least two pages and no more than four
pages in length for presentations or panels that deal with digital
repositories and repository services. Abstracts of accepted papers will be
made available through the conference’s web site, and later they and
associated materials will be made available in a repository intended for
current and future OR content. In general, sessions are an hour and a half
long with three papers per session; panels may take an entire
session. Relevant papers unsuccessful in the main track will automatically
be considered for inclusion, as appropriate, as an Interest Group
presentation.

*Interest Group Presentations*
One to two-page proposals for presentations or panels that focus on use of
one of the major repository platforms (DSpace, ePrints, and Fedora) are
invited from developers, researchers, repository managers, administrators
and practitioners describing novel experiences or developments in the
construction and use of repositories involving issues specific to these
technical platforms.

*24x7 Presentation Proposals*
We welcome one- to two-page proposals for 7 minute presentations comprising
no more than 24 slides. Similar to Pecha Kuchas or Lightning Talks, these
24x7 presentations will be grouped into blocks based on conference themes,
with each block followed by a moderated discussion / question and answer
session involving the audience and whole block of presenters. This format
will provide conference goers with a fast-paced survey of like work across
many institutions, and presenters the chance to disseminate their work in
more depth and context than a traditional poster.

*Repository Rants 24x7 Block*. One block of 24x7's at OR14 will revolve
around repository rants: brief exposés that challenge the conventional
wisdom or practice, and highlight what the repository community is  doing
that is misguided, or perhaps just missing altogether. The top proposals
will be incorporated into a track meant to provoke unconventional
approaches to repository services.

*Posters, Demos and Developer  How-To's*
We invite developers, researchers, repository managers, 

Re: [CODE4LIB] Charlotte, NC Code4Lib Meeting

2013-11-12 Thread Ethan Gruber
I'm in Virginia and might attend said meeting, even if I can't help
organize.
On Nov 12, 2013 6:35 PM, Riley Childs ri...@tfsgeo.com wrote:

 Is anyone in Charlotte, NC (and surrounding areas) interested in starting a
 Code4Lib meeting?
 Just kind of asking :{D!
 *Riley Childs*
 *Library Technology Manager at Charlotte United Christian Academy
 http://cucawarriors.com/*
 *Head Programmer/Manager at Open Library Management Projec
 http://openlibman.sf.net/t http://openlibman.sourceforge.net/*
 *Cisco Certified Entry Network Technician *
 _

 *Phone: +1 (704) 497-2086*
 *email: ri...@tfsgeo.com ri...@tfsgeo.com*
 *Twitter: @RowdyChildren http://twitter.com/rowdychildren*



Re: [CODE4LIB] Charlotte, NC Code4Lib Meeting

2013-11-12 Thread Riley Childs
I can secure a meeting space at one of the public library branches, so
really there isn't much organizing, I would want it to be more in the
style of a 2600 meeting (no real point person) so if you could (maybe)
bring some friends, I can bring some and there, we have our selves a
meeting!

*Riley Childs*
*Library Technology Manager at Charlotte United Christian Academy
http://cucawarriors.com/*
*Head Programmer/Manager at Open Library Management Projec
http://openlibman.sf.net/t http://openlibman.sourceforge.net/*
*Cisco Certified Entry Level Technician *
_

*Phone: +1 (704) 497-2086*
*email: ri...@tfsgeo.com ri...@tfsgeo.com*
*email: ri...@rileychilds.net ri...@rileychilds.net*
*Twitter: @RowdyChildren http://twitter.com/rowdychildren*




On Tue, Nov 12, 2013 at 7:16 PM, Ethan Gruber ewg4x...@gmail.com wrote:

 I'm in Virginia and might attend said meeting, even if I can't help
 organize.
 On Nov 12, 2013 6:35 PM, Riley Childs ri...@tfsgeo.com wrote:

  Is anyone in Charlotte, NC (and surrounding areas) interested in
 starting a
  Code4Lib meeting?
  Just kind of asking :{D!
  *Riley Childs*
  *Library Technology Manager at Charlotte United Christian Academy
  http://cucawarriors.com/*
  *Head Programmer/Manager at Open Library Management Projec
  http://openlibman.sf.net/t http://openlibman.sourceforge.net/*
  *Cisco Certified Entry Network Technician *
  _
 
  *Phone: +1 (704) 497-2086*
  *email: ri...@tfsgeo.com ri...@tfsgeo.com*
  *Twitter: @RowdyChildren http://twitter.com/rowdychildren*
 



Re: [CODE4LIB] We should use HTTPS on code4lib.org

2013-11-12 Thread Riley Childs
also just a very off topic topic:
what if a trusted CA issued a *.* cert? for those of you who don't know,
that would be valid everywhere (even if the session was hjacked) but again,
very off topic, back to the topic at hand :D

*Riley Childs*
*Library Technology Manager at Charlotte United Christian Academy
http://cucawarriors.com/*
*Head Programmer/Manager at Open Library Management Projec
http://openlibman.sf.net/t http://openlibman.sourceforge.net/*
*Cisco Certified Entry Level Technician *
_

*Phone: +1 (704) 497-2086*
*email: ri...@tfsgeo.com ri...@tfsgeo.com*
*email: ri...@rileychilds.net ri...@rileychilds.net*
*Twitter: @RowdyChildren http://twitter.com/rowdychildren*




On Tue, Nov 12, 2013 at 7:56 PM, Riley Childs ri...@tfsgeo.com wrote:

 Is there a donate button somewhere? the only hurdle I see now is finding
 some to maintain the cert, and coming up with the money, maybe we could put
 a check box on the conference sign up form, like chip in $10 for a SSL
 Cert?
 Also, once again I ask how do you normally take this sort of poll deal? I
 would assume it would just be a roll call (like I vote yes in a series of
 emails)
 Once again my recommendation for a cert provider is DigiCert, they will
 cover both the wiki and site (plus *.code4lib.org) for about $475 a year
 (or they have a single cert for $159)

 *Riley Childs*
 *Library Technology Manager at Charlotte United Christian Academy
 http://cucawarriors.com/*
 *Head Programmer/Manager at Open Library Management Projec
 http://openlibman.sf.net/t http://openlibman.sourceforge.net/*
 *Cisco Certified Entry Level Technician *
 _

 *Phone: +1 (704) 497-2086 %2B1%20%28704%29%20497-2086*
 *email: ri...@tfsgeo.com ri...@tfsgeo.com*
 *email: ri...@rileychilds.net ri...@rileychilds.net*
 *Twitter: @RowdyChildren http://twitter.com/rowdychildren*




 On Tue, Nov 12, 2013 at 7:28 PM, Simon Spero sesunc...@gmail.com wrote:

 On Mon, Nov 4, 2013 at 1:45 PM, Ethan Gruber ewg4x...@gmail.com wrote:

  NSA broke it already


 SSL was born into lossage.  After Netscape decided to go it alone, the
 first version they came back with used RC4... with the same symmetric key
 in both directions...  At EIT I did a Proof of Concept attack using the
 initial lack of binding between DNS name and X.500 certificate (this was
 funded on the DARPA MADE project grant).

 All this was done at a time when the guestimate of a ~1 Public Key
 Operation per second.

 On a late 2011 macbook pro ( Intel(R) Core(TM) i7-2760QM CPU @ 2.40GHz )

 openssl speed -multi 8 rsa2048 gives a throughput of 3124.2
 signatures.second, and 97561.0 verifications.

 For Symmetric AES, the same hardware gives the throughput listed below.

 The 'numbers' are in 1000s of bytes per second processed.

 type  16 bytes 64 bytes256 bytes   1024 bytes   8192
 bytes

 aes-128 cbc 427093.88k   451648.30k   460755.99k   462780.42k
 459068.76k

 aes-192 cbc 352143.17k   368399.83k   370499.48k   371674.11k
 371816.40k

 aes-256 cbc 299224.85k   309780.08k   301863.34k   286403.36k
 286261.25k
 In other words:  the cpu cost ain't not thang.

 There is an recurrent cost for a server certificate, but I'm sure that
 this
 could be obtained from the usual suspects (Mellon, OCLC, Kilgour, or
 Stanford).  Somebody has to responsible for renewing certificates before
 they expire (same sort of work as making sure the DNS domains don't
 expire).

 Simon





Re: [CODE4LIB] Charlotte, NC Code4Lib Meeting

2013-11-12 Thread Kevin S. Clarke
I'd be interested. I'm in Boone... not too far a drive. :)

Kevin
On Nov 12, 2013 6:35 PM, Riley Childs ri...@tfsgeo.com wrote:

 Is anyone in Charlotte, NC (and surrounding areas) interested in starting a
 Code4Lib meeting?
 Just kind of asking :{D!
 *Riley Childs*
 *Library Technology Manager at Charlotte United Christian Academy
 http://cucawarriors.com/*
 *Head Programmer/Manager at Open Library Management Projec
 http://openlibman.sf.net/t http://openlibman.sourceforge.net/*
 *Cisco Certified Entry Network Technician *
 _

 *Phone: +1 (704) 497-2086*
 *email: ri...@tfsgeo.com ri...@tfsgeo.com*
 *Twitter: @RowdyChildren http://twitter.com/rowdychildren*



Re: [CODE4LIB] Looking for two coders to help with discoverability of videos

2013-11-12 Thread Heather Claxton
Hi Kelley,

I might be able to help in your search.   I'm in the process of starting a
website that connects academic researchers with volunteer software
developers.  I'm looking for people to post programming projects on the
website once it's launched in late January.   I realize that may be a
little late for you, but perhaps the project you mentioned in your PS
(clustering based on title, name, date ect.) would be perfect?  The
one caveat is that the website is targeting software developers who wish to
volunteer.   Anyway, if you're interested in posting, please send me an
e-mail  at  sciencesolved2...@gmail.comI would greatly appreciate it.
Oh and of course it would be free to post  :)  Best of luck in your
hiring process,

Heather Claxton-Douglas


On Mon, Nov 11, 2013 at 9:58 PM, Kelley McGrath kell...@uoregon.edu wrote:

 I have a small amount of money to work with and am looking for two people
 to help with extracting data from MARC records as described below. This is
 part of a larger project to develop a FRBR-based data store and discovery
 interface for moving images. Our previous work includes a consideration of
 the feasibility of the project from a cataloging perspective (
 http://www.olacinc.org/drupal/?q=node/27), a prototype end-user interface
 (https://blazing-sunset-24.heroku.com/,
 https://blazing-sunset-24.heroku.com/page/about) and a web form to
 crowdsource the parsing of movie credits (
 http://olac-annotator.org/#/about).
 Planned work period: six months beginning around the second week of
 December (I can be somewhat flexible on the dates if you want to wait and
 start after the New Year)
 Payment: flat sum of $2500 upon completion of the work

 Required skills and knowledge:

   *   Familiarity with the MARC 21 bibliographic format
   *   Familiarity with Natural Language Processing concepts (or
 willingness to learn)
   *   Experience with Java, Python, and/or Ruby programming languages

 Description of work: Use language and text processing tools and provided
 strategies to write code to extract and normalize data in existing MARC
 bibliographic records for moving images. Refine code based on feedback from
 analysis of results obtained with a sample dataset.

 Data to be extracted:
 Tasks for Position 1:
 Titles (including the main title of the video, uniform titles, variant
 titles, series titles, television program titles and titles of contents)
 Authors and titles of related works on which an adaptation is based
 Duration
 Color
 Sound vs. silent
 Tasks for Position 2:
 Format (DVD, VHS, film, online, etc.)
 Original language
 Country of production
 Aspect ratio
 Flag for whether a record represents multiple works or not
 We have already done some work with dates, names and roles and have a
 framework to work in. I have the basic logic for the data extraction
 processes, but expect to need some iteration to refine these strategies.

 To apply please send me an email at kelleym@uoregon explaining why you
 are interested in this project, what relevant experience you would bring
 and any other reasons why I should hire you. If you have a preference for
 position 1 or 2, let me know (it's not necessary to have a preference). The
 deadline for applications is Monday, December 2, 2013. Let me know if you
 have any questions.

 Thank you for your consideration.

 Kelley

 PS In the near future, I will also be looking for someone to help with
 work clustering based on title, name, date and identifier data from MARC
 records. This will not involve any direct interaction with MARC.


 Kelley McGrath
 Metadata Management Librarian
 University of Oregon Libraries
 541-346-8232
 kell...@uoregon.edu