Re: [CODE4LIB] Anyone working with iPython?

2013-12-19 Thread Corey A Harper
The slickest thing about iPython notebooks is the easy of publishing them
on github (or elsewhere), then sharing the results with the notebook viewer
here: http://nbviewer.ipython.org/ This community could easily use this as
an alternative (or compliment) to gist for sharing small chunks of code.
This has the added benefit of sharing the _output_ of said code alongside
the source. Also, that visual environment is a must when learning highly
graph & chart dependent things like matplotlib & even scikit learn.

The iPython notebook also simplifies shelling: just prefix your line with a
bang (!) and system command lines are right there.

I just finished a "Practical Data Science" course for grad school last
night, and we used iPython heavily throughout the course, both as the
platform for the professors lecture notes, and for doing our homework
assignments.

-Corey


On Thu, Dec 19, 2013 at 6:03 PM, Sarason,Christian wrote:

> +1 for ipython ‹ it was an easy transition from my MATLAB shell
> programming (for scientific problems) to ipython and the various and
> sundry wonderful part of python overall for scientific programming.  In
> fact, I became so used to the ipython console, when I go to the regular
> python shell now I miss all the goodies (amazing how used to tab
> completion you get used toŠ :D )
>
> Cheers
> Christian
>
> On 12/19/13, 2:55 PM, "Sam Kome"  wrote:
>
> >iPython is the only console to bother with IMHO, regardless of what chore
> >I'm doing.  I've noodled with the Notebooks and they're wonderful but I
> >am time and attention challenged and haven't progressed far.
> >
> >Eric Matthes uses iPython notebooks to teach programming and has set out
> >some excellent resources:
> >
> >https://github.com/ehmatthes/intro_programming
> >
> >$.02
> >SK
> >
> >-Original Message-
> >From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
> >Roy Tennant
> >Sent: Thursday, December 19, 2013 9:49 AM
> >To: CODE4LIB@LISTSERV.ND.EDU
> >Subject: [CODE4LIB] Anyone working with iPython?
> >
> >Our Wikipedian in Residence, Max Klein brought iPython [1] to my
> >attention recently and even in just the little exploration I've done with
> >it so far I'm quite impressed. Although you could call it "interactive
> >Python" that doesn't begin to put across the full range of capabilities,
> >as when I first heard that I thought "Great, a Python shell where you
> >enter a command, hit the return, and it executes. Great. Just what I
> >need. NOT." But I was SO WRONG.
> >
> >It certainly can and does do that, but also so much more. You can enter
> >blocks of code that then execute. Those blocks don't even have to be
> >Python. They can be Ruby or Perl or bash. There are built-in functions of
> >various kinds that it (oddly) calls "magic". But perhaps the killer bit
> >is the idea of "Notebooks" that can capture all of your work in a way
> >that is also editable and completely web-ready. This last part is
> >probably difficult to understand until you experience it.
> >
> >Anyway, i was curious if others have been working with it and if so, what
> >they are using it for. I can think of all kinds of things I might want to
> >do with it, but hearing from others can inspire me further, I'm sure.
> >Thanks,
> >Roy
> >
> >[1] http://ipython.org/
>



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] transforming marc to rdf

2013-12-05 Thread Corey A Harper
With apologies to Eric & to others from the LiAM project, I feel like I
want to jump in here with a little more context.

Eric, or Aaron, or Anne, please feel free to correct any of what I say
below.

I agree with the points made and concerns raised by both Ross & Mark --
most significantly, that a sustainable infrastructure for linked archival
metadata is not going to come from an XSLT stylesheet. However, I also see
tremendous value in what Eric is putting together here.

The prospectus for the LiAM project, which is the context for Eric's
questions, is about developing guiding principles and educational tools for
the archival community to better understand, prepare for, and contribute to
the kind of infrastructure both Ross & Mark are talking about:
http://sites.tufts.edu/liam/deliverables/prospectus-for-linked-archival-metadata-a-guidebook/

While I agree that converting legacy data in EAD & MARC formats to RDF is
not the approach this work will take in the future, I also believe that
these are formats that the archival community is very familiar with, and
XSLT is a tool that many archivists work with regularly. A workflow for
that community to experiment is a laudable goal.

In short, I think we need approaches that illustrate the potential of
linked data in archives, to highlight some of the shortcomings in our
current metadata management frameworks, to help archivists be in a position
to get their metadata ready for what Mark is describing in the context of
ArchivesSpace (e.g. please use id attributes in c tags!!), and to have a
more complete picture of why doing so is of some value.

Sorry for the long message, and I hope that the context is helpful.

Regards,
-Corey



On Thu, Dec 5, 2013 at 11:33 AM, Mark A. Matienzo
wrote:

> On Thu, Dec 5, 2013 at 11:26 AM, Eric Lease Morgan  wrote:
>
> >
> > Good question! At the very least, these applications (ArchivesSpace,
> > Archivists’ Toolkit, etc.) can regularly and systematically export their
> > data as EAD, and the EAD can be made available as linked data. It would
> be
> > ideal if the applications where to natively make their metadata available
> > as linked data, but exporting their content as EAD is a functional
> stopgap
> > solution. —Eric Morgan
> >
>
> Wouldn't it make more sense, especially with a system like ArchivesSpace,
> which provides a backend HTTP API and a public UI, to publish linked data
> directly instead of adding yet another stopgap?
>
> Mark
>
> --
> Mark A. Matienzo 
> Director of Technology, Digital Public Library of America
>



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] transforming marc to rdf

2013-12-04 Thread Corey A Harper
Eric,

Have you had a look at Ed Chamberlain's work on COMET:
https://github.com/edchamberlain/COMET

It's been a while since I've run this, but if I remember correctly, it was
fairly "easy-to-use".

Also, though much older, I seem to remember the Simile MARC "RDFizer" being
a pretty straightforward one to run:
http://simile.mit.edu/wiki/MARC/MODS_RDFizer

MODS aficionados will point to some problems with some of it's choices for
representing that data, but still a good starting point (IMO).

Hope that helps,
-Corey



On Wed, Dec 4, 2013 at 9:59 PM, Eric Lease Morgan  wrote:

> I have to eat some crow, and I hope somebody here can give me some advice
> for transforming MARC to RDF.
>
> I am in the midst of writing a book describing the benefits of linked data
> for archives. Archival metadata usually comes in two flavors: EAD and MARC.
> I found a nifty XSL stylesheet from the Archives Hub (that’s in the United
> Kingdom) transforming EAD to RDF/XML. [1] With a bit of customization I
> think it could be used quite well for just about anybody with EAD files. I
> have retained a resulting RDF/XML file online. [2]
>
> Converting MARC to RDF has been more problematic. There are various tools
> enabling me to convert my original MARC into MARCXML and/or MODS. After
> that I can reportably use a few tools to convert to RDF:
>
>   * MARC21slim2RDFDC.xsl [3] - functions, but even for
> my tastes the resulting RDF is too vanilla. [4]
>
>   * modsrdf.xsl [5] - optimal, but when I use my
> transformation engine (Saxon), I do not get XML
> but rather plain text
>
>   * BIBFRAME Tools [6] - sports nice ontologies, but
> the online tools won’t scale for large operations
>
> In short, I have discovered nothing that is “easy-to-use”. Can you provide
> me with any other links allowing me to convert MARC to serialized RDF?
>
> [1] ead2rdf.xsl - http://data.archiveshub.ac.uk/xslt/ead2rdf.xsl
> [2] transformed EAD file - http://infomotions.com/tmp/una-ano.rdf
> [3] MARC21slim2RDFDC.xsl -
> http://www.loc.gov/standards/marcxml/xslt/MARC21slim2RDFDC.xsl
> [4] vanilla RDF - http://infomotions.com/tmp/pamphlets.rdf
> [5] modsrdf.xsl -
> http://www.loc.gov/standards/mods/modsrdf/xsl-files/modsrdf.xsl
> [6] BIBFRAME Tools - http://bibframe.org/tools/transform/start
>
> —
> Eric Lease Morgan
>



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] Follow-up to my c4l13 lightning talk (emotion, interactive fiction, and linked data)

2013-02-18 Thread Corey A Harper
Mark,

Thank you so much for this. Both your talk, and this essay, are
amazing. I feel like there's 4-6 months worth of material to explore &
contemplate in your post, and marvel at how clearly you've been able
to articulate the last 4-6 months of your own thinking.

I was tempted to open my response with "anarchivist++", partly as an
allusion to your point about "protological control", and partly to
point out that in our own community here we have a form of that as
well, though unlike facebook's "like", it is both owned by & beholden
to _us_... I'm not sure why I think that makes a difference, but I do.

Like Karen, I can say that your words have shone a light on something
that I've also tried to understand. I hear you speaking to what I've
tried to describe as an opportunity to merge an archive's or a
library's narrative with the narratives of users, scholars,
researchers & other interested parties who engage both the resources
in our collections & the ideas, people & organizations those resource
describe. As I read your post, I realize how much the slides in my own
talk about narrative and about context are derivative of the
conversations you & I have had on many occasions.

I think you've hit on something extremely important about the emerging
changes in scholarly publication, and publication in general, and how
they relate to the resources in library, archive, and museum
collections. The relationship between annotation, research,
publishing, conversation, and narrative... I've also been thinking
about that a lot, and now realize one of the missing pieces is
emotion.

Looking forward to talking about this more,
-Corey


On Mon, Feb 18, 2013 at 10:15 AM, Mark A. Matienzo
 wrote:
> I want to thank the code4lib community for the opportunity to present
> my lightning talk [0] at the conference last week. As I could tell
> from the positive feedback I got in person and via email and Twitter,
> there wasn't enough time to unpack all my ideas in 5 minutes.
> Accordingly, I wrote up a blog post to expand some of the ideas and
> give them a better context [1].
>
> If you're curious or have ideas I'd love to have your feedback. I know
> I owe several of you emails - I'll get back to you soon!
>
> xo,
> Mark
>
> [0] http://matienzo.org/storage/2013/2013Feb-code4lib-lightning-talk
> [1] 
> http://matienzo.org/blog/2013/emotion-archives-interactive-fiction-linked-data/



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] Choosing fora. was: Proliferation of Code4Lib Channels

2012-12-03 Thread Corey A Harper
FWIW, the folks at Harvard's Library Innovation Lab have an instance
of the software behind HackerNews specifically for library stuff.
http://news.librarycloud.org/

It's been pretty pretty low traffic, but the infrastructure's there
and I'm sure we'd be welcome.

Also, lobst.er's looks like it might be my next destination when hn's
signal/noise ratio gets unbearable Wish someone would fork the hn
android app so that it'll work with arbitrary arc-based fora...

-Corey

On Mon, Dec 3, 2012 at 10:43 AM, Mark A. Matienzo
 wrote:
> On Mon, Dec 3, 2012 at 11:34 AM, Shaun Ellis  wrote:
>> I only suggested and created the subreddit because it's easy to set up and
>> requires very little maintenance.  I, for one, am open to suggestions for
>> tools with similar functionality, so long as they don't require too much
>> maintenance.
>>
>> Looking at the Hacker News source code... anyone know Arc? :)
>
> For what it's worth, https://lobste.rs (an improved, invite-only
> Hacker News-clone) has its source up on Github - it's a Rails app:
> https://github.com/jcs/lobsters
>
> Mark



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


[CODE4LIB]

2012-11-27 Thread Corey A Harper
I did back-of-envelope math last year, based on the attendees list,
and my calculations showed that 54 out of 244 attendees were female,
so about 22%. This # is surely off as there were about 25 names that I
was unable to put a gender with. I counted these as male to get a
conservative estimate.

I believe this to be an increase from previous years, or perhaps
comparable to 2011. I'd guess all 3 percentages (attendees, proposals,
presenters) have been steadily increasing at pace since 2006. We can
probably estimate that the 2012 conf was 22% women, 2013 proposers
were 16% women, and presenters will be 12% women.

It would be interesting to do a longitudinal study of all 3 numbers
and some nifty data vis alongside results of the survey being
discussed. In addition to increasingly all 3 numbers, our goal should
also be reducing the (albeit slight) discrepancy across the ratios.

-Corey

On Tue, Nov 27, 2012 at 9:31 AM, Bohyun Kim  wrote:
> By any chance, do we have the numbers of the previous code4lib conference 
> attendees by the female/male ratio?
>
> ~Bohyun
> By any chance, do we have the numbers of the previous code4lib conference 
> attendees by the female/male ratio?

> 
> From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] on behalf of Ross Singer 
> [rossfsin...@gmail.com]
> Sent: Tuesday, November 27, 2012 10:20 AM
> To: CODE4LIB@LISTSERV.ND.EDU
> Subject: [CODE4LIB]
>
> On Nov 27, 2012, at 10:03 AM, Chad Nelson  wrote:
>
>> Rosalyn,
>>
>> If we are only 17% women, when we are subset of the broader Library
>> community, which is majority women, then we are doing something wrong. And
>> that deeper question, what do we need to do to encourage more women to
>> participate in the community, to make the community as a whole appealing
>> and safe, is the question I am really asking.
>>
>
> I'm not entirely sure I agree with this.  The issue is less about where the 
> number is now than where it's going (and how quickly).
>
> Is our (completely hypothetical) 17% up from 2006 (or whenever), when 
> Code4lib started?  If so, then I'm less inclined to panic about the 
> statistics and just continue working towards making the community amenable to 
> more groups.
>
> If it has plateaued or regressed, then, yes, we need to be extremely 
> concerned.
>
> -Ross.
>
>> Chad
>>
>>
>> On Tue, Nov 27, 2012 at 9:57 AM, Rosalyn Metz  wrote:
>>
>>> I think first we would need to do a survey of how many women are in the
>>> community.  if it turns out that this community is only 17% women then
>>> we're on target.  who knows, maybe we're actually 10% women and we're way
>>> above target.  in which case the real question might be "how do we get more
>>> women in tech."
>>>
>>>
>>> On Tue, Nov 27, 2012 at 9:11 AM, Chad Nelson 
>>> wrote:
>>>
>>>> Ooops. Hit the wrong key.
>>>>
>>>> So, about our presenters...
>>>>
>>>> Is it a problem that only 4 of our 33 presenters are women? Or that only
>>> 16
>>>> of 95 proposers were women?
>>>>
>>>> Is there something this community needs to do to encourage more women to
>>>> feel like they can and should speak / propose sessions?
>>>>
>>>



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread Corey A Harper
Karen,

I think there's a useful distinction here. Ed can correct me if I'm
wrong, but I suspect he was not actually suggesting that Git itself be
the user-interface to a github-for-data type service, but rather that
such a service can be built *on top* of an infrastructure component
like GitHub.

I agree that there's a barrier to use if we just plunk a bunch of our
bib data in GitHub and call it done, but the version control model and
implementation there could definitely provide a good bit of the
library-version-control stack down below the UI layer.

Cheers,
-Corey

On Mon, Aug 27, 2012 at 8:49 AM, Karen Coyle  wrote:
> Actually, Ed, this would not only make for a good blog post (please, so it
> doesn't get lost in email space), but I would love to see a discussion of
> what kind of revision control would work:
>
> 1) for libraries (git is gawdawful nerdy)
> 2) for linked data
>
> kc
> p.s. the Ramsay book is now showing on Open Library, and the subtitle is
> correct... perhaps because the record is from the LC MARC service :-)
> http://openlibrary.org/works/OL16528530W/Reading_machines
>
>
> On 8/26/12 6:32 PM, Ed Summers wrote:
>>
>> Thanks for sharing this bit of detective work. I noticed something
>> similar fairly recently myself [1], but didn't discover as plausible
>> of a scenario for what had happened as you did. I imagine others have
>> noticed this network effect before as well.
>>
>> On Tue, Aug 21, 2012 at 11:42 AM, Lars Aronsson  wrote:
>>>
>>> And sure enough, there it is,
>>> http://clio.cul.columbia.edu:7018/vwebv/holdingsInfo?bibId=1439352
>>> But will my error report to Worldcat find its way back
>>> to CLIO? Or if I report the error to Columbia University,
>>> will the correction propagate to Google, Hathi and Worldcat?
>>> (Columbia asks me for a student ID when I want to give
>>> feedback, so that removes this option for me.)
>>
>> I realize this probably will sound flippant (or overly grandiose), but
>> innovating solutions to this problem, where there isn't necessarily
>> one metadata master that everyone is slaved to seems to be one of the
>> more important and interesting problems that our sector faces.
>>
>> When Columbia University can become the source of a bibliographic
>> record for Google Books, HathiTrust and OpenLibrary, etc how does this
>> change the hub and spoke workflows (with OCLC as the hub) that we are
>> more familiar with? I think this topic is what's at the heart of the
>> discussions about a "github-for-data" [2,3], since decentralized
>> version control systems [4] allow for the evolution of more organic,
>> push/pull, multimaster workflows...and platforms like Github make them
>> socially feasible, easy and fun.
>>
>> I also think Linked Library Data, where bibliographic descriptions are
>> REST enabled Web resources identified with URLs, and patterns such as
>> webhooks [5] make it easy to trigger update events could be part of an
>> answer. Feed technologies like Atom, RSS and the work being done on
>> ResourceSync also seem important technologies for us to use to allow
>> people to poll for changes [6]. And being able to say where you have
>> obtained data from, possibly using something like the W3C Provenance
>> vocabulary [7] also seems like an important part of the puzzle.
>>
>> I'm sure there are other (and perhaps better) creative analogies or
>> tools that could help solve this problem. I think you're probably
>> right that we are starting to see the errors more now that more
>> library data is becoming part of the visible Web via projects like
>> GoogleBooks, HathiTrust, OpenLibrary and other enterprising libraries
>> that design their catalogs to be crawlable and indexable by search
>> engines.
>>
>> But I think it's more fun to think about (and hack on) what grassroots
>> things we could be doing to help these new bibliographic data
>> workflows to grow and flourish than to get piled under by the errors,
>> and a sense of futility...
>>
>> Or it might make for a good article or dissertation topic :-)
>>
>> //Ed
>>
>> [1] http://inkdroid.org/journal/2011/12/25/genealogy-of-a-typo/
>> [2] http://www.informationdiet.com/blog/read/we-need-a-github-for-data
>> [3] http://sunlightlabs.com/blog/2010/we-dont-need-a-github-for-data/
>> [4] http://en.wikipedia.org/wiki/Distributed_revision_control
>> [5] https://help.github.com/articles/post-receive-hooks
>> [6] http://www.niso.org/workrooms/resourcesync/
>> [7] http://www.w3.org/TR/prov-primer/
>
>
> --
> Karen Coyle
> kco...@kcoyle.net http://kcoyle.net
> ph: 1-510-540-7596
> m: 1-510-435-8234
> skype: kcoylenet



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


[CODE4LIB] Fwd: "Learning Linked Data" - comments sought

2012-06-17 Thread Corey A Harper
 descriptions of things and descriptions of the vocabularies used to describe
  those things are all considered "data," so many of the basic tools for
  editing, mapping, converting, and extracting data may be adapted for
  different types of data.

-- Visualization [5]: Linked Data is conceptually diagrammatic in nature, and
  graphical tools can help the learner explore the statistical, spatial, or
  temporal characteristics of datasets by visualizing webs of data at various
  levels of granularity or by plotting the data to maps or timelines.

-- Implementing a Linked Data Application [6]: Simply learning how to
interpret and
  manipulate Linked Data could stop with the topics outlined above.  The extent
  to which a language-lab-like platform for learning Linked Data
should encompass
  tools for building real applications poses questions of scope on which the
  project would appreciate input.

The project envisions the platform as a basis for the development of course
modules by people involved in both formal and informal learning environments,
so comments about the usefulness of such a platform for particular scenarios
would be especially welcome.

The comments received will be incorporated into a revised document and final
report to be published in September 2012. This report will be used as the basis
for a subsequent IMLS project proposal, to be submitted in early 2013, for
implementing the platform specified.

The partners of the Learning Linked Data Project are the University of
Washington, Kent State University, the University of North Carolina, JES &
Company, and 3 Round Stones, Inc. The project lead and contact person is Mike
Crandall of the University of Washington.

[1] http://www.imls.gov/news/national_leadership_grant_announcement.aspx#WA
[2] http://lld.ischool.uw.edu/wp/learning/understanding-linked-data/
[3] http://lld.ischool.uw.edu/wp/learning/searching-and-querying-linked-data/
[4] http://lld.ischool.uw.edu/wp/learning/creating-and-manipulating-rdf-data/
[5] http://lld.ischool.uw.edu/wp/learning/visualization/
[6] 
http://lld.ischool.uw.edu/wp/learning/implementing-a-linked-data-application/
[7] http://lld.ischool.uw.edu/wp/glossary/

--
Tom Baker 
Learning Linked Data
   Wiki: http://wiki.dublincore.org/index.php/Learning_Linked_Data
   List: http://dublincore.org/pipermail/learninglinkeddata/

--
Tom Baker 


-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


[CODE4LIB] Crowd-sourcing c4l12 video QA

2012-02-12 Thread Corey A Harper
Hey all,

Just a friendly request that folks viewing the current ad-hoc archives
of the livestream [1] help me out with quality assurance.

As a lot of you know, I have a secondary video set of variable quality
recorded to DV tapes. I'll be spending some of tomorrow (2/13)
capturing those to file at UW. I don't have time to watch all the
archived footage, but I know a lot of you have been viewing &/or
looking for specific talks / segments.

To this end, I've put an editable copy of the schedule on the wiki. If
you've watched something and know it's good, let me know. If you've
watched something and noted that the stream dropped or the quality
bites let me know. If you looked for and couldn't find something,
definitely let me know. I can't promise a perfect archive, but I hope
you'll all want to help me make it as good as it can be.

You can either edit in the wiki directly, or email me, or hit me up on
twitter (@chrpr), or I'll try to lurk in channel sometimes the next
few weeks.

This way, over the next few of weeks, I can cobble together the best
possible footages, load them to the Internet Archive as well as
re-load them and tag them on Livestream, and link them from the
conference schedule.

It'll probably take me a few weeks to get all of this done, and feel
free to let me know if you have any suggestions or want to help in any
way.

Thanks,
-Corey

[1]http://www.livestream.com/code4lib/
[2]http://wiki.code4lib.org/index.php/Crowdsourced_Video_QA
-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] code4lib 2012 streaming

2012-02-07 Thread Corey A Harper
Dear Remote code4libbers,

The livestream is now up and running at:
http://www.livestream.com/code4lib

Currently playing ambient sounds of breakfast, but opening sessions
begin at 9am.

Ping me on email or twitter / irc (chrpr) if you notice anything problematic.

-Corey

On Wed, Feb 1, 2012 at 8:48 AM, Corey A Harper  wrote:
> Dear All,
>
> I'll be managing our attempts to ensure code4lib 2012 is streamed. The
> plan is to stream all plenary portions of the conference via
> livestream, and I'll post the channel link to IRC, Twitter & on this
> list before the event begins. If all goes well, we'll have a stream
> for the following (PST) times:
>  * Tues: 9am-12pm, 1pm-2.40, 4-5.20
>  * Wed: 9am-12pm, 1pm-2.20, 3.50-5.15
>  * Thu: 9am-12pm
>
> The streaming committee has some concerns about the equipment we have
> access to, so if there is anyone in the community who would volunteer
> a digital camcorder with a firewire known to be compatible with
> Livestream, we would be in your debt. (Which means I would buy you
> beer from time to time throughout the conference...)
>
> Alternately, I have leads on rental equipment, so please let me know
> (offlist) if virtual attendees would be willing to donate toward the
> stream or if onsite attendees would be willing to make a donation at
> the door. :)
>
> Thanks in advance. I will post a link to the livestream channel no
> later than Monday.
>
> Best,
> -Corey
>
> On Mon, Jan 30, 2012 at 11:21 AM, Julia Bauder  wrote:
>> Speaking of video streaming, is there any information yet about the
>> streaming? E.g., what will be streamed, and where will the links to the
>> stream appear?
>>
>> Julia (who is also eagerly awaiting her streaming + IRC Code4Lib fix)
>>
>> On Mon, Jan 30, 2012 at 9:50 AM, Ranti Junus  wrote:
>>
>>> Hello All,
>>>
>>>
>>> For those who might not realize it, the code4lib 2012 schedule is up.
>>> http://code4lib.org/conference/2012/schedule
>>>
>>> Once the conference is over, we'll work on adding the links to the
>>> presentations. Better yet, those of you who do the presentation can
>>> add the link to your own presentation (slides, screencast, code
>>> examples, etc.) You'd need to register for an account first, if you
>>> haven't done that.
>>>
>>> Have a great time, everyone! I'm looking forward to watch the video
>>> streaming and participate in the #code4lib IRC.
>>>
>>>
>>> thanks,
>>> ranti on behalf of code4lib 2012 program committee
>>>
>>> --
>>> Bulk mail.  Postage paid.
>>>
>
>
>
> --
> Corey A Harper
> Metadata Services Librarian
> New York University Libraries
> 20 Cooper Square, 3rd Floor
> New York, NY 10003-7112
> 212.998.2479
> corey.har...@nyu.edu



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] code4lib 2012 streaming

2012-02-01 Thread Corey A Harper
By the way, if anyone has any experience with streaming video with
livestream, or recording video for archiving, or is interested in
trying their hand at pointing a camera at really smart people while
they talk, please consider signing up to join the video committee:
http://wiki.code4lib.org/index.php/During_the_Conference_Volunteers#Video_Committee

On Wed, Feb 1, 2012 at 1:19 PM, Corey A Harper  wrote:
> Yes. The camera's being provided by UW will work for that, and there
> will be a second camera running to grab a version to process in the
> weeks following the conference & post to the internet archive as done
> in previous years.
>
> I decided on two separate cameras because of feedback from Access 2012
> indicating that if the livestream dropped, the archival copy that can
> be pulled out later is also lost. Therefore, I'm erring on the side of
> redundant capture.
>
> -Corey
>
> On Wed, Feb 1, 2012 at 1:17 PM, Jonathan Rochkind  wrote:
>> Is the video also being recorded for putting up on the web later?
>>
>>
>> On 2/1/2012 11:48 AM, Corey A Harper wrote:
>>>
>>> Dear All,
>>>
>>> I'll be managing our attempts to ensure code4lib 2012 is streamed. The
>>> plan is to stream all plenary portions of the conference via
>>> livestream, and I'll post the channel link to IRC, Twitter&  on this
>>>
>>> list before the event begins. If all goes well, we'll have a stream
>>> for the following (PST) times:
>>>  * Tues: 9am-12pm, 1pm-2.40, 4-5.20
>>>  * Wed: 9am-12pm, 1pm-2.20, 3.50-5.15
>>>  * Thu: 9am-12pm
>>>
>>> The streaming committee has some concerns about the equipment we have
>>> access to, so if there is anyone in the community who would volunteer
>>> a digital camcorder with a firewire known to be compatible with
>>> Livestream, we would be in your debt. (Which means I would buy you
>>> beer from time to time throughout the conference...)
>>>
>>> Alternately, I have leads on rental equipment, so please let me know
>>> (offlist) if virtual attendees would be willing to donate toward the
>>> stream or if onsite attendees would be willing to make a donation at
>>> the door. :)
>>>
>>> Thanks in advance. I will post a link to the livestream channel no
>>> later than Monday.
>>>
>>> Best,
>>> -Corey
>>>
>>> On Mon, Jan 30, 2012 at 11:21 AM, Julia Bauder
>>>  wrote:
>>>>
>>>> Speaking of video streaming, is there any information yet about the
>>>> streaming? E.g., what will be streamed, and where will the links to the
>>>> stream appear?
>>>>
>>>> Julia (who is also eagerly awaiting her streaming + IRC Code4Lib fix)
>>>>
>>>> On Mon, Jan 30, 2012 at 9:50 AM, Ranti Junus
>>>>  wrote:
>>>>
>>>>> Hello All,
>>>>>
>>>>>
>>>>> For those who might not realize it, the code4lib 2012 schedule is up.
>>>>> http://code4lib.org/conference/2012/schedule
>>>>>
>>>>> Once the conference is over, we'll work on adding the links to the
>>>>> presentations. Better yet, those of you who do the presentation can
>>>>> add the link to your own presentation (slides, screencast, code
>>>>> examples, etc.) You'd need to register for an account first, if you
>>>>> haven't done that.
>>>>>
>>>>> Have a great time, everyone! I'm looking forward to watch the video
>>>>> streaming and participate in the #code4lib IRC.
>>>>>
>>>>>
>>>>> thanks,
>>>>> ranti on behalf of code4lib 2012 program committee
>>>>>
>>>>> --
>>>>> Bulk mail.  Postage paid.
>>>>>
>>>
>>>
>>
>
>
>
> --
> Corey A Harper
> Metadata Services Librarian
> New York University Libraries
> 20 Cooper Square, 3rd Floor
> New York, NY 10003-7112
> 212.998.2479
> corey.har...@nyu.edu



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] code4lib 2012 streaming

2012-02-01 Thread Corey A Harper
Yes. The camera's being provided by UW will work for that, and there
will be a second camera running to grab a version to process in the
weeks following the conference & post to the internet archive as done
in previous years.

I decided on two separate cameras because of feedback from Access 2012
indicating that if the livestream dropped, the archival copy that can
be pulled out later is also lost. Therefore, I'm erring on the side of
redundant capture.

-Corey

On Wed, Feb 1, 2012 at 1:17 PM, Jonathan Rochkind  wrote:
> Is the video also being recorded for putting up on the web later?
>
>
> On 2/1/2012 11:48 AM, Corey A Harper wrote:
>>
>> Dear All,
>>
>> I'll be managing our attempts to ensure code4lib 2012 is streamed. The
>> plan is to stream all plenary portions of the conference via
>> livestream, and I'll post the channel link to IRC, Twitter&  on this
>>
>> list before the event begins. If all goes well, we'll have a stream
>> for the following (PST) times:
>>  * Tues: 9am-12pm, 1pm-2.40, 4-5.20
>>  * Wed: 9am-12pm, 1pm-2.20, 3.50-5.15
>>  * Thu: 9am-12pm
>>
>> The streaming committee has some concerns about the equipment we have
>> access to, so if there is anyone in the community who would volunteer
>> a digital camcorder with a firewire known to be compatible with
>> Livestream, we would be in your debt. (Which means I would buy you
>> beer from time to time throughout the conference...)
>>
>> Alternately, I have leads on rental equipment, so please let me know
>> (offlist) if virtual attendees would be willing to donate toward the
>> stream or if onsite attendees would be willing to make a donation at
>> the door. :)
>>
>> Thanks in advance. I will post a link to the livestream channel no
>> later than Monday.
>>
>> Best,
>> -Corey
>>
>> On Mon, Jan 30, 2012 at 11:21 AM, Julia Bauder
>>  wrote:
>>>
>>> Speaking of video streaming, is there any information yet about the
>>> streaming? E.g., what will be streamed, and where will the links to the
>>> stream appear?
>>>
>>> Julia (who is also eagerly awaiting her streaming + IRC Code4Lib fix)
>>>
>>> On Mon, Jan 30, 2012 at 9:50 AM, Ranti Junus
>>>  wrote:
>>>
>>>> Hello All,
>>>>
>>>>
>>>> For those who might not realize it, the code4lib 2012 schedule is up.
>>>> http://code4lib.org/conference/2012/schedule
>>>>
>>>> Once the conference is over, we'll work on adding the links to the
>>>> presentations. Better yet, those of you who do the presentation can
>>>> add the link to your own presentation (slides, screencast, code
>>>> examples, etc.) You'd need to register for an account first, if you
>>>> haven't done that.
>>>>
>>>> Have a great time, everyone! I'm looking forward to watch the video
>>>> streaming and participate in the #code4lib IRC.
>>>>
>>>>
>>>> thanks,
>>>> ranti on behalf of code4lib 2012 program committee
>>>>
>>>> --
>>>> Bulk mail.  Postage paid.
>>>>
>>
>>
>



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] jobs.code4lib.org

2012-02-01 Thread Corey A Harper
I should repost the reply I sent on the c4lcon list here:

Hey Ed,

Thanks for posting this summary here. It's really cool to see a
description of how this is working.

I think this is a pretty good example of how a library data mgt
interface of the future might work:
* Grab some free text describing a thing;
* Try to clean it up, extract important concepts / themes topics
* Reconcile against some sort lod-lam friendly controlled vocabulary/ies
* Offer "cataloger" types an interface to accept / reject / refine
those mappings as well as the text of the "metadata" itself.

I would go to a breakout session about this.

Best,
-Corey

On Wed, Feb 1, 2012 at 9:46 AM, Cynthia Ng  wrote:
> Just a quick 2 cents. I only found out about the feed by reading this
> conversation. I think it would be great to make the RSS link a little
> more obvious from the front page.
>
> -Cynthia
>
> On Wed, Feb 1, 2012 at 8:22 AM, Michael J. Giarlo
>  wrote:
>> I smell a potential breakout session.
>>
>> -Mike
>>
>> P.S. No, really, jokers, that's what I smell.
>> On Jan 31, 2012 11:30 PM, "Ed Summers"  wrote:
>>
>>> I guess it's rarely a good idea to respond to your own post, but I
>>> forgot to add that when a job is "published" on jobs.code4lib.org it
>>> will show up in the site's Atom feed [1]. The feed should be usable by
>>> your feed reader of choice, and could also be useful if you want to
>>> syndicate the jobs elsewhere.
>>>
>>> //Ed
>>>
>>> [1] http://jobs.code4lib.org/feed/
>>>
>>> PS. It was kind of fun to finally use the "tag" link relation to mark
>>> up the job tags in the feed with Freebase URLs. For example:
>>>
>>> 
>>>        ...
>>>        >> href="http://www.freebase.com/view/en/unix"; type="text/html" />
>>>        >> href="http://www.freebase.com/experimental/topic/standard/en/unix";
>>> type="application/json" />
>>>        >> href="http://rdf.freebase.com/rdf/en.unix"; type="application/rdf+xml"
>>> />
>>> 
>>>



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


[CODE4LIB] code4lib 2012 streaming

2012-02-01 Thread Corey A Harper
Dear All,

I'll be managing our attempts to ensure code4lib 2012 is streamed. The
plan is to stream all plenary portions of the conference via
livestream, and I'll post the channel link to IRC, Twitter & on this
list before the event begins. If all goes well, we'll have a stream
for the following (PST) times:
 * Tues: 9am-12pm, 1pm-2.40, 4-5.20
 * Wed: 9am-12pm, 1pm-2.20, 3.50-5.15
 * Thu: 9am-12pm

The streaming committee has some concerns about the equipment we have
access to, so if there is anyone in the community who would volunteer
a digital camcorder with a firewire known to be compatible with
Livestream, we would be in your debt. (Which means I would buy you
beer from time to time throughout the conference...)

Alternately, I have leads on rental equipment, so please let me know
(offlist) if virtual attendees would be willing to donate toward the
stream or if onsite attendees would be willing to make a donation at
the door. :)

Thanks in advance. I will post a link to the livestream channel no
later than Monday.

Best,
-Corey

On Mon, Jan 30, 2012 at 11:21 AM, Julia Bauder  wrote:
> Speaking of video streaming, is there any information yet about the
> streaming? E.g., what will be streamed, and where will the links to the
> stream appear?
>
> Julia (who is also eagerly awaiting her streaming + IRC Code4Lib fix)
>
> On Mon, Jan 30, 2012 at 9:50 AM, Ranti Junus  wrote:
>
>> Hello All,
>>
>>
>> For those who might not realize it, the code4lib 2012 schedule is up.
>> http://code4lib.org/conference/2012/schedule
>>
>> Once the conference is over, we'll work on adding the links to the
>> presentations. Better yet, those of you who do the presentation can
>> add the link to your own presentation (slides, screencast, code
>> examples, etc.) You'd need to register for an account first, if you
>> haven't done that.
>>
>> Have a great time, everyone! I'm looking forward to watch the video
>> streaming and participate in the #code4lib IRC.
>>
>>
>> thanks,
>> ranti on behalf of code4lib 2012 program committee
>>
>> --
>> Bulk mail.  Postage paid.
>>



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] My crazed idea about dealing with registration limitations

2011-12-22 Thread Corey A Harper
Cary,

Good to know about your extensive experience w/ streaming.

If you'll be in Seattle, would you be willing to add your name to the
"Video Committee" listing?
http://wiki.code4lib.org/index.php/2012_committees_sign-up_page#Video_Committee

Having people who actually know what they're doing involved in this
effort *this* year will help ensure that we're actually able to pull
it off as effectively as IU did...

Thanks,
-Corey


On Thu, Dec 22, 2011 at 10:42 AM, Cary Gordon  wrote:
> This is definitely doable, and potentially effective for a single
> track conference.
>
> I have been doing streaming as a volunteer for eight years and it
> keeps getting easier.
>
> Cary
>
> On Thu, Dec 22, 2011 at 7:33 AM, Wilfred Drew  wrote:
>> Here is another crazy idea; stream the event live for those who can't get 
>> registered for the pace to face version and provide a lower registration fee 
>> for them.
>>
>>
>> -
>> Wilfred (Bill) Drew, M.S., B.S., A.S.
>> Assistant Professor
>> Librarian, Systems and Tech Services/Electronic Resources/Serials
>> Tompkins Cortland Community College  (TC3) Library:
>> http://www.tc3.edu/library/
>> Dryden, N.Y. 13053-0139
>> Follow the library: http://twitter.com/TC3Library
>> E-mail: dr...@tc3.edu
>> Phone: 607-844-8222 ext.4406
>> SKYPE/Twitter:BillDrew4
>> SMS/TXT Me: 6072182217
>> Website: http://BillTheLibrarian.com
>> StrengthsQuest Strengths: Ideation, Input, Learner, Command, Analytical
>> http://www.facebook.com/billdrew
>> "One thing about eBooks that most people haven't thought much is that eBooks 
>> are the very first thing that we're all able to have as much as we want 
>> other than air." -- Michael Hart, Project Gutenberg
>> Please consider the environment before printing this e-mail or document.
>
>
>
> --
> Cary Gordon
> The Cherry Hill Company
> http://chillco.com



-- 
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


[CODE4LIB] ALA Linked Library Data Interest Group - 6/26, 10:30-12

2011-06-13 Thread Corey A Harper

*** With apologies for cross-posting ***

The first official meeting of the new LITA/ALCTS Linked Library Data
Interest Group (LLD-IG) will take place from 10:30-12 on Sunday, June
26 in Convention Center Room 265. The agenda is below, and online at
http://wikis.ala.org/lita/index.php/Linkeddata

We can also make time for a limited number of lightning talk style
presentations of no longer than 5 minutes if you have projects or
topics that you'd like to share. If you'd like to give a brief talk,
please contact the interest group chairs.

Agenda:
1) Report on LOD-LAM meeting
2) W3C LLD Recommendations
3) What this SIG will do... sub-groups? other activities? list of lld
projects? training? wiki discussion? Our next meeting?
4) Short reports from anyone present: LLD activities, upcoming
conferences/meetings, ideas... anything
5) Challenge for next time: someone to commit to bring something to show

Minutes of previous "informal" LLD-IG meetings from ALA-2010 in
Washington DC, Dublin Core 2010 in Pittsburgh, and ALA-MW-2011 in San
Diego are online at:
http://wikis.ala.org/lita/index.php/Linkeddata

Thanks,
Karen Coyle and Corey A Harper
ALCTS/LITA LLD-IG Co-Chairs
corey.har...@nyu.edu
kco...@kcoyle.net

--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] If you were starting over, what would you learn and how would you do it?

2011-05-06 Thread Corey A Harper
That may apply to this profession, too. I've often been convinced that 
someone in the c4l community *was* Batman.


On 5/6/2011 4:47 PM, Nate Vack wrote:

On Fri, May 6, 2011 at 2:07 PM, Ceci Land  wrote:


How would you choose to develop your skills from "baby" level to something 
useful to the profession?


I'd pretty much follow the plot of "Batman Begins" as closely as possible.

Wait, useful to *this* profession?

-n


--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] If you were starting over, what would you learn and how would you do it?

2011-05-06 Thread Corey A Harper

As others have just said, a portfolio may or may not matter.

The polished product isn't the issue, though. It's skills, principles 
and knowledge you pick up while experimenting.


Whether or not you share a spare-time project with a potential employer, 
having spent a bit of time on your own learning about this stuff will 
make you a stronger candidate & a better interviewee. You'll be able to 
speak that much more confidently and persuasively about a broader range 
of technologies, which will almost certainly help you in the job market.


-Corey

On 5/6/2011 4:06 PM, Ceci Land wrote:

I like this.  Maybe it's because it's what I was already thinking about doing.  
I have 3 project ideas twirling around in my head at the moment.  I can't do 
them at work, but perhaps the systems department could give me a dataset to 
play around with in my spare time.  I already have a good dataset for one of 
the projects that I harvested via OAI-PMH.

Do these spare-time projects get any respect from the "real world" when it comes time to 
apply for a job? particularly if you focus on really making it as polished as possible (within the 
limitations of a non-work environment)?  I remember building my own darkroom as a teenager and doing 
B&W and color slide and print processing. (yes, I still love the smell of D76 and stop bath.  I can 
bring up the smell purely from memory :)  ).  I did manage to work for a while in photography because 
of my original personal investment of time and energy into it as a hobby.  I'm just concerned that the 
things may not work that way any more.  Life was not only slower paced back then, but having an exact 
skill match wasn't required to get a foot in the door.   Plus, I'm no Mozart so it's not likely that 
I'll come up with something uber creative or so nifty that it's used by a community at large.  But I do 
good technical work.  I tinker...I make things "go".

Thanks for the advice.  I'm going to start playing with the projects I have in mind.  One 
is already done as a JSP, but I think I'll convert it to something else and "clean 
up" the compromises I had to make to get it done in a limited time.

Ceci


   >>>  On 5/6/2011 at 2:31 PM, in 
message, Devon  
wrote:

My answer to this question changes every time it gets asked.

These days, my thinking is that focusing on skills/tools is backwards.
Instead, focus on a problems and solutions. Pick something you want to
do, then do it. Figure it all out on the way. If you don't know where
to start, build and deploy a simple website. Try a solution. If it
doesn't work, try a different solution. Keep trying. Don't be afraid
to toss all your work away and start over. Make the website more
complex as you go. Add a database. Switch the whole thing to jQuery.
Then switch to something else. Just keep going.

/dev



--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] If you were starting over, what would you learn and how would you do it?

2011-05-06 Thread Corey A Harper
uld really challenge your 
programming, web development etc. skills?

I see many people on this list with very strong skills, but in the job world, I 
don't see many 2nd string/entry level jobs that would allow someone to hone 
their skills to the level I often see here.  I've been thinking that I should 
focus on further developing my abilities in: HTML/CSS of course, XML, XSLT, 
PHP, and MySQL (because they're all readily available for someone to play with 
despite not being employed in a systems department).  It seems that anything I 
can learn about metadata transformations/crosswalks and RDF would be useful 
too.  I also find some classification theories very compelling (ok, I admit 
that colon classification really got my attention in my first MLIS class) and 
found myself drawn to potentially being interested in taxonomies and controlled 
vocabulary.  I know nothing about Drupal, but I wonder if I should include in 
my smorgasbord.  How much is too much and where you y'all recommend I put my 
energy?

Any advice is greatly appreciated.  The more specific the better.  :)
Thx!



--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] Semantic web introduction to tools

2011-03-24 Thread Corey A Harper

Karen, Cindy, et al.,

I stumbled upon and bookmarked this list a few days ago:
http://spqr.cerch.kcl.ac.uk/?page_id=94

It seems pretty comprehensive. Might be worth a perusal, and possibly a 
link off the w3c-lld page as well.


Best,
-Corey

On 3/23/2011 11:15 PM, Karen Coyle wrote:

Sorry, I don't remember if I have already posted this, but there is a
growing list of tools at:

http://www.w3.org/2005/Incubator/lld/wiki/Tools

Most of these are free/non-commercial. If you know of others, let me
know or post to the public-...@w3.org list and we'll get it added on.

kc

Quoting Cindy Harper :


This article came in via email this morning - it may be the kind of
pointers
I needed to read about open-source tools to get started using the SW.


*Computerworld First Look*


<http://cwonline.computerworld.com/t/7258117/240182/237524/0/?0fc84754=Y2hhcnBlckBtYWlsLmNvbGdhdGUuZWR1&x=9633e82f>

--

*Semantic Web: Tools you can
use*<http://cwonline.computerworld.com/t/7258117/240182/376767/0/>
Standards, tools, platforms, prewritten components and services are
available to help make semantic deployments less time-consuming, less
technically complex and (somewhat) less costly. *Read
More*<http://cwonline.computerworld.com/t/7258117/240182/376767/0/>

Cindy Harper, Systems Librarian
Colgate University Libraries
char...@colgate.edu
315-228-7363







--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] Excel file to Dublin Core?

2011-03-01 Thread Corey A Harper

Hi Edward,

Like Mike, I have some existing tools that do this in perl - though very 
hack-ish.


Basically, I set the Windows default field separator to pipe (because I 
hate comma separated data) and save the spreadsheet as .csv, though a 
pipe delimited one.


My perl then reads that in, and maps columns to DC elements and 
OAI-header elements and spits out an XML file per row of the spreadsheet.


I'd be happy to pop a version of one of these into git-hub if you're 
interested.


-corey

On 3/1/2011 3:59 PM, Edward M. Corrado wrote:

Hi Mike,

Yes, by Dublin Core, I mean OAI Dublin Core XML.

Edward

On Tue, Mar 1, 2011 at 3:34 PM, Michael J. Giarlo
  wrote:

Edward,

Because I already have some code lying around that does more or less
the same thing, I'd probably sling some Python using the xlrd library
(N.B. works on xls files but not xlsx files). �It'd look similar to
this method, perhaps a little simpler, though this method doesn't
write out a DC file:

� � https://github.com/MaxFisher/caps/blob/master/pilot/views.py#L87

By "Dublin Core," I assume you mean OAI Dublin Core XML?

-Mike


On Tue, Mar 1, 2011 at 14:53, Edward M. Corrado  wrote:

Hello all,

I have an excel file that I need to map to Dublin Core. I can think of
a number of ways to do this, but was wondering if anyone else who has
done it has a suggestion before I dust off my old sed/awk skills or
otherwise reinvent the wheel. I looked at Terry Reese's MarcEdit and I
probably can use that, but it looks like I'd have to intermediately
convert it to MARC. Either a windows-based program or *nix tool is
fine.

Edward





--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


[CODE4LIB] Linked Library Data @ ALA Midwinter

2010-12-22 Thread Corey A Harper

** With apologies for cross-posting **

Linked Library Data @ ALA Midwinter
Sunday, Jan. 9, 2011, 10:30-Noon
Marriott, Del Mar Room

This meeting will build on sessions from ALA 2010 Annual [1] and DC-2010 
in Pittsburgh [2], though content will not assume attendees were at 
either session. An ongoing LITA/ALCTS Linked Library Data Interest Group 
is in the works, and attendees will be able to shape the direction of 
that group and its activities.[3] [4]


Specific agenda topics will be driven by attendee interest. Some 
possible questions that can be discussed are:


- What library system functions can be enhanced by linked data?
- What role can library vendors play in an environment with open linking?
- For those working on linked data projects, what are the main 
challenges you are facing?

- Where can we coordinate activities to make it easier to create links?
- What tools and communication channels could ease the burden of 
producing or working with linked data?
- What role can we play in influencing standards development to support 
the linked data environment?


Corey Harper, Co-Chair
Karen Coyle, Co-Chair

[1]http://kcoyle.net/lld-ala/lld-ala.html
[2]http://kcoyle.net/lld-ala/pittsburgh2010.html
[3]Function Statement (Draft): To provide a forum for discussion of 
issues related to Linked Library Data and the role of library metadata 
in the Semantic Web. Goals include: raising awareness of Semantic Web 
technologies, such as the Resource Description Framework (RDF) and the 
use of URIs as identifiers within bibliographic descriptions; promoting 
research on linked data challenges, such as domain modeling and 
vocabulary selection and design; and informing the ongoing development 
of existing metadata standards for Libraries, Archives and Cultural 
Heritage Institutions.

[4]Some further reading linked from: http://kcoyle.net/lld-ala/

--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


[CODE4LIB] Invitation to participate in DC-2010 Linked Data Special Sessions

2010-10-13 Thread Corey A Harper

*** Apologies for cross-posting ***

DC2010 is a week away, and I hope many of you are planning to attend.

As co-convenor of the special sessions on linked data, I wanted to let 
you know what to expect from these meetings. While there's plenty of 
linked data related goodness throughout the conference program, the 
linked data special sessions are as follows:


* Thu, 10/21, 2-3:30: Domain Models
* Thu, 10/21, 4-5:30: Vocabulary Selection and Development
* Fri, 10/22, 4-5:30: Breakout sessions & Follow-up

Karen Coyle and I will be co-moderating the 1st two sessions, and one or 
both of us will be moderating the discussions for the third. The 3 
sessions will be run as a mini-un-conference within the larger space of 
DC2010.


Beginning during registration at 8am on Wed, 10/20, there will be a 
signup sheet for lightening talks for the 1st two sessions. If you'll be 
in attendance, and would like to give a short, 5 minute talk on either 
data models for your domain, or work that you're doing on crafting an 
RDF vocabulary for a particular project, we'd like to devote at least 
20-30 minutes for each of these two sessions to participation from 
attendees. The remainder of the sessions will be brainstorming sessions, 
similar to the session Karen and I ran at ALA in Washington this Summer: 
http://kcoyle.net/lld-ala.html


The third session will be a mix of wrap up, discussion, identification 
of next steps, and collection of comments and ideas to share with both 
the DCMI Architecture Forum and the W3C Library Linked Data incubator 
group. The former will be meeting concurrently with our third session on 
Friday, and the latter will be having a Face to Face meeting over the 
weekend. If there are potential linked data use cases emerging from 
these sessions, groups may want to work on writing them up for 
submission to the W3C group. Attendees interested in discussion of 
developing application profiles based on the data models and 
vocabularies discussed in sessions one & two may wish to attend the 
first half of "Application Profiles for Linked Data: Models & 
Requirements" from 2-3:30pm on 10/22. If there is sufficient interest, a 
group of us could split off to prepare a short report to present at part 
2 of that session.


Further information about DC-2010, including program descriptions, is 
available online at: http://dc-2010.org/.


Please let me know if you have any questions about this series of 
special sessions, and we look forward to seeing you at DC2010.


Regards,
-Corey A Harper
--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] SRU indexes for Aleph

2010-06-09 Thread Corey A Harper

Hi David, et al.,

Aleph does have an SRU/SRW interface, though it's really just a 
translator that accepts CQL queries and maps them to z39.50 queries. We 
haven't implemented at NYU, though I've been meaning to get it running 
to give our digital library folks a cleaner interface.


For Aleph customers, there's a file called "ALEPH SRU/SRW Server" in the 
ExL doc portal. It's listed under "Z39.50 SRU (Cross Version)" rather 
than under a specific Aleph version.


Best,
-Corey

Walker, David wrote:
Are you saying that Aleph has no native SRU 
capability and YAZ is the only SRU access to it?


Someone correct me if I am wrong, but I'm pretty sure Alpeh *doesn't* have an 
SRU interface.  There's no documentation for one on the Ex Libris site anyway.

It does have some web services, and it's possible the library you are accessing 
here, Ralph, has written an SRU wrapper around those.  YAZ Proxy in front of 
the Alpeh Z39 server also a possibility, as Ere mentioned.

Probably best to talk to the specific library here.  It's very likely this is 
their own creation.

--Dave

==
David Walker
Library Web Services Manager
California State University
http://xerxes.calstate.edu

From: Code for Libraries [code4...@listserv.nd.edu] On Behalf Of LeVan,Ralph 
[le...@oclc.org]
Sent: Wednesday, June 09, 2010 8:29 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] SRU indexes for Aleph

-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Ere 
Maijala
Sent: Wednesday, June 09, 2010 4:11 AM


it's probably a custom YAZ Proxy. This is as far as I know the default
mapping to Z39.50 (of course it could have been modified locally):


Really?  I find it hard to believe that the Index Data folks don't know how to 
make an Explain record.

Are you saying that Aleph has no native SRU capability and YAZ is the only SRU 
access to it?

Thanks!

Ralph


--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


[CODE4LIB] ALA annual program: Linked Library Data Un-Meeting

2010-06-08 Thread Corey A Harper

** With apologies for cross-posting **

Linked Library Data Un-Meeting @ ALA Annual
Friday, June 25, 2010, 9am-12pm
Hyatt Regency Washington on Capitol Hill, Regency A room

This informal gathering, co-sponsored by LITA Next Generation Catalogs 
IG and ALCTS CC:DA, will be focused on discovering possible upcoming 
activities in the emerging Linked Library Data landscape. The goal is to 
surface a handful of particularly challenging issues in modeling library 
data for inclusion in the Linked Data ecosystem  that can become the 
focus of future formal and informal activities.


The meeting will begin with brief introductory presentations by Karen 
Coyle on the DC-RDA data modeling project, and Corey Harper on current 
state and emerging needs for RDF descriptions personal and corporate 
names, including use of FoaF vocabularies and OCLC's VIAF-linking work.


Following these presentations will be breakout sessions on topics 
identified by the attendees. Come prepared to participate, 
barcamp-style, in some intense brainstorming. Lead a discussion on a 
topic of your choice.


If you are interested in giving a brief "lightening talk", time can be 
made for additional short presentations before or after the discussions. 
These can be signed up for in advance by contacting Corey Harper, or 
proposed at the start of the meeting via a sign-up sheet.


Hopefully the meeting will also serve as a good follow-on for attendees 
of Thursday's Linked Data Pre-Conference.


The organizers are working on a follow-up meeting to continue these 
conversations at DC2010, which will be held in Pittsburgh, PA, October 
20-22, 2010.


--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] MODS and DCTERMS

2010-05-03 Thread Corey A Harper
n 
focusing on working for the bulk of use cases.

But I realize this now sounds like an NGC4lib thread, and for that I apologize.  :-)  So, 
to keep it pragmatic: it sounds to me that people think doing something as 
"basic" as getting millions of records out of binary MARC format into something 
as lossy and unrefined as DCTERMS to expose them isn't a worthwhile effort?

MJ

NB: When Karen Coyle, Eric Morgan, and Roy Tennant all reply to your thread 
within half an hour of each other, you know you've hit the big time.  Time to 
retire young I think.


--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


Re: [CODE4LIB] it's cool to hate on OpenURL (was: Twitter annotations...)

2010-04-30 Thread Corey A Harper

Hi All,

Though hesitant to jump in here, I agree with Owen that the dead ends 
aren't a standards issue. The bloat of the standard is, as is the lack 
of a standardized response format, but the dead ends have to do with bad 
metadata being coded into open-URLs and with breakdowns in the 
connection between content aggregators/providers and knowledge base 
maintainers.


Work in this area isn't completely stagnant, though. The joint NISO/UK 
Serials Group's "Knowledge Bases And Related Tools working group" is 
looking towards solutions to exactly these problems.

http://www.uksg.org/kbart

Their initial report on best practice for content providers and KB 
maintainers is worth a look.


-Corey

Owen Stephens wrote:

Dead ends from OpenURL enabled hyperlinks aren't a result of the standard
though, but rather an aspect of both the problem they are trying to solve,
and the conceptual way they try to do this.

I'd content these dead ends are an implementation issue - and despite this I
have to say that my experience on the ground is that feedback from library
users on the use of link resolvers is positive - much more so than many of
the other library systems I've been involved with.

What I do see as a problem is that this market seems to have essentially
stagnated, at least as far as I can see. I suspect the reasons for this are
complex, but it would be nice to see some more innovation in this area.

Owen

On Thu, Apr 29, 2010 at 6:14 PM, Ed Summers  wrote:


On Thu, Apr 29, 2010 at 12:08 PM, Eric Hellman  wrote:

Since this thread has turned into a discussion on OpenURL...

I have to say that during the OpenURL 1.0 standardization process, we

definitely had moments of despair. Today, I'm willing to derive satisfaction
from "it works" and overlook shortcomings. It might have been otherwise.

Personally, I've followed enough OpenURL enabled hyperlink dead ends
to contest "it works".

//Ed







--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


[CODE4LIB] Metadata Registries Functional Requirements Survey

2009-07-14 Thread Corey A Harper

Dear All,

Apologies for the cross-posting.

The following survey is being distributed on behalf of the Dublin Core
Metadata Initiative's Registries Task Group, the Joint Information
Steering Committe (JISC) and UKOLN.

The goal of the survey is to collect information from Metadata Registry
managers, developers and end-users to determine current practice in
registry development, collect information on registry content, and assess
the interoperability needs of the registry community.

The survey should take less than 20 minutes, and we hope that you will
complete it. The survey will remain open through Friday, July 31.

The link to the survey is:
http://www.surveymonkey.com/s.aspx?sm=UBi6N6R2r_2fdd9f5CSgPkkw_3d_3d

Thank you for your time.

Sincerely,
-Corey A Harper

--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey.har...@nyu.edu


[CODE4LIB] [Fwd: Vocabulary Mapping Framework]

2009-06-18 Thread Corey A Harper

code4libbers,

Interesting stuff afoot.  The press release linked below is well worth a 
read.  Apologies if you've already seen this


Cheers,
-Corey

 Original Message 
Subject:Vocabulary Mapping Framework
Date:   Thu, 18 Jun 2009 15:26:15 +0200
From:   Frodl, Christine
Reply-To:   DCMI Libraries Community 
To: dc-librar...@jiscmail.ac.uk




Dear list members,

I want to draw your attention on this information, sent out by Norman
Paskin today:

List members may be interested to know of the Vocabulary Mapping Framework.
The press release is available at
<_http://www.doi.org/news/VMF_project_announcement_090615.pdf_>.

Summary: A new initiative, the Vocabulary Mapping Framework (VMF), has
been announced by a consortium of partners. This will create an
extensive and authoritative mapping of vocabularies from nine major
content metadata standards, creating a downloadable tool to support
interoperability across communities. The mapping will also be extensible
to other standards. The work builds on the principles of
interoperability established in the indecs Content Model, and is an
expansion of the existing RDA/ONIX Framework into a comprehensive
vocabulary of resource relators and categories, which will be a superset
of those used in major standards from the publisher/producer, education
and bibliographic/heritage communities.

The International DOI Foundation, which fully endorses this work, will
provide a web hosting facility for the Framework as part of its
commitment to promoting the wider use of interoperable metadata, and
will use the vocabulary mapping wherever possible to support the
association of metadata with DOI names.

The IDF is organising an event in conjunction with the iPres2009 Digital
Preservation meeting (San Francisco, October) which will consider this
and wider issues:

<_http://www.doi.org/doi_presentations/members_meeting_2009/index.html_>.

Please pass on information or these links.

Dr. Norman Paskin
5, Linkside Avenue
Oxford
OX2 8HY
UK

Tel: (+44) 1865 559070
Mobile: (+44) 7710 327569
skype: npaskin
_www.tertius.ltd.uk_ 
_www.linkedin.com/in/normanpaskin_


--
Christine Frodl
German National Library
Office for Library Standards
Adickesallee 1
D-60322 Frankfurt am Main
Telefon: +49-69-1525-1404
Telefax: +49-69-1525-1010
_mailto:c.fr...@d-nb.de_
_http://www.d-nb.de_



--
Corey A Harper
Metadata Services Librarian
Bobst Library, B42-LL1
New York University
70 Washington Square South
New York, NY  10012
212.998.2479
corey.har...@nyu.edu


[CODE4LIB] Be a data librarian in New York

2008-10-24 Thread Corey A Harper

 *In case you missed the first mention, we are looking for a data
 librarian! Details below...
 *


 *New York* University Libraries


 *Data Service Librarian*

*Description*:

New York University is seeking an energetic, creative, and knowledgeable
librarian to select, acquire, manage, and deliver numeric and geospatial
data collections to support campus research and scholarship. The
incumbent in this new position will work closely with NYU librarians,
faculty technology specialists, and research computing staff to develop
a new Data Service Studio - combining library and statistical computing
services in the Bobst Library. The librarian will build numeric and
spatial data collections and facilitate access to additional data
resources across the sciences, engage in outreach to faculty and
students through subject and departmental liaison activities, design and
conduct data literacy instruction, and participate in reference and
consultation activities as a part of the Data Service Studio team.

This position will play an instrumental role in the library's efforts to
optimize resource discovery and delivery by identifying, evaluating,
manipulating, and enhancing metadata for numeric and spatial
collections. Reporting to the Data Service Coordinator, the Data Service
Librarian works to develop appropriate description for managing research
data collections; investigates new sources for metadata; keeps abreast
of new and evolving metadata standards such as the Data Documentation
Initiative (DDI) and Federal Geographic Data Committee (FGDC) standards.

The incumbent will participate in library-wide committees, activities,
and special projects, especially those involving new technologies and
data. The incumbent will develop and maintain awareness of data-centered
initiatives across the sciences, attending professional meetings,
workshops and conferences for training and continuing professional
development.

*Qualifications*: MLS from an ALA accredited institution. Second masters
or the equivalent required for tenure, preference for fields with
preparation in quantitative/spatial analysis. Familiarity with database
and cataloging practices and metadata schemas; experience with XML
preferred. Knowledge of public and proprietary resources for national
and international numeric and geospatial data. The candidate should be
professionally active and have knowledge of trends in data management
and description. Basic familiarity with software for statistical and
geospatial analysis (//e.g//. SAS, SPSS, Stata, R, GIS applications).
Additional requirements are strong written and verbal communication
skills, demonstrated ability to work independently and collaboratively
in a complex organization, and a creative, service-oriented approach to
problem solving.

*New York** University Libraries:* Library facilities at New York
University serve the school’s 40,000 students and faculty and contain
more than 4 million volumes. New York University is a member of the
Association of Research Libraries, the Research Libraries Group, the
Digital Library Federation; serves as the administrative headquarters of
the Research Library Association of South Manhattan, a consortium that
includes three academic institutions and the Interuniversity Consortium
for Political and Social Research (ICPSR).

* *

*Salary/Benefits*: Faculty status, attractive benefits package including
five weeks annual vacation. Salary commensurate with experience and
background.

*To Apply*: To ensure consideration, send resume and letter of
application, including the name, address, and telephone number of three
references to: Janet Koztowski, Human Resources Director, New York
University Libraries, 70 Washington Square South, New York, NY 10012 or
via email to [EMAIL PROTECTED] Resumes will be considered until the
position is filled. NYU is an Equal Opportunity/ Affirmative Action
Employer.


Re: [CODE4LIB] XML schema for describing software applications?

2007-07-17 Thread Corey A Harper

Sharon, Michael, & others --

Within the Dublin Core Metadata Initiative, the Tools Community is
working on a Tools Application Profile.  The AP is still in draft form,
and draws heavily upon the DOAP namespace referenced by Michael.
http://dublincore.org/groups/tools/map.shtml

At present, the Tools AP is largely focused on describing tools,
algorithms and software applications for working with metadata.

However, I think it is important that this work address the descriptive
requirements of applications and web-services for supporting digital
libraries, as well as being applicable to the needs of software
preservation repositories.

If anyone on this list has feedback on this draft proposal, I'd be happy
to share it with the DCMI Tools Community, and would encourage those
that are interested to join the DCMI Tools Mailing List, linked off of
the Community homepage:
http://dublincore.org/groups/tools/

-Corey

--


Michael J. Giarlo wrote:

Howdy Sharon,

Not sure if this is perfect, but you might check out DOAP (Description of a
Project).  Quoth Wikipedia:

"*DOAP* (Description Of A Project) is an attempt to make an RDF
schema<http://en.wikipedia.org/wiki/RDF_Schema>and
XML <http://en.wikipedia.org/wiki/Extensible_Markup_Language> vocabulary to
describe open-source <http://en.wikipedia.org/wiki/Open-source>
projects. It
was created and initially developed by Edd Dumbill to convey semantically
information associated with open-source software projects. It is currently
used in O'Reilly's <http://en.wikipedia.org/wiki/O%27Reilly_Media>
CodeZoo<http://www.codezoo.com/>and the Apache
Software
Foundation's<http://en.wikipedia.org/wiki/Apache_Software_Foundation>
project
page <http://projects.apache.org/>. There are currently generators,
validators <http://en.wikipedia.org/wiki/Validator>, viewers and converters
to enable more projects to be able to be included in the semantic
web<http://en.wikipedia.org/wiki/Semantic_web>
."

Good luck,

-Mike


On 7/17/07, Sharon Foster <[EMAIL PROTECTED]> wrote:


Please forgive the cross-posting.

For my final project in the class Digital Libraries, I am bringing
together a "bibliography" ("appliography"?) of open source software
applications and free web services that would be useful in the
construction of digital libraries. (How self-referential can you get?
;-)) I am looking for advice on finding, selecting, and using an
existing XML schema that would include syntax for the type of
application, target platforms, OSs, licensing, etc.

In searching Ask.com and Google, I zeroed in on OSD, the Open Software
Description Format, but I'm not finding a lot of new material about
it, and I can't recall ever reading about it. Is this in fact the
latest and greatest? Any advice would be greatly appreciated.

Thanks,
Sharon

--
Sharon M. Foster, B.S., J.D., 0.5 * (MLS)
F/OSS Evangelist
Cheshire Public Library
104 Main Street
Cheshire, CT  06410
http://www.cheshirelibrary.org
My library school portfolio: http://home.southernct.edu/~fosters4/

Any opinions expressed here are entirely my own.



--
Corey A Harper
Metadata Services Librarian
Bobst Library
New York University
70 Washington Square South
New York, NY  10012
212.998.2479
[EMAIL PROTECTED]


[CODE4LIB] Oregon: Digital Services Program Manager

2006-11-17 Thread Corey A Harper

With apologies for cross-posting.

~~~

The University of Oregon Libraries invites applications for the following:


  *Digital Services Program Manager*

The Orbis Cascade Alliance ( http://www.orbiscascade.org/) seeks a
collaborative and innovative professional for the position of Digital
Services Program Manager. The Alliance has a core membership of 33
academic institutions in Oregon and Washington and serves a broad group
of institutions throughout the Northwest. Building on a history of
success that includes electronic resources purchasing, the Summit union
catalog, courier service, and conference hosting, the Alliance is
pursuing an ambitious strategic agenda that includes research,
development, and implementation of digital library services. This
position will play a key leadership role in these efforts, working with
member institutions, consortia, and commercial vendors to develop and
support innovative services and next-generation library systems and
technologies. This position is expected to develop an advanced
understanding of the diverse needs of academic libraries and take an
innovative and entrepreneurial approach to predicting and meeting those
needs in a consortial environment.

*Qualifications*: /Required/: Master’s degree in library science or
applicable IT field from an accredited institution and three years
experience in relevant library projects; excellent analytical, oral and
written communication skills; demonstrated ability to establish
priorities in a dynamic environment; ability to work productively on
collaborative projects; ability to work independently and on a collegial
basis with a diverse group of individuals and institutions. The
successful candidate will support and enhance and enhance a diverse
learning and working environment.

*Salary & Benefits*: $50,000 minimum depending on prior professional
experience and education. The University of Oregon (fiscal agent for the
Alliance) offers a generous benefits package (see:
http://hr.uoregon.edu/benefits/).

*Deadline to apply*: *_Dec. 15, 2006_*.

*To Apply*: Send Word or pdf attachment via e-mail to Laine Stambaugh,
[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>. Be sure to include
cover letter, resume, and names, addresses, phone numbers and e-mail
addresses of four references. For complete details about the position,
about the Alliance, the UO and UO Libraries, and Oregon, see:
http://libweb.uoregon.edu/admnpers/digitalservices.html.

The University of Oregon is an Affirmative Action/Equal
Opportunity/ADA-compliant institution committed to cultural diversity.

Thank you.

Laine Stambaugh
Director, Library Human Resources
University of Oregon Libraries
Eugene, OR 97403-1299
(541) 346-1895 (voice)
[EMAIL PROTECTED]



--
Corey A Harper
Metadata Librarian - CMET Team Leader
Metadata and Digital Library Services
University of Oregon
541/346.1854
[EMAIL PROTECTED]