[CODE4LIB] Code4Lib 2014 pre-conferences: please indicate interest on wiki

2013-12-09 Thread Emily Lynema
Hi folks, We have [possibly?] a record number of pre-conference proposals up on the wiki -- 19 by my count. Yay for the awesomeness of you all! We haven't crunched the numbers on morning vs. afternoon scheduling and the number of rooms available quite yet, but it would be really, super duper

[CODE4LIB] Reminder - Code4Lib 2014 Conference Diversity Scholarship Application Due Dec 14

2013-12-09 Thread Bohyun Kim
*** Apologies for the cross-posting*** For the Code4Lib 2014 Conference, 9 scholarships have been sponsored to promote diversity. http://code4lib.org/conference/2013/scholarship CLIR/DLF has sponsored 5 scholarships, EBSCO has sponsored 2 scholarships, ProQuest has sponsored 1 full scholarship,

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Joshua Welker
Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Richard Sarvas
I've had good luck using both the Data::Faker and Text::Lorem Perl modules to generate large amounts (30k+ rows) of Archivists Toolkit test data. Other ports of Data::Faker would probably work just as well, though it needs a bit more code to more than generate more than name, address and

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Kevin S. Clarke
Nice! Are you going to put the code on GitHub (or some such place)? I'd be interested in tracking... Kevin On Mon, Dec 9, 2013 at 11:27 AM, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Joshua Welker
Sure. It's not a fancy reusable class or anything though--just a simple PHP script. I will try to put it up sometime today or tomorrow and will share a link. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent:

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Kevin S. Clarke
Cool! Thanks, Kevin On Mon, Dec 9, 2013 at 11:37 AM, Joshua Welker wel...@ucmo.edu wrote: Sure. It's not a fancy reusable class or anything though--just a simple PHP script. I will try to put it up sometime today or tomorrow and will share a link. Josh Welker -Original Message-

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Ben Companjen
Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Joshua Welker
It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Ben Companjen
Hi Josh, Before you start coding: http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD or XSD file. First try with the EAC XSD shows I need to try other options, but it's promising. (It's still an

Re: [CODE4LIB] Reminder - Code4Lib 2014 Conference Diversity Scholarship Application Due Dec 14

2013-12-09 Thread Bohyun Kim
*** Correcting the typos *** Please see this page: http://code4lib.org/node/491 (This was for the past 2013 conference. http://code4lib.org/conference/2013/scholarship. ) The application deadline is Dec. 13, 2013, 5pm EST. The scholarship committee will notify successful candidates the week of

[CODE4LIB] 2 interesting work opportunities in Chattanooga TN

2013-12-09 Thread Nate Hill
Hello friends, Want to work for the Mozilla Foundation and collaborate with our awesome public library here in Chattanooga? Check out these two opportunities. Documentation Design Coordinator, Mozilla Gigabit Community Fundhttps://careers.mozilla.org/en-US/position/obX7XfwW and Community

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Joshua Welker
I checked out the Eclipse option and was not able to get much use out of it. Maybe someone else will have better luck? It doesn't seem to align very well with a library use case. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben

[CODE4LIB] problem in old etd xml files

2013-12-09 Thread Robertson, Wendy C
Back in 1999-2002 a handful of our theses were submitted as a collection of xml files. We posted the files in our repository several years ago (we posted a zipped folder with all the files). At that time, if you opened front.xml you would be able to access the thesis. We have not touched the

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Ron Gilmour
DTDs and XML namespaces don't like each other very much. I think you're getting into trouble because your DTD doesn't allow the two namespace-declaring attributes on the thesis element. Try adding this to your DTD: !ATTLIST thesis xmlns:xhtml CDATA #FIXED 'http://www.w3.org/1999/xhtml'

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Jon Gorman
A lot of modern systems won't load entities (or will limit it somehow) because of the denial of service attack that is possible. Look for XML Entity Reference Denial of Service. I can't remember if Public declarations are treated any differently than System ones. (I would have suspected it to

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Peter Binkley
Years ago Bill Moen had a set of radioactive MARC records with unique tokens in all fields, to test Z39.50 retrieval. I don't know whether they were ever released anywhere, but I see the specs are here: http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/ Peter Peter Binkley Digital

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Roy Tennant
I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-) Roy On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.cawrote: Years ago Bill Moen had a set of radioactive MARC records with

[CODE4LIB] Job: Software Developer at University of Maryland, College Park

2013-12-09 Thread jobs
Software Developer University of Maryland, College Park College Park As the largest university library system in the Washington D.C.-Baltimore area, the University of Maryland Libraries serve more than 37,500 students and 4,200 faculty of the flagship College Park campus. The University of

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Robertson, Wendy C
Thanks. I'll see if this helps. I'm sure IE was used to view the files 4.5 years ago. I don't think I looked at them, but we had super employees (recent grads from library school) that worked with the files and I trust that they would have noticed problems. Fortunately we only have 7 of

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Tom Cramer
I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-) It's 42, but Z39.50 will accelerate the rate of decay. - T

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Frumkin, Jeremy
Finally, Z39.50 actually speeds up something -- jaf Sent from my iPad On Dec 9, 2013, at 4:59 PM, Tom Cramer tcra...@stanford.edu wrote: I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-)

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Jason Bengtson
What's killing you is the ampersands. When these were encoded they contained characters that hadn't been properly encoded as XML (mainly special linguistic characters and non-breaking spaces). Definitely replace your :stylesheet with -stylesheet, but then do a find and replace on all of your

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Joshua Welker
Evil ampersands! They have caused me hours of headaches in past XML projects... Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jason Bengtson Sent: Monday, December 09, 2013 4:35 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re:

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Jason Bengtson
Agreed. Best regards, Jason Bengtson, MLIS, MA Head of Library Computing and Information Systems Assistant Professor, Graduate College Department of Health Sciences Library and Information Management University of Oklahoma Health Sciences Center 405-271-2285, opt. 5 405-271-3297 (fax)

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Jason Bengtson
Of course, the fly in the ointment here is that you still lose all of your links, which you need for the document to work as intended. From what I can see the original construction of the document relied upon the sloppiness of parsers in some of the old browsers (hence the ampersands in html

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Bill Dueber
The sad thing is that the Library of Congress spent billions of dollars of taxpayer money building a safe storage facility in the stable caves under Dublin, OH, but now no one will let them bury them there. On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant roytenn...@gmail.com wrote: I can't help

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Jon Gorman
How did you fix the ampersands? I ask, because if you just did a simple text transform from to amp;, it would mask the problem of the entity escaping I think... Not at work, so I don't have a good example and the file is downloading very slowly here, so I'll try to do one from memory. There

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Roy Tennant
I ask you, would you want to work all day sitting on top of a huge pile of radioactive MARC records? I sure wouldn't... Roy On Mon, Dec 9, 2013 at 5:08 PM, Bill Dueber b...@dueber.com wrote: The sad thing is that the Library of Congress spent billions of dollars of taxpayer money building a

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread jason bengtson
For testing purposes I just nixed them. As I noted, to rework the file a person would probably want to use a more critical eye with find and replace. Totally doable. On Dec 9, 2013, at 7:37 PM, Jon Gorman jonathan.gor...@gmail.com wrote: How did you fix the ampersands? I ask, because if you

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread jason bengtson
Each ampersand was in a special character, which would have to be replaced with something. Most of them were in nonbreaking spaces or the enya in a particular name. On Dec 9, 2013, at 7:37 PM, Jon Gorman jonathan.gor...@gmail.com wrote: How did you fix the ampersands? I ask, because if you

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Roy Tennant
For my money, the text transform should look only for exact matches (e.g., aacute;, nbsp;, copy;) and replace them with their numeric counterparts. Roy On Mon, Dec 9, 2013 at 5:41 PM, jason bengtson j.bengtson...@gmail.comwrote: For testing purposes I just nixed them. As I noted, to rework the

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Pottinger, Hardy J.
Well it's not a web service, but it does make lots of fake metadata for batch loading into DSpace. I will just leave this here: https://github.com/hardyoyo/random_dspace_batch_metadata Thanks for the lead on the Faker gem! This was a fun diversion. I especially like the titles this script

Re: [CODE4LIB] transforming marc to rdf

2013-12-09 Thread Eric Lease Morgan
I have created an initial pile of RDF, mostly. I am in the process of experimenting with linked data for archives. My goal is to use existing (EAD and MARC) metadata to create RDF/XML, and then to expose this RDF/XML using linked data principles. Once I get that far I hope to slurp up the

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Brian Zelip
Not metadata, but still pretty fun - http://meettheipsums.com - some curated ipsums. Brian Zelip --- Graduate Assistant Scholarly Commons, University Library University of Illinois at Urbana-Champaign On Mon, Dec 9, 2013 at 9:14 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Well

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Kevin S. Clarke
I was telling Hardy earlier today we needed a metadata ipsum... it would pull random words from the full-text of the AACR2. Kevin On Mon, Dec 9, 2013 at 11:24 PM, Brian Zelip bze...@gmail.com wrote: Not metadata, but still pretty fun - http://meettheipsums.com - some curated ipsums.