Re: [CODE4LIB] date fields

2016-07-11 Thread Kyle Banerjee
Is the idea that this new field would be stored as MARC in the system (the ILS?). If so, the 9xx solution already suggested is probably the way to go if the 008 route suggested earlier won't work for you. Otherwise, you run a risk that some form of record maintenance will blow out all your

Re: [CODE4LIB] Formalizing Code4Lib?

2016-06-14 Thread Kyle Banerjee
On Tue, Jun 14, 2016 at 9:05 AM, Miles Fidelman wrote: > I'm rather surprised that nobody has suggested contacting: > - the American Library Association (particularly the LITA division) > - the Internet Archive > > Or... the Tides Foundation (tides.org in San

Re: [CODE4LIB] Formalizing Code4Lib?

2016-06-07 Thread Kyle Banerjee
On Tue, Jun 7, 2016 at 2:59 PM, Salazar, Christina < christina.sala...@csuci.edu> wrote: > Having gone to C4L in 2007 in Athens, when it was I think 150 people (ha! > Let's be honest: 145 men and 5 women) and then again in 2015 in Portland > and 2014 in Raleigh, the Code 4 Lib that once was is no

Re: [CODE4LIB] JPEG question

2016-05-25 Thread Kyle Banerjee
Is that 1524 dpi for Batch A a misprint? If not, that's very likely to be your problem -- I doubt that's what the vendor really scanned at. If you change the dpi values and try to reload, my guess is you'll get very different results. kyle On Wed, May 25, 2016 at 2:40 PM, Bernadette Houghton <

Re: [CODE4LIB] Anything Interesting Going on in Archival Metadata?

2016-05-24 Thread Kyle Banerjee
On Tue, May 24, 2016 at 6:57 AM, Matt Sherman wrote: > Is linked data even useful in a setting with extremely unique > materials? IMO, linked data is especially useful with unique materials because relationships are simultaneously more important and more

Re: [CODE4LIB] All URLs redirect to mod_rewrite error page

2016-05-09 Thread Kyle Banerjee
Howdy Justin, We don't have enough info to diagnose what's going on, but DB corruption strikes me a more likely cause of your headaches than OS or Apache issues. Based on your description, it sounds like Omeka thinks it is not properly installed -- i.e. mod_rewrite is probably fine. Before

Re: [CODE4LIB] using drupal for a document repository

2016-05-06 Thread Kyle Banerjee
> On May 6, 2016, at 8:37 AM, Joshua Klingbeil wrote: > > ...These, and other req-cons can help you to better understand what type of > investment should be considered for your project... However, going through > the process may help you to determine if it feels more like

Re: [CODE4LIB] Form fill from URL

2016-04-25 Thread Kyle Banerjee
On Fri, Apr 22, 2016 at 4:58 PM, Teague Allen wrote: > Hello collective, > > I've been given the opportunity to replace a much-detested PDF form used > to request cataloging for items by our researchers that are published > outside our organization. My hope is to create a

Re: [CODE4LIB] Good Database Software for a Digital Project?

2016-04-16 Thread Kyle Banerjee
On Sat, Apr 16, 2016 at 7:15 AM, Matt Sherman wrote: > Thanks for all the advice folks, this gives me a lot to look into. You all > have certainly made me table MySQL, so now to look into PostgreSQL, Solr, > XTF, and some of these other technologies to see what would

Re: [CODE4LIB] Good Database Software for a Digital Project?

2016-04-15 Thread Kyle Banerjee
On Fri, Apr 15, 2016 at 11:53 AM, Roy Tennant wrote: > In my experience, for a number of use cases, including possibly this one, > a database is overkill. Often, flat files in a directory system indexed by > something like Solr is plenty and you avoid the inevitable

Re: [CODE4LIB] authority work with isni

2016-04-15 Thread Kyle Banerjee
On Fri, Apr 15, 2016 at 2:16 AM, Eric Lease Morgan wrote: > ... > My questions are: > > * What remote authority databases are available programmatically? I > already know of one from the Library of Congress, VIAF, and probably > WorldCat Identities. Does ISNI support some sort

Re: [CODE4LIB] Google can give you answers, but librarians give you the right answers

2016-04-10 Thread Kyle Banerjee
On Fri, Apr 8, 2016 at 5:04 PM, Karen Coyle wrote: > The percentage of things that have decent LCSH assigned to them is >> small >> and shrinking for the simple reason that a fewer and fewer humans have to >> manage more resources. >> > > I'm not sure what you are saying

Re: [CODE4LIB] Software used in Panama Papers Analysis

2016-04-08 Thread Kyle Banerjee
On Fri, Apr 8, 2016 at 8:13 AM, Jenn C wrote: > I worked on a text mining project last semester where I had a bunch of > magazines with text that was totally unstructured (from IA). I would have > really liked to know how to work entity matching into such a project. Are > there

Re: [CODE4LIB] including data from static JSON file in Javascript

2016-04-06 Thread Kyle Banerjee
If all you want to do is load external json as a string, you can do it using syntax almost identical to what you suggest. Just change your data.json file so the content is var data = ' [include your json here, be sure to escape things properly]'; Then just load this file before your external

Re: [CODE4LIB] Google can give you answers, but librarians give you the right answers

2016-04-06 Thread Kyle Banerjee
On Wed, Apr 6, 2016 at 7:42 AM, Karen Coyle wrote: > ... Libraries "do" it, but our user interfaces ignore it (honestly, does > anyone NOT think that the whole BT/NT relationship in LCSH is completely > wasted in today's systems?). Google searches "work" best on proper nouns >

Re: [CODE4LIB] Google can give you answers, but librarians give you the right answers

2016-04-01 Thread Kyle Banerjee
On Thu, Mar 31, 2016 at 9:31 PM, Cornel Darden Jr. wrote: > > "Google can give you answers, but librarians give you the right answers." > > Is it me? Or is there something wrong with this statement? > There's nothing wrong with the statement. As is the case with all

Re: [CODE4LIB] Public Health Metadata

2016-03-20 Thread Kyle Banerjee
BTW, I hope you share the solution you decide to implement. Public health research goes on at a lot of institutions (including mine), and I'm always looking for ways to address weaknesses in our current practices/systems. kyle On Mon, Mar 14, 2016 at 11:43 AM, Jacob Ratliff

Re: [CODE4LIB] Do you use alt tags in your images for digital collections

2016-03-19 Thread Kyle Banerjee
On Thu, Mar 17, 2016 at 4:39 PM, Erica FINDLEY wrote: > Good evening, > > We are currently experiencing a dilemma with alt tags in our digital > collections. > > We would like to include alt tags to be in compliance with accessibility > guidelines. > > When looking at an item

Re: [CODE4LIB] Public Health Metadata

2016-03-15 Thread Kyle Banerjee
g at this point). > > The suggestions everyone has given so far are very helpful and are putting > me on the right track. My hope is to be able to use one of them, or at > least part(s) of them to get to where we need to go. > > Hopefully in a few months I will have a good

Re: [CODE4LIB] Public Health Metadata

2016-03-14 Thread Kyle Banerjee
Could you say a bit more about the documents you need to manage, the level of specificity you need, how they'll be used, and what process you envision to assign terms? If your documents are mostly clinical in nature, SNOMED strikes me a good choice, but if you want terminology that could take you

Re: [CODE4LIB] php and email

2016-02-26 Thread Kyle Banerjee
> > Our library has a website run on PHP. The university IT would not help to > set up email capability via Web. My question is, what are the options > there that I can add email notification capability to our website, and how? > > Our server is Windows 2008r2, PHP5.6, IIS 7.5. > Does

Re: [CODE4LIB] Listserv communication, was RE: Proposed Duty Officer

2016-02-26 Thread Kyle Banerjee
> You're also always going to have trouble with getting people to ask > questions, unless the concept of asking for help/guidance has been drilled > into them as not stupid, but constructive, for a very long time. I'm > talking life span. > Responses people expect are also a barrier to

Re: [CODE4LIB] [code4libcon] Proposed Duty Officer

2016-02-25 Thread Kyle Banerjee
On Wed, Feb 24, 2016 at 4:36 PM, Becky Yoose wrote: > Apologies for the short reply with my manager's hat firmly in place - > transparency is good, but there are times when a particular process or > discussion should not be public. Given the sensitive nature of some of the >

Re: [CODE4LIB] [code4libcon] Proposed Duty Officer

2016-02-24 Thread Kyle Banerjee
t. So I think it's > > > completely necessary to have an anonymous method of raising concerns, > if > > > you really want people to raise concerns with the conference > organizers. > > > > > > -Esmé > > > > > &g

Re: [CODE4LIB] [code4libcon] Proposed Duty Officer

2016-02-24 Thread Kyle Banerjee
> Feedback about proposed duty officers can be emailed to directly to me, > chadbnel...@gmail.com, or submitted via this anonymous form > . > It's unfortunate people feel a need to move discussions offline -- I interpret this as meaning some people are afraid of

Re: [CODE4LIB] Best way to handle non-US keyboard chars in URLs?

2016-02-21 Thread Kyle Banerjee
> > > 3) Who type un-shortened URLs any more? > > I'm looking for responses that solve this rather than dismiss Intuitive > URLs. > The question is what the use case you're trying to solve looks like. Is the goal typability because it's hand transcribed from a business card, knowing what the link

Re: [CODE4LIB] searching metadata vs searching content

2016-01-27 Thread Kyle Banerjee
A couple things come to mind. The first is that you'll need to experiment a bit to get the behavior that works for your situation and your users -- common solutions that work great in other environments may not work for you. The second is that if you haven't already, you should see if nested solr

Re: [CODE4LIB] oclc member code

2016-01-21 Thread Kyle Banerjee
Try something like this: http://www.worldcat.org/webservices/registry/lookup/Institutions/oclcSymbol/OHS?serviceLabel=enhancedContent Seems to me I messed with this sort of info some years back in an effort to gather info about libraries in my consortium and found so much redundant/outdated info

Re: [CODE4LIB] Anyone familiar with XSLT? Im stuck

2016-01-21 Thread Kyle Banerjee
> > For simple situations one might do without XSLT and stuff > XPath expressions for the content to grab into the command > line of utilities like xml_grep or xpath. In many cases, it's even easier to use string utilities, particularly if there's any chance the XML is not totally valid. If

Re: [CODE4LIB] Creating/maintaining metadata for intangible concepts

2016-01-08 Thread Kyle Banerjee
Hi Laura, You have the idea. There are a number of access points we'd like humans to add based on space/time/location/use/visual elements in the photos unrelated to the actual subject matter. There are a variety of approaches that could be taken, and I've received helpful ideas offline on how to

[CODE4LIB] Creating/maintaining metadata for intangible concepts

2016-01-07 Thread Kyle Banerjee
We are looking for ideas to help users search our collections for photos based on concepts (e.g. diversity) rather than the subject matter depicted in the photo. Since the high priority institutional objectives are often behind requests for these items, we'd really like a better solution than

Re: [CODE4LIB] Marc record creation and matching

2015-10-28 Thread Kyle Banerjee
On Wed, Oct 28, 2015 at 6:03 PM, Terry Reese wrote: > Honestly -- if this was me and I didn't have load table training (even if I > did) -- I would export the MARC records from my III system that I wanted to > overlay. I would create MARC records from the Excel sheets -- then

[CODE4LIB] Video playback issues

2015-09-18 Thread Kyle Banerjee
Howdy all, A number of researchers at our institution use devices that take time sequence photos and transmit the images to software that converts these to AVI. In general, it's pretty straightforward. However, we are encountering cases where the AVI files created on Macs don't play properly in

Re: [CODE4LIB] "coders for libraries"

2015-09-01 Thread Kyle Banerjee
I'm a little surprised that on a list populated by metadata geeks, no one has suggested that just the title (i.e. code4lib) be in the title element ;-) — Sent from Mailbox On Tue, Sep 1, 2015 at 4:51 PM, Tom Cramer wrote: > You can tell it’s a public library because

Re: [CODE4LIB] Protocol-relative URLs in MARC

2015-08-17 Thread Kyle Banerjee
Information in subfield u should be complete, but even if that weren't the case, it's important to consider how systems handle the information they're given. MARC is just a container, and just because the information is syntactically kosher does not mean it will be processed how you like. In the

Re: [CODE4LIB] Processing Circ data

2015-08-06 Thread Kyle Banerjee
On Wed, Aug 5, 2015 at 1:07 PM, Harper, Cynthia char...@vts.edu wrote: Hi all. What are you using to process circ data for ad-hoc queries. I usually extract csv or tab-delimited files - one row per item record, with identifying bib record data, then total checkouts over the given time

Re: [CODE4LIB] Looking for Ideas on Line Breaks in OCR Text

2015-08-04 Thread Kyle Banerjee
On Tue, Aug 4, 2015 at 6:09 AM, Matt Sherman matt.r.sher...@gmail.com wrote: I am on Windows machines, so I don't have quite the easy access to that useful command. Someone had earlier put the OCR in a doc file so I've been playing with that more than with the raw PDF OCR. Versions of the

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Kyle Banerjee
Y'all are doing this the hard way. Word allows regex replacements as well as format based criteria. For this particular use case: 1. Open the find/replace dialog (CTL+H) 2. In the Find what box, put (*) -- make sure the option for Use Wildcards is selected, and for the format, specify

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Kyle Banerjee
] Regex Question Thanks everyone, this really helps. I'll have to work out the italicized stuff, but this gets me much closer. On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee kyle.baner...@gmail.com wrote: Y'all are doing this the hard way. Word allows regex replacements as well as format based

Re: [CODE4LIB] Desiring Advice for Converting OCR Text into Metadata and/or a Database

2015-06-18 Thread Kyle Banerjee
How you want to preprocess and structure the data depends on what you hope to achieve. Can you say more about what you want the end product to look like? kyle On Thu, Jun 18, 2015 at 10:08 AM, Matt Sherman matt.r.sher...@gmail.com wrote: That is a pretty good summation of it yes. I appreciate

Re: [CODE4LIB] LC Cutter Generator - does this exist?

2015-05-12 Thread Kyle Banerjee
There's one built into the Cataloging Calculator. It's a javascript program I wrote 18 years ago for Netscape 4.0, but it still works and gets significant use. Since you're working server side, you'll probably just want to copy the method used rather than to use the code outright though anyone is

Re: [CODE4LIB] How to measure quality of a record

2015-05-06 Thread Kyle Banerjee
On May 6, 2015, at 7:08 AM, James Morley james.mor...@europeana.eu wrote: I think a key thing is to determine to what extent any definition of 'completeness' is actually a representation of 'quality'. As Peter says, making sure not just that metadata is present but then checking it

Re: [CODE4LIB] Mac OS 9 emulator

2015-04-23 Thread Kyle Banerjee
On Thu, Apr 23, 2015 at 10:20 AM, Schmitz Fuhrig, Lynda schmitzfuhr...@si.edu wrote: Thanks for the responses. We actually need to read media within it so Virtual Box would not work for us. Could you say a bit more about your use case? Some applications such as dealing with archival

Re: [CODE4LIB] Recommendations for places to advertise for a library systems guru?

2015-04-22 Thread Kyle Banerjee
On Wed, Apr 22, 2015 at 7:18 AM, Jack Hill jackh...@duke.edu wrote: I would also look at advertising through local technical user groups or meetings that touch on topics related to the job. This. Also might not hurt to consider LinkedIn -- results from there can surprise you. Whatever you

Re: [CODE4LIB] DSpace/Eprints vs Fedora

2015-04-10 Thread Kyle Banerjee
If your discovery strategy is predicated on having your scholarly IR harvested and presented to the world through a separate discovery tool and the vast bulk of your document views are coming from Google and Google Scholar users, does this lessen the 'compelling experience'

Re: [CODE4LIB] Amazon Glacier - tracking deposits

2015-04-09 Thread Kyle Banerjee
Howdy Sara, I've played around a bit with Glacier. It's a bit weird to work with, but tools keep on improving. The real question is what you hope to accomplish with it. As its name implies, it's designed for stuff that is basically frozen. When you take things out, you need to do so very slowly.

Re: [CODE4LIB] talking about digital collections vs electronic resources

2015-03-19 Thread Kyle Banerjee
On Wed, Mar 18, 2015 at 9:51 AM, Laura Krier laura.kr...@gmail.com wrote: I think too often we present our collections to students through the framework of our own workflows and functional handling of materials This. We also try too hard to convey distinctions that aren't important to users

[CODE4LIB] Job: Technology Director, Oregon Health Science University

2015-03-09 Thread Kyle Banerjee
Oregon Health Science University (OHSU) Library in Portland seeks a creative, dynamic, and innovative Technology Director. OHSU is the state's only comprehensive academic health center and is made up of the Schools of Dentistry, Medicine, and Nursing; College of Pharmacy; numerous centers and

Re: [CODE4LIB] Code4lib 2016 - tracks

2015-02-25 Thread Kyle Banerjee
On Mon, Feb 23, 2015 at 5:10 PM, Cary Gordon listu...@chillco.com wrote: If Code4LibCon changes, I will be disappointed, but I will still go. I think it's changed a great deal over the years. But all things must evolve to stay relevant. I do think it would be a shame if the content and

Re: [CODE4LIB] examples of displays for compound objects and metadata

2015-01-28 Thread Kyle Banerjee
The best way to display compound objects really depends on the nature of the compound objects. For example, the optimal display for a book stored as a compound object will be very different than an art object taken from various vantage points or a dataset. Likewise, whether you can get away with

Re: [CODE4LIB] Conference photography policy

2015-01-26 Thread Kyle Banerjee
On Mon, Jan 26, 2015 at 6:58 AM, Galen Charlton g...@esilibrary.com wrote: I would like to propose that C4L adopt a policy requiring that consent be explicitly given to be photographed or recorded, along the lines of a policy adopted by the Evergreen Project. [1] As a practical matter, this

Re: [CODE4LIB] Checksums for objects and not embedded metadata

2015-01-25 Thread Kyle Banerjee
On Sat, Jan 24, 2015 at 11:07 AM, Rosalyn Metz rosalynm...@gmail.com wrote: - How is your content packaged? - Are you talking about the SIPs or the AIPs or both? - Is your content in an instance of Fedora, a unix file structure, or something else? - Are you generating

Re: [CODE4LIB] wifi / network use policies

2015-01-23 Thread Kyle Banerjee
I haven't managed a network for years, but our approach was to provide a broad statement of what the network was for and to make it clear the network couldn't be used for malicious or illegal purposes. The CYA policy is a start but you'll still have to deal with problems such as people using the

[CODE4LIB] Checksums for objects and not embedded metadata

2015-01-23 Thread Kyle Banerjee
Howdy all, I've been toying with the idea of embedding DOI's in all our digital assets and possibly inserting/updating other metadata as well. However, doing this would alter checksums created using normal methods. Is there a practical/easy way to checksum only the objects themselves without the

Re: [CODE4LIB] linked data and open access

2014-12-23 Thread Kyle Banerjee
Well, that raises an important question -- whether an 'end user use', or other use, do people have examples of neat/important/useful things done with linked data in Europe, especially that would have been harder or less likely without the data being modelled/distributed as linked data? I'm

Re: [CODE4LIB] linked data and open access

2014-12-19 Thread Kyle Banerjee
On Fri, Dec 19, 2014 at 7:57 AM, Joe Hourcle onei...@grace.nascom.nasa.gov wrote: I can't comment on the linked data side of things so much, but in following all of the comments from the US's push for opening up access to federally funded research, I'd have to say that capitalism and

Re: [CODE4LIB] Easy Borrow or another way to automate search/request across multiple catalogs?

2014-12-15 Thread Kyle Banerjee
The answer depends on your objective. The quick answer to your question is that union catalogs are easier to maintain and generally work better than federated searches. Can you say a bit more about what catalogs need to be searched and what needs to happen? For example, do the catalogs in

Re: [CODE4LIB] Easy Borrow or another way to automate search/request across multiple catalogs?

2014-12-15 Thread Kyle Banerjee
On Mon, Dec 15, 2014 at 11:49 AM, Darylyne Provost dprov...@colby.edu wrote: Thanks so much for your reply. Our patrons currently must choose from our combined ILS CBBCat (III's Sierra), which we share with two other colleges; three consortial systems, NExpress, MaineInfoNet, and as of

[CODE4LIB] Scanned PDF to text

2014-12-09 Thread Kyle Banerjee
Howdy all, I've just started a project that involves harvesting large numbers of scanned PDF's and extracting information from the text from the OCR output. The process I've started with -- use imagemagick to convert to tiff and tesseract to pull out the OCR -- is more system intensive than I

Re: [CODE4LIB] Balancing security and privacy with EZproxy

2014-11-20 Thread Kyle Banerjee
will really hose your users. Getting that kind of thing cleared up takes time because most places aren't nearly as forgiving as libraries. kyle On Wed, Nov 19, 2014 at 8:47 PM, Dan Scott deni...@gmail.com wrote: On Wed, Nov 19, 2014 at 4:06 PM, Kyle Banerjee kyle.baner...@gmail.com wrote

Re: [CODE4LIB] Balancing security and privacy with EZproxy

2014-11-20 Thread Kyle Banerjee
monitor c4l, so I'm hoping some of them will weigh in. kyle On Thu, Nov 20, 2014 at 10:17 AM, Jonathan Rochkind rochk...@jhu.edu wrote: On 11/20/14 1:06 PM, Kyle Banerjee wrote: BTW, you can do some funky things with EZP that include conditional logic Can you say more about funky things you

Re: [CODE4LIB] Balancing security and privacy with EZproxy

2014-11-20 Thread Kyle Banerjee
are shared between many systems. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kyle Banerjee Sent: Thursday, November 20, 2014 12:07 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Balancing security and privacy

Re: [CODE4LIB] Balancing security and privacy with EZproxy

2014-11-19 Thread Kyle Banerjee
There are a number of technical approaches that could be used to identify which accounts have been compromised. But it's easier to just make the problem go away by setting usage limits so EZP locks the account out after it downloads too much. Alternatively, just block the Chinese IP's unless you

Re: [CODE4LIB] Stack Overflow

2014-11-04 Thread Kyle Banerjee
On Tue, Nov 4, 2014 at 7:34 AM, Schulkins, Joe joseph.schulk...@liverpool.ac.uk wrote: To be honest I absolutely hate the whole reputation and badge system for exactly the reasons you outline, but I can't deny that I do find the family of Stack Exchange sites extremely useful and by

Re: [CODE4LIB] MARC reporting engine

2014-11-03 Thread Kyle Banerjee
On Sun, Nov 2, 2014 at 6:29 PM, Stuart Yeates stuart.yea...@vuw.ac.nz wrote: Do any of these have built-in indexing? 800k records isn't going to fit in memory and if building my own MARC indexer is 'relatively straightforward' then you're a better coder than I am. Unless I'm missing

Re: [CODE4LIB] Metadata

2014-10-29 Thread Kyle Banerjee
On Oct 29, 2014, at 1:52 PM, Matthew Sherman matt.r.sher...@gmail.com wrote: That is a very vague question, would you care to elaborate a bit more? This. If we just mention standards we use, you'll get drowned in alphabet soup of acronyms. If you could say a few words about what you have

Re: [CODE4LIB] Why learn Unix?

2014-10-27 Thread Kyle Banerjee
On Oct 27, 2014, at 10:02 AM, Siobhain Rivera siori...@indiana.edu wrote: what do you think are reasons librarians need to know Unix, even if they aren't in particularly tech heavy jobs? The best reason is so that you can understand the problems you're working with as well as

Re: [CODE4LIB] Recommendations for image de-duping software?

2014-10-16 Thread Kyle Banerjee
Could you say something about the type of dup detection you need? Are we talking true duplicates, or possibly the same image in multiple formats, cropped, etc? Roughly how many images (thousands, tens of thousands, etc) and how big are they? Also, what did you try that did not meet your needs?

Re: [CODE4LIB] Requesting a Little IE Assistance

2014-10-13 Thread Kyle Banerjee
You could encode it quotable-printable or mess with content disposition http headers. But using these hacks or others mentioned on your data to accommodate this use case doesn't strike me a great idea since solutions like this don't age well. You might suggest to your supervisor to right click

Re: [CODE4LIB] Forwarding blog post: Apple, Android and NFC – how should libraries prepare? (RFID stuffs)

2014-10-07 Thread Kyle Banerjee
I think code4 lib is fine as it is, but I think we definitely need a professional organization for librarians that code. These talks of standards and guidelines may reflect such a need. I think LITA is awesome as well! But is there not a need for something else? Aside from the library

Re: [CODE4LIB] What is the real impact of SHA-256? - Updated

2014-10-03 Thread Kyle Banerjee
On Thu, Oct 2, 2014 at 3:47 PM, Simon Spero sesunc...@gmail.com wrote: Checksums can be kept separate (tripwire style). For JHU archiving, the use of MD5 would give false positives for duplicate detection. There is no reason to use a bad cryptographic hash. Use a fast hash, or use a safe

Re: [CODE4LIB] What is the real impact of SHA-256? - Updated

2014-10-03 Thread Kyle Banerjee
On Fri, Oct 3, 2014 at 7:26 AM, Charles Blair c...@uchicago.edu wrote: Look at slide 15 here: http://www.slideshare.net/DuraSpace/sds-cwebinar-1 I think we're worried about the cumulative effect over time of undetected errors (at least, I am). This slide shows that data loss via drive

Re: [CODE4LIB] Non-library job boards to advertise a developer position widely

2014-10-03 Thread Kyle Banerjee
Depending on customs in your area, it can make sense to post real jobs to Craigslist. kyle On Fri, Oct 3, 2014 at 11:57 AM, Francis Kayiwa kay...@pobox.com wrote: On 10/03/2014 02:52 PM, Kim, Bohyun wrote: Hi all, Which non-library job boards would be good to advertise a web developer job

Re: [CODE4LIB] Reconciling corporate names?

2014-09-29 Thread Kyle Banerjee
IMO, API isn't the best tool for this job. My inclination would be to just download the LCNAF data, normalize source and comparison data, and then compare via hash. That will be easier to write, and you'll be able to do thousands of comparisons per second. kyle On Mon, Sep 29, 2014 at 8:24 AM,

Re: [CODE4LIB] Reconciling corporate names?

2014-09-29 Thread Kyle Banerjee
On Mon, 29 Sep 2014, Kyle Banerjee wrote: KB IMO, API isn't the best tool for this job. My inclination would be to just KB download the LCNAF data, normalize source and comparison data, and then KB compare via hash. KB KB That will be easier to write, and you'll be able to do thousands

Re: [CODE4LIB] Reconciling corporate names?

2014-09-29 Thread Kyle Banerjee
The best way to handle them depends on what you want to do. You need to actually download the NAF files rather than countries or other small files as different kinds of data will be organized differently. Just don't try to read multigigabyte files in a text editor :) If you start with one of the

Re: [CODE4LIB] ruby-marc: how to sort fields after append?

2014-09-12 Thread Kyle Banerjee
On Fri, Sep 12, 2014 at 9:20 AM, Galen Charlton g...@esilibrary.com wrote: ... One caveat though -- at least in MARC21, re-sorting a MARC record strictly by tag number can be incorrect for certain fields... This is absolutely true. In addition to the fields you mention, 4XX, 7XX, and 8XX are

Re: [CODE4LIB] ruby-marc: how to sort fields after append?

2014-09-12 Thread Kyle Banerjee
On Fri, Sep 12, 2014 at 10:11 AM, Terry Reese ree...@gmail.com wrote: ... In fact, I wouldn't even resort the data to begin with ... Ding! Ding! Ding! And we have a winner for easiest and most practical solution. Any user display is either not going to display the control number being

Re: [CODE4LIB] Technology for Librarians / Libraries for Technologians

2014-09-04 Thread Kyle Banerjee
I know a lot gets said (here and elsewhere) about Technology for Librarians - important skills and standards, what's important/useful/trending/ignorable, and the like. But I'd love to start a discussion (or join one, if it already exists elsewhere) about the other side of things - the

Re: [CODE4LIB] Library Privacy, RIP (Was: Canvas Fingerprinting by AddThis)

2014-08-17 Thread Kyle Banerjee
:34 PM, Kyle Banerjee kyle.baner...@gmail.com wrote: On Fri, Aug 15, 2014 at 3:02 PM, Jason Bengtson j.bengtson...@gmail.com wrote: ... Generally speaking, I think surveillance is wretched stuff. But there is a point at which the hand wringing becomes a bit much. I agree

Re: [CODE4LIB] Hiring strategy for a library programmer with tight budget - thoughts?

2014-08-15 Thread Kyle Banerjee
I am in a situation in which a university has a set salary guideline for programmer position classifications and if I want to hire an entry-lever dev, the salary is too low to be competitive and if I want to hire a more experienced dev in a higher classification, the competitive salary amount

Re: [CODE4LIB] Library Privacy, RIP (Was: Canvas Fingerprinting by AddThis)

2014-08-15 Thread Kyle Banerjee
On Fri, Aug 15, 2014 at 3:02 PM, Jason Bengtson j.bengtson...@gmail.com wrote: ... Generally speaking, I think surveillance is wretched stuff. But there is a point at which the hand wringing becomes a bit much. I agree with Jon in that, while things are at a critical point, the technologies

Re: [CODE4LIB] Dewey code

2014-08-11 Thread Kyle Banerjee
We are a church with 1500 books we would like to put on our website, and thought we would use this workflow: 1. Create barcode from isbn number and print label. 2. Acquire Dewey number from Library of Congress via z39.50, and print that to a label. 3. Affix labels to the

Re: [CODE4LIB] Dewey code

2014-08-08 Thread Kyle Banerjee
Label printing practices vary by library. Just out of curiosity, why are you getting this information from a MARC file rather than the ILS? At many/most libraries, you'd need local Cuttering, item specific (e.g. volume/copy number), etc info not available in the bib record. kyle On Fri, Aug 8,

[CODE4LIB] Publishing large datasets

2014-07-23 Thread Kyle Banerjee
We've been facing increasing requests to help researchers publish datasets. There are many dimensions to this problem, but one of them is applying appropriate metadata and mounting them so they can be explored with a regular web browser or downloaded by expert users using specialized tools.

Re: [CODE4LIB] NCIP path on a Millennium server

2014-07-22 Thread Kyle Banerjee
AFAIK, Mil doesn't support NCIP. Rather, the library has to have purchased the III's DCB product. There is a project to allow Evergreen libraries to communicate with DCB via NCIP at https://github.com/iNCIPit It works and is used by a few libraries. This will contain information both connection

Re: [CODE4LIB] NCIP path on a Millennium server

2014-07-22 Thread Kyle Banerjee
will keep trying to resend the request. kyle On Tue, Jul 22, 2014 at 9:05 AM, Kyle Banerjee kyle.baner...@gmail.com wrote: AFAIK, Mil doesn't support NCIP. Rather, the library has to have purchased the III's DCB product. There is a project to allow Evergreen libraries to communicate with DCB via

Re: [CODE4LIB] net.fun

2014-07-14 Thread Kyle Banerjee
The only problem is that some people might have difficulty obtaining audio modems that could be made to work with their cell phones... On Mon, Jul 14, 2014 at 8:56 AM, Riley Childs ri...@tfsgeo.com wrote: I know I might be little youn but code4lib needs a bbs Riley Childs Student Asst.

[CODE4LIB] Job: Systems and Applications Librarian -- Walla Walla, WA

2014-07-02 Thread Kyle Banerjee
** Apologies for duplicate postings ** SYSTEMS APPLICATIONS LIBRARIAN Whitman College seeks a dynamic, creative and technically proficient individual for the position of Systems Applications Librarian who will help provide leadership as the Penrose Library transitions to an expanding digital

Re: [CODE4LIB] Excel to XML (for a Drupal Feeds import)

2014-06-16 Thread Kyle Banerjee
I'd just do this the old fashioned way. Awk is great for problems like this. For example, if your file is tab delimited, the following should work awk '{FS=\t}{if ($2 != ) question = $2;}{print $1,question,$3}'' yourfile In the example above, I just print the fields but you could easily encase

Re: [CODE4LIB] Jobs Digest

2014-06-04 Thread Kyle Banerjee
On Wed, Jun 4, 2014 at 1:55 PM, Eric Lease Morgan emor...@nd.edu wrote: C4L is not a democracy but an anarchy. Sometimes. We vote on conference locations. We vote on keynote talks. We vote for presentations. Everybody had multiple opportunities to voice their opinion. I think this vote

[CODE4LIB] Anonymizing address data

2014-06-02 Thread Kyle Banerjee
HIPPA compliant data cannot include personally identifiable information, a category which includes address. The safe harbor approach where geographic subdivisions smaller than states cannot be used frequently renders data useless. The expert determination method is always an option and

Re: [CODE4LIB] Job Interview : A Libcoder's Helpful Advices

2014-05-12 Thread Kyle Banerjee
On Mon, May 12, 2014 at 7:29 AM, Bigwood, David dbigw...@hou.usra.eduwrote: Asking questions is an essential part of the interview. You are interviewing them as well as them you. But, never ask questions that can be easily answered by browsing their website or common reference works. It

Re: [CODE4LIB] Job Interview : A Libcoder's Helpful Advices

2014-05-12 Thread Kyle Banerjee
On Mon, May 12, 2014 at 11:32 AM, Tom Johnson johnson.tom+code4...@gmail.com wrote: At the very least, if you're going to hire for personality traits, you need to do some very serious thinking about whether and why you think those traits will actually make the person more effective at

Re: [CODE4LIB] separate list for Jobs

2014-05-09 Thread Kyle Banerjee
I have filters set up, and find they just don't work reliably. OK, they work 9 times out of 10, but things always slip through. Imho, there are more people inconvenienced by having jobs on the list (setting up filters, filters not working, unable to filter digests, etc.) than there are

Re: [CODE4LIB] Is it time to invite zoia to join the mailing list?

2014-05-08 Thread Kyle Banerjee
Aside the issue that giving specific individuals or bots preferential treatment reverses progress made towards greater equality, I would be concerned about the quality of participation by anyone who needs an invitation to join. Besides, it sends a message to other bots that didn't get an

Re: [CODE4LIB] separate list for jobs

2014-05-06 Thread Kyle Banerjee
On Tue, May 6, 2014 at 9:59 AM, Richard Sarvas richard.sar...@lib.uconn.edu wrote: Not to be a jerk about this, but why is the answer always No? There seem to be more posts on this list relating to job openings than there are relating to code discussions. Are job postings a part why this list

Re: [CODE4LIB] barriers to open metadata?

2014-04-30 Thread Kyle Banerjee
Lack of demand, particularly since many catalogs contain a lot of garbage metadata and/or resources that others cannot access. Plus, the information goes stale quickly. Not that there's no use for this information, but not that many people are asking. Also, despite declarations to wanting to

Re: [CODE4LIB] convert MODS XML into CSV or tab-delimted text

2014-04-22 Thread Kyle Banerjee
Given that you'll most likely have to deal with elements that are missing and/or repeat variable amounts of times, conditional mappings, and data that needs to be transformed, it may be easier to use a string parsing routine to do what you need. kyle On Tue, Apr 22, 2014 at 11:35 AM, English,

Re: [CODE4LIB] distributed responsibility for web content

2014-04-18 Thread Kyle Banerjee
While 'letting chaos reign' might seem the best solution, we've found that it also presents unforeseen accessibility and general readability issues, e.g, entire pages of bolded or inappropriately colored text, not to mention making entire websites look like, well, crap! This is a serious

  1   2   3   4   >