Re: [CODE4LIB] Mac OS 9 emulator

2015-04-22 Thread Bryan Baldus
On Wednesday, April 22, 2015 1:54 PM, Matt Sherman wrote:
Why would you not just run an instance in Virtual Box?

Does Virtual Box support Mac OS 9 now? I haven't kept up on developments 
recently, but in the first few results in Googling virtualbox Mac os 9 
(without the quotes), it doesn't appear to be so ([1], [2]), though they are 
from 2008-2012.

SheepShaver [3] would be my suggestion (though as I said, it's been a few years 
since I've kept track of developments)

[1] http://www.emaculation.com/forum/viewtopic.php?f=1t=7528
[2] https://forums.virtualbox.org/viewtopic.php?f=1t=9635
[3] http://www.emaculation.com/doku.php/sheepshaver

I hope this helps,

Bryan Baldus
Senior Cataloger
Quality Books Inc.
The Best of America's Independent Presses
1-800-323-4241x402
bryan.bal...@quality-books.com


Re: [CODE4LIB] Anybody know a way to add a MARC tag on-mass to a file of MARC records

2014-08-28 Thread Bryan Baldus
On Thursday, August 28, 2014 1:27 PM, Ray Schwartz wrote:
Anybody know a way to add a MARC tag on-mass to a file of MARC records.  I 
need to add the tag 918 $a with the contents DELETE to each of the records.

What language are you using? For Perl, using MARC::Record, there is an example 
at [1].

[1] 
http://search.cpan.org/dist/MARC-Record/lib/MARC/Doc/Tutorial.pod#Adding_a_field

I hope this helps,

Bryan Baldus
bryan.bal...@quality-books.com
eij...@cpan.org
http://home.comcast.net/~eijabb/


Re: [CODE4LIB] convert MODS XML into CSV or tab-delimted text

2014-04-22 Thread Bryan Baldus
On Tuesday, April 22, 2014 1:36 PM, Eben English wrote:
Does anyone out there have an XSL stylesheet to transform MODS XML into a CSV 
or tab-delimited text file?
Even if it's highly localized to your own institution/project, it would 
probably still be useful.

I'm not sure how well it would work, but MarcEdit [1] has a MODS=MARC XML 
conversion option, and an option to Export Tab Delimited Records.

[1] http://marcedit.reeset.net/

I hope this helps,

Bryan Baldus
Senior Cataloger
Quality Books Inc.
The Best of America's Independent Presses
1-800-323-4241x402
bryan.bal...@quality-books.com
eij...@cpan.org
http://home.comcast.net/~eijabb/


Re: [CODE4LIB] Mapping LCSH to DDC

2013-12-11 Thread Bryan Baldus
Thanks everyone for your responses- especially for pointing out the classify 
web service, which was new to me.

Speaking of OCLC's Classify service, MarcEdit [1] includes a feature that makes 
use of Classify to generate call numbers.

[1] http://marcedit.reeset.net/

I hope this helps,

Bryan Baldus
Senior Cataloger
Quality Books Inc.
The Best of America's Independent Presses
1-800-323-4241x402
bryan.bal...@quality-books.com


Re: [CODE4LIB] Mapping LCSH to DDC

2013-12-10 Thread Bryan Baldus
On Tuesday, December 10, 2013 7:18 AM, Irina Arndt wrote:
we would like to add DDC classes to a bunch of MARC records, which contains 
only LoC Subject Headings. Does anybody know, if a mapping between LCSH and 
DDC is anywhere existent (and available)?
...
I'm thinking of a tool, where I can upload my list of subject headings and get 
back a list, where the matching Dewey classes have been added (but a 'simple' 
csv file with LCSH terms and DDC classes would be helpful as well- I am fully 
aware, that neither LCSH nor DDC are simple at all...) . Naïve idea...?

Classification Web offers a correlations feature between Dewey and the 1st 
LCSH, based on usage in LC's database (as well as correlations between LCC and 
LCSH, and DDC and LCC). It is of some use in helping the cataloger determine 
possible classifications or subject headings to use. Unfortunately, I don't 
believe ClassWeb is easily accessible by automated processes (even for 
subscribers). Even if it were, I doubt it is possible to automate a process of 
assigning Dewey based on 1st LCSH. As mentioned, the 1st LCSH and 
classification are generally supposed to be similar/linked, but that applies 
more to LCC/LCSH than DDC to LCSH, due to the way Dewey works. For example, 
ClassWeb correlation between LCSH Disease management (chosen while looking at 
Health, then Disease, then looking for an example showing a better variety of 
Deweys than the 1st 2) shows DDCs used by LC (counts of records in parentheses):

Disease management [Topical]
 362.1 (4)
 610.285 (1)
 615.1 (1)
 615.5071 (1)
 616.89142 (1)



That said, as Ed mentioned, given a large set of records for training, you 
should be able to develop something to help local catalogers determine possible 
Deweys record-by-record.

I hope this helps,

Bryan Baldus
Senior Cataloger
Quality Books Inc.
The Best of America's Independent Presses
1-800-323-4241x402
bryan.bal...@quality-books.com


Re: [CODE4LIB] a note on MARC8 to UTF8 transcoding: Character references

2013-11-05 Thread Bryan Baldus
So be warned, you may need to add this to your software too.

One of these that may cause problems in some systems (including the ones we 
use; hopefully our customers' systems deal with it more appropriately) is the 
character used in the middle of [1],  the Extended Roman alif character 
which was changed to #x02bc; in 2005 [2], though I only saw that code in 
bibliographic and authority records starting around April of 2013, about the 
same time as NAR n 79046204 was updated (I don't believe those were related 
events, though). Some systems aren't able to translate the #x02bc to the 
appropriate ' (apostrophe) character, making searching for things with that 
character more challenging.

[1] http://lccn.loc.gov/n79046204
[2] http://www.loc.gov/marc/marbi/2005/2005-05.html

Thank you,

Bryan Baldus
Senior Cataloger
Quality Books Inc.
The Best of America's Independent Presses
1-800-323-4241x402
bryan.bal...@quality-books.com


Re: [CODE4LIB] LOC Subject Headings API

2013-06-04 Thread Bryan Baldus
On Tuesday, June 04, 2013 6:31 PM, Joshua Welker [jwel...@sbuniv.edu] wrote:
I am building an auto-suggest feature into our library's search box, and I am 
wanting to include LOC subject headings in my suggestions list. Does anyone 
know of any web service that allows for automated harvesting of LOC Subject 
Headings? I am also looking for name authorities, for that matter. Any format 
will be acceptable to me: RDF, XML, JSON, HTML, CSV... I have spent a while 
Googling with no luck, but this seems like the sort of general-purpose thing 
that a lot of people would be interested in. I feel like I must be missing 
something. Any help is appreciated.

Have you seen http://id.loc.gov/ with bulk downloads in various formats at 
http://id.loc.gov/download/

I hope this helps,

Bryan Baldus
Senior Cataloger
Quality Books Inc.
The Best of America's Independent Presses
1-800-323-4241x402
bryan.bal...@quality-books.com
eij...@cpan.org
http://home.comcast.net/~eijabb/


Re: [CODE4LIB] Minimal bibliographic record as filename

2013-05-22 Thread Bryan Baldus
On Wednesday, May 22, 2013 8:01 AM, Joel Marchesoni wrote:
I like the idea of a master ISBN (one number to rule them all? Sorry, too 
easy) but I think failing that I'd stick with the Ebook's ISBN. Any search on 
it will give a user the title and author of the work. Plus, it gives you a 
unique number for each item.

Are publishers better about how they use ISBNs on electronic resources than 
they are on books? Because an ISBN, while theoretically unique when applied as 
intended, is certainly not always unique in practice (where publishers reuse 
ISBNs for subsequent editions, and even sometimes for completely different 
books). That said, if you are including an ISBN in the name (for resources 
which have ISBNs), then I'd agree that the one that applies to the e-book 
version(s) should be used rather than the one(s) that should apply to the print 
version(s). As far as I know, there is not such a thing like a main or 
master ISBN [1].

###

[1] Wednesday, May 22, 2013 02:30, Michael Lackhoff

Thank you for your time,

Bryan Baldus
Senior Cataloger
Quality Books Inc.
The Best of America's Independent Presses
1-800-323-4241x402
bryan.bal...@quality-books.com


Re: [CODE4LIB] Invitation to join us at Screening the Future 2012, USC, Los Angeles, May 21-23

2012-03-07 Thread Bryan Baldus
On Wednesday, March 07, 2012 11:59 AM, Win Shih [winyu...@usc.edu] wrote:
[attachment] Screening the Future 2012 - flyer.pdf (2 MB)

While I appreciate receiving announcements of programs to attend and other 
messages, I'm less enthousiastic about receiving large e-mail attachments. I'm 
no longer on dial-up (thankfully), but I do know people who are, and trying to 
retrieve e-mail for several hours is not a pleasant experience. Plus, since I 
try to save the majority of my mail in case I need it at some point in the 
future, I often run close to the maximum storage limit for my account, so a 
large attachment could easily put me over the limit, hindering receipt of 
additional e-mails. If the information is available in the body of the message 
as well as at a link provided in the body, why is it necessary to include a 
large attachment as well?

(p.s. I apologize for sounding harsh. In other words, I long for the days of 
plain-text-only discussion lists.)

Thank you,

Bryan Baldus
Cataloger
Quality Books Inc.
The Best of America's Independent Presses
1-800-323-4241x402
bryan.bal...@quality-books.com


Re: [CODE4LIB] MARCXML - What is it for?

2010-10-25 Thread Bryan Baldus
On  Monday, October 25, 2010 1:50 PM, Andrew Hankinson wrote:
- Documents can be validated for their well-formedness using these existing 
tools and a pre-defined schema (a validator for MARC would need to be 
custom-coded)

In Perl, MARC::Lint might be an example of such a validator (though I need to 
update it with the most recent MARC updates at some point soon). MarcEdit also 
includes a validator.

Bryan Baldus
bryan.bal...@quality-books.com
eij...@cpan.org
http://home.comcast.net/~eijabb/


Re: [CODE4LIB] simple,flexible ILS for a small library.

2010-09-28 Thread Bryan Baldus
On Monday, September 27, 2010 5:15 PM, Elliot ... [offonoffoffon...@gmail.com] 
wrote:
Some folks in the VuFind library suggested I ask here.  We are starting a 
small library and thinking of using VuFind as our online catalog.  As for the 
ILS we would like something small and simple (evergreen and others seem 
massive for the small amount of functionality we need), and especially 
something which is flexible enough to allow us to base our library on book 
sharing rather than an institutionally owned collection.

I didn't see Koha on your list (though I have almost no direct experience with 
it and so can't comment on its suitability for your needs). A discussion on 
what appears to have been the Vufind-tech mailing list [1] around Friday, March 
05, 2010 9:01 AM, indicates some work remained to be done to make Koha and 
VuFind work together.

[1] The original site had trouble coming up on my browser, so I was only able 
to read via Google's cached version: http://bit.ly/aFZQIU 

I hope this helps,

Bryan Baldus
bryan.bal...@quality-books.com
eij...@cpan.org
http://home.comcast.net/~eijabb/


[CODE4LIB] New MARC-related Perl scripts--440s, 008lang zxx

2009-07-29 Thread Bryan Baldus
I have posted 2 new Perl .pl programs to my website [1]. Both use a module, 
MARC::QBI::Misc, not currently available, but it is used only for file 
handling, so that section of the program should be revised according to local 
needs.

440to490-830.txt [2]: Converts 440 to 490-830 pairs. Retains subfield 'x' 
(ISSN) from 440; deletes all square brackets from 830, adds period at the end 
of the 830 if punctuation is not already present. If any quotes or apostrophes 
exist in the 440, a period is added before the quote(s) or apostrophe(s) in the 
830. I've only done limited testing, so results may not be 100% correct yet.

008langblanktozxx.txt [3]: Converts 008/35-37 from 3 blank spaces to zxx.

##

Not yet started, but next on my list:

-Convert 'c' in books 008/33 (comic strips) to 008/24-27 '6' (Comics/graphic 
novels); revise 008/33 to 0 or 1, depending on fiction vs. non-fiction.

-Convert 041 subfield 'b' to 'j' for videorecording records.

##

I welcome any comments or corrections.

[1] http://home.inwave.com/eija/
[2] 
http://home.inwave.com/eija/fullrecscripts/Cleanup_full_recs/440to490-830.txt
[3] 
http://home.inwave.com/eija/fullrecscripts/Cleanup_full_recs/008langblanktozxx.txt

Thank you for your time,

Bryan Baldus
bryan.bal...@quality-books.com
eij...@cpan.org
http://home.inwave.com/eija


Re: [CODE4LIB] LOC Authority Data

2008-09-23 Thread Bryan Baldus
On Tuesday, September 23, 2008 3:35 PM, Andrew Nagy wrote:
Hello - I am curious if anyone knows of a way to access the entire collection 
of authority records from the LOC.  It seems that the only way to access them 
know is one record at a time.  Feel free to email me off line if you are 
uncomfortable posting a response to the list.

One way (as you likely know) (official, expensive) is via The Library of 
Congress Cataloging Distribution Service:

http://www.loc.gov/cds/mds.html#lcaf:

LC Authority Files
Name Authorities
MARC records for personal, corporate, conference, and geographical name 
headings, uniform titles, and series established by LC and cooperating 
libraries under the National Coordinated Cataloging Operations (NACO) program. 
Names written in non-roman script appear in romanized form only. Available in 
MARC 21 and MARCXML formats.

2008 Subscription: Available weekly. Approximately 450,000 records; including 
250,000 new records.

2008 Price: $10,565

Retrospective: 1977-2007. 7,000,000 records. File size: 3,350 MB. Avg. record 
length: 479 bytes.

2008 Price: $10,675



Otherwise, as far as I am aware, the files that are available (for free) are 
less than current.

I hope this helps,

Bryan Baldus
Cataloger
Quality Books Inc.
The Best of America's Independent Presses
1-800-323-4241x402
[EMAIL PROTECTED]


Re: [CODE4LIB] alpha characters used for field names

2008-06-25 Thread Bryan Baldus
On Wed, Jun 25, 2008 at 2:20 PM, Eric Lease Morgan [EMAIL PROTECTED] wrote:
 Moreover, does something like Marc4J or MARC::Batch and friends deal
 with these alpha field names correctly?

I believe the Perl modules MARC::Batch/MARC::Record accept records with 
alphabetic characters as tags. Searching alpha tags (without quotes) at The 
Mail Archive for Perl4Lib [1] retrieves 6 messages in what appear to be two 
threads.

[1] http://www.mail-archive.com/[EMAIL PROTECTED]q=alpha+tags

I hope this helps,

Bryan Baldus
[EMAIL PROTECTED]
[EMAIL PROTECTED]
http://home.inwave.com/eija


Re: [CODE4LIB] Blacklight released - ruby on rails and solr open source OPAC

2008-01-28 Thread Bryan Baldus
On Monday, January 28, 2008 10:31 AM, Bess Sadler wrote:
Blacklight, an open source OPAC using ruby on rails and solr, has now been
released under an Apache 2.0 license.
...
The code4lib community has been very supportive of this project, so thank
you! I hope that some of you who have expressed interest in the past will
consider joining us as developers, testers, documenters, or just by making
suggestions for how to improve.

I like that you display the entire MARC record as part of the main display
once the user has selected the record.

Are there plans to add the ability to browse alphabetically, at least by
author, title, subject, if not also series/uniform title, call number, and
others?

If I know the name of an author, for example, Shakespeare, William,
1849-1931 (NAR no 92032318 ) and want to find works either by or about him
(and/or want to choose only one of those options), will it be possible to do
so? Without retrieving records for Shakespeare, William, 1564-1616 that are
not related to the former?

On the brief display, why do you display only Author:  Shakespeare,
william, (only $a) rather than Author:  Shakespeare, william, 1564-1616
or Author:  Shakespeare, william, 1849-1931 (the entire field, or perhaps
all but $e and $4)? Why display Wagner, alice m., rather than Wagner,
Alice M.,?

Are there plans to allow controlled headings to be clicked to conduct a
search? For example, if I happen upon a record

When searching in the Music tab, there are subject/genre/geographic options
that don't appear to be available on the Entire Catalog tab.

On the results, will it be possible to sort the results in various ways?

Thank you,

Bryan Baldus
Cataloger
Quality Books Inc.
1-800-323-4241x402
[EMAIL PROTECTED]
[EMAIL PROTECTED]
http://home.inwave.com/eija


[CODE4LIB] MARC::Lint and Errorchecks updated on CPAN

2007-10-04 Thread Bryan Baldus
(I apologize to Perl4Lib subscribers for this duplicate message.)

I've posted updated versions to CPAN of MARC::Lint (v. 1.43) and
MARC::Errorchecks (v. 1.13). I've also uploaded new versions of
MARC::Lintadditions (v. 1.12) and a stand-alone copy of
MARC-Lint-CodeData (v. 1.18) to my personal home page [1].

Note: MARC::Lintadditions is provided as a stand-alone module and
must be installed manually (copy the .pm to the MARC:: folder, next
to Lint, Record, Errorchecks, etc.). I still hope to integrate most
of its checks into MARC::Lint, but progress so far has been rather
slow due to other projects.

Other notes: The version of MARC::Lint::CodeData provided with Lint
and Errorchecks should be identical. I've experienced difficulty
installing both modules through PPM on Windows, perhaps due to
CodeData being included with both modules.

Changes for each appear below:

MARC::Lint:

1.43Wed October 3 19:36:00 CDT 2007

 [THINGS THAT MAY BREAK YOUR CODE]

 - Updated Lint::DATA section with Update No. 7 (Oct. 2006)

 - MARC::Lint is incompatibile with Business::ISBN versions
2.00-2.02_01.
 Business::ISBN versions below 2 and 2.02_02 or above should work.

 - Updated check_record's treatment of 880 fields. Now if the tagno
is
 880, check_record attempts to look at subfield 6 for the linked
tagno
 and uses that as the basis for the tagno to be checked.

 - Updated _check_article to account for 880, using subfield 6
linked
 tagno instead.
 - Updated _check_article to account for articles followed
parentheses,
 apostrophes and/or quotes. Also related bug fixes for counting
 punctuation around the article.

 - For subfield 6, it should always be the 1st subfield
according to MARC
 21 specifications, so check_245 has been updated to account
for subfield
 6 being 1st, rather than requiring subfield a to be 1st.

 - Added new test, test880and6.t for 880 field and for subfield 6.

 - Added TODO concerning subfield 9. This subfield is not officially
 allowed in MARC, since it is locally defined. Some way needs to be
made
 to allow messages/warnings about this subfield to be turned off.

 - Added TODO concerning subfield 8. This subfield could be the 1st
or
 2nd subfield, so the code that checks for the 1st few subfields
 (check_245, check_250) should take that into account.

 - Updated MARC::Lint::CodeData with most recent version.

###

MARC::Errorchecks:

Version 1.13: Updated Aug. 26, 2007. Released Oct. 3, 2007.

  -Uncommented valid MARC 21 leader values in %ldrbytes to remove
local practice. Libraries wishing to restrict leader values should
comment out individual bytes to enable errors when an unwanted value
is encountered.
  -Added ldrvalidate.t.pl and ldrvalidate.t tests.
  -Includes version 1.18 of MARC::Lint::CodeData.


###

MARC::Lintadditions:

Version 1.12: Updated Mar. 1-Aug 26, 2007. Released Oct. 3, 2007.

  -Updated check_042 with new code, ukblderived, from Technical Notice
for Aug. 13, 2007.
  -Updated check_042 with new code, scipio, from Technical Notice for
Mar. 1, 2007.
  -Updated check_xxx methods (check_250) to account for subfield '6'
as 1st subfield.


###

MARC::Lint::CodeData.pm:

Versions 1.15 to 1.18: Updated Feb. 28, 2007-Aug. 14, 2007.


  -Added new source codes from Technical Notice of Aug. 13, 2007.
  -Added new source codes from Technical Notice of July 13, 2007.
  -Added new source codes from Technical Notice of Apr. 5, 2007.
  -Added new country and geographic codes from Technical Notice of
Feb. 28, 2007.
  -Added 'yu ' to list of obsolete codes.

###

[1] http://home.inwave.com/eija/bryanmodules/

Please let me know of any problems, corrections, or suggestions.

Thank you for you assistance,

Bryan Baldus
[EMAIL PROTECTED]
[EMAIL PROTECTED]
http://home.inwave.com/eija