Re: [CODE4LIB] What happened to the code4lib blog?

2016-04-12 Thread Robert Haschart
Ralph, If you are looking for the spec that was detailed in that post, I cannot help you. If you are looking for a tool to actually perform the conversion and produce output that conforms to that spec it can be done via the Marc4J library. If you are interested I can give specific

Re: [CODE4LIB] Code4Lib 2015 Registration Update [spots still available]

2014-12-18 Thread Robert Haschart
the original reservation. snark I guess she was confused by all the other people with the same last name as me. /snark It may be that the lower room availability is due to several similarly duplicated reservations. -Robert Haschart On 12/16/2014 10:59 PM, Tom Johnson wrote: The word from

Re: [CODE4LIB] MARC reporting engine

2014-11-03 Thread Robert Haschart
I was going to echo Eric Hatcher's recommendation of Solr and SolrMarc, since I'm the creator of SolrMarc. It does provide many of the same tools as are described in the toolset you linked to, but it is designed to write to Solr rather than to a SQL style database. Solr may or may not be

Re: [CODE4LIB] MarcEdit Tasks power?

2014-06-26 Thread Robert Haschart
Jonathan, Using the as-yet-to-be-documented record editing capabilities of SolrMarc I think you can you what you want. create a file namededitor338.properties in the same directory and SolrMarc.jar that contains: 338=true 338_0=and(subfieldmatches(a, online

[CODE4LIB] Fwd: FW: Lorem Ipsum metadata? Is there such a thing?

2013-12-10 Thread Robert Haschart
Forwarding a message for someone who's having trouble posting... Original Message __ From: Roland, Perry (pdr4h) Sent: Tuesday, December 10, 2013 11:41 AM To: Code for Libraries Subject: RE: Lorem Ipsum metadata? Is there such a thing? oXygen can generate sample XML files

Re: [CODE4LIB] Looking for two coders to help with discoverability of videos

2013-12-02 Thread Robert Haschart
but the custom extraction routines seem directly applicable to your goals, and may also provide a template that may make your other goals more easily achievable. -Robert Haschart On 12/2/2013 12:37 AM, Kelley McGrath wrote: I wanted to follow up on my previous post with a couple points. 1

Re: [CODE4LIB] ruby-marc api design feedback wanted

2013-11-20 Thread Robert Haschart
When I first started working on marc4j, its behavior was to behave as suggested here, ie. expect the records to be correctly formed in almost every respect, and to throw an exception when an error was encountered, it was done in a way that didn't even allow the processing to continue with the

Re: [CODE4LIB] anyone know how to properly do a marc4j release?

2013-11-13 Thread Robert Haschart
I believe that is one of the open issues for Marc4j. I do not know how to push a jar or a new version of a jar to a Maven repo. I believe Bill Dueber was looking into this just last month when he wrote the following to the Solrmarc list: I'm trying to get marc4j into maven central, and I

Re: [CODE4LIB] anyone know how to properly do a marc4j release?

2013-11-13 Thread Robert Haschart
pushing projects into Maven's central repo through Sonatype. Maven has a standard structure (that you don't have to use, but it makes things easier/more-Maven-ish). Would you want the project reorganized into that structure in the process? Kevin On Wed, Nov 13, 2013 at 5:15 PM, Robert

Re: [CODE4LIB] pdf2txt [tesseract]

2013-10-17 Thread Robert Haschart
On 10/17/2013 9:43 AM, Eric Lease Morgan wrote: On Oct 16, 2013, at 10:56 AM, Robert Haschartrh...@virginia.edu wrote: The abstract extraction routine I have been working on does use tesseract internally for doing OCR when it encounters a document that doesn't have usable full-text. I agree

Re: [CODE4LIB] pdf2txt

2013-10-16 Thread Robert Haschart
On 10/15/2013 12:25 PM, Eric Lease Morgan wrote: On Oct 14, 2013, at 4:49 PM, Robert Haschartrh...@virginia.edu wrote: For a limited period of time I am making publicly available a Web-based program called PDF2TXT --http://bit.ly/1bJRyh8 Although based on some subsequent messages where you

Re: [CODE4LIB] pdf2txt

2013-10-14 Thread Robert Haschart
Eric, Very interesting. I've have been working with some existing pdf utilities with a goal of automatically extracting the abstract from technical reports, articles and dissertations that are to be bulk uploaded to our institutional repository. I tried two of our documents through your

[CODE4LIB] Editing Code4lib Wiki

2013-02-06 Thread Robert Haschart
I have tried editing the Code4lib wiki several times, but keep getting a you have not confirmed your e-mail address message. I then go to the Preferences page and try to do so. I am told that a confirmation code is being mailed to me, but no mail ever seems to arrive. Does anybody have any

Re: [CODE4LIB] haititrust

2012-08-14 Thread Robert Haschart
Eric , These blog postings are interesting. Here at UVa we have added MARC records for publicly accessible items from Hathi Trust into our solr based online catalog, but we have made no attempt yet to link from the records drawn from our ILS that reference physical items on the shelves the

Re: [CODE4LIB] more on MARC char encoding: Now we're about ISO_2709 and MARC21

2012-04-19 Thread Robert Haschart
. -Robert Haschart

Re: [CODE4LIB] Project Gutenberg MARC

2012-03-16 Thread Robert Haschart
That's pretty cool. I just downloaded those records, tweaked my solrmarc import specification, and added the 466 records to our blacklight solr index. Currently they are only in our dev index, but I plan to get the OK to add them to our production index sometime next week. -Bob Haschart