Note also that the utility program MarcEdit [1] has a merge capability. I don't think it uses a weighted match, but admit that I haven't played with it.
This is such a common problem that we really need a good, stand-alone, OS solution. kc [1] http://people.oregonstate.edu/~reeset/marcedit/ On 9/12/13 6:26 AM, Tom Morris wrote: > On Tue, Sep 3, 2013 at 12:53 PM, Michael Beccaria > <[email protected] <mailto:[email protected]>> wrote: > > I posted this awhile back to the ol-discuss listserve but didn't get > a response. ol-tech seems more appropriate. > > > This list is more appropriate and it's where I answered your original > question. Unfortunately, bcc'ing ol-discuss (to focus the discussion > here) seems to have the effect of causing the message to not get forwarded > > Anyone know the basics of getting started on this? I'm proficient in > coding but don't want to spend hours digging through piles of code > and framework documentation to find out this might not be possible > or there was an easier way. > > Karen Coyle in the code4lib listserv pointed me in the direction of > the source code for the merge algorithms OL uses to de-dupe records > > (https://github.com/openlibrary/openlibrary/tree/master/openlibrary/catalog/merge). > I'm interested in taking 2 sets of marc records and spitting out > either a report on similarity or a merged record set. I looked at > the code and the OL instructions but it isn't clear to me exactly > how the merge code fits in and whether it is possible to run it > independently of the overall system. > > Anyone have any insight into this to point me in the right direction? > > > Here's my original reply from the archive: > http://www.mail-archive.com/[email protected]/msg01118.html > Let us know if you have any follow-up questions. > > Tom > > [long .sig elided] > > [completely unrelated message elided] > > > _______________________________________________ > Ol-tech mailing list > [email protected] > http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech > To unsubscribe from this mailing list, send email to > [email protected] > -- Karen Coyle [email protected] http://kcoyle.net ph: 1-510-540-7596 m: 1-510-435-8234 skype: kcoylenet _______________________________________________ Ol-tech mailing list [email protected] http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech To unsubscribe from this mailing list, send email to [email protected]
