Note also that the utility program MarcEdit [1] has a merge capability. 
I don't think it uses a weighted match, but admit that I haven't played 
with it.

This is such a common problem that we really need a good, stand-alone, 
OS solution.

kc
[1] http://people.oregonstate.edu/~reeset/marcedit/

On 9/12/13 6:26 AM, Tom Morris wrote:
> On Tue, Sep 3, 2013 at 12:53 PM, Michael Beccaria
> <[email protected] <mailto:[email protected]>> wrote:
>
>     I posted this awhile back to the ol-discuss listserve but didn't get
>     a response. ol-tech seems more appropriate.
>
>
> This list is more appropriate and it's where I answered your original
> question.  Unfortunately, bcc'ing ol-discuss (to focus the discussion
> here) seems to have the effect of causing the message to not get forwarded
>
>     Anyone know the basics of getting started on this? I'm proficient in
>     coding but don't want to spend hours digging through piles of code
>     and framework documentation to find out this might not be possible
>     or there was an easier way.
>
>     Karen Coyle in the code4lib listserv pointed me in the direction of
>     the source code for the merge algorithms OL uses to de-dupe records
>     
> (https://github.com/openlibrary/openlibrary/tree/master/openlibrary/catalog/merge).
>     I'm interested in taking 2 sets of marc records and spitting out
>     either a report on similarity or a merged record set. I looked at
>     the code and the OL instructions but it isn't clear to me exactly
>     how the merge code fits in and whether it is possible to run it
>     independently of the overall system.
>
>     Anyone have any insight into this to point me in the right direction?
>
>
> Here's my original reply from the archive:
> http://www.mail-archive.com/[email protected]/msg01118.html
> Let us know if you have any follow-up questions.
>
> Tom
>
> [long .sig elided]
>
> [completely unrelated message elided]
>
>
> _______________________________________________
> Ol-tech mailing list
> [email protected]
> http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech
> To unsubscribe from this mailing list, send email to 
> [email protected]
>

-- 
Karen Coyle
[email protected] http://kcoyle.net
ph: 1-510-540-7596
m: 1-510-435-8234
skype: kcoylenet
_______________________________________________
Ol-tech mailing list
[email protected]
http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech
To unsubscribe from this mailing list, send email to 
[email protected]

Reply via email to