[Koha] How to remove unwanted characters when importing MARC data?

Michael Kuhn Wed, 21 Jun 2017 09:56:08 -0700

Hi

Our library receives MARC data from EKZ (a German cataloging dataprovider) which includes two unwanted characters:


* a beginning "non-sorting character"
* an ending "non-sorting character"

These characters can't be seen in the OPAC and in the hitlist of thestaff client, but they do appear in the framework and also in the topline of the webbrowser. Here is an example of a file containing suchcharacters: http://adminkuhn.ch/download/kuhn0000000


When opening the original .mrc file with vi these characters show as:

<98>The<9c> obsession

With "od -c" they show as:

302 230   T   h   e 302 234       o   b   s   e   s   s   i   o   n

Of course these characters could be removed e. g. with sed (but thiswill result in a wrong character length in MARC LEADER positions 0-4)and also it has to be done separately on the shell outside and beforethe regular importing process. Or even using software like MarcEdit.

Now the question is if there is an EASY way how to delete these unwantedcharacters within Koha, for example by using the MARC modificationtemplates which is used anyway when loading such data?


Best wishes: Michael
--
Geschäftsführer · Diplombibliothekar BBS, Informatiker eidg. Fachausweis
Admin Kuhn GmbH · Pappelstrasse 20 · 4123 Allschwil · Schweiz
T 0041 (0)61 261 55 61 · E m...@adminkuhn.ch · W www.adminkuhn.ch
_______________________________________________
Koha mailing list  http://koha-community.org
Koha@lists.katipo.co.nz
https://lists.katipo.co.nz/mailman/listinfo/koha

[Koha] How to remove unwanted characters when importing MARC data?

Reply via email to