Dear Vit,

On 22/03/13 08:24, Vít Tuček wrote:
Hello everyone,
  my employer decided to try to move his digital library to invenio and
I was tasked to facilitate that process. I kindly ask for your help in
assessing the work required to do so.

Our data are stored in TEI P5 XML and we already have XSL transforms
to extract MARC XML. We were thinking of the following workflow. (Of
course any ideas how to proceed in way better suited for Inveio are
appreciated.)

IMPORT: A daemon checks an assigned directory and if there is a zip
file it unzips it and runs the XSL transform on the resulting TEI XML
to produce the MARC XML. The MARC XML is then checked for sanity,
imported into the library and associated with the original TEI XML.
If you already have a MARC XML transformation, then your are nearly done :-) There are several ways to get content into Invenio, and I think which way you use depends mostly on how you want to do the migration. The central part of Invenio responsible for uploading metadata is BibUpload which takes a MARC XML file by default. Additionally we have BatchUploader which is basically a wrapper around BibUpload, which will monitor a directory and import MARC XML files. More details are available here http://invenio-demo.cern.ch/help/admin/bibupload-admin-guide

EXPORT: We would like to be able to export the TEI XML (or it's XSL
transforms) as collections from the web interface.

For this you would use BibFormat. You would add a new output format ( http://invenio-demo.cern.ch/help/admin/bibformat-admin-guide#addOutputFormat): Give it a Code and content-type (note, the code is important, there's special handling depending on the first letter, especially for x and h).

Then add a XSL format template for the output format that transforms MARC XML to TEI XML: http://invenio-demo.cern.ch/help/admin/bibformat-admin-guide#xslFormatTemplate

Here's an example of Dublin Core transformation:
Output format: http://invenio-software.org/repo/invenio/tree/modules/bibformat/etc/output_formats/XD.bfo?h=maint-1.1 Format template: http://invenio-software.org/repo/invenio/tree/modules/bibformat/etc/format_templates/OAI_DC.xsl?h=maint-1.1 The files live in etc/bibformat/format_templates and etc/bibformat/output_formats

Once you have the transformation, each record can be exported individually, or collective from the search page, as well as background job.

Best regards,
Lars

I installed Invenio locally and took a quick look around. I'm a little
bit lost in the documentation right now, but it seems that the import
is mostly a matter of proper configuration. As for export, so far it
seems to me that some scripting is needed.

Best regards,
                            Vit


--
Lars Holm Nielsen
Software Engineer

CERN, IT Department, Digital Library Technology Section
Office 513/1-014
Tel: +41 22 76 79182
Cel: +41 76 672 8927

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

Reply via email to