Hello Henning,

contrary to Alexander, I can easily find scenarios where ingesting
records from external sources is a good (or the best) strategy, and we
have done quite a bit here at UAB, from all kind of sources.  So, after
taking a look at your example, you may be interested in avoiding some of
the mistakes I've run into.

> Today I had a meeting with my manager, and we decided to use a simple
> plain text input format (instead Excel/CSV) that should be convertible
> with BibConvert, e.g.:
>
>       FILE    Literature/Educational/16527.epub
>       CAT     Educational;Math
>       TITLE   1001 задача для умственного счета       
>       AUT     Rachinskij, Sergej Aleksandrovich
>       DESC    A collection of mathematical puzzles    
>       LANG    rus     
>       YEAR    1899    
>       PUBL    Project Gutenberg
>       SRC     gutenberg.org
>       LIC     Project Gutenberg License
>       REFNO   16527
>       KEY     sadanye
>       KEY     puzzle
>
> Most key codes can appear several times, and we defined some more
> codes than shown here. (Of course most contents will be in Russian,
> but I wrote my documentation and samples in English.)

First of all, please, please, make sure that those records have an
unique identifier.  You'll find yourself loading and reloading them,
either on your test or your production server.  This unique identifier
will allow Invenio to replace the older version with the new one reusing
the same Invenio identifier.  You can find a similar thread here:

 http://www.mail-archive.com/[email protected]/msg00468.html

If you have different «sources», you can use a simple subfield $9 for
the source identifier and $a for the number.

The second lesson is that, if you end up using «simple» office
applications, I think that repeteable values are better solved with a
known character, like the semicolon you seem to use, rather than
multiple columns, because you can repeat the value zero or more times
without wasting fields.

And finally, I don't know how flexible is BibConvert, as I don't use it,
but if you feel confortable with Python, probably in the long run it
will pay to invest on it, as probably you'll have to fiddle with some
subtle cases where the flexibilty of a real programming language will
help you.

>> For a unified search you'd like to have I think at least
>> Invenio 1.2 (current head master, if Tibor managed to merge
>> authority based searches yet).
>
> Grmbl, I forked at v1.1.2.473-1ab71 and decoupled. But I must learn
> how to manage upstream changes in git anyway. At the moment, I set my
> personal server repository as origin, develop on my Mac and pull
> changes from the development web server. All my changes
> (e.g. configuration, web style, docs) are in a personal branch, so it
> should cause no problems to pull master.

You may take a look at guilt.  I did a brief introduction during last
year Invenio Users Meeting that I hope to expand this november.  The
slides are here

 http://ddd.uab.cat/record/93913

My 2 cents,

Ferran

Reply via email to