On 09.09.2011 13:46, Tibor Simko wrote:

Tibor,

2011-09-09 09:30:31 -->  EMERGENCY: tmp_idxWORD09F needs to repair 205 of
205 index records

It looks like you have tmp_idx* tables left behind from a previous
re-indexing job.  You can safely remove these by doing:

   $ echo "SHOW TABLES LIKE 'tmp%'" | /opt/invenio/bin/dbexec
   $ echo "DROP TABLE tmp_idxWORD09F" | /opt/invenio/bin/dbexec
   $ echo "DROP TABLE tmp_idxWORD09R" | /opt/invenio/bin/dbexec
   [...]

as usual you made my day :) This is exactly what happend, I removed the
tmp_ tables and now I'm well beyond the failing index. Thanks for the
pointer! :)

Probably those lines should go to the reindexing section of
bibindex-admin as well?

Curiously enough, my problem appears in a --reindex call.
I'd guess that this should recreate the indexes from scratch
anyway so any inconsistency should be resolved by this
automatically regardless if it existed before.

Yes, `bibindex -R' should well drop any temporary tables that may have
been left behind from former re-indexing jobs.  It seems that in your
case the tables were not dropped.  This may be because you are running
an older master branch of Invenio?  (prior to 2010-03-31?)

Actually, the box where I gave that a try is still on 1.0rc0 you on the
web while ago. We'll move on to a newer version soon. I was mainly just
testing some larger chunks of data on a fresh box to get some idea of
time required for our final migrations, initial loading and indexing and
the quality of some autogenerated records on one of our real boxes
instead of some small scale vm on my notebook. For what it's worth we'll
not be able to start out with a small setup and build up slowly, but our
initial installation will already be in a ballpark of some the 10^6
records. Would be nasty if that fails and we'd have to hotfix it.

BTW: on this test I found that by default external webpages seems to end
up in the full text index due to

       CFG_BIBINDEX_FULLTEXT_INDEX_LOCAL_FILES_ONLY = 0

It might be a safer default to set this to 1, initally. Actually, I
loaded some 30.000 records containing external URLs linking to journal
homepages... So the crashing indexer was actually a feature and
prevented me to index all of <place your favourite oligarchs here>. ;)

--

Kind regards,

Alexander Wagner
Subject Specialist
Central Library
52425 Juelich

mail : [email protected]
phone: +49 2461 61-1586
Fax  : +49 2461 61-6103
http://www.fz-juelich.de/zb/mitarbeiter/fachinformation#wagner


------------------------------------------------------------------------------------------------
------------------------------------------------------------------------------------------------
Forschungszentrum Juelich GmbH
52425 Juelich
Sitz der Gesellschaft: Juelich
Eingetragen im Handelsregister des Amtsgerichts Dueren Nr. HR B 3498
Vorsitzender des Aufsichtsrats: MinDirig Dr. Karl Eugen Huthmacher
Geschaeftsfuehrung: Prof. Dr. Achim Bachem (Vorsitzender),
Karsten Beneke (stellv. Vorsitzender), Prof. Dr.-Ing. Harald Bolt,
Prof. Dr. Sebastian M. Schmidt
------------------------------------------------------------------------------------------------
------------------------------------------------------------------------------------------------

Reply via email to