On 11.10.2012 09:22, Ferran Jorba wrote:

Hello Ferran!

915__:'StatID:(DE-HGF)0100' OR 915__:'StatID:(DE-HGF)0110' AND
9201_:'I:(DE-Juel1)ZB-20090406'

Actually, if these are exact patterns (full content of the tags, and not
substrings), replacing simple quotes with double quotes should give
faster search time:

915__:"StatID:(DE-HGF)0100" OR 915__:"StatID:(DE-HGF)0110" AND
9201_:"I:(DE-Juel1)ZB-20090406"

I fear I'd have to ask you for a fix concerning the braces in search
terms if they are contained in the "literal string" tags. (An

Maybe I'll take advantage of this thread so I can understand something
that never has been clear to me, but: which is the behaviour of the ':',
'(' and ')' characters as separators?  I mean, there are those two
Invenio config variables (CFG_BIBINDEX_CHARS_ALPHANUMERIC_SEPARATORS and
CFG_BIBINDEX_CHARS_PUNCTUATION) that maybe you can use to your
advangage:

  http://invenio-software.org/repo/invenio/tree/config/invenio.conf#n929

Ah. I'll give that a read. Thank you for the pointer! :) Still I feel
that an exact literal search should be possible. It's basically a "known
item search".

[...]
OTOH, I'm also intrigued why you don't create specific field names (with
or without indexes) with nice names for those tags searching if
searching them is critical to your installation.

These are already specific fields and "nice name". The tags mentioned
are all authority ids. The braces appear within them. For some context,
what I need to accomplish is indeed a search for a precisely known (set
of) item(s). The original example stemmed from a combined list of
publications that where either listed in JCR or Science Citation Index
expanded of the central library e.g. (We build it up as an URL, our
content management system fires it against invenio and gets back what it
needs to display.)

I'll give you an example for the Ids. Take authority records for people.
We have unique ids for each of employee (ie. all our identified
"internal authors"), and these ids are built up like

       P:(DE-Juel1)12345

"P:"         -> people,
                ie. we refer to a person here not to an institute
"(DE-Juel1)" -> issuing instance of this authority
                here FZJ library specified by it's ISIL
"12345"      -> unique number

However, you can well have something like

       I:(DE-Juel1)12345

Referring to an institute on campus. Or you can have

       P:(DE-82)12345

Referring to a person, but this authority is issued by DE-82, ie RWTH
Aachen, so it is not the same guy as the first one. Now, Aachen is one
of the next universities with which we additionally have a very close
(scientific) collaboration so we need to exchange records with them and
still need to tell both "John Smith" apart.

From the last example you see that the contents of the braces is
important if you imagine data exchange, which is one of the larger parts
of our ongoing projects (ie. if I introduced JuSER its always a synonym
for an instance of the same kind at DESY, GSI and RWTH Aachen).

The funny thing is, the braces that now cause some headache weren't our
invention, but specified in

       http://cdsweb.cern.ch/record/1376677

section 4.4, p.20. A given example there is

       institution:(SzGeCERN)xyz4321

We just shortend our ids to use I: instead of institution. And we were
not aware of the fact that exact search has a problem with the braces.

You only find it if you have a query that itself requires braces to
logically group several terms, like

       (InstA or InstB) and (TypeA or TypeB) and yearX and projectY

Say, all publications from the Institutes A and B which are either a
book or a contribution to a book that where published on grant no. 12345
in the year 2012.

HTH :)

--

Kind regards,

Alexander Wagner
Subject Specialist
Central Library
52425 Juelich

mail : [email protected]
phone: +49 2461 61-1586
Fax  : +49 2461 61-6103
www.fz-juelich.de/zb/DE/zb-fi


------------------------------------------------------------------------------------------------
------------------------------------------------------------------------------------------------
Forschungszentrum Juelich GmbH
52425 Juelich
Sitz der Gesellschaft: Juelich
Eingetragen im Handelsregister des Amtsgerichts Dueren Nr. HR B 3498
Vorsitzender des Aufsichtsrats: MinDir Dr. Karl Eugen Huthmacher
Geschaeftsfuehrung: Prof. Dr. Achim Bachem (Vorsitzender),
Karsten Beneke (stellv. Vorsitzender), Prof. Dr.-Ing. Harald Bolt,
Prof. Dr. Sebastian M. Schmidt
------------------------------------------------------------------------------------------------
------------------------------------------------------------------------------------------------

Kennen Sie schon unsere app? http://www.fz-juelich.de/app

Reply via email to