On 25.02.2014 21:15, Tibor Simko wrote:
Hi!
[...]
>> Which is then an exact match, right? So to get '' matches one would
use "*bla*", right?
No, actually, not an exact match, but a word pair match. Here is a
possibly clearer example. Consider the following record:
245 $a The Kreutzer Sonata
When users type:
245:'Kreutzer Sonata'
245:"Kreutzer Sonata"
So, if I get it, it checks the words and does substring like match if
the full words match. Right?
If I get it
245:"Sonata Kreutzer"
would not match. Right? (Any word in field kind of thing.)
What defines "end of word"? I think about this ID-thingy: how many words
are things like "P:(DE-Juel1)12345".
then the record won't be returned; people would have to type:
245:/reutzer son/
in order to get a substring match.
IMHO "*reutzer son*" would be an easier to remember syntax for mere
mortals. Does this work as well?
In summary:
+-----------------------+-------------------+--------------------+
| QUERY | CURRENT BEHAVIOUR | PROPOSED BEHAVIOUR |
+-----------------------+-------------------+--------------------+
| 245:'Kreutzer Sonata' | hit | hit |
| 245:"Kreutzer Sonata" | miss | hit |
I'm not sure about the hit here in the new version.
| 245:'reutzer son' | hit | miss |
| 245:"reutzer son" | miss | miss |
| 245:/reutzer son/ | hit | hit |
+-----------------------+-------------------+--------------------+
Note that proposed behaviour is already the case for some logical
indexes such as "title" in Invenio v1.1 release series and above.
I found that Invenio is doing fancy stuff in certain fields (author
seems to be very special...)
The
current RFC proposes to widen its scope to cover all indexes, including
physical MARC queries.
I agree that having the same behaviour in all fields would be desirable.
Still, but this is a feeling, I'm not sure that giving up "exact match"
type searches is a good idea.
[...]
if you map "sid:(DE-HGF)1" to the old 'sid:(DE-HGF)1' it matches also
"sid:(DE-HGF)11", which is wrong and not intended.
Nope, it would not be mapped that way, see above. The ID matching would
remain safe.
So word ends are white spaces? Or is it that "" does not use permutations?
--
Kind regards,
Alexander Wagner
Scientific Services / Scientific Publishing
Central Library
52425 Juelich
mail : [email protected]
phone: +49 2461 61-1586
Fax : +49 2461 61-6103
http://www.fz-juelich.de/zb/wp
------------------------------------------------------------------------------------------------
------------------------------------------------------------------------------------------------
Forschungszentrum Juelich GmbH
52425 Juelich
Sitz der Gesellschaft: Juelich
Eingetragen im Handelsregister des Amtsgerichts Dueren Nr. HR B 3498
Vorsitzender des Aufsichtsrats: MinDir Dr. Karl Eugen Huthmacher
Geschaeftsfuehrung: Prof. Dr. Achim Bachem (Vorsitzender),
Karsten Beneke (stellv. Vorsitzender), Prof. Dr.-Ing. Harald Bolt,
Prof. Dr. Sebastian M. Schmidt
------------------------------------------------------------------------------------------------
------------------------------------------------------------------------------------------------