https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
--- Comment #12 from David Cook ---
It would be interesting to look at this in an Elasticsearch context...
--
You are receiving this mail because:
You are watching all bug changes.
___
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
mathieu saby changed:
What|Removed |Added
CC||mathsaby...@gmail.com
--
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
--- Comment #11 from David Cook ---
Of course, some characters also have special significance in the CCL query
language as well, so even getting them to Zebra would be part of the first
step.
Not a simple problem by any stretch.
--
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
--- Comment #10 from David Cook ---
Oh I've had some fun playing with ICU...
chain.xml:
echo -n '.NET. test' | yaz-icu -c chain.xml
1 1 '.net'' '.NET''
2 1 'test' 'test'
--
Here we tokenize based on the line break
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
--- Comment #9 from David Cook ---
(In reply to Katrin Fischer from comment #3)
> Would be interesting to test with Elasticsearch.
I agree.
It looks like a bit of a known issue:
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
--- Comment #8 from David Cook ---
(In reply to David Cook from comment #5)
> With words in ICU, it
> looks like we might only be removing punctuation that is preceded by
> whitespace (which seems to imitate the example chain.xml
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
--- Comment #7 from David Cook ---
After some experimenting, it seems YAZ ICU will tokenize based on the "+"
without any normalization when using the "line" tokenize rule:
echo -n "C++" | yaz-icu -c chain.xml
1 1 'c+' 'C+'
2 1 '+'
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
--- Comment #6 from David Cook ---
Hmm nope...
echo "C++" | yaz-icu -c words-icu.xml
1 1 'c+' 'C+'
2 1 '+' '+'
echo "C#" | yaz-icu -c words-icu.xml
1 1 'c' 'C'
echo ".NET" | yaz-icu -c words-icu.xml
1 1 'net' 'NET'
--
You are
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
David Cook changed:
What|Removed |Added
CC||dc...@prosentient.com.au
---
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
--- Comment #4 from Marjorie Barry-Vila ---
Still valid with Zebra and ElasticSearch in 20.05
Marjorie
--
You are receiving this mail because:
You are watching all bug changes.
___
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
--- Comment #3 from Katrin Fischer ---
Would be interesting to test with Elasticsearch.
--
You are receiving this mail because:
You are watching all bug changes.
___
Koha-bugs mailing
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
Patrick Robitaille changed:
What|Removed |Added
CC|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
Marjorie Barry-Vila changed:
What|Removed |Added
CC|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
--- Comment #2 from Marjorie Barry-Vila ---
Still an issue in 16.11.
Marjorie
--
You are receiving this mail because:
You are watching all bug changes.
http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9729
Katrin Fischer katrin.fisc...@bsz-bw.de changed:
What|Removed |Added
CC|
15 matches
Mail list logo