https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Nick Clemens changed:
What|Removed |Added
See Also||https://bugs.koha-community
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Katrin Fischer changed:
What|Removed |Added
See Also|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #79 from David Cook ---
I have a library catalogue with French, English, and Arabic records. We're
using Zebra with ICU indexing, but it would be interesting to try out
Elasticsearch on that catalogue, especially with
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #78 from David Cook ---
An interesting case I bumped into recently was "Dewhurst's textbook of
obstetrics"
The library wants to be able to search for "Dewhursts textbook of obstetrics".
I think that I can understand
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #77 from David Cook ---
(In reply to Katrin Fischer from comment #76)
> I am not sure I follow that argument. We already have been ignoring ' with
> Zebra for many years and never got a bug filed about it. On the other
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #76 from Katrin Fischer ---
(In reply to David Cook from comment #75)
> (In reply to Janusz Kaczmarek from comment #74)
> > ... I mean the original apostrophe filter:
> >
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
David Cook changed:
What|Removed |Added
CC||dc...@prosentient.com.au
---
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #74 from Janusz Kaczmarek ---
... I mean the original apostrophe filter:
https://www.elastic.co/guide/en/elasticsearch/reference/8.7/analysis-apostrophe-tokenfilter.html
I.e. to have:
analyzer_standard:
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Janusz Kaczmarek changed:
What|Removed |Added
CC||janus...@gmail.com
---
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #72 from Katrin Fischer ---
I think Ere's patch would be a great start for French and maybe we should bring
back this bug to just that and handle quotes separately?
As I said earlier, I think a lot of libraries have
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #71 from Nicolas Legrand ---
I was very happy with Ere first proposal. Bug 14542 looks weirder to me.
--
You are receiving this mail because:
You are watching all bug changes.
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #70 from Ere Maijala ---
I find 14542 a bit scary, but perhaps I'm just paranoid. The problem with
changing apostrophes to spaces is that the elision is left dangling. In the
"l'avion" example you can then find the
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #69 from Katrin Fischer ---
(In reply to Ere Maijala from comment #68)
> (In reply to Katrin Fischer from comment #67)
> > (In reply to Katrin Fischer from comment #66)
> > > What would be the easiest option here to make
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #68 from Ere Maijala ---
(In reply to Katrin Fischer from comment #67)
> (In reply to Katrin Fischer from comment #66)
> > What would be the easiest option here to make everyone happy for now: Could
> > it be made a
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Katrin Fischer changed:
What|Removed |Added
See Also|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #67 from Katrin Fischer ---
(In reply to Katrin Fischer from comment #66)
> What would be the easiest option here to make everyone happy for now: Could
> it be made a configuration option somehow?
I just found bug 14542
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #66 from Katrin Fischer ---
What would be the easiest option here to make everyone happy for now: Could it
be made a configuration option somehow?
--
You are receiving this mail because:
You are watching all bug
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Jennifer Millikan changed:
What|Removed |Added
CC||jmilli...@sja1840.org
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Simon Hohl changed:
What|Removed |Added
CC||simon.h...@dainst.org
---
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Janet McGowan changed:
What|Removed |Added
CC||janet.mcgowan@ptfs-europe.c
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
jmbroust changed:
What|Removed |Added
CC||jean-manuel.broust@univ-lyo
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Nicolas Legrand changed:
What|Removed |Added
Status|Needs Signoff |Failed QA
--- Comment
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Katrin Fischer changed:
What|Removed |Added
Status|Failed QA |Needs Signoff
--- Comment
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Katrin Fischer changed:
What|Removed |Added
Attachment #101307|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #61 from Katrin Fischer ---
(In reply to Ere Maijala from comment #60)
> I think the last patch "Allow to use multiple ES language analyzers" should
> be a separate issue. The fix for case-insensitivity in the second
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #60 from Ere Maijala ---
I think the last patch "Allow to use multiple ES language analyzers" should be
a separate issue. The fix for case-insensitivity in the second patch is valid
and necessary. I failed to even test
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Martin Renvoize changed:
What|Removed |Added
CC|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Katrin Fischer changed:
What|Removed |Added
CC|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #57 from Katrin Fischer ---
(In reply to Katrin Fischer from comment #56)
> +1 for default filtering. It's how it works now in the default and we
> haven't seen questions or complaints about this. To me it appears to be
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #56 from Katrin Fischer ---
+1 for default filtering. It's how it works now in the default and we haven't
seen questions or complaints about this. To me it appears to be the expected
behaviour.
--
You are receiving
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #55 from Ere Maijala ---
Indeed my proposal was just to add elision filtering to the default ES index
config without any code changes. The first patch still does that, but since
then there have been additional ideas
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Nick Clemens changed:
What|Removed |Added
Status|Signed Off |Failed QA
--- Comment #54
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Bouzid Fergani changed:
What|Removed |Added
Attachment #101310|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Bouzid Fergani changed:
What|Removed |Added
Attachment #101311|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Bouzid Fergani changed:
What|Removed |Added
CC||bouzid.ferg...@inlibro.com
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #52 from Bouzid Fergani ---
Created attachment 101310
-->
https://bugs.koha-community.org/bugzilla3/attachment.cgi?id=101310=edit
Bug 19482 - Add support for defining 'mandatory' mappings
To test:
1 - Apply patch
2 -
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #53 from Bouzid Fergani ---
Created attachment 101311
-->
https://bugs.koha-community.org/bugzilla3/attachment.cgi?id=101311=edit
Bug 19482: SCHEMA CHANGES _ DO NOT PUSH
Signed-off-by: Bouzid Fergani
--
You are
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #51 from Bouzid Fergani ---
Created attachment 101309
-->
https://bugs.koha-community.org/bugzilla3/attachment.cgi?id=101309=edit
Bug 19482 - DB changes
Signed-off-by: Nicolas Legrand
Signed-off-by: Bouzid Fergani
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Bouzid Fergani changed:
What|Removed |Added
Attachment #98245|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Bouzid Fergani changed:
What|Removed |Added
Attachment #98246|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Bouzid Fergani changed:
What|Removed |Added
Attachment #98244|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Michal Denar changed:
What|Removed |Added
CC||blac...@gmail.com
--
You
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #47 from Michal Denar ---
Created attachment 98246
-->
https://bugs.koha-community.org/bugzilla3/attachment.cgi?id=98246=edit
Bug 21357: Allow to use multiple ES language analyzers
Test plan:
1. Apply this patch
2. If
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Michal Denar changed:
What|Removed |Added
Status|Needs Signoff |Signed Off
--
You are
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #46 from Michal Denar ---
Created attachment 98245
-->
https://bugs.koha-community.org/bugzilla3/attachment.cgi?id=98245=edit
Bug 21357: Case-insensive articles to the elision filter
Test with :
GET index/_analyze
{
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Michal Denar changed:
What|Removed |Added
Attachment #87901|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #44 from Nicolas Legrand ---
Julian, for non latin scripts we are doing a latin transliteration and so are
suppose to do all the ABES partners in France. Look at
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #43 from Nick Clemens ---
(In reply to Ere Maijala from comment #42)
> At least with MARC 21 that's the language of the catalogued item, not the
> language of the metadata record, right? We've been through this with our
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #42 from Ere Maijala ---
At least with MARC 21 that's the language of the catalogued item, not the
language of the metadata record, right? We've been through this with our
discovery interface and had to give up trying to
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #41 from Julian Maurice ---
(In reply to Nick Clemens from comment #40)
> (In reply to Julian Maurice from comment #39)
>
> What about using a record field to determine the language of the record and
> applying
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #40 from Nick Clemens ---
(In reply to Julian Maurice from comment #39)
What about using a record field to determine the language of the record and
applying different analyser settings based on that? 008/35-37 or 041 in
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #39 from Julian Maurice ---
> because you'd need to avoid indexing into "wrong" fields which would
> require you to know what language the string to be indexed is in.
I don't think you have to avoid indexing into the
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #38 from Ere Maijala ---
(In reply to Julian Maurice from comment #37)
> Ellision might not cause troubles (but what about names like "D'Amato" ?).
> I'm thinking about the next step : stemming is very different from one
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #37 from Julian Maurice ---
(In reply to Katrin Fischer from comment #35)
> Hi Julian, does tit mean it searches the different representations
> simultanously?
Only one query to ES is needed, if that's what you mean by
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #36 from Ere Maijala ---
I can't really see the benefit since, as far as I can see, elision handling is
not prone to cause conflicts with other language analysis. Separating analysis
for different languages also won't
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #35 from Katrin Fischer ---
Hi Julian, does it mean it searches the different representations
simultanously?
I am still not quite sure about 'French specific'.
It feels like every catalog containing some French
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Julian Maurice changed:
What|Removed |Added
Status|Failed QA |Needs Signoff
--- Comment
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Julian Maurice changed:
What|Removed |Added
CC|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #33 from Julian Maurice ---
Created attachment 98169
-->
https://bugs.koha-community.org/bugzilla3/attachment.cgi?id=98169=edit
Bug 21357: Allow to use multiple ES language analyzers
Test plan:
1. Apply this patch
2.
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #31 from Séverine Queune ---
(In reply to Fridolin SOMERS from comment #30)
> Ohhh playing with ES API with explain analyze :
> https://www.elastic.co/guide/en/elasticsearch/reference/current/
> _explain_analyze.html
>
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #30 from Fridolin SOMERS ---
Ohhh playing with ES API with explain analyze :
https://www.elastic.co/guide/en/elasticsearch/reference/current/_explain_analyze.html
I see that elision is only with the begining of the
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Fridolin SOMERS changed:
What|Removed |Added
Attachment #87902|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #28 from Fridolin SOMERS ---
At last I'm going to work on this again.
I understand way more Elasticsearch now :D
I know now that we can use french elision on a non-french catalog, even with an
unwanted impact the
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #27 from Fridolin SOMERS ---
(In reply to Ere Maijala from comment #24)
> Created attachment 87902 [details] [review]
> Bug 21357: Add uppercase articles to the elision filter
I think the correct way is using :
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Séverine Queune changed:
What|Removed |Added
Status|Needs Signoff |Failed QA
--- Comment
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #25 from Ere Maijala ---
Right, thanks for pointing that out. I added uppercase versions for all
articles.
--
You are receiving this mail because:
You are watching all bug changes.
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Ere Maijala changed:
What|Removed |Added
Attachment #86670|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #24 from Ere Maijala ---
Created attachment 87902
-->
https://bugs.koha-community.org/bugzilla3/attachment.cgi?id=87902=edit
Bug 21357: Add uppercase articles to the elision filter
--
You are receiving this mail
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Ere Maijala changed:
What|Removed |Added
Status|Signed Off |Needs Signoff
--
You are
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #22 from Nicolas Legrand ---
This is a most awaited feature for us. Note it doesn't work for « L'avion ». So
we need maybe something like this :
diff --git a/admin/searchengine/elasticsearch/index_config.yaml
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #21 from Björn Nylén ---
We signed off this bug as we think it's a good feature as is. We just recently
had a coworker noting this behaviour.
--
You are receiving this mail because:
You are watching all bug changes.
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Björn Nylén changed:
What|Removed |Added
CC||bjorn.ny...@ub.lu.se
--
You
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Björn Nylén changed:
What|Removed |Added
Attachment #79375|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Björn Nylén changed:
What|Removed |Added
Status|Needs Signoff |Signed Off
--
You are
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Marjorie Barry-Vila changed:
What|Removed |Added
CC|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Nick Clemens changed:
What|Removed |Added
CC||n...@bywatersolutions.com
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Ere Maijala changed:
What|Removed |Added
Status|In Discussion |Needs Signoff
--- Comment
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #18 from Fridolin SOMERS ---
(In reply to Ere Maijala from comment #17)
> The apostrophe filter is for the other cases where an apostrophe is not a
> part of an elision. It's not stripped in the char filter like other
>
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #17 from Ere Maijala ---
The apostrophe filter is for the other cases where an apostrophe is not a part
of an elision. It's not stripped in the char filter like other punctuation so
that the elision filter can do its job
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Fridolin SOMERS changed:
What|Removed |Added
Attachment #79564|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #16 from Fridolin SOMERS ---
Ok I will propose the stemming configuration on wiki page.
This patch is OK, but for me the "apostrophe" filter is not needed.
I've checked with analyse API on ES :
curl -X GET
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #15 from Katrin Fischer ---
> OK maybe we're weird, nonetheless I think most University libraries have
> books in a lot of different languages and not just one or two.
Exactly my point too - you are not weird!
Not
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #14 from Nicolas Legrand ---
Ay up friends,
We are a language library and we have more than 350 different languages and
lots of writing systems. Our interface is only in French (the language of our
country) or in
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #13 from Katrin Fischer ---
(In reply to Katrin Fischer from comment #12)
> Serious question: Why would we want stop words?
To explain: to me stop words seem to stem from a time where search would die
searching for
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #12 from Katrin Fischer ---
Serious question: Why would we want stop words?
--
You are receiving this mail because:
You are watching all bug changes.
___
Koha-bugs mailing
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #11 from Fridolin SOMERS ---
(In reply to Katrin Fischer from comment #10)
> I think ideally things shoudl be as easy as possible. I think all of our
> libraries have materials in various languages - the Goethe institute
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #10 from Katrin Fischer ---
I think ideally things shoudl be as easy as possible. I think all of our
libraries have materials in various languages - the Goethe institute libraires
being a prominent example. I don't want
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #9 from Fridolin SOMERS ---
(In reply to Katrin Fischer from comment #8)
> (In reply to Fridolin SOMERS from comment #6)
> > (In reply to Ere Maijala from comment #5)
> > > If it's decided this shouldn't be part of the
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #8 from Katrin Fischer ---
(In reply to Fridolin SOMERS from comment #6)
> (In reply to Ere Maijala from comment #5)
> > If it's decided this shouldn't be part of the default rules, we could
> > perhaps still include
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #7 from Fridolin SOMERS ---
Created attachment 79564
-->
https://bugs.koha-community.org/bugzilla3/attachment.cgi?id=79564=edit
french ES index config
We could add this in etc/elasticsearch/fr or etc/searchengine/fr ?
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #6 from Fridolin SOMERS ---
(In reply to Ere Maijala from comment #5)
> If it's decided this shouldn't be part of the default rules, we could
> perhaps still include these as a commented-out example, right?
I'd prefer
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
--- Comment #5 from Ere Maijala ---
If it's decided this shouldn't be part of the default rules, we could perhaps
still include these as a commented-out example, right?
--
You are receiving this mail because:
You are watching all
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Katrin Fischer changed:
What|Removed |Added
CC||katrin.fisc...@bsz-bw.de
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Fridolin SOMERS changed:
What|Removed |Added
Status|Signed Off |In Discussion
---
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Séverine Queune changed:
What|Removed |Added
CC||nicolas.legr...@bulac.fr
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Séverine Queune changed:
What|Removed |Added
Attachment #78889|0 |1
is obsolete|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Séverine Queune changed:
What|Removed |Added
Status|Needs Signoff |Signed Off
--
You are
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Fridolin SOMERS changed:
What|Removed |Added
CC||severine.que...@bulac.fr
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Fridolin SOMERS changed:
What|Removed |Added
CC|
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=21357
Ere Maijala changed:
What|Removed |Added
See Also||https://bugs.koha-community
1 - 100 of 103 matches
Mail list logo