On Tue, 22 Mar 2016 08:38:31 -0700, Debin, Infant Jerald (LNG-CON) <[email protected]> wrote:
> Hi Team, > > When we give the French ligature (Æ and æ) in our word query, it is not > getting recognized as ligature and no results are returned. > > But when we use ligature (Œ and œ) in our word query, , it is getting > recognized as ligature and results are returned. > > Is there any limitation for Æ and æ French ligature in Marklogic? > > To make it work is there any workaround needs to be done? > > Thanks and Regards, > > Debin > An unstemmed search would not match 'æ' to 'ae' because those are different codepoints that are not related via a decomposition mapping (ditto for 'œ' and 'oe', for that matter). A stemmed search may produce the same stem for words that have 'æ' vs 'ae' depending on the word or the language, and therefore produce a match, but it may not, too. Since French does not use the 'æ' ligature much, but it does use the 'œ' routinely, it likely that the French stemmer routinely maps 'œ' to 'oe' in stems but does not do the same for 'æ'. But it may vary word by word, so it could be that for some (likely: more common) words it does do the mapping and for others it doesn't. Your workaround is either to manually perform NFKD normalization on your content and/or queries or to add the specific words to your custom dictionary. //Mary _______________________________________________ General mailing list [email protected] Manage your subscription at: http://developer.marklogic.com/mailman/listinfo/general
