Hi,
Sorry I forgot the most important : i use lucene 3.6.
Here is my code : tokenStream = new ElisionFilter(Version.LUCENE_36,
tokenStream);
I looked at the source code of ElisionFilter, and DEFAULT_ARTICLES
doesn't contain "d" and "c", in order to manage terms like /"d'une/" or
"/c'est"/.
A possible workaround would be to call this constructor
ElisionFilter(Version matchVersion, TokenStream input, Set<?> articles).
But i don't understand why this "d" and "c" are not present in default
articles.
Yann.
On 07/26/2012 03:52 AM, Jack Krupansky wrote:
The filter should work (remove the letter and apostrophe).
Could you supply an exact code fragment that shows the literal term,
the code invoking the filter, and the exact literal output?
And, which release of Lucene?
-- Jack Krupansky
-----Original Message----- From: yamo93
Sent: Wednesday, July 25, 2012 9:56 AM
To: [email protected]
Subject: Re: Question on ElisionFilter with d'
Thanks for replying,
The problem is that the filter don't remove d' (and c' too).
Shall i open an issue on jira ?
On 07/25/2012 04:36 PM, Ian Lea wrote:
I bet it's expected. From http://en.wikipedia.org/wiki/Elision_(French)
In written French, elision (both phonetic and orthographic) is
obligatory for the following words:
...
the preposition de
...
Le père d'Albert vient d'arriver.
So surely the removal of d' is correct.
--
Ian.
On Wed, Jul 25, 2012 at 2:01 PM, yamo93 <[email protected]> wrote:
Hello,
I'm using ElisionFilter to index french text.
The filter works but ignore the d letter followed by an apostrophe
(example:
d'une).
Is-it an expected behaviour or is it an issue ?
Regards,
Yann.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]