[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter
[ https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552680#comment-13552680 ] Martijn van Groningen commented on LUCENE-3931: --- This makes sense to me. Adding d character to default ElisionFilter - Key: LUCENE-3931 URL: https://issues.apache.org/jira/browse/LUCENE-3931 Project: Lucene - Core Issue Type: Improvement Components: core/index Reporter: David Pilato Priority: Trivial As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d character is used in french as an elision character. E.g.: déclaration d'espèce So, it would be useful to have it as a default elision token. {code:title=ElisionFilter.java|borderStyle=solid} private static final CharArraySet DEFAULT_ARTICLES = CharArraySet.unmodifiableSet( new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList( l, m, t, qu, n, s, j, d), true)); {code} HTH David. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter
[ https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552686#comment-13552686 ] Tommaso Teofili commented on LUCENE-3931: - that's true for Italian as well. Adding d character to default ElisionFilter - Key: LUCENE-3931 URL: https://issues.apache.org/jira/browse/LUCENE-3931 Project: Lucene - Core Issue Type: Improvement Components: core/index Reporter: David Pilato Assignee: Martijn van Groningen Priority: Trivial As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d character is used in french as an elision character. E.g.: déclaration d'espèce So, it would be useful to have it as a default elision token. {code:title=ElisionFilter.java|borderStyle=solid} private static final CharArraySet DEFAULT_ARTICLES = CharArraySet.unmodifiableSet( new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList( l, m, t, qu, n, s, j, d), true)); {code} HTH David. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter
[ https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552687#comment-13552687 ] Steve Rowe commented on LUCENE-3931: Because ElisionFilter use is used by more than just French, the set of contractions was moved out of ElisionFilter (LUCENE-3884). The issue of missing French contractions has already been addressed, in LUCENE-4662. I didn't notice this issue - I would have resolved it when I resolved LUCENE-4662. So Martijn, unless there is some other reason to keep this issue open, I think it can be resolved as a duplicate. Adding d character to default ElisionFilter - Key: LUCENE-3931 URL: https://issues.apache.org/jira/browse/LUCENE-3931 Project: Lucene - Core Issue Type: Improvement Components: core/index Reporter: David Pilato Assignee: Martijn van Groningen Priority: Trivial As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d character is used in french as an elision character. E.g.: déclaration d'espèce So, it would be useful to have it as a default elision token. {code:title=ElisionFilter.java|borderStyle=solid} private static final CharArraySet DEFAULT_ARTICLES = CharArraySet.unmodifiableSet( new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList( l, m, t, qu, n, s, j, d), true)); {code} HTH David. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter
[ https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552692#comment-13552692 ] Steve Rowe commented on LUCENE-3931: bq. that's true for Italian as well. [ItalianAnalyzer|http://svn.apache.org/viewvc/lucene/dev/tags/lucene_solr_4_0_0/lucene/analysis/common/src/java/org/apache/lucene/analysis/it/ItalianAnalyzer.java?revision=1396952view=markup#l53] includes d in the list of contractions it gives to ElisionFilter. Adding d character to default ElisionFilter - Key: LUCENE-3931 URL: https://issues.apache.org/jira/browse/LUCENE-3931 Project: Lucene - Core Issue Type: Improvement Components: core/index Reporter: David Pilato Assignee: Martijn van Groningen Priority: Trivial As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d character is used in french as an elision character. E.g.: déclaration d'espèce So, it would be useful to have it as a default elision token. {code:title=ElisionFilter.java|borderStyle=solid} private static final CharArraySet DEFAULT_ARTICLES = CharArraySet.unmodifiableSet( new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList( l, m, t, qu, n, s, j, d), true)); {code} HTH David. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter
[ https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552729#comment-13552729 ] Tommaso Teofili commented on LUCENE-3931: - ok, thanks for clarifying Steve. Adding d character to default ElisionFilter - Key: LUCENE-3931 URL: https://issues.apache.org/jira/browse/LUCENE-3931 Project: Lucene - Core Issue Type: Improvement Components: core/index Reporter: David Pilato Assignee: Martijn van Groningen Priority: Trivial As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d character is used in french as an elision character. E.g.: déclaration d'espèce So, it would be useful to have it as a default elision token. {code:title=ElisionFilter.java|borderStyle=solid} private static final CharArraySet DEFAULT_ARTICLES = CharArraySet.unmodifiableSet( new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList( l, m, t, qu, n, s, j, d), true)); {code} HTH David. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter
[ https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552761#comment-13552761 ] Martijn van Groningen commented on LUCENE-3931: --- I see. I'll close it. Adding d character to default ElisionFilter - Key: LUCENE-3931 URL: https://issues.apache.org/jira/browse/LUCENE-3931 Project: Lucene - Core Issue Type: Improvement Components: core/index Reporter: David Pilato Assignee: Martijn van Groningen Priority: Trivial As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d character is used in french as an elision character. E.g.: déclaration d'espèce So, it would be useful to have it as a default elision token. {code:title=ElisionFilter.java|borderStyle=solid} private static final CharArraySet DEFAULT_ARTICLES = CharArraySet.unmodifiableSet( new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList( l, m, t, qu, n, s, j, d), true)); {code} HTH David. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter
[ https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552764#comment-13552764 ] David Pilato commented on LUCENE-3931: -- Thanks all! Adding d character to default ElisionFilter - Key: LUCENE-3931 URL: https://issues.apache.org/jira/browse/LUCENE-3931 Project: Lucene - Core Issue Type: Improvement Components: core/index Reporter: David Pilato Priority: Trivial As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d character is used in french as an elision character. E.g.: déclaration d'espèce So, it would be useful to have it as a default elision token. {code:title=ElisionFilter.java|borderStyle=solid} private static final CharArraySet DEFAULT_ARTICLES = CharArraySet.unmodifiableSet( new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList( l, m, t, qu, n, s, j, d), true)); {code} HTH David. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org