[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter

2013-01-14 Thread Martijn van Groningen (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552680#comment-13552680
 ] 

Martijn van Groningen commented on LUCENE-3931:
---

This makes sense to me.

 Adding d character to default ElisionFilter
 -

 Key: LUCENE-3931
 URL: https://issues.apache.org/jira/browse/LUCENE-3931
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index
Reporter: David Pilato
Priority: Trivial

 As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d 
 character is used in french as an elision character.
 E.g.: déclaration d'espèce
 So, it would be useful to have it as a default elision token.
 {code:title=ElisionFilter.java|borderStyle=solid}
   private static final CharArraySet DEFAULT_ARTICLES = 
 CharArraySet.unmodifiableSet(
   new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList(
   l, m, t, qu, n, s, j, d), true));
 {code}
 HTH
 David.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter

2013-01-14 Thread Tommaso Teofili (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552686#comment-13552686
 ] 

Tommaso Teofili commented on LUCENE-3931:
-

that's true for Italian as well.

 Adding d character to default ElisionFilter
 -

 Key: LUCENE-3931
 URL: https://issues.apache.org/jira/browse/LUCENE-3931
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index
Reporter: David Pilato
Assignee: Martijn van Groningen
Priority: Trivial

 As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d 
 character is used in french as an elision character.
 E.g.: déclaration d'espèce
 So, it would be useful to have it as a default elision token.
 {code:title=ElisionFilter.java|borderStyle=solid}
   private static final CharArraySet DEFAULT_ARTICLES = 
 CharArraySet.unmodifiableSet(
   new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList(
   l, m, t, qu, n, s, j, d), true));
 {code}
 HTH
 David.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter

2013-01-14 Thread Steve Rowe (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552687#comment-13552687
 ] 

Steve Rowe commented on LUCENE-3931:


Because ElisionFilter use is used by more than just French, the set of 
contractions was moved out of ElisionFilter (LUCENE-3884).

The issue of missing French contractions has already been addressed, in 
LUCENE-4662.

I didn't notice this issue - I would have resolved it when I resolved 
LUCENE-4662.

So Martijn, unless there is some other reason to keep this issue open, I think 
it can be resolved as a duplicate.

 Adding d character to default ElisionFilter
 -

 Key: LUCENE-3931
 URL: https://issues.apache.org/jira/browse/LUCENE-3931
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index
Reporter: David Pilato
Assignee: Martijn van Groningen
Priority: Trivial

 As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d 
 character is used in french as an elision character.
 E.g.: déclaration d'espèce
 So, it would be useful to have it as a default elision token.
 {code:title=ElisionFilter.java|borderStyle=solid}
   private static final CharArraySet DEFAULT_ARTICLES = 
 CharArraySet.unmodifiableSet(
   new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList(
   l, m, t, qu, n, s, j, d), true));
 {code}
 HTH
 David.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter

2013-01-14 Thread Steve Rowe (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552692#comment-13552692
 ] 

Steve Rowe commented on LUCENE-3931:


bq. that's true for Italian as well.

[ItalianAnalyzer|http://svn.apache.org/viewvc/lucene/dev/tags/lucene_solr_4_0_0/lucene/analysis/common/src/java/org/apache/lucene/analysis/it/ItalianAnalyzer.java?revision=1396952view=markup#l53]
 includes d in the list of contractions it gives to ElisionFilter.


 Adding d character to default ElisionFilter
 -

 Key: LUCENE-3931
 URL: https://issues.apache.org/jira/browse/LUCENE-3931
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index
Reporter: David Pilato
Assignee: Martijn van Groningen
Priority: Trivial

 As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d 
 character is used in french as an elision character.
 E.g.: déclaration d'espèce
 So, it would be useful to have it as a default elision token.
 {code:title=ElisionFilter.java|borderStyle=solid}
   private static final CharArraySet DEFAULT_ARTICLES = 
 CharArraySet.unmodifiableSet(
   new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList(
   l, m, t, qu, n, s, j, d), true));
 {code}
 HTH
 David.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter

2013-01-14 Thread Tommaso Teofili (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552729#comment-13552729
 ] 

Tommaso Teofili commented on LUCENE-3931:
-

ok, thanks for clarifying Steve.

 Adding d character to default ElisionFilter
 -

 Key: LUCENE-3931
 URL: https://issues.apache.org/jira/browse/LUCENE-3931
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index
Reporter: David Pilato
Assignee: Martijn van Groningen
Priority: Trivial

 As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d 
 character is used in french as an elision character.
 E.g.: déclaration d'espèce
 So, it would be useful to have it as a default elision token.
 {code:title=ElisionFilter.java|borderStyle=solid}
   private static final CharArraySet DEFAULT_ARTICLES = 
 CharArraySet.unmodifiableSet(
   new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList(
   l, m, t, qu, n, s, j, d), true));
 {code}
 HTH
 David.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter

2013-01-14 Thread Martijn van Groningen (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552761#comment-13552761
 ] 

Martijn van Groningen commented on LUCENE-3931:
---

I see. I'll close it.

 Adding d character to default ElisionFilter
 -

 Key: LUCENE-3931
 URL: https://issues.apache.org/jira/browse/LUCENE-3931
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index
Reporter: David Pilato
Assignee: Martijn van Groningen
Priority: Trivial

 As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d 
 character is used in french as an elision character.
 E.g.: déclaration d'espèce
 So, it would be useful to have it as a default elision token.
 {code:title=ElisionFilter.java|borderStyle=solid}
   private static final CharArraySet DEFAULT_ARTICLES = 
 CharArraySet.unmodifiableSet(
   new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList(
   l, m, t, qu, n, s, j, d), true));
 {code}
 HTH
 David.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3931) Adding d character to default ElisionFilter

2013-01-14 Thread David Pilato (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13552764#comment-13552764
 ] 

David Pilato commented on LUCENE-3931:
--

Thanks all!

 Adding d character to default ElisionFilter
 -

 Key: LUCENE-3931
 URL: https://issues.apache.org/jira/browse/LUCENE-3931
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index
Reporter: David Pilato
Priority: Trivial

 As described in Wikipedia (http://fr.wikipedia.org/wiki/%C3%89lision), the d 
 character is used in french as an elision character.
 E.g.: déclaration d'espèce
 So, it would be useful to have it as a default elision token.
 {code:title=ElisionFilter.java|borderStyle=solid}
   private static final CharArraySet DEFAULT_ARTICLES = 
 CharArraySet.unmodifiableSet(
   new CharArraySet(Version.LUCENE_CURRENT, Arrays.asList(
   l, m, t, qu, n, s, j, d), true));
 {code}
 HTH
 David.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org