[jira] Resolved: (LUCENE-1099) Making Tokenizer.reset(Reader) public

2007-12-26 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen resolved LUCENE-1099. - Resolution: Fixed (http://www.nabble.com/Making-Tokenizer.reset%28Reader%29-public-tp14499883p14

[jira] Commented: (LUCENE-770) CfsExtractor tool

2007-12-26 Thread Daniel Naber (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554405 ] Daniel Naber commented on LUCENE-770: - Otis, I've used it just once and noticed the problem. I'm not sure how to

Re: [jira] Commented: (LUCENE-1068) Invalid behavior of StandardTokenizerImpl

2007-12-26 Thread Grant Ingersoll
Yeah, I think it can be made a separate issue. -Grant On Dec 23, 2007, at 2:36 AM, Shai Erera (JIRA) wrote: [ https://issues.apache.org/jira/browse/LUCENE-1068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel #action_12554174 ] Shai Erera commented on LUCENE-1068: -

[jira] Created: (LUCENE-1100) StandardTokenizer incorrectly types certain values

2007-12-26 Thread Grant Ingersoll (JIRA)
StandardTokenizer incorrectly types certain values -- Key: LUCENE-1100 URL: https://issues.apache.org/jira/browse/LUCENE-1100 Project: Lucene - Java Issue Type: Bug Components: Analys

[jira] Created: (LUCENE-1101) Tokenizers should reset positionIncrement to 1 in their next(Token result)

2007-12-26 Thread Doron Cohen (JIRA)
Tokenizers should reset positionIncrement to 1 in their next(Token result) --- Key: LUCENE-1101 URL: https://issues.apache.org/jira/browse/LUCENE-1101 Project: Lucene - Java

[jira] Updated: (LUCENE-1101) Tokenizers should reset positionIncrement to 1 in their next(Token result)

2007-12-26 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-1101: Attachment: lucene-1101.patch Simple patch fixing this. Planing to commit this shortly if no objec

[jira] Commented: (LUCENE-1095) StopFilter should have option to incr positionIncrement after stop word

2007-12-26 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554472 ] Doron Cohen commented on LUCENE-1095: - Ok I separated the tokenizers fix to LUCENE-1101. Patch here stands-alone

[jira] Commented: (LUCENE-1101) Tokenizers should reset positionIncrement to 1 in their next(Token result)

2007-12-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554473 ] Yonik Seeley commented on LUCENE-1101: -- Resetting tokens (including the position) is currently handled via Tok

[jira] Commented: (LUCENE-1101) Tokenizers should reset positionIncrement to 1 in their next(Token result)

2007-12-26 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554474 ] Doron Cohen commented on LUCENE-1101: - I think I checked that and found that Token.clear() is too strong to be i

[jira] Commented: (LUCENE-1101) Tokenizers should reset positionIncrement to 1 in their next(Token result)

2007-12-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554481 ] Yonik Seeley commented on LUCENE-1101: -- In what cases? The protocol we are currently using requires that the c

[jira] Commented: (LUCENE-1101) Tokenizers should reset positionIncrement to 1 in their next(Token result)

2007-12-26 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554482 ] Doron Cohen commented on LUCENE-1101: - Currently Token.clear() is used only for un-tokenized fields in DocmentsW

[jira] Updated: (LUCENE-1101) Tokenizers should reset positionIncrement to 1 in their next(Token result)

2007-12-26 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-1101: Attachment: lucene-1101.patch Updated patch - Tokenizers now calling clear() as suggested. This se

SinkTokenizer: next(Token) vs. next()

2007-12-26 Thread Doron Cohen
Working on Lucene-1101 I checked if SinkTokenizer.next(Token) should also call Token.clear(). (It shouldn't, because it ignores the input token.) However I think that calls to next() would end up creating Tokens for nothing (by TokenStream.next()). May currently be an empty case (if all current u

site javadocs link broken

2007-12-26 Thread Doron Cohen
is the site javadocs link broken or is it just me? Doron

[jira] Commented: (LUCENE-1101) Tokenizers should reset positionIncrement to 1 in their next(Token result)

2007-12-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554520 ] Yonik Seeley commented on LUCENE-1101: -- {quote}Currently Token.clear() is used only for un-tokenized fields in

[jira] Commented: (LUCENE-1101) Tokenizers should reset positionIncrement to 1 in their next(Token result)

2007-12-26 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554533 ] Doron Cohen commented on LUCENE-1101: - {quote} I think it's used for both tokenized and un-tokenized see lin