[jira] Created: (LUCENE-1231) Column-stride fields (aka per-document Payloads)

2008-03-13 Thread Michael Busch (JIRA)
Column-stride fields (aka per-document Payloads) Key: LUCENE-1231 URL: https://issues.apache.org/jira/browse/LUCENE-1231 Project: Lucene - Java Issue Type: New Feature Components: In

[jira] Updated: (LUCENE-1230) Source release files missing the *.pom.template files

2008-03-13 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-1230: -- Attachment: lucene-1230.patch > Source release files missing the *.pom.template files > --

Hudson build is back to normal: Lucene-trunk #401

2008-03-13 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Lucene-trunk/401/changes - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Created: (LUCENE-1230) Source release files missing the *.pom.template files

2008-03-13 Thread Michael Busch (JIRA)
Source release files missing the *.pom.template files - Key: LUCENE-1230 URL: https://issues.apache.org/jira/browse/LUCENE-1230 Project: Lucene - Java Issue Type: Bug Components:

[jira] Commented: (LUCENE-1202) Clover setup currently has some problems

2008-03-13 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578615#action_12578615 ] Hoss Man commented on LUCENE-1202: -- bq. Does the Clover license allow instrumenting non a

Re: Build failed in Hudson: Lucene-trunk #400

2008-03-13 Thread Chris Hostetter
: [gsingers] LUCENE-550: Added RAMDirectory alternative as a contrib. : Similar to MemoryIndex, but can hold more than one document ... : compile-core: : [mkdir] Created dir: http://hudson.zones.apache.org/hudson/job/Lucene-trunk/ws/trunk/build/contrib/instantiated/classes/java

[jira] Updated: (LUCENE-1229) NGramTokenFilter optimization in query phase

2008-03-13 Thread Hiroaki Kawai (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hiroaki Kawai updated LUCENE-1229: -- Attachment: NGramTokenFilter.patch Added a patch NGramTokenFilter.patch. This patch includes L

[jira] Created: (LUCENE-1229) NGramTokenFilter optimization in query phase

2008-03-13 Thread Hiroaki Kawai (JIRA)
NGramTokenFilter optimization in query phase Key: LUCENE-1229 URL: https://issues.apache.org/jira/browse/LUCENE-1229 Project: Lucene - Java Issue Type: Improvement Components: contrib/*

Build failed in Hudson: Lucene-trunk #400

2008-03-13 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Lucene-trunk/400/changes Changes: [buschmi] LUCENE-1226: Fixed IndexWriter.addIndexes(IndexReader[]) to commit successfully created compound files. [gsingers] LUCENE-550: put the comment in the wrong spot [gsingers] LUCENE-550: Added RAMDirectory

[jira] Updated: (LUCENE-1187) Things to be done now that Filter is independent from BitSet

2008-03-13 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1187: Attachment: ChainedFilterAndCachingFilterTest.patch Test that now fails with ChainedFilter. {code

[jira] Commented: (LUCENE-1228) IndexWriter.commit() does not update the index version

2008-03-13 Thread Ning Li (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578518#action_12578518 ] Ning Li commented on LUCENE-1228: - Does SegmentInfos really need both "version" and "gener

[jira] Resolved: (LUCENE-1226) IndexWriter.addIndexes(IndexReader[]) fails to create compound files

2008-03-13 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch resolved LUCENE-1226. --- Resolution: Fixed Lucene Fields: [New, Patch Available] (was: [Patch Available, New])

[jira] Commented: (LUCENE-1228) IndexWriter.commit() does not update the index version

2008-03-13 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578475#action_12578475 ] Doron Cohen commented on LUCENE-1228: - Oh good, less migration to do. Mmm.. so it is n

[jira] Updated: (LUCENE-1228) IndexWriter.commit() does not update the index version

2008-03-13 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-1228: Lucene Fields: [New, Patch Available] (was: [Patch Available, New]) Affects Version/s:

[jira] Commented: (LUCENE-1228) IndexWriter.commit() does not update the index version

2008-03-13 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578472#action_12578472 ] Michael McCandless commented on LUCENE-1228: Good catch Doron, thanks! This o

[jira] Updated: (LUCENE-1228) IndexWriter.commit() does not update the index version

2008-03-13 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-1228: Attachment: lucene-1228-commit-reopen.patch Patch with a fix + a test that demonstrates the bug. A

[jira] Created: (LUCENE-1228) IndexWriter.commit() does not update the index version

2008-03-13 Thread Doron Cohen (JIRA)
IndexWriter.commit() does not update the index version --- Key: LUCENE-1228 URL: https://issues.apache.org/jira/browse/LUCENE-1228 Project: Lucene - Java Issue Type: Bug Componen

Re: an API for synonym in Lucene-core

2008-03-13 Thread J. Delgado
Mathieu, Have you thought about incorporating a standard format for thesaurus and thus for query/index expansion. Here is the recommendation from NISO: http://www.niso.org/committees/MT-info.html Beyond synonyms, having the capabilities to specify the use of BT (broader terms or Hypernyms) or NT

Should Document.getFieldables really return null

2008-03-13 Thread Stefan Trcek
Hello The 'Document.getFieldables(String name)' is documented to return 'null' in some cases (and really does, see the code below). However this makes a penalty to the client, as code like this Document doc = hits.doc(i); for (Fieldable f: doc.getFieldables("somefield")) { Syst

[jira] Updated: (LUCENE-1227) NGramTokenizer to handle more than 1024 chars

2008-03-13 Thread Hiroaki Kawai (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hiroaki Kawai updated LUCENE-1227: -- Attachment: NGramTokenizer.patch bugfix that I made a mistake about char array addressing. >

[jira] Resolved: (LUCENE-550) InstantiatedIndex - faster but memory consuming index

2008-03-13 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved LUCENE-550. Resolution: Fixed Committed revision 636745. Thanks Karl! > InstantiatedIndex - faster bu

[jira] Assigned: (LUCENE-1227) NGramTokenizer to handle more than 1024 chars

2008-03-13 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned LUCENE-1227: --- Assignee: Grant Ingersoll > NGramTokenizer to handle more than 1024 chars >

[jira] Updated: (LUCENE-1227) NGramTokenizer to handle more than 1024 chars

2008-03-13 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated LUCENE-1227: Priority: Minor (was: Major) Lucene Fields: [New, Patch Available] (was: [Patch

[jira] Assigned: (LUCENE-1224) NGramTokenFilter creates bad TokenStream

2008-03-13 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned LUCENE-1224: --- Assignee: Grant Ingersoll > NGramTokenFilter creates bad TokenStream > -

[jira] Commented: (LUCENE-1219) support array/offset/ length setters for Field with binary data

2008-03-13 Thread Eks Dev (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578253#action_12578253 ] Eks Dev commented on LUCENE-1219: - >>Eks can you see if the changes look OK? Thanks. It lo

[jira] Updated: (LUCENE-1224) NGramTokenFilter creates bad TokenStream

2008-03-13 Thread Hiroaki Kawai (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hiroaki Kawai updated LUCENE-1224: -- Attachment: NGramTokenFilter.patch Modified to set a right start/end offset value in Token pro

[jira] Updated: (LUCENE-1029) Illegal character replacements in ISOLatin1AccentFilter

2008-03-13 Thread Hiroaki Kawai (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hiroaki Kawai updated LUCENE-1029: -- Attachment: ISOLatin1AccentFilter-by-Collator.patch Wrote a patch that use java.text.Collator.

Re: an API for synonym in Lucene-core

2008-03-13 Thread Mathieu Lecarme
I'll slice my contrib in small parts Synonyms 1) Synonym (Token + a weight) 2) Synonym provider from OO.o thesaurus 3) SynonymTokenFilter 4) Query expander wich apply a filter (and a boost) on each of its TermQuery 5) a Synonym filter for the query expander 6) to be efficient, Synonym can be excl

[jira] Commented: (LUCENE-1219) support array/offset/ length setters for Field with binary data

2008-03-13 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578199#action_12578199 ] Michael McCandless commented on LUCENE-1219: Alas, I'm not really happy with i

[jira] Updated: (LUCENE-1219) support array/offset/ length setters for Field with binary data

2008-03-13 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1219: --- Attachment: LUCENE-1219.take2.patch OK I updated the patch: * Added a ctor to Fie

[jira] Commented: (LUCENE-1029) Illegal character replacements in ISOLatin1AccentFilter

2008-03-13 Thread Hiroaki Kawai (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578195#action_12578195 ] Hiroaki Kawai commented on LUCENE-1029: --- I'd like to comment that we have another to

Re: Build failed in Hudson: Lucene-trunk #399

2008-03-13 Thread Michael McCandless
Looks like Oracle finally shut off the old download location from Sleepycat. It's been moved to a new location. I'll commit a fix shortly. Mike On Mar 12, 2008, at 10:12 PM, Apache Hudson Server wrote: See http://hudson.zones.apache.org/hudson/job/Lucene-trunk/399/changes Changes: [m

[jira] Commented: (LUCENE-1227) NGramTokenizer to handle more than 1024 chars

2008-03-13 Thread Hiroaki Kawai (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578182#action_12578182 ] Hiroaki Kawai commented on LUCENE-1227: --- LUCENE-1227's NGramTokenizer.patch will als

[jira] Updated: (LUCENE-1227) NGramTokenizer to handle more than 1024 chars

2008-03-13 Thread Hiroaki Kawai (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hiroaki Kawai updated LUCENE-1227: -- Attachment: NGramTokenizer.patch > NGramTokenizer to handle more than 1024 chars > ---

[jira] Created: (LUCENE-1227) NGramTokenizer to handle more than 1024 chars

2008-03-13 Thread Hiroaki Kawai (JIRA)
NGramTokenizer to handle more than 1024 chars - Key: LUCENE-1227 URL: https://issues.apache.org/jira/browse/LUCENE-1227 Project: Lucene - Java Issue Type: Improvement Components: contrib/

[jira] Commented: (LUCENE-1161) Punctuation handling in StandardTokenizer (and WikipediaTokenizer)

2008-03-13 Thread Hiroaki Kawai (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578179#action_12578179 ] Hiroaki Kawai commented on LUCENE-1161: --- I think WhitespaceTokenizer + WordDelimiter