[jira] Updated: (LUCENE-559) Turkish Analyzer for Lucene

2006-06-15 Thread Emre Bayram (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-559?page=all ] Emre Bayram updated LUCENE-559: --- Attachment: IndexFiles.java > Turkish Analyzer for Lucene > --- > > Key: LUCENE-559 > URL: http://issues.apache.org/jira

[jira] Updated: (LUCENE-559) Turkish Analyzer for Lucene

2006-06-15 Thread Emre Bayram (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-559?page=all ] Emre Bayram updated LUCENE-559: --- Attachment: SearchFiles.java > Turkish Analyzer for Lucene > --- > > Key: LUCENE-559 > URL: http://issues.apache.org/jir

[jira] Resolved: (LUCENE-304) [PATCH] add boost feature to MultiFieldQueryParser

2006-06-15 Thread Otis Gospodnetic (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-304?page=all ] Otis Gospodnetic resolved LUCENE-304: - Resolution: Duplicate Assign To: (was: Lucene Developers) I think this is a "duplicate" of LUCENE-544 . > [PATCH] add boost feature to

[jira] Resolved: (LUCENE-601) RAMDirectory implements Serializable

2006-06-15 Thread Otis Gospodnetic (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-601?page=all ] Otis Gospodnetic resolved LUCENE-601: - Resolution: Fixed Applied, thanks Karl. > RAMDirectory implements Serializable > > > Key: LUCENE-6

[jira] Updated: (LUCENE-602) [PATCH] Filtering tokens for position and term vector storage

2006-06-15 Thread Chuck Williams (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-602?page=all ] Chuck Williams updated LUCENE-602: -- Attachment: TokenSelectorAllWithParallelWriter.patch TokenSelectorAllWithParallelWriter.patch contains ParallelWriter as well (LUCENE-600) as it is also af

[jira] Updated: (LUCENE-602) [PATCH] Filtering tokens for position and term vector storage

2006-06-15 Thread Chuck Williams (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-602?page=all ] Chuck Williams updated LUCENE-602: -- Attachment: TokenSelectorSoloAll.patch TokenSelectorSoloAll.patch applies against today's svn head. It only requires Java 1.4. > [PATCH] Filtering toke

[jira] Created: (LUCENE-602) [PATCH] Filtering tokens for position and term vector storage

2006-06-15 Thread Chuck Williams (JIRA)
[PATCH] Filtering tokens for position and term vector storage - Key: LUCENE-602 URL: http://issues.apache.org/jira/browse/LUCENE-602 Project: Lucene - Java Type: New Feature Components: Index Versio

Re: Refactored FuzzyTermEnum

2006-06-15 Thread Otis Gospodnetic
I'm still waiting for my employer to send in the CCLA. :( But I've got a cool name for this thing - "lube" - contrib/lube. Otis - Original Message From: Marvin Humphrey <[EMAIL PROTECTED]> To: java-dev@lucene.apache.org Sent: Wednesday, June 14, 2006 3:51:52 AM Subject: Re: Refactored F

Re: Java 1.5 was [jira] Updated: (LUCENE-600) ParallelWriter companion to ParallelReader

2006-06-15 Thread Otis Gospodnetic
I agree and completely understand Chuck. I'm waiting for my employer to sign and fax the CCLA for some search benchmarking code I wrote, and it uses Java 1.5 stuff. It would only be a contrib piece, not core, so it's less of a problem, but... Grant: how to poll users? How about this: http:

Re: Seeking feedback on LUCENE-557

2006-06-15 Thread Yonik Seeley
On 6/15/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: 1) Modifying existing search tests to implicitly check explain LUCENE-557-modify-existing-tests.patch modifies (almost) every core test I could find that used an IndexSearcher to use a modified IndexSearcher that implicitly tests explanation

Seeking feedback on LUCENE-557

2006-06-15 Thread Chris Hostetter
Regarding "search vs explain - score discrepancies"... http://issues.apache.org/jira/browse/LUCENE-557 ...I got sidetracked by a pesky anoyance called "work" for a little while, but I'm looking at this again, and I'd like to commit some varient of the patches I added to that bug -- the notabl

[jira] Updated: (LUCENE-259) HTML Parser doesn't decode character references in attributes

2006-06-15 Thread Daniel Naber (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-259?page=all ] Daniel Naber updated LUCENE-259: Bugzilla Id: (was: 30621) Assign To: (was: Lucene Developers) Priority: Minor (was: Major) Decrease priority because this affects the demo o

[jira] Commented: (LUCENE-101) Selecting a language-specific analyzer according to a locale.

2006-06-15 Thread Daniel Naber (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-101?page=comments#action_12416410 ] Daniel Naber commented on LUCENE-101: - The URL from the original report doesn't work anymore, I think it refers to this post: http://mail-archives.apache.org/mod_mbox/luc

[jira] Commented: (LUCENE-559) Turkish Analyzer for Lucene

2006-06-15 Thread Daniel Naber (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-559?page=comments#action_12416407 ] Daniel Naber commented on LUCENE-559: - Thanks for your contribution. Could you write some unit tests for your classes, similar to the existing tests for other languages?

[jira] Commented: (LUCENE-562) Allow Unstored AND Unindexed Fields as in 1.4

2006-06-15 Thread Daniel Naber (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-562?page=comments#action_12416415 ] Daniel Naber commented on LUCENE-562: - I think this should be closed as "won't fix". You could either write your wrapper class or just use an indexed or stored field that

[jira] Updated: (LUCENE-525) A standard Lucene install that works for simple web sites

2006-06-15 Thread Daniel Naber (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-525?page=all ] Daniel Naber updated LUCENE-525: Priority: Minor (was: Major) decrease priority > A standard Lucene install that works for simple web sites > -

[jira] Updated: (LUCENE-590) Demo HTML parser gives incorrect summaries when title is repeated as a heading

2006-06-15 Thread Daniel Naber (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-590?page=all ] Daniel Naber updated LUCENE-590: Description: If you have an html document where the title is repeated as a heading at the top of the document, the HTMLParser will return the title as the summ

[jira] Resolved: (LUCENE-196) [PATCH] Added support for segmented field data files and cached directories

2006-06-15 Thread Otis Gospodnetic (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-196?page=all ] Otis Gospodnetic resolved LUCENE-196: - Resolution: Duplicate Assign To: (was: Lucene Developers) Thanks Christian. I think LUCENE-545 provided the solution to selective field

Re: Bugs

2006-06-15 Thread Otis Gospodnetic
Grant - there are a ton of good patches from Paul in JIRA. There was also that one from a person at IBM, I think, the one that improved performance of interleaved add/delete operations. I think that one would be good to apply before it gets stale. Other than that - yeah, go by votes, I'd say,

[jira] Updated: (LUCENE-601) RAMDirectory implements Serializable

2006-06-15 Thread Karl Wettin (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-601?page=all ] Karl Wettin updated LUCENE-601: --- Attachment: serializable_RAMDirectory.diff > RAMDirectory implements Serializable > > > Key: LUCENE-601 > U

[jira] Created: (LUCENE-601) RAMDirectory implements Serializable

2006-06-15 Thread Karl Wettin (JIRA)
RAMDirectory implements Serializable Key: LUCENE-601 URL: http://issues.apache.org/jira/browse/LUCENE-601 Project: Lucene - Java Type: Improvement Components: Store Versions: 2.0.0 Reporter: Karl Wettin Priori

Re: Bugs

2006-06-15 Thread Chris Hostetter
: > unfortunately, many people don't think to search "resolved" or "closed" : > bugs for similar problems. : It's kind of ironic that people working on a search engine wouldn't : think to search first! :-) Human nature, I guess... well, yeah ... but i suspect peoples natural dendency when searc

Re: Bugs

2006-06-15 Thread Paul Elschot
On Thursday 15 June 2006 13:56, Grant Ingersoll wrote: > Hey Chris, ... > > > If you would like a second pair of eyes on anything, just let me know. Some sandboxes were mentioned, where's the playground? Regards, Paul Elschot --

Re: Bugs

2006-06-15 Thread Grant Ingersoll
Hey Chris, All good points. See some more of my thoughts below... Chris Hostetter wrote: : them are listed as increased priority. There are also a number of bugs : that date back as far as 2002 which I highly doubt are all that useful : at this point unless someone wants to patch a specific b

Re: Scoring

2006-06-15 Thread Nadav Har'El
One interesting thing to talk about is when you need to create a new Query subclass, and how to do it. For example, let's say you want something between a BooleanQuery and a PhraseQuery, which matches documents with some of the query words in them (like the normal BooleanQuery), but giving more sc

Re: Refactored FuzzyTermEnum

2006-06-15 Thread karl wettin
On Thu, 2006-06-15 at 11:42 +0200, karl wettin wrote: > > 2. Standard RAMDirectory: Oups, I got the headers wrong here. Again: With your fixes: 1000 fuzzy~0.1, 121 per min. 1000 fuzzy~0.2, 190 per min. 1000 fuzzy~0.3, 342 per min. 1000 fuzzy~0.4, 456 per min. 1000 fuzzy~0.5, 578 per min. 1000 f

Re: Refactored FuzzyTermEnum

2006-06-15 Thread karl wettin
On Tue, 2006-06-13 at 15:14 -0400, Bob Carpenter wrote: > I refactored the org.apache.lucene.search.FuzzyTermEnum Nice! > Does anyone have regression/performance test harnesses? I have a quite unscientific test I ran. 500 documents in corpus. One index reader. A new Index searcher for each query

Re: Bugs

2006-06-15 Thread Chris Hostetter
: them are listed as increased priority. There are also a number of bugs : that date back as far as 2002 which I highly doubt are all that useful : at this point unless someone wants to patch a specific branch. : : So, I guess I am wondering where I can be the most helpful? I would : like to pr