[jira] Commented: (LUCENE-794) Beginnings of a span based highlighter

2007-02-23 Thread Mark Harwood (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475289 ] Mark Harwood commented on LUCENE-794: - Hi Mark, I found a little time to look at the span Highlighter the other n

[jira] Commented: (LUCENE-811) Public API inconsistency

2007-02-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475311 ] Michael McCandless commented on LUCENE-811: --- > I think making some methods public is fine, so long as the c

[jira] Commented: (LUCENE-794) Beginnings of a span based highlighter

2007-02-23 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475316 ] Mark Miller commented on LUCENE-794: Yeah the patch should take care of all of that...I would have started with a

[jira] Commented: (LUCENE-811) Public API inconsistency

2007-02-23 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475418 ] Doug Cutting commented on LUCENE-811: - > for a package-private final class, marking un-inherited > methods/field

Re: [jira] Field constructor, avoiding String.intern()

2007-02-23 Thread James Kennedy
In our case, we're trying to optimize document() retrieval and we found that disabling the String interning in the Field constructor improved performance dramatically. I agree that interning should be an option on the constructor. For document retrieval, at least for a small of amount of fields, t

Re: [jira] Field constructor, avoiding String.intern()

2007-02-23 Thread robert engels
I don't think it is just the performance gain of equals() where intern () matters. It also reduces memory consumption dramatically when working with large collections of documents in memory - although this could also be done with constants, there is nothing in Java to enforce it (thus the

Re: [jira] Field constructor, avoiding String.intern()

2007-02-23 Thread James Kennedy
True. However, in the case where you are processing Documents one at a time and discarding them (e.g. We use hitCollector to process all documents from a search), or memory is not an issue, it would be nice to have the ability to disable the interning for performance sake. Robert Engels wrote

Deprecating IndexReader's delete methods?

2007-02-23 Thread Otis Gospodnetic
Hi, Now that we have Document deletion via IndexWriter, is the plan to deprecate the delete methods in IndexReader? If so, how come we didn't do that for 2.1? Sorry if I missed this discussion, if there was one. Thanks, Otis -

Re: Deprecating IndexReader's delete methods?

2007-02-23 Thread karl wettin
23 feb 2007 kl. 19.35 skrev Otis Gospodnetic: Now that we have Document deletion via IndexWriter, is the plan to deprecate the delete methods in IndexReader? If so, how come we didn't do that for 2.1? Sorry if I missed this discussion, if there was one. If I got it all right, then the

Re: Deprecating IndexReader's delete methods?

2007-02-23 Thread Yonik Seeley
On 2/23/07, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: Now that we have Document deletion via IndexWriter, is the plan to deprecate the delete methods in IndexReader? If so, how come we didn't do that for 2.1? Sorry if I missed this discussion, if there was one. IndexWriter deletion is sig

Re: [jira] Field constructor, avoiding String.intern()

2007-02-23 Thread Mike Klaas
On 2/23/07, James Kennedy <[EMAIL PROTECTED]> wrote: In our case, we're trying to optimize document() retrieval and we found that disabling the String interning in the Field constructor improved performance dramatically. I agree that interning should be an option on the constructor. Out of cur

Re: Deprecating IndexReader's delete methods?

2007-02-23 Thread Mark Miller
Is it more efficient to use IndexWriter to delete or does the change not bring a speed improvement? On 2/23/07, Yonik Seeley <[EMAIL PROTECTED]> wrote: On 2/23/07, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Now that we have Document deletion via IndexWriter, is the plan to deprecate the dele

Re: [jira] Field constructor, avoiding String.intern()

2007-02-23 Thread Doug Cutting
James Kennedy wrote: True. However, in the case where you are processing Documents one at a time and discarding them (e.g. We use hitCollector to process all documents from a search), or memory is not an issue, it would be nice to have the ability to disable the interning for performance sake.

[jira] Commented: (LUCENE-811) Public API inconsistency

2007-02-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475436 ] Michael McCandless commented on LUCENE-811: --- Ahhh, OK, I see. I like that style (declaring fields/methods

Re: Deprecating IndexReader's delete methods?

2007-02-23 Thread Doron Cohen
[EMAIL PROTECTED] wrote on 23/02/2007 10:43:13: > On 2/23/07, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > > Now that we have Document deletion via IndexWriter, is the plan to > deprecate the delete methods in IndexReader? If so, how come we > didn't do that for 2.1? Sorry if I missed this disc

Re: Deprecating IndexReader's delete methods?

2007-02-23 Thread Doug Cutting
Mark Miller wrote: Is it more efficient to use IndexWriter to delete or does the change not bring a speed improvement? It's no faster. It's primarily meant to simplify updates. Doug - To unsubscribe, e-mail: [EMAIL PROTECTED

[jira] Commented: (LUCENE-811) Public API inconsistency

2007-02-23 Thread Daniel John Debrunner (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475439 ] Daniel John Debrunner commented on LUCENE-811: -- Doesn't making such methods public mean they can be call

Re: [jira] Commented: (LUCENE-811) Public API inconsistency

2007-02-23 Thread robert engels
worrying about reflection "security" in a library is pointless. any real security needs to be accomplished using server isolation, and/or data access/encryption On Feb 23, 2007, at 1:11 PM, Daniel John Debrunner (JIRA) wrote: [ https://issues.apache.org/jira/browse/LUCENE-811? page=c

[jira] Assigned: (LUCENE-800) Incorrect parsing by QueryParser.parse() when it encounters backslashes (always eats one backslash.)

2007-02-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch reassigned LUCENE-800: Assignee: Michael Busch > Incorrect parsing by QueryParser.parse() when it encounters backs

[jira] Commented: (LUCENE-800) Incorrect parsing by QueryParser.parse() when it encounters backslashes (always eats one backslash.)

2007-02-23 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475452 ] Doron Cohen commented on LUCENE-800: Michael, I've been looking into this and think I made some progress. Are you

[jira] Updated: (LUCENE-811) Public API inconsistency

2007-02-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-811: -- Attachment: LUCENE-811-take2.patch OK fixed the patch to leave some fields/methods publ

[jira] Commented: (LUCENE-800) Incorrect parsing by QueryParser.parse() when it encounters backslashes (always eats one backslash.)

2007-02-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475457 ] Michael Busch commented on LUCENE-800: -- Hi Dilip, the backslash is the escape character in Lucene's queryparser

[jira] Assigned: (LUCENE-372) Unmatched right parentheses truncates query

2007-02-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch reassigned LUCENE-372: Assignee: Michael Busch > Unmatched right parentheses truncates query > ---

[jira] Commented: (LUCENE-800) Incorrect parsing by QueryParser.parse() when it encounters backslashes (always eats one backslash.)

2007-02-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475461 ] Michael Busch commented on LUCENE-800: -- Doron, the problem here is that a backslash is a valid TERM_CHAR and an

[jira] Assigned: (LUCENE-813) leading wildcard's don't work with trailing wildcard

2007-02-23 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen reassigned LUCENE-813: -- Assignee: Doron Cohen > leading wildcard's don't work with trailing wildcard >

Re: Lucene 2.1, soon

2007-02-23 Thread Grant Ingersoll
OK, I created https://svn.apache.org/repos/asf/lucene/java/dist and have committed the KEYS file from the last release. I also updated http://wiki.apache.org/jakarta-lucene/ReleaseTodo to reference the url. On Feb 21, 2007, at 12:38 PM, Doug Cutting wrote: Chris Hostetter wrote: : since

[jira] Commented: (LUCENE-800) Incorrect parsing by QueryParser.parse() when it encounters backslashes (always eats one backslash.)

2007-02-23 Thread Dilip Nimkar (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475513 ] Dilip Nimkar commented on LUCENE-800: - In my test code, I took care of the difference between \ as the Java escap

Re: commited docs vs wiki -- was: Re: [jira] Commented: (LUCENE-805) New Lucene Demo

2007-02-23 Thread Grant Ingersoll
I'll ask on infrastructure if there is a way to take a snapshot of the Wiki as HTML for release purposes. If we can do that, then I think we could move more to the Wiki. One solution, would be to have a simple script that calls wget (or some crawler) and downloads all of the wiki. It wou

[jira] Commented: (LUCENE-778) Allow overriding a Document

2007-02-23 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475526 ] Hoss Man commented on LUCENE-778: - >From email... http://www.nabble.com/-jira--Created%3A-%28LUCENE-778%29-Allow-ove

Re: [jira] Field constructor, avoiding String.intern()

2007-02-23 Thread Chris Hostetter
: Accessing documents from a hit-collector is not advised. It is : generally best to compose queries and filters to reduce the number of : matches. When that's not feasible, a hit collector that uses a : FieldCache to filter by or collect field values is much faster than : accessing documents.

[jira] Commented: (LUCENE-800) Incorrect parsing by QueryParser.parse() when it encounters backslashes (always eats one backslash.)

2007-02-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475529 ] Michael Busch commented on LUCENE-800: -- Dilip, are you using Lucene 1.9? The problem you are referring to (a se

[jira] Updated: (LUCENE-800) Incorrect parsing by QueryParser.parse() when it encounters backslashes (always eats one backslash.)

2007-02-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-800: - Priority: Minor (was: Major) just lowering the severity to minor > Incorrect parsing by QueryPa

Re: [jira] Field constructor, avoiding String.intern()

2007-02-23 Thread James Kennedy
Roughly search time of 10,000,000 documents (3 fields each) was cut in half. However, keep in mind that we're using slightly modified lucene document retrieval code. Using a HitCollector to aggregate search results. Mike Klaas wrote: > > On 2/23/07, James Kennedy <[EMAIL PROTECTED]> wrote: >>

Re: [jira] Field constructor, avoiding String.intern()

2007-02-23 Thread Wolfgang Hoschek
On Feb 23, 2007, at 10:28 AM, James Kennedy wrote: True. However, in the case where you are processing Documents one at a time and discarding them (e.g. We use hitCollector to process all documents from a search), or memory is not an issue, it would be nice to have the ability to disabl

[jira] Commented: (LUCENE-813) leading wildcard's don't work with trailing wildcard

2007-02-23 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475540 ] Doron Cohen commented on LUCENE-813: I fixed the PREFIX definition in QueryParser.jj, changed: (<_TERM_S

RE: commited docs vs wiki -- was: Re: [jira] Commented: (LUCENE-805) New Lucene Demo

2007-02-23 Thread Steven Parkes
All wiki changes. -Original Message- From: Grant Ingersoll [mailto:[EMAIL PROTECTED] Sent: Friday, February 23, 2007 2:04 PM To: java-dev@lucene.apache.org Subject: Re: commited docs vs wiki -- was: Re: [jira] Commented: (LUCENE-805) New Lucene Demo I'll ask on infrastructure if there is

[jira] Commented: (LUCENE-813) leading wildcard's don't work with trailing wildcard

2007-02-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475573 ] Michael Busch commented on LUCENE-813: -- > Another tehnical issue that came up is line endings - compiling on XP,

[jira] Updated: (LUCENE-813) leading wildcard's don't work with trailing wildcard

2007-02-23 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-813: --- Attachment: 813.fix.lead.wildcard.patch Attached 813.fix.lead.wildcard.patch fixes this by modifying

[jira] Updated: (LUCENE-813) leading wildcard's don't work with trailing wildcard

2007-02-23 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-813: --- Attachment: (was: 813.fix.lead.wildcard.patch) > leading wildcard's don't work with trailing wild

[jira] Updated: (LUCENE-813) leading wildcard's don't work with trailing wildcard

2007-02-23 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-813: --- Attachment: 813.fix.lead.wildcard.patch Reattaching (forgot to grant the license in the first shot...

[jira] Updated: (LUCENE-813) leading wildcard's don't work with trailing wildcard

2007-02-23 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-813: --- Lucene Fields: [New, Patch Available] (was: [New]) > leading wildcard's don't work with trailing wil

[jira] Created: (LUCENE-814) javacc on Win32 (cygwin) creates wrong line endings - fix them with 'ant replace'

2007-02-23 Thread Doron Cohen (JIRA)
javacc on Win32 (cygwin) creates wrong line endings - fix them with 'ant replace' - Key: LUCENE-814 URL: https://issues.apache.org/jira/browse/LUCENE-814 Project: Lucene

[jira] Updated: (LUCENE-814) javacc on Win32 (cygwin) creates wrong line endings - fix them with 'ant replace'

2007-02-23 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-814: --- Attachment: 814.javacc.line.ends.patch > javacc on Win32 (cygwin) creates wrong line endings - fix th

[jira] Commented: (LUCENE-813) leading wildcard's don't work with trailing wildcard

2007-02-23 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475599 ] Doron Cohen commented on LUCENE-813: I added a fix for this in build.xml - http://issues.apache.org/jira/browse

[jira] Commented: (LUCENE-814) javacc on Win32 (cygwin) creates wrong line endings - fix them with 'ant replace'

2007-02-23 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12475600 ] Hoss Man commented on LUCENE-814: - a few comments in no particular order... 1) there is a core ant task called that

Re: commited docs vs wiki -- was: Re: [jira] Commented: (LUCENE-805) New Lucene Demo

2007-02-23 Thread Chris Hostetter
: think we could move more to the Wiki. One solution, would be to have : a simple script that calls wget (or some crawler) and downloads all : of the wiki. It would, however, be better if the wiki supported yeah .. that's a fairly crude approach that would result in a lot of the useless navigat

Re: commited docs vs wiki -- was: Re: [jira] Commented: (LUCENE-805) New Lucene Demo

2007-02-23 Thread Chris Hostetter
: hhh this looks promising... : : http://cwiki.apache.org/confluence/spaces/exportspace.action?key=labs Hmmm... i found a lot of interesting documentation directly on point here... http://cwiki.apache.org/confluence/display/CWIKI/Index in a nutshell: * cwiki has permissions s