[jira] Created: (LUCENE-1375) add IndexCommit.getTimestamp method

2008-09-04 Thread Michael McCandless (JIRA)
add IndexCommit.getTimestamp method --- Key: LUCENE-1375 URL: https://issues.apache.org/jira/browse/LUCENE-1375 Project: Lucene - Java Issue Type: Improvement Components: Index Affects Versions:

[jira] Commented: (LUCENE-1375) add IndexCommit.getTimestamp method

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628290#action_12628290 ] Michael McCandless commented on LUCENE-1375: Changes are trivial; I plan to co

[jira] Resolved: (LUCENE-1375) add IndexCommit.getTimestamp method

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1375. Resolution: Fixed Committed revision 691952. > add IndexCommit.getTimestamp metho

[jira] Updated: (LUCENE-1376) sometimes if a BG merge hits an exception, optimize() will fail to forward the exception

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1376: --- Attachment: LUCENE-1376.patch Attached patch. I will commit shortly. > sometimes i

[jira] Created: (LUCENE-1376) sometimes if a BG merge hits an exception, optimize() will fail to forward the exception

2008-09-04 Thread Michael McCandless (JIRA)
sometimes if a BG merge hits an exception, optimize() will fail to forward the exception Key: LUCENE-1376 URL: https://issues.apache.org/jira/browse/LUCENE-1376

[jira] Assigned: (LUCENE-1369) Eliminate unnecessary uses of Hashtable and Vector

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-1369: -- Assignee: Michael McCandless > Eliminate unnecessary uses of Hashtable and Vec

[jira] Commented: (LUCENE-1369) Eliminate unnecessary uses of Hashtable and Vector

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628306#action_12628306 ] Michael McCandless commented on LUCENE-1369: This patch looks good, thanks DM!

[jira] Commented: (LUCENE-1126) Simplify StandardTokenizer JFlex grammar

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628308#action_12628308 ] Michael McCandless commented on LUCENE-1126: {quote} One solution would be to

[jira] Resolved: (LUCENE-1376) sometimes if a BG merge hits an exception, optimize() will fail to forward the exception

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1376. Resolution: Fixed Committed revision 691964. > sometimes if a BG merge hits an ex

Re: [jira] Commented: (LUCENE-1320) ShingleMatrixFilter, a three dimensional permutating shingle filter

2008-09-04 Thread Karl Wettin
Right, but that's sort of a hassle :) I'll see what I can do. 4 sep 2008 kl. 04.36 skrev Grant Ingersoll: Or just remove the generics, right? On Sep 3, 2008, at 5:09 PM, Karl Wettin (JIRA) wrote: [ https://issues.apache.org/jira/browse/LUCENE-1320?page=com.atlassian.jira.plugin.system.i

Re: Is the COMPANY rule in StandardTokenizer valid?

2008-09-04 Thread Grant Ingersoll
On Sep 4, 2008, at 2:43 AM, Shai Erera wrote: Hi The COMPANY rule in StandardTokenizer is defined like this: // Company names like AT&T and [EMAIL PROTECTED] COMPANY= {ALPHA} ("&"|"@") {ALPHA} While this works perfect for AT&T and [EMAIL PROTECTED], it doesn't work well for strings li

[jira] Commented: (LUCENE-1372) Proposal: introduce more sensible sorting when a doc has multiple values for a term

2008-09-04 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628331#action_12628331 ] Mark Miller commented on LUCENE-1372: - Hey Paul, I agree that your patch is more intu

Re: Realtime Search for Social Networks Collaboration

2008-09-04 Thread Yonik Seeley
On Wed, Sep 3, 2008 at 6:50 PM, Jason Rutherglen <[EMAIL PROTECTED]> wrote: > I also think it's got a > lot of things now which makes integration difficult to do properly. I agree, and that's why the major bump in version number rather than minor - we recognize that some features will need some am

Re: Realtime Search for Social Networks Collaboration

2008-09-04 Thread Jason Rutherglen
Hi Yonik, I found the basic integration with SOLR and Ocean to be fairly straightforward, the https://issues.apache.org/jira/browse/SOLR-567 patch is key to that. SOLR just needs an optimistic concurrency update handler and most of the functionality would work. I guess the problem would be, remo

[jira] Commented: (LUCENE-1344) Make the Lucene jar an OSGi bundle

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628348#action_12628348 ] Michael McCandless commented on LUCENE-1344: Nicolas, does this mean we need

[jira] Commented: (LUCENE-1131) Add numDeletedDocs to IndexReader

2008-09-04 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628355#action_12628355 ] Otis Gospodnetic commented on LUCENE-1131: -- I think so - applies and compiles. >

[jira] Commented: (LUCENE-1126) Simplify StandardTokenizer JFlex grammar

2008-09-04 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628368#action_12628368 ] Steven Rowe commented on LUCENE-1126: - {quote} Could we, alternatively, modify the pat

[jira] Commented: (LUCENE-1344) Make the Lucene jar an OSGi bundle

2008-09-04 Thread JIRA
[ https://issues.apache.org/jira/browse/LUCENE-1344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628370#action_12628370 ] Nicolas Lalevée commented on LUCENE-1344: - bq. Nicolas, does this mean we need to

Re: Is the COMPANY rule in StandardTokenizer valid?

2008-09-04 Thread Shai Erera
>> If I had to choose, this sounds reasonable. Which of the two sound reasonable: (1) or (2)? On Thu, Sep 4, 2008 at 3:47 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > > On Sep 4, 2008, at 2:43 AM, Shai Erera wrote: > > Hi >> >> The COMPANY rule in StandardTokenizer is defined like this: >> >

Re: Is the COMPANY rule in StandardTokenizer valid?

2008-09-04 Thread Grant Ingersoll
Sorry, 2. I realized after I sent it that my last sentence in the reply was ambiguous. On Sep 4, 2008, at 12:24 PM, Shai Erera wrote: >> If I had to choose, this sounds reasonable. Which of the two sound reasonable: (1) or (2)? On Thu, Sep 4, 2008 at 3:47 PM, Grant Ingersoll <[EMAIL PROT

[jira] Commented: (LUCENE-1126) Simplify StandardTokenizer JFlex grammar

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628387#action_12628387 ] Michael McCandless commented on LUCENE-1126: bq. I'll shortly provide a redone

[jira] Updated: (LUCENE-1126) Simplify StandardTokenizer JFlex grammar

2008-09-04 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1126: Attachment: LUCENE-1126.patch Refreshed original patch to include the Thai range {{[\u0e00-\u0e59]

[jira] Commented: (LUCENE-1126) Simplify StandardTokenizer JFlex grammar

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628406#action_12628406 ] Michael McCandless commented on LUCENE-1126: Steven, it looks like you ran JFl

[jira] Commented: (LUCENE-1126) Simplify StandardTokenizer JFlex grammar

2008-09-04 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628424#action_12628424 ] Steven Rowe commented on LUCENE-1126: - bq. Steven, it looks like you ran JFlex with a

[jira] Commented: (LUCENE-1126) Simplify StandardTokenizer JFlex grammar

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628428#action_12628428 ] Michael McCandless commented on LUCENE-1126: Steven, it's OK I can regen; I ju

[jira] Created: (LUCENE-1377) Add HTMLStripReader and WordDelimiterFilter from SOLR

2008-09-04 Thread Jason Rutherglen (JIRA)
Add HTMLStripReader and WordDelimiterFilter from SOLR - Key: LUCENE-1377 URL: https://issues.apache.org/jira/browse/LUCENE-1377 Project: Lucene - Java Issue Type: Improvement Comp

[jira] Resolved: (LUCENE-1126) Simplify StandardTokenizer JFlex grammar

2008-09-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1126. Resolution: Fixed Committed revision 692211. Thanks Steven! > Simplify StandardT

Re: Moving SweetSpotSimilarity out of contrib

2008-09-04 Thread Chris Hostetter
: Contrib lacks many requirements of core code - it can be java 1.5, it doesn't : have to be backward compatible, etc. Putting something in core ensures its : treated as a Lucene first class citizen, stuff in contrib is not held to such : strict standards. "Contribs" as an idea lack those require

[jira] Commented: (LUCENE-1372) Proposal: introduce more sensible sorting when a doc has multiple values for a term

2008-09-04 Thread Paul Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628480#action_12628480 ] Paul Smith commented on LUCENE-1372: Having a Document sorted last because it has "zeb

Re: Moving SweetSpotSimilarity out of contrib

2008-09-04 Thread Chris Hostetter
: Another important driver is the "out-of-the-box experience". I honestly have no idea what an OOTB experience for Lucene-Java means ... For Solr i understand, For Nutch i understand ... for a java library The closest thing we can do to describing an OOTB experience is making a good demo

Re: Moving SweetSpotSimilarity out of contrib

2008-09-04 Thread Chris Hostetter
: My thought was to move SSS to core as a step towards : making it the default, if and when there is more evidence it is : better than current default - it just felt right as a cautious : step - I mean first move it to core so that it is more exposed If people really want to make SSS the default

[jira] Commented: (LUCENE-1372) Proposal: introduce more sensible sorting when a doc has multiple values for a term

2008-09-04 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628496#action_12628496 ] Mark Miller commented on LUCENE-1372: - Ah, but right now, the documentation will tell

[jira] Commented: (LUCENE-1372) Proposal: introduce more sensible sorting when a doc has multiple values for a term

2008-09-04 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628509#action_12628509 ] Hoss Man commented on LUCENE-1372: -- bq. Right now it looks blatantly silly at the present

[jira] Commented: (LUCENE-1372) Proposal: introduce more sensible sorting when a doc has multiple values for a term

2008-09-04 Thread Paul Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628513#action_12628513 ] Paul Smith commented on LUCENE-1372: bq. I'm not following this argument. Will it be l

Re: [jira] Created: (LUCENE-1377) Add HTMLStripReader and WordDelimiterFilter from SOLR

2008-09-04 Thread Chris Hostetter
: SOLR has two classes HTMLStripReader and WordDelimiterFilter which are : very useful for a wide variety of use cases. It would be good to place : them into core Lucene. FWIW: Just about every concrete TokenFilter and Tokeinzer in Solr's code base could and probably should be promoted up int

[jira] Commented: (LUCENE-1372) Proposal: introduce more sensible sorting when a doc has multiple values for a term

2008-09-04 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628555#action_12628555 ] Hoss Man commented on LUCENE-1372: -- bq. We'd sort the list of attributes so that it would

[jira] Issue Comment Edited: (LUCENE-1373) Most of the contributed Analyzers suffer from invalid recognition of acronyms.

2008-09-04 Thread Mark Lassau (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628563#action_12628563 ] marklassau edited comment on LUCENE-1373 at 9/4/08 11:13 PM: --

[jira] Commented: (LUCENE-1151) Fix StandardAnalyzer to not mis-identify HOST as ACRONYM by default

2008-09-04 Thread Mark Lassau (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628568#action_12628568 ] Mark Lassau commented on LUCENE-1151: - Michael, Great work. I am glad we are moving to

[jira] Commented: (LUCENE-1151) Fix StandardAnalyzer to not mis-identify HOST as ACRONYM by default

2008-09-04 Thread Mark Lassau (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628573#action_12628573 ] Mark Lassau commented on LUCENE-1151: - I love the solution you have come up with, but