[jira] [Commented] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Uwe Schindler (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1308#comment-1308
 ] 

Uwe Schindler commented on LUCENE-3847:
---

As far as I remember, the main problem with the Map interface in Properties is 
the values coming from the parent object (passed in Parameter). When you wrap 
another properties object with one that has the other one as defaults given 
in ctor, then your are fcked up if you are using the Map interface. Which is 
not the case for system properties, but that's not guaranteed (e.g. the Java 
JVM could set some basic properties in another Properties object and make the 
changes available only through the overlayed properties object).

To fix the defaults problem I would go with the Enumeration? (works in Java 
5, too) in a for(Enumeration? e=propertyNames(); e.hasMoreElements();) loop.

If you use the Set interface, the inherited props would not have been seen and 
the system may *delete* properties which are in a parent/defaults properties 
object.

 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Issue Comment Edited] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Uwe Schindler (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1308#comment-1308
 ] 

Uwe Schindler edited comment on LUCENE-3847 at 3/5/12 8:09 AM:
---

As far as I remember, the main problem with the Map interface in Properties is 
the values coming from the parent defaults object. When you wrap another 
properties object with one that has the other one as defaults given in ctor, 
then your are fcked up if you are using the Map interface. Which is not the 
case for system properties, but that's not guaranteed (e.g. the Java JVM could 
set some basic properties in another Properties object and make the changes 
available only through the overlayed properties object).

To fix the defaults problem I would go with the Enumeration? (works in Java 
5, too) in a for(Enumeration? e=propertyNames(); e.hasMoreElements();) loop.

If you use the Set interface, the inherited props would not have been seen and 
the system may *delete* properties which are in a parent/defaults properties 
object.

  was (Author: thetaphi):
As far as I remember, the main problem with the Map interface in Properties 
is the values coming from the parent object (passed in Parameter). When you 
wrap another properties object with one that has the other one as defaults 
given in ctor, then your are fcked up if you are using the Map interface. Which 
is not the case for system properties, but that's not guaranteed (e.g. the Java 
JVM could set some basic properties in another Properties object and make the 
changes available only through the overlayed properties object).

To fix the defaults problem I would go with the Enumeration? (works in Java 
5, too) in a for(Enumeration? e=propertyNames(); e.hasMoreElements();) loop.

If you use the Set interface, the inherited props would not have been seen and 
the system may *delete* properties which are in a parent/defaults properties 
object.
  
 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Dawid Weiss (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1309#comment-1309
 ] 

Dawid Weiss commented on LUCENE-3847:
-

Yeah, it might have been that that I experienced. So back to enumeration we go 
:(

 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-1772) UpdateProcessor to prune empty values

2012-03-05 Thread Koji Sekiguchi (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1312#comment-1312
 ] 

Koji Sekiguchi commented on SOLR-1772:
--

Hoss: is this same with a part of SOLR-2802 
(RemoveBlankFieldUpdateProcessorFactory)?

 UpdateProcessor to prune empty values
 ---

 Key: SOLR-1772
 URL: https://issues.apache.org/jira/browse/SOLR-1772
 Project: Solr
  Issue Type: Wish
Reporter: Hoss Man

 Users seem to frequently get confused when some FieldTypes (typically the 
 numeric ones) complain about invalid field values when the inadvertantly 
 index an empty string.
 It would be cool to provide an UpdateProcessor that makes it easy to strip 
 out any fields being added as empty values ... it could be configured using 
 field (and/or field type) names or globs to select/ignore certain fields -- i 
 haven't thought it through all that hard

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Issue Comment Edited] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Uwe Schindler (Issue Comment Edited) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1317#comment-1317
 ] 

Uwe Schindler edited comment on LUCENE-3847 at 3/5/12 8:30 AM:
---

About your commit: It should be getProperty() [without cast] and not get()... 
Otherwise the defaults are again fcked up :-)

  was (Author: thetaphi):
About your commit: It should be getProperty() and not get()... Otherwise 
the defaults are again fcked up :-)
  
 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Uwe Schindler (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1317#comment-1317
 ] 

Uwe Schindler commented on LUCENE-3847:
---

About your commit: It should be getProperty() and not get()... Otherwise the 
defaults are again fcked up :-)

 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Dawid Weiss (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1320#comment-1320
 ] 

Dawid Weiss commented on LUCENE-3847:
-

Right, good catch. Properties are seriously messed up, aren't they?

 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Resolved] (SOLR-2437) Use IndexWriter.deleteAll() in DUH2.deleteAll() instead of closing and reopening writer w/ create=true

2012-03-05 Thread Koji Sekiguchi (Resolved) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi resolved SOLR-2437.
--

   Resolution: Fixed
Fix Version/s: 4.0

This has been fixed and committed in part of SOLR-2193 and its related issues.

 Use IndexWriter.deleteAll()  in DUH2.deleteAll() instead of closing and 
 reopening writer w/ create=true
 ---

 Key: SOLR-2437
 URL: https://issues.apache.org/jira/browse/SOLR-2437
 Project: Solr
  Issue Type: Improvement
Affects Versions: 1.4
Reporter: Koji Sekiguchi
Priority: Minor
 Fix For: 4.0

 Attachments: SOLR-2437.patch


 Use just IW.deleteAll() to abort background merges, remove buffered docs, etc.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Uwe Schindler (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1323#comment-1323
 ] 

Uwe Schindler commented on LUCENE-3847:
---

Horrible!!! I just say: die, die, die

 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3188) New admin page: Enable Polling button disappears after disabling polling and reloading page

2012-03-05 Thread Stefan Matheis (steffkes) (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1322#comment-1322
 ] 

Stefan Matheis (steffkes) commented on SOLR-3188:
-

Neil, i've checked that .. and the only part in the code where the error-box on 
the replication get's activated is this: 
https://github.com/steffkes/solr-admin/blob/master/js/scripts/replication.js#L89
 . so, i guess the easiest thing to try is, to request the replication urls 
manually and see what the response is.

{{/replication?command=disablereplicationwt=json}} and maybe additionally 
{{/replication?command=detailswt=json}} afterwards

 New admin page: Enable Polling button disappears after disabling polling 
 and reloading page
 -

 Key: SOLR-3188
 URL: https://issues.apache.org/jira/browse/SOLR-3188
 Project: Solr
  Issue Type: Bug
  Components: Schema and Analysis
Affects Versions: 4.0
Reporter: Neil Hooey
Priority: Minor
  Labels: admin

 When you go to this URL on a slave:
 http://localhost:8983/solr/#/singlecore/replication
 And click the Disable Polling button, you see a red bar that says 
 invalid_master. I'm not sure why I get this red bar, as I haven't tested it 
 outside of my own installation, but it seems normal.
 If you then refresh the page, the Replicate Now and Enable Polling 
 buttons disappear. It seems like their generation is being interrupted by the 
 invalid_master error.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Dawid Weiss (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1325#comment-1325
 ] 

Dawid Weiss commented on LUCENE-3847:
-

bq. As far as I remember, the main problem with the Map interface in Properties 
is the values coming from the parent defaults object. 

Exactly, I'm kind of slow today.

 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Dawid Weiss (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1324#comment-1324
 ] 

Dawid Weiss commented on LUCENE-3847:
-

Now I remember what it was that I had a problem with. Properties has the notion 
of defaults so it falls back to another Properties by default if the key is 
not present. Unfortunately get() is not overridden in Properties (and 
everything else is) so things like propertyNames or getProperty will return a 
value from the fallback set, but get() won't.
{code}
  public static void main(String[] args) {
Properties defaults = new Properties();
Properties p = new Properties(defaults);
String key = custom-key;
String value = value;
defaults.put(key, value);
for (Enumeration? e = p.propertyNames(); e.hasMoreElements();) {
  Object currentKey = e.nextElement();
  if (currentKey == key) {
System.out.println(Default key found in propertyNames(), but...);
System.out.println(p.get()= + p.get(key));
System.out.println(p.getProperty()= + p.getProperty(key));
  }
}
  }
{code}
You can easily corrupt other programs by inserting Object values into the 
system property set -- any enumeration will then fail with a classcast. Still 
don't remember where this caused a problem but it definitely was.

 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3846) Fuzzy suggester

2012-03-05 Thread Eks Dev (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1328#comment-1328
 ] 

Eks Dev commented on LUCENE-3846:
-

Robert, 
I am not talking from some abstract-theoretical point of view, I made my own 
experience on  nontrivial Lucene datasets that are unfortunately not for 
sharing. Having possibility to train cost matrices per edit operation brings a 
lot, but you may have had another experience (different problems, different 
data...).  

Without specifying concrete task (annotated data), there is no notion of 
better, so this argument simply does not help (show me it is better, no 
you show me all ones matrix  is better than any other, no, no...). It is 
simply about the experience we made in the past, different opinions.   

I personally would not try this argument with molecular biology teams, and tell 
them their POM and BLOSUM matrices are worthless or to someone in record 
linkage community (Lucene was used in this context a lot) or ... 




 Fuzzy suggester
 ---

 Key: LUCENE-3846
 URL: https://issues.apache.org/jira/browse/LUCENE-3846
 Project: Lucene - Java
  Issue Type: Improvement
Reporter: Michael McCandless
Assignee: Michael McCandless
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3846.patch


 Would be nice to have a suggester that can handle some fuzziness (like spell 
 correction) so that it's able to suggest completions that are near what you 
 typed.
 As a first go at this, I implemented 1T (ie up to 1 edit, including a 
 transposition), except the first letter must be correct.
 But there is a penalty, ie, the corrected suggestion needs to have a much 
 higher freq than the exact match suggestion before it can compete.
 Still tons of nocommits, and somehow we should merge this / make it work with 
 analyzing suggester too (LUCENE-3842).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Uwe Schindler (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1330#comment-1330
 ] 

Uwe Schindler commented on LUCENE-3847:
---

No prob. That is exactly the problem I wanted to prevent with the Enumeration 
and getProperty(). Code using those 2 functions is the only correct one to list 
all properties... :(

 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3195) timeAllowed is ignored for grouping queries

2012-03-05 Thread Martijn van Groningen (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1345#comment-1345
 ] 

Martijn van Groningen commented on SOLR-3195:
-

The difference between the elapsed time and qtime is that the first also takes 
into account the time spend on the trip from and to the shard where the shard 
request is executed. I think it is fine to include also this time in the time 
allowed for a query.

 timeAllowed is ignored for grouping queries
 ---

 Key: SOLR-3195
 URL: https://issues.apache.org/jira/browse/SOLR-3195
 Project: Solr
  Issue Type: Bug
  Components: search
Affects Versions: 3.5, 4.0
Reporter: Russell Black
Assignee: Martijn van Groningen
  Labels: patch
 Attachments: SOLR-3195-timeallowed-for-grouping-3x-backport.patch, 
 SOLR-3195-timeallowed-for-grouping.patch, 
 SOLR-3195-timeallowed-for-grouping.patch


 For grouping queries, the {{timeAllowed}} parameter is ignored.  I couldn't 
 find this limitation documented anywhere, and we would like to use that 
 feature in our production system.  I have created a patch that makes grouping 
 queries respect the {{timeAllowed}} parameter.  The test cases all pass, and 
 it and it seems to work well.  I added support for both distributed and 
 non-distributed grouping.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-3195) timeAllowed is ignored for grouping queries

2012-03-05 Thread Martijn van Groningen (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-3195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martijn van Groningen updated SOLR-3195:


Fix Version/s: 4.0
   3.6

 timeAllowed is ignored for grouping queries
 ---

 Key: SOLR-3195
 URL: https://issues.apache.org/jira/browse/SOLR-3195
 Project: Solr
  Issue Type: Bug
  Components: search
Affects Versions: 3.5, 4.0
Reporter: Russell Black
Assignee: Martijn van Groningen
  Labels: patch
 Fix For: 3.6, 4.0

 Attachments: SOLR-3195-timeallowed-for-grouping-3x-backport.patch, 
 SOLR-3195-timeallowed-for-grouping.patch, 
 SOLR-3195-timeallowed-for-grouping.patch


 For grouping queries, the {{timeAllowed}} parameter is ignored.  I couldn't 
 find this limitation documented anywhere, and we would like to use that 
 feature in our production system.  I have created a patch that makes grouping 
 queries respect the {{timeAllowed}} parameter.  The test cases all pass, and 
 it and it seems to work well.  I added support for both distributed and 
 non-distributed grouping.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[JENKINS] Solr-trunk - Build # 1783 - Still Failing

2012-03-05 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Solr-trunk/1783/

1 tests failed.
FAILED:  org.apache.solr.TestDistributedSearch.testDistribSearch

Error Message:
java.lang.AssertionError: Some threads threw uncaught exceptions!

Stack Trace:
java.lang.RuntimeException: java.lang.AssertionError: Some threads threw 
uncaught exceptions!
at 
org.apache.lucene.util.LuceneTestCase.tearDownInternal(LuceneTestCase.java:809)
at 
org.apache.lucene.util.LuceneTestCase.access$1100(LuceneTestCase.java:137)
at 
org.apache.lucene.util.LuceneTestCase$InternalSetupTeardownRule$1.evaluate(LuceneTestCase.java:633)
at 
org.apache.lucene.util.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:22)
at 
org.apache.lucene.util.LuceneTestCase$TestResultInterceptorRule$1.evaluate(LuceneTestCase.java:531)
at 
org.apache.lucene.util.LuceneTestCase$RememberThreadRule$1.evaluate(LuceneTestCase.java:593)
at 
org.apache.lucene.util.LuceneTestCaseRunner.runChild(LuceneTestCaseRunner.java:165)
at 
org.apache.lucene.util.LuceneTestCaseRunner.runChild(LuceneTestCaseRunner.java:57)
at 
org.apache.lucene.util.SystemPropertiesRestoreRule$1.evaluate(SystemPropertiesRestoreRule.java:20)
at 
org.apache.lucene.util.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:22)
at 
org.apache.lucene.util.LuceneTestCase.checkUncaughtExceptionsAfter(LuceneTestCase.java:837)
at 
org.apache.lucene.util.LuceneTestCase.tearDownInternal(LuceneTestCase.java:781)




Build Log (for compile errors):
[...truncated 9979 lines...]



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (SOLR-3195) timeAllowed is ignored for grouping queries

2012-03-05 Thread Martijn van Groningen (Resolved) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-3195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martijn van Groningen resolved SOLR-3195.
-

Resolution: Fixed

Committed to trunk and 3x branch.

 timeAllowed is ignored for grouping queries
 ---

 Key: SOLR-3195
 URL: https://issues.apache.org/jira/browse/SOLR-3195
 Project: Solr
  Issue Type: Bug
  Components: search
Affects Versions: 3.5, 4.0
Reporter: Russell Black
Assignee: Martijn van Groningen
  Labels: patch
 Fix For: 3.6, 4.0

 Attachments: SOLR-3195-timeallowed-for-grouping-3x-backport.patch, 
 SOLR-3195-timeallowed-for-grouping.patch, 
 SOLR-3195-timeallowed-for-grouping.patch


 For grouping queries, the {{timeAllowed}} parameter is ignored.  I couldn't 
 find this limitation documented anywhere, and we would like to use that 
 feature in our production system.  I have created a patch that makes grouping 
 queries respect the {{timeAllowed}} parameter.  The test cases all pass, and 
 it and it seems to work well.  I added support for both distributed and 
 non-distributed grouping.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3808) Switch LuceneTestCaseRunner to RandomizedRunner. Enforce Random sharing contracts. Enforce thread leaks.

2012-03-05 Thread Dawid Weiss (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dawid Weiss updated LUCENE-3808:


Description: 
Dev. branch at: https://github.com/dweiss/lucene_solr/tree/rr

Switch the runner to RandomizedRunner. Enforce the following:
- (/) Random sharing will result in a failure/ exception.
- (/) -Add a validator for testXXX without @Test annotation.- (custom test 
provider added).
- (/) Make sure tests are executed with assertions enabled (at least for 
solr/lucene packages).
- (/) Add a validator for static hook shadowing (no-no).
- (/) Modify custom execution groups in LTC to be real @Groups.
- Thread leaks will result in a failure (add lingering if needed, but no 
ignores). [this is done, but disabled]
- Add a validator for @Test method overrides (check how many of these we 
already have first).
- What to do with thread-shared Random instances copies in MockIndexWriter and 
MockAnalyzer?

  was:
Dev. branch at: https://github.com/dweiss/lucene_solr/tree/rr

Switch the runner to RandomizedRunner. Enforce the following:
- (/) Random sharing will result in a failure/ exception.
- (/) -Add a validator for testXXX without @Test annotation.- (custom test 
provider added).
- (/) Make sure tests are executed with assertions enabled (at least for 
solr/lucene packages).
- (/) Add a validator for static hook shadowing (no-no).
- (/) Modify custom execution groups in LTC to be real @Groups.
- Thread leaks will result in a failure (add lingering if needed, but no 
ignores). [this is done, but disabled]
- Add a validator for @Test method overrides (check how many of these we 
already have first).



 Switch LuceneTestCaseRunner to RandomizedRunner. Enforce Random sharing 
 contracts. Enforce thread leaks.
 

 Key: LUCENE-3808
 URL: https://issues.apache.org/jira/browse/LUCENE-3808
 Project: Lucene - Java
  Issue Type: Sub-task
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 4.0


 Dev. branch at: https://github.com/dweiss/lucene_solr/tree/rr
 Switch the runner to RandomizedRunner. Enforce the following:
 - (/) Random sharing will result in a failure/ exception.
 - (/) -Add a validator for testXXX without @Test annotation.- (custom test 
 provider added).
 - (/) Make sure tests are executed with assertions enabled (at least for 
 solr/lucene packages).
 - (/) Add a validator for static hook shadowing (no-no).
 - (/) Modify custom execution groups in LTC to be real @Groups.
 - Thread leaks will result in a failure (add lingering if needed, but no 
 ignores). [this is done, but disabled]
 - Add a validator for @Test method overrides (check how many of these we 
 already have first).
 - What to do with thread-shared Random instances copies in MockIndexWriter 
 and MockAnalyzer?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3850) Fix rawtypes warnings for Java 7 compiler

2012-03-05 Thread Uwe Schindler (Created) (JIRA)
Fix rawtypes warnings for Java 7 compiler
-

 Key: LUCENE-3850
 URL: https://issues.apache.org/jira/browse/LUCENE-3850
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 3.5, 4.0
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 3.6, 4.0


Java 7 changed the warnings a little bit:
- Java 6 only knew unchecked warning type, applying for all types of generics 
violations, like missing generics (raw types)
- Java 7 still knows unchecked but only emits warning if the call is really 
unchecked. Declaration of variables/fields or constructing instances without 
type param now emits rawtypes warning.

The changes above causes the Java 7 compile now emit lots of rawtypes 
warnings, where Java 6 is silent. The easy fix is to suppres both warning 
types: @SuppressWarnings({unchecked,rawtypes}) for all those places. 
Changes are easy to do, will provide patch later!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3850) Fix rawtypes warnings for Java 7 compiler

2012-03-05 Thread Uwe Schindler (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1371#comment-1371
 ] 

Uwe Schindler commented on LUCENE-3850:
---

This also makes the eclipse compiler more silent, because rawtypes was ported 
from the eclipse compiler to Java 7.

 Fix rawtypes warnings for Java 7 compiler
 -

 Key: LUCENE-3850
 URL: https://issues.apache.org/jira/browse/LUCENE-3850
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 3.5, 4.0
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 3.6, 4.0


 Java 7 changed the warnings a little bit:
 - Java 6 only knew unchecked warning type, applying for all types of 
 generics violations, like missing generics (raw types)
 - Java 7 still knows unchecked but only emits warning if the call is really 
 unchecked. Declaration of variables/fields or constructing instances without 
 type param now emits rawtypes warning.
 The changes above causes the Java 7 compile now emit lots of rawtypes 
 warnings, where Java 6 is silent. The easy fix is to suppres both warning 
 types: @SuppressWarnings({unchecked,rawtypes}) for all those places. 
 Changes are easy to do, will provide patch later!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-3154) SolrJ CloudServer should be leader and network aware when adding docs

2012-03-05 Thread Grant Ingersoll (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-3154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Ingersoll updated SOLR-3154:
--

Description: It would be good when indexing if the SolrJ CloudServer was 
leader aware so that we could avoid doing an extra hop for the data.  It would 
also be good if one could easily set things up based on data locality 
principles.  This might mean that CloudServer is aware of where on the network 
it is and would pick leaders that are as close as possible (i.e. local, 
perhaps.)  This would come in to play when working with tools like Hadoop or 
other grid computing frameworks.  (was: It would be good when indexing if the 
SolrJ CloudServer was leader aware so that we could avoid doing an extra hop 
for the data.)
Summary: SolrJ CloudServer should be leader and network aware when 
adding docs  (was: SolrJ CloudServer should be leader aware when adding docs)

 SolrJ CloudServer should be leader and network aware when adding docs
 -

 Key: SOLR-3154
 URL: https://issues.apache.org/jira/browse/SOLR-3154
 Project: Solr
  Issue Type: Improvement
  Components: SolrCloud
Affects Versions: 4.0
Reporter: Grant Ingersoll
Priority: Minor
 Fix For: 4.0


 It would be good when indexing if the SolrJ CloudServer was leader aware so 
 that we could avoid doing an extra hop for the data.  It would also be good 
 if one could easily set things up based on data locality principles.  This 
 might mean that CloudServer is aware of where on the network it is and would 
 pick leaders that are as close as possible (i.e. local, perhaps.)  This would 
 come in to play when working with tools like Hadoop or other grid computing 
 frameworks.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: errata for solr tutorial

2012-03-05 Thread Jan Høydahl
Hi,

Thanks for reporting. This is fixed now on the staging site, will be set live 
soon.

--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Solr Training - www.solrtraining.com

On 1. mars 2012, at 16:50, Nicolai Scheer wrote:

 Hi!
 
 Having just worked through the solr tutorial
 (http://lucene.apache.org/solr/tutorial.html) I think I found two minor
 bugs:
 
 1.
 The delete by query example
 
 java -Ddata=args -jar post.jar deletequeryname:DDR/query/delete
 
 should read
 
 java -Ddata=args -jar post.jar deletequeryname:DDR/query/delete
 
 2.
 The link to the mailing lists at the end of the tutorial seems to be
 dead as of now:
 
 http://lucene.apache.org/solr/mailing_lists.html
 
 Greetings,
 
 Nico


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: errata for solr tutorial

2012-03-05 Thread Jan Høydahl
I checked in the two changes but did not push live, seems like someone else is 
in the process of cleaning up the tutorial too.
http://lucene.staging.apache.org/solr/tutorial.html

--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Solr Training - www.solrtraining.com

On 5. mars 2012, at 13:10, Jan Høydahl wrote:

 Hi,
 
 Thanks for reporting. This is fixed now on the staging site, will be set live 
 soon.
 
 --
 Jan Høydahl, search solution architect
 Cominvent AS - www.cominvent.com
 Solr Training - www.solrtraining.com
 
 On 1. mars 2012, at 16:50, Nicolai Scheer wrote:
 
 Hi!
 
 Having just worked through the solr tutorial
 (http://lucene.apache.org/solr/tutorial.html) I think I found two minor
 bugs:
 
 1.
 The delete by query example
 
 java -Ddata=args -jar post.jar deletequeryname:DDR/query/delete
 
 should read
 
 java -Ddata=args -jar post.jar deletequeryname:DDR/query/delete
 
 2.
 The link to the mailing lists at the end of the tutorial seems to be
 dead as of now:
 
 http://lucene.apache.org/solr/mailing_lists.html
 
 Greetings,
 
 Nico
 
 
 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org
 


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3846) Fuzzy suggester

2012-03-05 Thread Robert Muir (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222303#comment-13222303
 ] 

Robert Muir commented on LUCENE-3846:
-

{quote}
I personally would not try this argument with molecular biology teams, and tell 
them their POM and BLOSUM matrices are worthless or to someone in record 
linkage community (Lucene was used in this context a lot) or ... 
{quote}

Thats ok, I would :)

I don't think we should complicate already-complicated things unless there is 
some clear benefit.

I'm not worried about offending anyone.


 Fuzzy suggester
 ---

 Key: LUCENE-3846
 URL: https://issues.apache.org/jira/browse/LUCENE-3846
 Project: Lucene - Java
  Issue Type: Improvement
Reporter: Michael McCandless
Assignee: Michael McCandless
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3846.patch


 Would be nice to have a suggester that can handle some fuzziness (like spell 
 correction) so that it's able to suggest completions that are near what you 
 typed.
 As a first go at this, I implemented 1T (ie up to 1 edit, including a 
 transposition), except the first letter must be correct.
 But there is a penalty, ie, the corrected suggestion needs to have a much 
 higher freq than the exact match suggestion before it can compete.
 Still tons of nocommits, and somehow we should merge this / make it work with 
 analyzing suggester too (LUCENE-3842).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: [jira] [Commented] (LUCENE-3846) Fuzzy suggester

2012-03-05 Thread eks dev
For that matter, I am worried not to offend anyone, just that type of person :)

But expressing his opinion, just as we did here has nothing to do with
it. Hope you did not read my comments as offending, this was by no
means my intention.

Just do not complain later, I warned you, molecular biologists can be
mean if you touch their matrices :)


(I took it on mailing list, as it adds only noise to Jira)


On Mon, Mar 5, 2012 at 1:38 PM, Robert Muir (Commented) (JIRA)
j...@apache.org wrote:

    [ 
 https://issues.apache.org/jira/browse/LUCENE-3846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222303#comment-13222303
  ]

 Robert Muir commented on LUCENE-3846:
 -

 {quote}
 I personally would not try this argument with molecular biology teams, and 
 tell them their POM and BLOSUM matrices are worthless or to someone in record 
 linkage community (Lucene was used in this context a lot) or ...
 {quote}

 Thats ok, I would :)

 I don't think we should complicate already-complicated things unless there is 
 some clear benefit.

 I'm not worried about offending anyone.


 Fuzzy suggester
 ---

                 Key: LUCENE-3846
                 URL: https://issues.apache.org/jira/browse/LUCENE-3846
             Project: Lucene - Java
          Issue Type: Improvement
            Reporter: Michael McCandless
            Assignee: Michael McCandless
             Fix For: 3.6, 4.0

         Attachments: LUCENE-3846.patch


 Would be nice to have a suggester that can handle some fuzziness (like spell 
 correction) so that it's able to suggest completions that are near what 
 you typed.
 As a first go at this, I implemented 1T (ie up to 1 edit, including a 
 transposition), except the first letter must be correct.
 But there is a penalty, ie, the corrected suggestion needs to have a much 
 higher freq than the exact match suggestion before it can compete.
 Still tons of nocommits, and somehow we should merge this / make it work 
 with analyzing suggester too (LUCENE-3842).

 --
 This message is automatically generated by JIRA.
 If you think it was sent incorrectly, please contact your JIRA 
 administrators: 
 https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
 For more information on JIRA, see: http://www.atlassian.com/software/jira



 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3850) Fix rawtypes warnings for Java 7 compiler

2012-03-05 Thread Uwe Schindler (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler updated LUCENE-3850:
--

Attachment: LUCENE-3850-part1.patch

Patch that fixes most warnings in Lucene Core and modules. Partly also Solr.

I will commit this now as it gets outdated soon. Will then backport / ask 
eclipse to fix 3.x.

 Fix rawtypes warnings for Java 7 compiler
 -

 Key: LUCENE-3850
 URL: https://issues.apache.org/jira/browse/LUCENE-3850
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 3.5, 4.0
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3850-part1.patch


 Java 7 changed the warnings a little bit:
 - Java 6 only knew unchecked warning type, applying for all types of 
 generics violations, like missing generics (raw types)
 - Java 7 still knows unchecked but only emits warning if the call is really 
 unchecked. Declaration of variables/fields or constructing instances without 
 type param now emits rawtypes warning.
 The changes above causes the Java 7 compile now emit lots of rawtypes 
 warnings, where Java 6 is silent. The easy fix is to suppres both warning 
 types: @SuppressWarnings({unchecked,rawtypes}) for all those places. 
 Changes are easy to do, will provide patch later!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3850) Fix rawtypes warnings for Java 7 compiler

2012-03-05 Thread Uwe Schindler (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222344#comment-13222344
 ] 

Uwe Schindler commented on LUCENE-3850:
---

Committed trunk Revision: 1297048

Now backporting...

 Fix rawtypes warnings for Java 7 compiler
 -

 Key: LUCENE-3850
 URL: https://issues.apache.org/jira/browse/LUCENE-3850
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 3.5, 4.0
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3850-part1.patch


 Java 7 changed the warnings a little bit:
 - Java 6 only knew unchecked warning type, applying for all types of 
 generics violations, like missing generics (raw types)
 - Java 7 still knows unchecked but only emits warning if the call is really 
 unchecked. Declaration of variables/fields or constructing instances without 
 type param now emits rawtypes warning.
 The changes above causes the Java 7 compile now emit lots of rawtypes 
 warnings, where Java 6 is silent. The easy fix is to suppres both warning 
 types: @SuppressWarnings({unchecked,rawtypes}) for all those places. 
 Changes are easy to do, will provide patch later!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3850) Fix rawtypes warnings for Java 7 compiler

2012-03-05 Thread Uwe Schindler (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222345#comment-13222345
 ] 

Uwe Schindler commented on LUCENE-3850:
---

Btw, I also changed lots of non-rawtypes warnings like accessing static fields 
on wrong type (subclass or more horrible, with instance). This was also 
producing warning in Java 7.

 Fix rawtypes warnings for Java 7 compiler
 -

 Key: LUCENE-3850
 URL: https://issues.apache.org/jira/browse/LUCENE-3850
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 3.5, 4.0
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3850-part1.patch


 Java 7 changed the warnings a little bit:
 - Java 6 only knew unchecked warning type, applying for all types of 
 generics violations, like missing generics (raw types)
 - Java 7 still knows unchecked but only emits warning if the call is really 
 unchecked. Declaration of variables/fields or constructing instances without 
 type param now emits rawtypes warning.
 The changes above causes the Java 7 compile now emit lots of rawtypes 
 warnings, where Java 6 is silent. The easy fix is to suppres both warning 
 types: @SuppressWarnings({unchecked,rawtypes}) for all those places. 
 Changes are easy to do, will provide patch later!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3838) IndexWriter.maybeMerge() removes deleted documents from index (Lucene 3.1.0 to 3.5.0)

2012-03-05 Thread Ivan Stojanovic (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222347#comment-13222347
 ] 

Ivan Stojanovic commented on LUCENE-3838:
-

Hi Michael,

of course this happens in version 3.1.0 as stated in the title (in parentheses).

Actually, it has never been stated that this is an internal implementation 
detail (if I can remember correctly). I'm very sure that we are not the only 
ones who were relying on this behavior. Also, this backward compatibility break 
wasn't stated in 3.1.0 changes log.

Anyway, we already have an ID field but we can't rely on it for long running 
operations. Suppose that an index export is in progress while there is a bunch 
of add/delete/search operations. Or worse, suppose that a batch delete is in 
progress (driven by a filter criteria) at the same time. I have to say here 
that we are using only one searcher per index here and also we are working with 
farms of indexes with size of 3-5 millions of documents per index. I can't even 
imagine the use of more than one searcher per index here. One searcher per 
index also gives us the best performance which is our top concern. Another 
thing. When an admin performs optimization then the index is locked so no one 
can access it in order to avoid disk overuse.

We also have a deletes filter :)
It is used as a ram filter that buffers deletes using BitSet and occasionally 
flushing this buffer to index (deleting documents marked as deletes). This 
gives us the lightning performance related to both deleting documents and 
search operation relying on a custom Collector wrapping this filter. If we used 
an application filter to skip documents than the search would suffer a 
significant slow down because of communication with an application filter for 
each document retrieved from index. if we do this than our ultra fast Lucene 
driven application loses its sense.

The suggestion with a custom codec sounds very promising. I only don't 
understand if we will have to wait for a Lucene 4.0 release for a custom codec 
implementation (with maybe an API that allows that) or I need to implement it 
for Lucene trunk. If I need to implement it for trunk than can you please give 
me a starting point where to begin from? I must say that I haven't dived deep 
in Lucene merge functionality. Also, can this approach differentiate between 
maybeMerge() and forceMerge(). We need to support document removal in 
forceMerge(), of course.

Greatest regards,
Ivan

 IndexWriter.maybeMerge() removes deleted documents from index (Lucene 3.1.0 
 to 3.5.0)
 -

 Key: LUCENE-3838
 URL: https://issues.apache.org/jira/browse/LUCENE-3838
 Project: Lucene - Java
  Issue Type: Bug
  Components: core/index
Affects Versions: 3.1, 3.2, 3.3, 3.4, 3.5
 Environment: Windows, Linux, OSX
Reporter: Ivan Stojanovic
Priority: Blocker
  Labels: api-change
 Attachments: TempTest.java


 My company uses Lucene for high performance, heavy loaded farms of 
 translation repositories with hundreds of simultaneous 
 add/delete/update/search/retrieve threads. In order to support this complex 
 architecture beside other things and tricks used here I rely on docId-s being 
 unchanged until I ask that explicitly (using IndexWriter.optimize() - 
 IndexWriter.forceMerge()).
 For this behavior LogMergePolicy is used.
 This worked fine until we raised the Lucene version from 3.0.2 to 3.5.0. 
 Until version 3.1.0 merge triggerred by IndexWriter.addDocument() didn't 
 expunge deleted documents ensuring that docId-s stayed unchanged and making 
 some critical jobs possible without impact on index size. 
 IndexWriter.optimize() did the actual deleted documents removal.
 From Lucene version 3.1.0 IndexWriter.maybeMerge() does the same thing as 
 IndexWriter.forceMerge() regarding deleted documents. There is no difference. 
 This leads to unpredictable internal index structure changes during simple 
 document add (and possible delete) operations and in undefined point in time. 
 I looked into the Lucene source code and can definitely confirm this.
 This issue makes our Lucene client code totally unusable.
 Solution steps:
 1) add a flag somewhere that will control whether the deleted documents 
 should be removed in maybeMerge(). Note that this is only a half of what we 
 need here.
 2) make forceMerge() always remove deleted documents no matter if 
 maybeMerge() removes them or not. Alternatively, there can be another 
 parameter added to forceMerge() that will also tell if deleted documents 
 should be removed from index or not.
 The sample JUnit code that can replicate this issue is added below.
 public class TempTest {
 private Analyzer _analyzer = new 

[jira] [Commented] (SOLR-3011) DIH MultiThreaded bug

2012-03-05 Thread Wenca Petr (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222354#comment-13222354
 ] 

Wenca Petr commented on SOLR-3011:
--

The problem was that the first thread that reached the end of data set the 
iterator to null and when other thread called SqlEntityProcessor.nextRow() then 
all the data to be indexed were loaded again and all the documents were 
analysed again because SqlEntityProcessor.nextRow() tests the iterator to be 
null (assuming that null means the beginning of the process). By the change a 
made all the threads then receive null from the iterator, so they know that 
there are no more data. Is this explanation ok? 

 DIH MultiThreaded bug
 -

 Key: SOLR-3011
 URL: https://issues.apache.org/jira/browse/SOLR-3011
 Project: Solr
  Issue Type: Sub-task
  Components: contrib - DataImportHandler
Affects Versions: 3.5, 4.0
Reporter: Mikhail Khludnev
Priority: Minor
 Fix For: 4.0

 Attachments: SOLR-3011.patch, SOLR-3011.patch, 
 patch-3011-EntityProcessorBase-iterator.patch


 current DIH design is not thread safe. see last comments at SOLR-2382 and 
 SOLR-2947. I'm going to provide the patch makes DIH core threadsafe. Mostly 
 it's a SOLR-2947 patch from 28th Dec. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3154) SolrJ CloudServer should be leader and network aware when adding docs

2012-03-05 Thread Mark Miller (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222359#comment-13222359
 ] 

Mark Miller commented on SOLR-3154:
---

The main difficulty here is that to get the readable to indexed id that we 
hash, we need the field type and the id field. The id field we can simply have 
the user supply - but technically each field type can impl a custom method to 
get the indexed id that we hash on. Not sure what we do about this, but 
initially I was thinking we could support one or two field types (int, string) 
and just copy the (simple) logic for these two types into the server impl. Then 
the user would also have to supply the type (int,string) of the id field as 
well - though we could default it to what the example schema uses.

 SolrJ CloudServer should be leader and network aware when adding docs
 -

 Key: SOLR-3154
 URL: https://issues.apache.org/jira/browse/SOLR-3154
 Project: Solr
  Issue Type: Improvement
  Components: SolrCloud
Affects Versions: 4.0
Reporter: Grant Ingersoll
Priority: Minor
 Fix For: 4.0


 It would be good when indexing if the SolrJ CloudServer was leader aware so 
 that we could avoid doing an extra hop for the data.  It would also be good 
 if one could easily set things up based on data locality principles.  This 
 might mean that CloudServer is aware of where on the network it is and would 
 pick leaders that are as close as possible (i.e. local, perhaps.)  This would 
 come in to play when working with tools like Hadoop or other grid computing 
 frameworks.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3847) LuceneTestCase should check for modifications on System properties

2012-03-05 Thread Mark Miller (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222361#comment-13222361
 ] 

Mark Miller commented on LUCENE-3847:
-

This is great Dawid, thanks!

 LuceneTestCase should check for modifications on System properties
 --

 Key: LUCENE-3847
 URL: https://issues.apache.org/jira/browse/LUCENE-3847
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/test
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3847.patch


 - fail the test if changes have been detected.
 - revert the state of system properties before the suite.
 - cleanup after the suite.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3778) Create a grouping convenience class

2012-03-05 Thread Martijn van Groningen (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222368#comment-13222368
 ] 

Martijn van Groningen commented on LUCENE-3778:
---

{quote}
One can actually use GroupingSearch in a shard'd env, on each shard,
right? It's just that then you merge them like normal on the front
end (ie, TopGroups.merge). Is that the only reason for the ... in a
non distributed environment javadoc warning?
{quote}
In a sharded env one needs to first execute all the first pass collector on all 
shards, merge the returned search groups, execute the second pass search (with 
the top N merged search groups as argument) on most of the shards and finally 
merge the TopGroups from all shards into a topN TopGroups. Also grouping by 
docblock and grouping features like allGroups and groupHead don't work in a 
normal sharded environment (unless you partition the groups properly). The docs 
caching also only makes sense when performing grouping on a local index. This 
usage is very different then non distributed grouping, that is why I think it 
is better to have a separate grouping convenience class for distributed 
grouping (DistributedGroupSearch?). 

bq. Maybe the ctor should take docValuesType / diskResidentValues
Makes sense!

bq. Maybe you should pass the groupSort, groupsOffset, groupsLimit to the 
search method (instead of setters)?
Maybe we just should have defaults for these options? Sort.RELEVANCE, 0 and 10?

 Create a grouping convenience class
 ---

 Key: LUCENE-3778
 URL: https://issues.apache.org/jira/browse/LUCENE-3778
 Project: Lucene - Java
  Issue Type: Improvement
  Components: modules/grouping
Reporter: Martijn van Groningen
 Attachments: LUCENE-3778.patch


 Currently the grouping module has many collector classes with a lot of 
 different options per class. I think it would be a good idea to have a 
 GroupUtil (Or another name?) convenience class. I think this could be a 
 builder, because of the many options 
 (sort,sortWithinGroup,groupOffset,groupCount and more) and implementations 
 (term/dv/function) grouping has.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3850) Fix rawtypes warnings for Java 7 compiler

2012-03-05 Thread Uwe Schindler (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler updated LUCENE-3850:
--

Attachment: LUCENE-3850-part1-branch3x.patch

Patch for 3.x (affecting also Lucene Core mainly, parts of Solr). Still contrib 
is (like in trunk) on TODO list (especially grouping!).

Will commit now.

 Fix rawtypes warnings for Java 7 compiler
 -

 Key: LUCENE-3850
 URL: https://issues.apache.org/jira/browse/LUCENE-3850
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 3.5, 4.0
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3850-part1-branch3x.patch, LUCENE-3850-part1.patch


 Java 7 changed the warnings a little bit:
 - Java 6 only knew unchecked warning type, applying for all types of 
 generics violations, like missing generics (raw types)
 - Java 7 still knows unchecked but only emits warning if the call is really 
 unchecked. Declaration of variables/fields or constructing instances without 
 type param now emits rawtypes warning.
 The changes above causes the Java 7 compile now emit lots of rawtypes 
 warnings, where Java 6 is silent. The easy fix is to suppres both warning 
 types: @SuppressWarnings({unchecked,rawtypes}) for all those places. 
 Changes are easy to do, will provide patch later!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3850) Fix rawtypes warnings for Java 7 compiler

2012-03-05 Thread Uwe Schindler (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222374#comment-13222374
 ] 

Uwe Schindler commented on LUCENE-3850:
---

Committed 3.x revision: 1297074

I will keep this issue open for more cleanups.

 Fix rawtypes warnings for Java 7 compiler
 -

 Key: LUCENE-3850
 URL: https://issues.apache.org/jira/browse/LUCENE-3850
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 3.5, 4.0
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3850-part1-branch3x.patch, LUCENE-3850-part1.patch


 Java 7 changed the warnings a little bit:
 - Java 6 only knew unchecked warning type, applying for all types of 
 generics violations, like missing generics (raw types)
 - Java 7 still knows unchecked but only emits warning if the call is really 
 unchecked. Declaration of variables/fields or constructing instances without 
 type param now emits rawtypes warning.
 The changes above causes the Java 7 compile now emit lots of rawtypes 
 warnings, where Java 6 is silent. The easy fix is to suppres both warning 
 types: @SuppressWarnings({unchecked,rawtypes}) for all those places. 
 Changes are easy to do, will provide patch later!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2719) REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing fl=my-field-name type usecases

2012-03-05 Thread Yonik Seeley (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222379#comment-13222379
 ] 

Yonik Seeley commented on SOLR-2719:


I've been saying for a while that using roughly java identifiers for field 
names was best practice, but we should document it somewhere.

I don't think we should change StrParser.getId to be more permissive though - 
that will just cause more problems in the future (say when we want to start 
adding infix and want a-b to be a minus b.  There's not a regression in that 
specific code since the function parser has never accepted - as part of a 
field name.

 REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing 
 fl=my-field-name type usecases
 -

 Key: SOLR-2719
 URL: https://issues.apache.org/jira/browse/SOLR-2719
 Project: Solr
  Issue Type: Bug
  Components: search
Affects Versions: 4.0
Reporter: Nik V. Babichev
Priority: Blocker
  Labels: field, fl, query, queryparser
 Fix For: 4.0

 Attachments: SOLR-2719.patch


 fl=my-hyphen-field in query params parsed as my instead of 
 my-hyphen-field.
 OAS.search.ReturnFields use method getId() from OAS.search.QueryParsing
 in which check chars if (!Character.isJavaIdentifierPart(ch)  ch != '.')
 Hyphen is not JavaIdentifierPart and this check break when first - is found.
 This problem solve by passing '-' to check:
 if (!Character.isJavaIdentifierPart(ch)  ch != '.'  ch != '-') break;
 But I don't know how it can affect on whole project.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3134) Include shard Information in response

2012-03-05 Thread Russell Black (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222381#comment-13222381
 ] 

Russell Black commented on SOLR-3134:
-

Ryan, 
Are you still planning to commit the 3x patch?  We'd love to have this feature 
in 3.6. 

 Include shard Information in response
 -

 Key: SOLR-3134
 URL: https://issues.apache.org/jira/browse/SOLR-3134
 Project: Solr
  Issue Type: Improvement
  Components: SearchComponents - other
Reporter: Ryan McKinley
Assignee: Ryan McKinley
 Fix For: 4.0

 Attachments: SOLR-3134-shard-info-3_5-backport.patch, 
 SOLR-3134-shard-info-3_x-backport.patch, SOLR-3134-shard-info-fix.patch, 
 SOLR-3134-shard-info.patch


 For distributed search where each shard represents a logically different 
 index (or physical location), it would be great to know the hit count for 
 each shard.
 In addition, it would be nice to get error info for each shard rather then 
 aborting the whole request when something fails.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2719) REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing fl=my-field-name type usecases

2012-03-05 Thread Luca Cavanna (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222394#comment-13222394
 ] 

Luca Cavanna commented on SOLR-2719:


Yonik, I see your point. On the other hand, the dash is a widely used character 
within field names. The regression is on the Solr behaviour, and I think it's 
pretty annoying from a user perspective.

Anyway, if that's the direction of the project, no problem. What matters more 
than anything else is consistency. We should document it somewhere as you 
wrote, but I'd also propose a field names validation at Solr startup, using the 
StrParser rules, so that Solr accepts only allowed field names and can 
guarantee the proper behaviour with all allowed field names.

What do you think?

 REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing 
 fl=my-field-name type usecases
 -

 Key: SOLR-2719
 URL: https://issues.apache.org/jira/browse/SOLR-2719
 Project: Solr
  Issue Type: Bug
  Components: search
Affects Versions: 4.0
Reporter: Nik V. Babichev
Priority: Blocker
  Labels: field, fl, query, queryparser
 Fix For: 4.0

 Attachments: SOLR-2719.patch


 fl=my-hyphen-field in query params parsed as my instead of 
 my-hyphen-field.
 OAS.search.ReturnFields use method getId() from OAS.search.QueryParsing
 in which check chars if (!Character.isJavaIdentifierPart(ch)  ch != '.')
 Hyphen is not JavaIdentifierPart and this check break when first - is found.
 This problem solve by passing '-' to check:
 if (!Character.isJavaIdentifierPart(ch)  ch != '.'  ch != '-') break;
 But I don't know how it can affect on whole project.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2719) REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing fl=my-field-name type usecases

2012-03-05 Thread Mark Miller (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222401#comment-13222401
 ] 

Mark Miller commented on SOLR-2719:
---

bq. but I'd also propose a field names validation at Solr startup,

+1 - rather than playing loosey goosey on what's a valid field name, we should 
doc and validate for it explicitly.

 REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing 
 fl=my-field-name type usecases
 -

 Key: SOLR-2719
 URL: https://issues.apache.org/jira/browse/SOLR-2719
 Project: Solr
  Issue Type: Bug
  Components: search
Affects Versions: 4.0
Reporter: Nik V. Babichev
Priority: Blocker
  Labels: field, fl, query, queryparser
 Fix For: 4.0

 Attachments: SOLR-2719.patch


 fl=my-hyphen-field in query params parsed as my instead of 
 my-hyphen-field.
 OAS.search.ReturnFields use method getId() from OAS.search.QueryParsing
 in which check chars if (!Character.isJavaIdentifierPart(ch)  ch != '.')
 Hyphen is not JavaIdentifierPart and this check break when first - is found.
 This problem solve by passing '-' to check:
 if (!Character.isJavaIdentifierPart(ch)  ch != '.'  ch != '-') break;
 But I don't know how it can affect on whole project.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2719) REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing fl=my-field-name type usecases

2012-03-05 Thread Yonik Seeley (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222408#comment-13222408
 ] 

Yonik Seeley commented on SOLR-2719:


bq. Yonik, I see your point. On the other hand, the dash is a widely used 
character within field names. The regression is on the Solr behaviour, and I 
think it's pretty annoying from a user perspective.

The easiest way to handle this would be this code in ReturnFields:
{code}

// short circuit test for a really simple field name
String key = null;
String field = sp.getId(null);
char ch = sp.ch();
{code}

Instead of using getId, we should hand-roll something that also accepts - as 
part of the field name.  That would leave function parser (and other users of 
getId) alone, but allow fieldnames with dashes in the fl param.

bq. What matters more than anything else is consistency.

If we really want to go for consistency, then we should not accept - anywhere 
(rather than attempting to expand it to everywhere).

 REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing 
 fl=my-field-name type usecases
 -

 Key: SOLR-2719
 URL: https://issues.apache.org/jira/browse/SOLR-2719
 Project: Solr
  Issue Type: Bug
  Components: search
Affects Versions: 4.0
Reporter: Nik V. Babichev
Priority: Blocker
  Labels: field, fl, query, queryparser
 Fix For: 4.0

 Attachments: SOLR-2719.patch


 fl=my-hyphen-field in query params parsed as my instead of 
 my-hyphen-field.
 OAS.search.ReturnFields use method getId() from OAS.search.QueryParsing
 in which check chars if (!Character.isJavaIdentifierPart(ch)  ch != '.')
 Hyphen is not JavaIdentifierPart and this check break when first - is found.
 This problem solve by passing '-' to check:
 if (!Character.isJavaIdentifierPart(ch)  ch != '.'  ch != '-') break;
 But I don't know how it can affect on whole project.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-3201) Recurrence resolver into VariableResolver

2012-03-05 Thread adrien ruffie (Created) (JIRA)
Recurrence resolver into VariableResolver
-

 Key: SOLR-3201
 URL: https://issues.apache.org/jira/browse/SOLR-3201
 Project: Solr
  Issue Type: Bug
 Environment: solr 3.1 solr dataimport 3.3.0
Reporter: adrien ruffie


Hello I have the following delta query: 

${dataimporter.functions.formatDate(${dataimporter.last_index_time}, 'MMdd 
HH:mm:ss:SSS')}

but during the VariableResolverImpl.resolve() the following action 
String[] parts = DOT_SPLIT.split(name, 0);
CURRENT_VARIABLE_RESOLVER.set(this);

split with '.' but the two recursive calling to ${dataimporter.last_index_time} 
in the first ${dataimporter.functions.formatDate()} isn't take into account.

Do you have a work around ?

Thank you

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3778) Create a grouping convenience class

2012-03-05 Thread Martijn van Groningen (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martijn van Groningen updated LUCENE-3778:
--

Attachment: LUCENE-3778.patch

Updated patch.
* Added jdocs.
* allGroups and allGroupHeads methods are implemented now.

 Create a grouping convenience class
 ---

 Key: LUCENE-3778
 URL: https://issues.apache.org/jira/browse/LUCENE-3778
 Project: Lucene - Java
  Issue Type: Improvement
  Components: modules/grouping
Reporter: Martijn van Groningen
 Attachments: LUCENE-3778.patch, LUCENE-3778.patch


 Currently the grouping module has many collector classes with a lot of 
 different options per class. I think it would be a good idea to have a 
 GroupUtil (Or another name?) convenience class. I think this could be a 
 builder, because of the many options 
 (sort,sortWithinGroup,groupOffset,groupCount and more) and implementations 
 (term/dv/function) grouping has.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2719) REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing fl=my-field-name type usecases

2012-03-05 Thread Luca Cavanna (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222443#comment-13222443
 ] 

Luca Cavanna commented on SOLR-2719:


How about trying to achieve both? I mean, are there many other places where we 
should do the same (adding the dash support)? QueryParsing#parseSort has the 
same problem. Anything else? I'm probably missing something.

Depending on where we need to add support for dash to add consistency, I would 
try to add support for the trailing dash here for backward compatibility (I'd 
have a patch almost ready), and work on validation as well.


 REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing 
 fl=my-field-name type usecases
 -

 Key: SOLR-2719
 URL: https://issues.apache.org/jira/browse/SOLR-2719
 Project: Solr
  Issue Type: Bug
  Components: search
Affects Versions: 4.0
Reporter: Nik V. Babichev
Priority: Blocker
  Labels: field, fl, query, queryparser
 Fix For: 4.0

 Attachments: SOLR-2719.patch


 fl=my-hyphen-field in query params parsed as my instead of 
 my-hyphen-field.
 OAS.search.ReturnFields use method getId() from OAS.search.QueryParsing
 in which check chars if (!Character.isJavaIdentifierPart(ch)  ch != '.')
 Hyphen is not JavaIdentifierPart and this check break when first - is found.
 This problem solve by passing '-' to check:
 if (!Character.isJavaIdentifierPart(ch)  ch != '.'  ch != '-') break;
 But I don't know how it can affect on whole project.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-2894) Implement distributed pivot faceting

2012-03-05 Thread Dan Cooper (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dan Cooper updated SOLR-2894:
-

Attachment: distribPatch-05-03-12.txt

 Implement distributed pivot faceting
 

 Key: SOLR-2894
 URL: https://issues.apache.org/jira/browse/SOLR-2894
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.0
Reporter: Erik Hatcher
 Attachments: distribPatch-05-03-12.txt


 Following up on SOLR-792, pivot faceting currently only supports 
 undistributed mode.  Distributed pivot faceting needs to be implemented.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2719) REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing fl=my-field-name type usecases

2012-03-05 Thread Yonik Seeley (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222449#comment-13222449
 ] 

Yonik Seeley commented on SOLR-2719:


bq. How about trying to achieve both? I mean, are there many other places where 
we should do the same (adding the dash support)?

I think this regression is limited to fl since that code was changed to 
support pseudo-fields.

bq. QueryParsing#parseSort has the same problem.

I just tried trunk with sort=a-b_s desc and it seemed to work fine.

 REGRESSION ReturnFields incorrect parse fields with hyphen - breaks existing 
 fl=my-field-name type usecases
 -

 Key: SOLR-2719
 URL: https://issues.apache.org/jira/browse/SOLR-2719
 Project: Solr
  Issue Type: Bug
  Components: search
Affects Versions: 4.0
Reporter: Nik V. Babichev
Priority: Blocker
  Labels: field, fl, query, queryparser
 Fix For: 4.0

 Attachments: SOLR-2719.patch


 fl=my-hyphen-field in query params parsed as my instead of 
 my-hyphen-field.
 OAS.search.ReturnFields use method getId() from OAS.search.QueryParsing
 in which check chars if (!Character.isJavaIdentifierPart(ch)  ch != '.')
 Hyphen is not JavaIdentifierPart and this check break when first - is found.
 This problem solve by passing '-' to check:
 if (!Character.isJavaIdentifierPart(ch)  ch != '.'  ch != '-') break;
 But I don't know how it can affect on whole project.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2894) Implement distributed pivot faceting

2012-03-05 Thread Dan Cooper (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222450#comment-13222450
 ] 

Dan Cooper commented on SOLR-2894:
--

Added a patch to provide distributed pivot faceting.  We've been running this 
code for a while now and it seems to work OK, also created a unit test to test 
distributed pivot faceting on a small set of data.

The patch was created against Solr trunk revision 1297102.

It should perform in much the same way as single shard pivot faceting.  It only 
sorts by count if you specify that option otherwise it returns results in the 
order they were generated (may be useful is performance is important but 
ordering is not).  Most will want to specify facet.sort=count.  This patch also 
supports limiting results using facet.limit.

To do the merge I'm converting the NamedList objects that get returned by each 
shard in a giant map (should be more efficient for merging the results) and 
then converting back into a NamedList when the merge is complete.  This merge 
should support N depth pivots but I've only properly tested a depth of 2.

I've added some new parameters to support the features we require from pivot 
faceting and thought they may as well go in the patch in case others find them 
useful.

* facet.pivot.limit.method
** set to 'combined' if you want only the N number of top results to be 
returned across all pivots, where N is set by facet.limit. e.g. if you pivoted 
by country,manufacturer and limited by 5, obviously the top 5 countries would 
be returned, but only the top 5 manufacturers by combined total would be 
returned too. e.g. Each country would return the same 5 manufacturers (or less 
if no results).

* facet.pivot.limit.ignore
** Ignores the specified field from the limiting operations. e.g. if you 
pivoted by country,manufacturer and limited by 5 and set 
facet.pivot.limit.ignore=country then you would get all available countries 
returned (not limited) but only 5 manufacturers for each country.

Can someone test the patch and give some feedback?

 Implement distributed pivot faceting
 

 Key: SOLR-2894
 URL: https://issues.apache.org/jira/browse/SOLR-2894
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.0
Reporter: Erik Hatcher
 Attachments: distribPatch-05-03-12.txt


 Following up on SOLR-792, pivot faceting currently only supports 
 undistributed mode.  Distributed pivot faceting needs to be implemented.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-2894) Implement distributed pivot faceting

2012-03-05 Thread Dan Cooper (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dan Cooper updated SOLR-2894:
-

Attachment: (was: distribPatch-05-03-12.txt)

 Implement distributed pivot faceting
 

 Key: SOLR-2894
 URL: https://issues.apache.org/jira/browse/SOLR-2894
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.0
Reporter: Erik Hatcher

 Following up on SOLR-792, pivot faceting currently only supports 
 undistributed mode.  Distributed pivot faceting needs to be implemented.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-2894) Implement distributed pivot faceting

2012-03-05 Thread Dan Cooper (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dan Cooper updated SOLR-2894:
-

Attachment: (was: SOLR-2894.patch)

 Implement distributed pivot faceting
 

 Key: SOLR-2894
 URL: https://issues.apache.org/jira/browse/SOLR-2894
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.0
Reporter: Erik Hatcher

 Following up on SOLR-792, pivot faceting currently only supports 
 undistributed mode.  Distributed pivot faceting needs to be implemented.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-2894) Implement distributed pivot faceting

2012-03-05 Thread Dan Cooper (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dan Cooper updated SOLR-2894:
-

Attachment: SOLR-2894.patch

 Implement distributed pivot faceting
 

 Key: SOLR-2894
 URL: https://issues.apache.org/jira/browse/SOLR-2894
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.0
Reporter: Erik Hatcher

 Following up on SOLR-792, pivot faceting currently only supports 
 undistributed mode.  Distributed pivot faceting needs to be implemented.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-2894) Implement distributed pivot faceting

2012-03-05 Thread Dan Cooper (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dan Cooper updated SOLR-2894:
-

Attachment: SOLR-2894.patch

 Implement distributed pivot faceting
 

 Key: SOLR-2894
 URL: https://issues.apache.org/jira/browse/SOLR-2894
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.0
Reporter: Erik Hatcher
 Attachments: SOLR-2894.patch


 Following up on SOLR-792, pivot faceting currently only supports 
 undistributed mode.  Distributed pivot faceting needs to be implemented.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2660) omitPositions improvements

2012-03-05 Thread Robert Muir (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222461#comment-13222461
 ] 

Robert Muir commented on SOLR-2660:
---

I think this could be a good option (in combination with shingles as 
mentioned), to accelerate 
the phrase queries that solr query parsers generate in order to boost closer 
matches.

Again the idea is to omit positions entirely, and instead use shinglefilter 
(unigrams and bigrams), approximating phrase 
queries with n-gram conjunctions. I think for the sloppy case, we should use an 
n-gram disjunction, perhaps interpreting 
slop factor as minNrShouldmatch?

This basically means you are substituting levenshtein distance for an n-gram 
approximation in both cases.

In general its a classic indexing/search tradeoff, in my tests on wikipedia 
indexing takes ~ twice as long with the shingles,
but the tradeoff is that for a lot of these use cases you don't need to consult 
the positions file at all.

As a parameter to the fieldtype its easily pluggable without messing with any 
queryparsers, and ordinary queries (term, boolean, etc)
are totally 'pass-thru', *however* the thing I guess I don't like about this 
patch is the fact that this is really a different 
'query intent', in other words, I think its a perfect approach when you just 
want to boost scores of close matches 
(e.g. when generated by dismax queryparser), but when your 'intent' is to 
actually limit matches to a phrase 
(e.g. when keyed in by a user directly), then this approximation isn't as good 
of a fit.

Either way I'm open to other opinions before doing anything (if we decide to do 
it, next step I think is to update the patch with 
the SloppyPhraseQuery approximation).


 omitPositions improvements
 --

 Key: SOLR-2660
 URL: https://issues.apache.org/jira/browse/SOLR-2660
 Project: Solr
  Issue Type: Improvement
Affects Versions: 3.3, 4.0
Reporter: Robert Muir
Priority: Minor
 Attachments: SOLR-2660.patch


 followup to LUCENE-2048:
 Adds factory methods getPhraseQuery/getMultiPhraseQuery to QP, this way you 
 can subclass it and customize behavior, particularly
 * by default, Solr throws exception here if the fieldtype omits positions: 
 rather than 3.x's silent failure of no results, and even for trunk its nicer 
 to fail during query parsing rather than waiting for lucene's failure during 
 execution.
 * adds phraseAsBoolean, which allows you to downgrade these 
 phrase/multiphrase queries to boolean queries: this is a nice option in 
 conjunction with our word n-gram filters (shingle/commongrams/etc)for a fast 
 approximation, if your application can tolerate some false positives, e.g. 
 foo bar - termQuery(foo_bar), foo bar baz - BQ(foo_bar AND bar_baz)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: Indexing Boolean Expressions

2012-03-05 Thread J. Delgado
I looked at LUCENE-2987 and its work on the query side (changes to the
accepted syntax to accept lower case 'or' and 'and'), which isn't really
related to my proposal.

What I'm proposing is to be able to index complex boolean expressions using
Lucene. This can be viewed as the opposite of the regular search task. The
objective here is find a set of relevant queries given a document
(assignment of values to fields).

This by itself may not sound that interesting but its a key piece
to efficiently implementing any MATCHING system which is effectively a
two-way search where constraints are defined both-ways. An example of this
would be:

1) Job matching: Potential employers define their job posting as a
documents along with complex boolean expressions used to narrow potential
candidates. Job searchers upload their profile and may formulate complex
queries when executing a search. Once a is search initiated from any of the
sides constraints need to satisfied both ways.
2) Advertising: Publishers define constraints on the type of
advertisers/ads they are willing to show in their sites. On the other hand,
advertisers define constraints (typically at the campaign level) on
publisher sites they want their ads to show at as well as on the user
audiences they are targeting to. While some attribute values are known at
definition time, others are only instantiated once the user visits a given
page which triggers a matching request that must be satisfied in
few milliseconds to select valid ads and then scored based on relevance.

So in a matching system a MATCH QUERY is considered to to be a tuple that
consists of a value assignment to attributes/fields (doc) + a boolean
expression (query) that goes against a double index also built on tuples
that  simultaneously boolean expressions and associated documents.

To do this efficiently we need to be able to build indexes on Boolean
expressions (Lucene queries) and retrieve the set of matching expressions
given a doc (typically few attributes with values assigned), which is the
core of what is described in this paper: Indexing Boolean Expressions
(See http://www.vldb.org/pvldb/2/vldb09-83.pdf)

-- J


So to effectively resolve the problem of realtime matching one can

On Tue, Feb 21, 2012 at 2:18 PM, Joe Cabrera calcmaste...@gmail.com wrote:

  On 02/21/2012 12:15 PM, Aayush Kothari wrote:




  So if Aayush Kothari is interested in working on this as a Student, all
 we need is a formal mentor (I can be the informal one).

  Anyone up for the task?


   Completely interested in working for and learning about the
 aforementioned subject/project. +1.

 This may be related to the work I'm doing with LUCENE-2987
 Basically changing the grammar to accepts conjunctions AND and OR in the
 query text.
 I would be interested in working with you on some of the details.

 However, I too am not a formal committer.

 --
 Joe Cabreraeminorlabs.com




[jira] [Commented] (LUCENE-3795) Replace spatial contrib module with LSP's spatial-lucene module

2012-03-05 Thread Adam Estrada (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222481#comment-13222481
 ] 

Adam Estrada commented on LUCENE-3795:
--

Yes...why not grow SIS seeing as its already in Apache Incubation? If another 
project is launched with the same intention as SIS, wouldn't that mean there is 
the potential for duplicate technologies sitting on an Apache server? There are 
a ton of different components to building out a proper geospatial core 
technology. GDAL/OGR, OpenLayers and the database storage techs therein 
represent some of the high points to operating a full GIS. See Wikipedia 
article (http://en.wikipedia.org/wiki/Geographic_information_system) and note 
that in order for there to be a full-fledged GIS, there has to be several 
mutually exclusive components that work in harmony with each other. I see 
Spatial4J being a great new way to implement geometric functions and storage 
within a Lucene index. That leaves the format/projection support, visualization 
and management components. Wouldn't it be nice to roll all of these fundamental 
concepts in to a single project like SIS? I am looking at the big picture 
rather than specific or individual components.

w/r,
Adam

 Replace spatial contrib module with LSP's spatial-lucene module
 ---

 Key: LUCENE-3795
 URL: https://issues.apache.org/jira/browse/LUCENE-3795
 Project: Lucene - Java
  Issue Type: New Feature
  Components: modules/spatial
Reporter: David Smiley
Assignee: David Smiley
 Fix For: 4.0


 I propose that Lucene's spatial contrib module be replaced with the 
 spatial-lucene module within Lucene Spatial Playground (LSP).  LSP has been 
 in development for approximately 1 year by David Smiley, Ryan McKinley, and 
 Chris Male and we feel it is ready.  LSP is here: 
 http://code.google.com/p/lucene-spatial-playground/  and the spatial-lucene 
 module is intuitively in svn/trunk/spatial-lucene/.
 I'll add more comments to prevent the issue description from being too long.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3850) Fix rawtypes warnings for Java 7 compiler

2012-03-05 Thread Uwe Schindler (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler updated LUCENE-3850:
--

Attachment: LUCENE-3850-part2.patch

2nd part of checnges for trunk. Grouping module is unfixable without knowledge 
on the generics. We should add rawtypes everywhere :-)

 Fix rawtypes warnings for Java 7 compiler
 -

 Key: LUCENE-3850
 URL: https://issues.apache.org/jira/browse/LUCENE-3850
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 3.5, 4.0
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3850-part1-branch3x.patch, 
 LUCENE-3850-part1.patch, LUCENE-3850-part2.patch


 Java 7 changed the warnings a little bit:
 - Java 6 only knew unchecked warning type, applying for all types of 
 generics violations, like missing generics (raw types)
 - Java 7 still knows unchecked but only emits warning if the call is really 
 unchecked. Declaration of variables/fields or constructing instances without 
 type param now emits rawtypes warning.
 The changes above causes the Java 7 compile now emit lots of rawtypes 
 warnings, where Java 6 is silent. The easy fix is to suppres both warning 
 types: @SuppressWarnings({unchecked,rawtypes}) for all those places. 
 Changes are easy to do, will provide patch later!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3851) TestTermInfosReaderIndex failing (always reproducible)

2012-03-05 Thread Dawid Weiss (Created) (JIRA)
TestTermInfosReaderIndex failing (always reproducible)
--

 Key: LUCENE-3851
 URL: https://issues.apache.org/jira/browse/LUCENE-3851
 Project: Lucene - Java
  Issue Type: Bug
Reporter: Dawid Weiss
Assignee: Robert Muir
Priority: Minor
 Fix For: 4.0


Always fails on branch (use reproduce string below):
git clone --depth 1 -b rr g...@github.com:dweiss/lucene_solr.git

{noformat}
[junit4] Running org.apache.lucene.codecs.lucene3x.TestTermInfosReaderIndex
[junit4] FAILURE 0.04s J0 | TestTermInfosReaderIndex.testSeekEnum
[junit4] Throwable #1: java.lang.AssertionError: 
expected:field9:z91ob3wozm6d but was::
[junit4]   at 
__randomizedtesting.SeedInfo.seed([C7597DFBBE0B3D7D:C6D9CEDD0700AAFF]:0)
[junit4]   at org.junit.Assert.fail(Assert.java:93)
[junit4]   at org.junit.Assert.failNotEquals(Assert.java:647)
[junit4]   at org.junit.Assert.assertEquals(Assert.java:128)
[junit4]   at org.junit.Assert.assertEquals(Assert.java:147)
[junit4]   at 
org.apache.lucene.codecs.lucene3x.TestTermInfosReaderIndex.testSeekEnum(TestTermInfosReaderIndex.java:137)
[junit4]   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
[junit4]   at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
[junit4]   at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
[junit4]   at java.lang.reflect.Method.invoke(Method.java:597)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1766)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner.access$1000(RandomizedRunner.java:141)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:728)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:789)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:803)
[junit4]   at 
org.apache.lucene.util.LuceneTestCase$SubclassSetupTeardownRule$1.evaluate(LuceneTestCase.java:744)
[junit4]   at 
org.apache.lucene.util.LuceneTestCase$InternalSetupTeardownRule$1.evaluate(LuceneTestCase.java:636)
[junit4]   at 
org.apache.lucene.util.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:22)
[junit4]   at 
org.apache.lucene.util.LuceneTestCase$TestResultInterceptorRule$1.evaluate(LuceneTestCase.java:550)
[junit4]   at 
org.apache.lucene.util.LuceneTestCase$RememberThreadRule$1.evaluate(LuceneTestCase.java:600)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner.runSingleTest(RandomizedRunner.java:735)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner.access$600(RandomizedRunner.java:141)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner$3$1.run(RandomizedRunner.java:586)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner$3.evaluate(RandomizedRunner.java:605)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner$4.evaluate(RandomizedRunner.java:641)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner$5.evaluate(RandomizedRunner.java:652)
[junit4]   at 
org.apache.lucene.util.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:22)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner.runSuite(RandomizedRunner.java:533)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner.access$400(RandomizedRunner.java:141)
[junit4]   at 
com.carrotsearch.randomizedtesting.RandomizedRunner$2.run(RandomizedRunner.java:479)
[junit4] 
[junit4]   2 NOTE: reproduce with: ant test 
-Dtests.filter=*.TestTermInfosReaderIndex -Dtests.filter.method=testSeekEnum 
-Drt.seed=C7597DFBBE0B3D7D -Dargs=-Dfile.encoding=UTF-8
[junit4]   2
[junit4] (@AfterClass output)
[junit4]   2 NOTE: test params are: codec=Appending, sim=DefaultSimilarity, 
locale=en, timezone=Atlantic/Stanley
[junit4]   2 NOTE: all tests run in this JVM:
[junit4]   2 [TestLock, TestFileSwitchDirectory, TestWildcardRandom, 
TestVersionComparator, TestTermdocPerf, TestBitVector, TestParallelTermEnum, 
TestSimpleSearchEquivalence, TestNumericRangeQuery64, TestSort, TestIsCurrent, 
TestToken, TestIntBlockCodec, TestDocumentsWriterDeleteQueue, TestPagedBytes, 
TestThreadedForceMerge, TestOmitTf, TestSegmentTermEnum, TestIndexWriterConfig, 
TestCheckIndex, TestTermVectorsWriter, TestNumericTokenStream, TestSearchAfter, 
TestRegexpQuery, InBeforeClass, InAfterClass, InTestMethod, 
NonStringProperties, TestIndexWriterMergePolicy, TestVirtualMethod, 
TestFieldCache, TestSurrogates, TestSegmentTermDocs, 
TestMultiValuedNumericRangeQuery, TestBasicOperations, TestCodecs, 
TestDateSort, TestPositiveScoresOnlyCollector, 

[jira] [Commented] (LUCENE-3749) Similarity.java javadocs and simplifications for 4.0

2012-03-05 Thread Neil Hooey (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222521#comment-13222521
 ] 

Neil Hooey commented on LUCENE-3749:


Thanks Robert, I've got it working now. I just set my default similarity to 
{{SchemaSimilarityFactory}} and it works just as it did before.

 Similarity.java javadocs and simplifications for 4.0
 

 Key: LUCENE-3749
 URL: https://issues.apache.org/jira/browse/LUCENE-3749
 Project: Lucene - Java
  Issue Type: Task
Affects Versions: 4.0
Reporter: Robert Muir
Assignee: Robert Muir
 Fix For: 4.0

 Attachments: LUCENE-3749.patch, LUCENE-3749_part2.patch


 As part of adding additional scoring systems to lucene, we made a lower-level 
 Similarity
 and the existing stuff became e.g. TFIDFSimilarity which extends it.
 However, I always feel bad about the complexity introduced here (though I do 
 feel there
 are some excuses, that its a difficult challenge).
 In order to try to mitigate this, we also exposed an easier API 
 (SimilarityBase) on top of 
 it that makes some assumptions (and trades off some performance) to try to 
 provide something 
 consumable for e.g. experiments.
 Still, we can cleanup a few things with the low-level api: fix outdated 
 documentation and
 shoot for better/clearer naming etc.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3821) SloppyPhraseScorer sometimes misses documents that ExactPhraseScorer finds.

2012-03-05 Thread Doron Cohen (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Doron Cohen updated LUCENE-3821:


Attachment: LUCENE-3821.patch

Attached updated patch. 

Repeating PPs with multi-Phrase-query is handled as well.

This called for more cases in the sloppy phrase scorer and more code, and, 
although I think the code is cleaner now, I don't know to what extent is it 
easier to maintain. 

It definitely fixes wrong behavior that exists in current 3x and trunk (patch 
is for 3x).

However, although the random test passes for me even with -Dtests.iter=2000, it 
is possible to break the scorer - that is, create a document and a query 
which should match each other but would not. 

The patch adds just such a case as an @Ignored test case:  
TestMultiPhraseQuery.testMultiSloppyWithRepeats(). 

I don't see how to solve this specific case in the context of current sloppy 
phrase scorer. 

So there are 3 options:
# leave things as they are
# commit this patch and for now document the failing scenario (also keep it in 
the ignored test case). 
# devise a different algorithm for this.

I would love it to be the 3rd if I just knew how to do it. Otherwise I like the 
2nd, just need to keep in mind that the random test might from time to time 
create this scenario and so there will be noise in the test builds.

Preferences?

 SloppyPhraseScorer sometimes misses documents that ExactPhraseScorer finds.
 ---

 Key: LUCENE-3821
 URL: https://issues.apache.org/jira/browse/LUCENE-3821
 Project: Lucene - Java
  Issue Type: Bug
Affects Versions: 3.5, 4.0
Reporter: Naomi Dushay
Assignee: Doron Cohen
 Attachments: LUCENE-3821.patch, LUCENE-3821.patch, LUCENE-3821.patch, 
 LUCENE-3821.patch, LUCENE-3821_test.patch, schema.xml, solrconfig-test.xml


 The general bug is a case where a phrase with no slop is found,
 but if you add slop its not.
 I committed a test today (TestSloppyPhraseQuery2) that actually triggers this 
 case,
 jenkins just hasn't had enough time to chew on it.
 ant test -Dtestcase=TestSloppyPhraseQuery2 -Dtests.iter=100 is enough to make 
 it fail on trunk or 3.x

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3850) Fix rawtypes warnings for Java 7 compiler

2012-03-05 Thread Uwe Schindler (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler updated LUCENE-3850:
--

Attachment: LUCENE-3850-parts2-branch3x.patch

Part2 for branch3x. Will commit now.

 Fix rawtypes warnings for Java 7 compiler
 -

 Key: LUCENE-3850
 URL: https://issues.apache.org/jira/browse/LUCENE-3850
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 3.5, 4.0
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3850-part1-branch3x.patch, 
 LUCENE-3850-part1.patch, LUCENE-3850-part2.patch, 
 LUCENE-3850-parts2-branch3x.patch


 Java 7 changed the warnings a little bit:
 - Java 6 only knew unchecked warning type, applying for all types of 
 generics violations, like missing generics (raw types)
 - Java 7 still knows unchecked but only emits warning if the call is really 
 unchecked. Declaration of variables/fields or constructing instances without 
 type param now emits rawtypes warning.
 The changes above causes the Java 7 compile now emit lots of rawtypes 
 warnings, where Java 6 is silent. The easy fix is to suppres both warning 
 types: @SuppressWarnings({unchecked,rawtypes}) for all those places. 
 Changes are easy to do, will provide patch later!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3850) Fix rawtypes warnings for Java 7 compiler

2012-03-05 Thread Uwe Schindler (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222541#comment-13222541
 ] 

Uwe Schindler commented on LUCENE-3850:
---

Committed 2nd patch trunk revision: 1297162
Committed 2nd patch 3.x revision: 1297189

 Fix rawtypes warnings for Java 7 compiler
 -

 Key: LUCENE-3850
 URL: https://issues.apache.org/jira/browse/LUCENE-3850
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 3.5, 4.0
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3850-part1-branch3x.patch, 
 LUCENE-3850-part1.patch, LUCENE-3850-part2.patch, 
 LUCENE-3850-parts2-branch3x.patch


 Java 7 changed the warnings a little bit:
 - Java 6 only knew unchecked warning type, applying for all types of 
 generics violations, like missing generics (raw types)
 - Java 7 still knows unchecked but only emits warning if the call is really 
 unchecked. Declaration of variables/fields or constructing instances without 
 type param now emits rawtypes warning.
 The changes above causes the Java 7 compile now emit lots of rawtypes 
 warnings, where Java 6 is silent. The easy fix is to suppres both warning 
 types: @SuppressWarnings({unchecked,rawtypes}) for all those places. 
 Changes are easy to do, will provide patch later!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Resolved] (SOLR-1772) UpdateProcessor to prune empty values

2012-03-05 Thread Hoss Man (Resolved) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man resolved SOLR-1772.


   Resolution: Duplicate
Fix Version/s: 4.0
 Assignee: Hoss Man

yep ... covered as part of SOLR-2802

 UpdateProcessor to prune empty values
 ---

 Key: SOLR-1772
 URL: https://issues.apache.org/jira/browse/SOLR-1772
 Project: Solr
  Issue Type: Wish
Reporter: Hoss Man
Assignee: Hoss Man
 Fix For: 4.0


 Users seem to frequently get confused when some FieldTypes (typically the 
 numeric ones) complain about invalid field values when the inadvertantly 
 index an empty string.
 It would be cool to provide an UpdateProcessor that makes it easy to strip 
 out any fields being added as empty values ... it could be configured using 
 field (and/or field type) names or globs to select/ignore certain fields -- i 
 haven't thought it through all that hard

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Data Streaming algorithms

2012-03-05 Thread Joe Cabrera

I currently use Apache Lucene for analysis of text documents. However my new 
research involves the design of techniques and algorithms for data stream 
mining of social media. I do not think currently the Apache Lucene framework 
specifically contains
support for streaming data. If Apache Lucene does not I may consider forking 
the current Lucene project and developing a new one better suited to data 
streams. Any thoughts for the developers on the current Apache Lucene project?


--
Joe Cabrera
eminorlabs.com


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3174) Visualize Cluster State

2012-03-05 Thread Stefan Matheis (steffkes) (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222558#comment-13222558
 ] 

Stefan Matheis (steffkes) commented on SOLR-3174:
-

Created another one for ryan's radical view: 
http://files.mathe.is/solr-admin/cloud-state/jit-rgraph/ - i've added another 
shard w/ three nodes, just for layout purposes .. otherwise the generated 
layout looks a [bit 
odd|http://files.mathe.is/solr-admin/cloud-state/jit-rgraph/index-2.html] ;o

To continue w/ the integration of that view, it would be very helpful to get 
some real life data .. would that be possible? what would a cluster normally 
look like? how many shards w/ how many nodes? so i'd get a feeling on how large 
the Graphs could be .. where they could be integrated in the admin-ui and so on 
:)

 Visualize Cluster State
 ---

 Key: SOLR-3174
 URL: https://issues.apache.org/jira/browse/SOLR-3174
 Project: Solr
  Issue Type: New Feature
Reporter: Ryan McKinley

 It would be great to visualize the cluster state in the new UI. 
 See Mark's wish:
 https://issues.apache.org/jira/browse/SOLR-3162?focusedCommentId=13218272page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13218272

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3795) Replace spatial contrib module with LSP's spatial-lucene module

2012-03-05 Thread Ryan McKinley (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222578#comment-13222578
 ] 

Ryan McKinley commented on LUCENE-3795:
---

bq.  Wouldn't it be nice to roll all of these fundamental concepts in to a 
single project like SIS?

I agree -- but the elephant in the room is the LGPL library JTS.  

From previous discussions, I believe this was a non-starter for their 
compile/test environment, but I have emailed the sis-dev@ list to see if this 
is still their feeling.

With spatial4j, we want a solid ASL solution, but also support complex polygons 
if people choose to use JTS in their runtime environment.

 Replace spatial contrib module with LSP's spatial-lucene module
 ---

 Key: LUCENE-3795
 URL: https://issues.apache.org/jira/browse/LUCENE-3795
 Project: Lucene - Java
  Issue Type: New Feature
  Components: modules/spatial
Reporter: David Smiley
Assignee: David Smiley
 Fix For: 4.0


 I propose that Lucene's spatial contrib module be replaced with the 
 spatial-lucene module within Lucene Spatial Playground (LSP).  LSP has been 
 in development for approximately 1 year by David Smiley, Ryan McKinley, and 
 Chris Male and we feel it is ready.  LSP is here: 
 http://code.google.com/p/lucene-spatial-playground/  and the spatial-lucene 
 module is intuitively in svn/trunk/spatial-lucene/.
 I'll add more comments to prevent the issue description from being too long.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-3202) Remove JSP from solr webapp

2012-03-05 Thread Ryan McKinley (Created) (JIRA)
Remove JSP from solr webapp
---

 Key: SOLR-3202
 URL: https://issues.apache.org/jira/browse/SOLR-3202
 Project: Solr
  Issue Type: Task
Reporter: Ryan McKinley
Assignee: Ryan McKinley
 Fix For: 4.0


The 4.0 admin UI does not use JSP to create its interface (It uses javascript 
and RequestHandlers)

We only use JSP to fill in the path to the core in index.jsp -- this can be 
replaced with simple template text



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3851) TestTermInfosReaderIndex failing (always reproducible)

2012-03-05 Thread Robert Muir (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222586#comment-13222586
 ] 

Robert Muir commented on LUCENE-3851:
-

Thanks Dawid: i'll take a look at this first chance i get (unless Mike beats me 
to it).

This looks scary: we've had bugs with 3.x's terms dict around 
emptyField+emptyTerm before. I hope its just a test bug.

 TestTermInfosReaderIndex failing (always reproducible)
 --

 Key: LUCENE-3851
 URL: https://issues.apache.org/jira/browse/LUCENE-3851
 Project: Lucene - Java
  Issue Type: Bug
Reporter: Dawid Weiss
Assignee: Robert Muir
Priority: Minor
 Fix For: 4.0


 Always fails on branch (use reproduce string below):
 git clone --depth 1 -b rr g...@github.com:dweiss/lucene_solr.git
 {noformat}
 [junit4] Running org.apache.lucene.codecs.lucene3x.TestTermInfosReaderIndex
 [junit4] FAILURE 0.04s J0 | TestTermInfosReaderIndex.testSeekEnum
 [junit4] Throwable #1: java.lang.AssertionError: 
 expected:field9:z91ob3wozm6d but was::
 [junit4] at 
 __randomizedtesting.SeedInfo.seed([C7597DFBBE0B3D7D:C6D9CEDD0700AAFF]:0)
 [junit4] at org.junit.Assert.fail(Assert.java:93)
 [junit4] at org.junit.Assert.failNotEquals(Assert.java:647)
 [junit4] at org.junit.Assert.assertEquals(Assert.java:128)
 [junit4] at org.junit.Assert.assertEquals(Assert.java:147)
 [junit4] at 
 org.apache.lucene.codecs.lucene3x.TestTermInfosReaderIndex.testSeekEnum(TestTermInfosReaderIndex.java:137)
 [junit4] at sun.reflect.NativeMethodAccessorImpl.invoke0(Native 
 Method)
 [junit4] at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
 [junit4] at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 [junit4] at java.lang.reflect.Method.invoke(Method.java:597)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1766)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner.access$1000(RandomizedRunner.java:141)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:728)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:789)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:803)
 [junit4] at 
 org.apache.lucene.util.LuceneTestCase$SubclassSetupTeardownRule$1.evaluate(LuceneTestCase.java:744)
 [junit4] at 
 org.apache.lucene.util.LuceneTestCase$InternalSetupTeardownRule$1.evaluate(LuceneTestCase.java:636)
 [junit4] at 
 org.apache.lucene.util.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:22)
 [junit4] at 
 org.apache.lucene.util.LuceneTestCase$TestResultInterceptorRule$1.evaluate(LuceneTestCase.java:550)
 [junit4] at 
 org.apache.lucene.util.LuceneTestCase$RememberThreadRule$1.evaluate(LuceneTestCase.java:600)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner.runSingleTest(RandomizedRunner.java:735)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner.access$600(RandomizedRunner.java:141)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$3$1.run(RandomizedRunner.java:586)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$3.evaluate(RandomizedRunner.java:605)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$4.evaluate(RandomizedRunner.java:641)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$5.evaluate(RandomizedRunner.java:652)
 [junit4] at 
 org.apache.lucene.util.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:22)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner.runSuite(RandomizedRunner.java:533)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner.access$400(RandomizedRunner.java:141)
 [junit4] at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$2.run(RandomizedRunner.java:479)
 [junit4] 
 [junit4]   2 NOTE: reproduce with: ant test 
 -Dtests.filter=*.TestTermInfosReaderIndex -Dtests.filter.method=testSeekEnum 
 -Drt.seed=C7597DFBBE0B3D7D -Dargs=-Dfile.encoding=UTF-8
 [junit4]   2
 [junit4] (@AfterClass output)
 [junit4]   2 NOTE: test params are: codec=Appending, sim=DefaultSimilarity, 
 locale=en, timezone=Atlantic/Stanley
 [junit4]   2 NOTE: all tests run in this JVM:
 [junit4]   2 [TestLock, TestFileSwitchDirectory, 

[jira] [Commented] (SOLR-3174) Visualize Cluster State

2012-03-05 Thread Erick Erickson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222599#comment-13222599
 ] 

Erick Erickson commented on SOLR-3174:
--

Way cool! I suspect that we should give them all the space they can have, but 
plan for 5 shards, three replicas? That covers quite a large installation 
actually. Almost immediately someone will have 20 shards and 10 replicas each, 
but at some point it gets ridiculous.

How about opening up a new tab and giving them the whole screen?

As you can tell, there's a dearth of real life data here, all we can do is 
guess.



 Visualize Cluster State
 ---

 Key: SOLR-3174
 URL: https://issues.apache.org/jira/browse/SOLR-3174
 Project: Solr
  Issue Type: New Feature
Reporter: Ryan McKinley

 It would be great to visualize the cluster state in the new UI. 
 See Mark's wish:
 https://issues.apache.org/jira/browse/SOLR-3162?focusedCommentId=13218272page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13218272

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-3203) SolrCloud should demand that the schema has the _version_ field

2012-03-05 Thread Lance Norskog (Created) (JIRA)
SolrCloud should demand that the schema has the _version_ field 


 Key: SOLR-3203
 URL: https://issues.apache.org/jira/browse/SOLR-3203
 Project: Solr
  Issue Type: Improvement
Reporter: Lance Norskog


SolrCloud does not function correctly without the _version_ field. It should 
fail at startup if this is not in the schema.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-3203) SolrCloud should demand that the schema has the _version_ field

2012-03-05 Thread Lance Norskog (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-3203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lance Norskog updated SOLR-3203:


  Component/s: SolrCloud
Fix Version/s: 4.0
 Priority: Minor  (was: Major)

 SolrCloud should demand that the schema has the _version_ field 
 

 Key: SOLR-3203
 URL: https://issues.apache.org/jira/browse/SOLR-3203
 Project: Solr
  Issue Type: Improvement
  Components: SolrCloud
Reporter: Lance Norskog
Priority: Minor
 Fix For: 4.0


 SolrCloud does not function correctly without the _version_ field. It should 
 fail at startup if this is not in the schema.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3174) Visualize Cluster State

2012-03-05 Thread Mark Miller (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222620#comment-13222620
 ] 

Mark Miller commented on SOLR-3174:
---

I would guestimate common distributed setups might have 3-15 shards (favoring 
lower). Common replication factor might be 2 or 3.

Of course those with billions of docs collections might get into 100+ shards 
area - again replication factor is still going to probably be 3-5 max I would 
guess.

 Visualize Cluster State
 ---

 Key: SOLR-3174
 URL: https://issues.apache.org/jira/browse/SOLR-3174
 Project: Solr
  Issue Type: New Feature
Reporter: Ryan McKinley

 It would be great to visualize the cluster state in the new UI. 
 See Mark's wish:
 https://issues.apache.org/jira/browse/SOLR-3162?focusedCommentId=13218272page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13218272

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3174) Visualize Cluster State

2012-03-05 Thread Cody Young (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222637#comment-13222637
 ] 

Cody Young commented on SOLR-3174:
--

18 shards, 3-5 replicas here.

 Visualize Cluster State
 ---

 Key: SOLR-3174
 URL: https://issues.apache.org/jira/browse/SOLR-3174
 Project: Solr
  Issue Type: New Feature
Reporter: Ryan McKinley

 It would be great to visualize the cluster state in the new UI. 
 See Mark's wish:
 https://issues.apache.org/jira/browse/SOLR-3162?focusedCommentId=13218272page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13218272

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3795) Replace spatial contrib module with LSP's spatial-lucene module

2012-03-05 Thread Ryan McKinley (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222729#comment-13222729
 ] 

Ryan McKinley commented on LUCENE-3795:
---

I have confirmed with SIS that a compile/test dependency on JTS in not 
possible.  (One of their main goals is to make an ASL version of JTS...  A 
great goal, but they are not yet to 1st base)

So, where does that leave us?

Is the spatial4j.jar a blocker for anyone?  I understand it is not everyone's 
preferred option -- but no option makes everyone happy.  

 Replace spatial contrib module with LSP's spatial-lucene module
 ---

 Key: LUCENE-3795
 URL: https://issues.apache.org/jira/browse/LUCENE-3795
 Project: Lucene - Java
  Issue Type: New Feature
  Components: modules/spatial
Reporter: David Smiley
Assignee: David Smiley
 Fix For: 4.0


 I propose that Lucene's spatial contrib module be replaced with the 
 spatial-lucene module within Lucene Spatial Playground (LSP).  LSP has been 
 in development for approximately 1 year by David Smiley, Ryan McKinley, and 
 Chris Male and we feel it is ready.  LSP is here: 
 http://code.google.com/p/lucene-spatial-playground/  and the spatial-lucene 
 module is intuitively in svn/trunk/spatial-lucene/.
 I'll add more comments to prevent the issue description from being too long.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-3202) Remove JSP from solr webapp

2012-03-05 Thread Ryan McKinley (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-3202:


Attachment: SOLR-3202-no-jsp.patch

This loads the admin UI with a simple servlet and:

{code:java}
+  String[] search = new String[] { 
+  ${contextPath}, 
+  ${adminPath} 
+  };
+  String[] replace = new String[] {
+  request.getContextPath(),
+  cores.getAdminPath()
+  };
+  
+  out.println( StringUtils.replaceEach(html, search, replace) );
{code}

 Remove JSP from solr webapp
 ---

 Key: SOLR-3202
 URL: https://issues.apache.org/jira/browse/SOLR-3202
 Project: Solr
  Issue Type: Task
Reporter: Ryan McKinley
Assignee: Ryan McKinley
 Fix For: 4.0

 Attachments: SOLR-3202-no-jsp.patch


 The 4.0 admin UI does not use JSP to create its interface (It uses javascript 
 and RequestHandlers)
 We only use JSP to fill in the path to the core in index.jsp -- this can be 
 replaced with simple template text

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3795) Replace spatial contrib module with LSP's spatial-lucene module

2012-03-05 Thread Chris A. Mattmann (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222739#comment-13222739
 ] 

Chris A. Mattmann commented on LUCENE-3795:
---

I'd encourage folks to read:

http://markmail.org/message/75yrfmumxlmdgxzz

and all of the surrounding discussion there.

Quoting Greg Stein:

{quote}
I simply think that it is a mistake for a PMC to create any sort of dependency 
upon code that is more restrictive than the Apache License. In this case, it 
means somebody must grab LGPL code in order to build our provided tarball. I 
would strongly advise against such a build dependency, whether the runtime 
requires it or not.
{quote}

I strongly agree with his interpretation.

 Replace spatial contrib module with LSP's spatial-lucene module
 ---

 Key: LUCENE-3795
 URL: https://issues.apache.org/jira/browse/LUCENE-3795
 Project: Lucene - Java
  Issue Type: New Feature
  Components: modules/spatial
Reporter: David Smiley
Assignee: David Smiley
 Fix For: 4.0


 I propose that Lucene's spatial contrib module be replaced with the 
 spatial-lucene module within Lucene Spatial Playground (LSP).  LSP has been 
 in development for approximately 1 year by David Smiley, Ryan McKinley, and 
 Chris Male and we feel it is ready.  LSP is here: 
 http://code.google.com/p/lucene-spatial-playground/  and the spatial-lucene 
 module is intuitively in svn/trunk/spatial-lucene/.
 I'll add more comments to prevent the issue description from being too long.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3202) Remove JSP from solr webapp

2012-03-05 Thread Uwe Schindler (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222759#comment-13222759
 ] 

Uwe Schindler commented on SOLR-3202:
-

+1

Except:
{noformat}
+  String html = IOUtils.toString(new FileReader(f));
{noformat}

Thats a bad idea as it uses default charset. Use IOUtils.toString(new 
FileInputStream(f), UTF-8) or like that. Of course it must also set the 
content-type correctly, including charset before getting the reader.

 Remove JSP from solr webapp
 ---

 Key: SOLR-3202
 URL: https://issues.apache.org/jira/browse/SOLR-3202
 Project: Solr
  Issue Type: Task
Reporter: Ryan McKinley
Assignee: Ryan McKinley
 Fix For: 4.0

 Attachments: SOLR-3202-no-jsp.patch


 The 4.0 admin UI does not use JSP to create its interface (It uses javascript 
 and RequestHandlers)
 We only use JSP to fill in the path to the core in index.jsp -- this can be 
 replaced with simple template text

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3795) Replace spatial contrib module with LSP's spatial-lucene module

2012-03-05 Thread Ryan McKinley (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222781#comment-13222781
 ] 

Ryan McKinley commented on LUCENE-3795:
---

bq.  PMC to create any sort of dependency upon code that is more restrictive 
than the Apache License. 

Note, the spatial4j.jar file is ASL

No user ever needs to touch LGPL code or artifacts

 Replace spatial contrib module with LSP's spatial-lucene module
 ---

 Key: LUCENE-3795
 URL: https://issues.apache.org/jira/browse/LUCENE-3795
 Project: Lucene - Java
  Issue Type: New Feature
  Components: modules/spatial
Reporter: David Smiley
Assignee: David Smiley
 Fix For: 4.0


 I propose that Lucene's spatial contrib module be replaced with the 
 spatial-lucene module within Lucene Spatial Playground (LSP).  LSP has been 
 in development for approximately 1 year by David Smiley, Ryan McKinley, and 
 Chris Male and we feel it is ready.  LSP is here: 
 http://code.google.com/p/lucene-spatial-playground/  and the spatial-lucene 
 module is intuitively in svn/trunk/spatial-lucene/.
 I'll add more comments to prevent the issue description from being too long.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3197) Allow firstSearcher and newSearcher listeners to run in multiple threads

2012-03-05 Thread Neil Hooey (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222790#comment-13222790
 ] 

Neil Hooey commented on SOLR-3197:
--

{quote}
The original rational was do we really want to peg more than one CPU with 
warmup while we are still serving live traffic?
But some people are using pretty big boxes these days.
{quote}

When Solr is configured _not_ to serve queries while warming, would it make 
sense to use as many CPUs as possible, since it's probably not doing anything 
else?

 Allow firstSearcher and newSearcher listeners to run in multiple threads
 

 Key: SOLR-3197
 URL: https://issues.apache.org/jira/browse/SOLR-3197
 Project: Solr
  Issue Type: Improvement
Reporter: Lance Norskog

 SolrCore submits all listeners (firstSearcher and newSearcher) to a java 
 ExecutorService, but uses a single-threaded one. 
 line 965 in the trunk: 
 {code}
 SolrCore.java around line 965: final ExecutorService searcherExecutor = 
 Executors.newSingleThreadExecutor(); 
 line 1280 in the trunk: 
 SolrCore.java around line 1280 runs first the, and then the first and new 
 searchers, all with the searcherExecutor object created at line 965. 
 Would it work if we changed this ExecutorService to a thread pool version? 
 This seems like it should work:
 {code}
 java.util.concurrent.Executors.newFixedThreadPool(int nThreads, ThreadFactory 
 threadFactory);
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-2667) Finish Solr Admin UI

2012-03-05 Thread Ryan McKinley (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-2667:


Component/s: web gui

 Finish Solr Admin UI
 

 Key: SOLR-2667
 URL: https://issues.apache.org/jira/browse/SOLR-2667
 Project: Solr
  Issue Type: Improvement
  Components: web gui
Reporter: Ryan McKinley
Assignee: Ryan McKinley
 Fix For: 4.0

 Attachments: SOLR-2667-110722.patch, 
 SOLR-2667-120223-file-structure.patch


 In SOLR-2399, we added a new admin UI. The issue has gotten too long to 
 follow, so this is a new issue to track remaining tasks.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-3162) Continue work on new admin UI

2012-03-05 Thread Ryan McKinley (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-3162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-3162:


Component/s: web gui

 Continue work on new admin UI
 -

 Key: SOLR-3162
 URL: https://issues.apache.org/jira/browse/SOLR-3162
 Project: Solr
  Issue Type: Improvement
  Components: Schema and Analysis, web gui
Affects Versions: 4.0
Reporter: Erick Erickson
Assignee: Erick Erickson
 Attachments: SOLR-3162-index.png, SOLR-3162-schema-browser.png, 
 SOLR-3162.patch, SOLR-3162.patch, SOLR-3162.patch, SOLR-3162.patch


 There have been more improvements to how the new UI works, but the current 
 open bugs are getting hard to keep straight. This is the new catch-all JIRA 
 for continued improvements.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-3174) Visualize Cluster State

2012-03-05 Thread Ryan McKinley (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-3174:


Component/s: web gui

 Visualize Cluster State
 ---

 Key: SOLR-3174
 URL: https://issues.apache.org/jira/browse/SOLR-3174
 Project: Solr
  Issue Type: New Feature
  Components: web gui
Reporter: Ryan McKinley

 It would be great to visualize the cluster state in the new UI. 
 See Mark's wish:
 https://issues.apache.org/jira/browse/SOLR-3162?focusedCommentId=13218272page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13218272

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Resolved] (SOLR-2635) FieldAnalysisRequestHandler; Expose Filter- Tokenizer-Settings

2012-03-05 Thread Ryan McKinley (Resolved) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley resolved SOLR-2635.
-

   Resolution: Fixed
Fix Version/s: 4.0

I think this was fixed a while back

 FieldAnalysisRequestHandler; Expose Filter-  Tokenizer-Settings
 

 Key: SOLR-2635
 URL: https://issues.apache.org/jira/browse/SOLR-2635
 Project: Solr
  Issue Type: Improvement
  Components: Schema and Analysis, web gui
Reporter: Stefan Matheis (steffkes)
Assignee: Uwe Schindler
Priority: Minor
 Fix For: 4.0


 The [current/old Analysis 
 Page|http://files.mathe.is/solr-admin/04_analysis_verbose_cur.png] exposes 
 the Filter-  Tokenizer-Settings -- the FieldAnalysisRequestHandler not :/
 This Information is already available on the 
 [Schema-Browser|http://files.mathe.is/solr-admin/06_schema-browser.png] 
 (through LukeRequestHandler) - so we could load this in parallel and grab the 
 required informations .. but it would be easier if we could add this 
 Information, so that we have all relevant Information at one Place.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-3181) New Admin UI, allow user to somehow cut/paste all the old Zookeeper info.

2012-03-05 Thread Ryan McKinley (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-3181:


Component/s: web gui

 New Admin UI, allow user to somehow cut/paste all the old Zookeeper info.
 ---

 Key: SOLR-3181
 URL: https://issues.apache.org/jira/browse/SOLR-3181
 Project: Solr
  Issue Type: Improvement
  Components: web gui
Affects Versions: 4.0
 Environment: n/a
Reporter: Erick Erickson
Assignee: Erick Erickson
Priority: Minor
 Attachments: SOLR-3181.patch


 When tracking down issues with ZK, the devs ask about various bits of data 
 from the cloud pages. It would be convenient to be able to just capture all 
 the data from the old /solr/admin/zookeeper.jsp page in the admin interface 
 to be able to send it to anyone debugging the info. 
 Perhaps just a get debug info for Apache. Or even more cool copy debug 
 info to clipboard if that's possible. Is this just the raw data that the 
 cloud view is manipulating? It doesn't have to be pretty although indentation 
 would be nice.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-3188) New admin page: Enable Polling button disappears after disabling polling and reloading page

2012-03-05 Thread Ryan McKinley (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-3188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-3188:


Component/s: (was: Schema and Analysis)
 web gui

 New admin page: Enable Polling button disappears after disabling polling 
 and reloading page
 -

 Key: SOLR-3188
 URL: https://issues.apache.org/jira/browse/SOLR-3188
 Project: Solr
  Issue Type: Bug
  Components: web gui
Affects Versions: 4.0
Reporter: Neil Hooey
Priority: Minor
  Labels: admin

 When you go to this URL on a slave:
 http://localhost:8983/solr/#/singlecore/replication
 And click the Disable Polling button, you see a red bar that says 
 invalid_master. I'm not sure why I get this red bar, as I haven't tested it 
 outside of my own installation, but it seems normal.
 If you then refresh the page, the Replicate Now and Enable Polling 
 buttons disappear. It seems like their generation is being interrupted by the 
 invalid_master error.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-3204) solr-commons-csv must not use the org.apache.commons.csv package

2012-03-05 Thread Emmanuel Bourg (Created) (JIRA)
solr-commons-csv must not use the org.apache.commons.csv package


 Key: SOLR-3204
 URL: https://issues.apache.org/jira/browse/SOLR-3204
 Project: Solr
  Issue Type: Bug
Affects Versions: 3.5
Reporter: Emmanuel Bourg
Priority: Blocker
 Fix For: 3.6


The solr-commons-csv artifact reused the code from the Apache Commons CSV 
project but the package wasn't changed to something else than 
org.apache.commons.csv in the process. This creates a compatibility issue as 
the Apache Commons team works toward an official release of Commons CSV. It 
prevents Commons CSV from using its own org.apache.commons.csv package, or 
forces the renaming of all the classes to avoid a classpath conflict.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-3205) Better error reporting from Analysis UI

2012-03-05 Thread Ryan McKinley (Created) (JIRA)
Better error reporting from Analysis UI
---

 Key: SOLR-3205
 URL: https://issues.apache.org/jira/browse/SOLR-3205
 Project: Solr
  Issue Type: Bug
  Components: web gui
Reporter: Ryan McKinley
 Fix For: 4.0


The new analysis UI does not behave well with invalid input.  To reproduce, 
from /#/singlecore/analysis
 * Select a number field (int) 
 * put in invalid text (hello)
 * click Analyse Values

The UI will have a red banner, but not say anything useful.  The log file will 
say:
{code}
SEVERE: org.apache.solr.common.SolrException: Invalid Number: hello
at 
org.apache.solr.analysis.TrieTokenizer.reset(TrieTokenizerFactory.java:113)
at 
org.apache.solr.analysis.TrieTokenizer.init(TrieTokenizerFactory.java:76)
{code}

Hopefully we can get the UI to say Invalid Number: hello


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Resolved] (SOLR-67) query interface with faceted browsing, highligting

2012-03-05 Thread Ryan McKinley (Resolved) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-67?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley resolved SOLR-67.
---

Resolution: Fixed

I think this is resolved in SOLR-2178

 query interface with faceted browsing, highligting
 --

 Key: SOLR-67
 URL: https://issues.apache.org/jira/browse/SOLR-67
 Project: Solr
  Issue Type: Wish
  Components: web gui
Reporter: Yonik Seeley

 It would be nice to a nice HTML page allowing the user to query and display
 1) some faceting info such as hit counts with links that allowed the user to 
 narrow their search results.
 2) highlighted summaries
 3) easy way to query the dismax handler as well as the standard request 
 handler
 Most likely this would be built into the admin pages (and have access to all 
 the field info).
 It would also seem useful to have an advanced query page... something like
 http://www.nabble.com/forum/AdvSearch.jtp or 
 http://www.google.com/advanced_search?hl=en
 that would allow one to easily customize and drop into their site.  This 
 might be best as contrib module run outside of Solr (a JSP, etc?)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-3202) Remove JSP from solr webapp

2012-03-05 Thread Ryan McKinley (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-3202:


Attachment: SOLR-3202-no-jsp.patch

This is a more complete patch that actually removes the JSP files and 
dependencies

 Remove JSP from solr webapp
 ---

 Key: SOLR-3202
 URL: https://issues.apache.org/jira/browse/SOLR-3202
 Project: Solr
  Issue Type: Task
Reporter: Ryan McKinley
Assignee: Ryan McKinley
 Fix For: 4.0

 Attachments: SOLR-3202-no-jsp.patch, SOLR-3202-no-jsp.patch


 The 4.0 admin UI does not use JSP to create its interface (It uses javascript 
 and RequestHandlers)
 We only use JSP to fill in the path to the core in index.jsp -- this can be 
 replaced with simple template text

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Remove old Solr UI from /trunk?

2012-03-05 Thread Ryan McKinley
Are we ready to remove the old UI from trunk?

I would like to move forward with SOLR-3202 (removing JSP)

I believe the Log level selection stuff is the only feature not
supported by the new UI (yet) -- anything else?

ryan

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3204) solr-commons-csv must not use the org.apache.commons.csv package

2012-03-05 Thread Ryan McKinley (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222911#comment-13222911
 ] 

Ryan McKinley commented on SOLR-3204:
-

Solr is using a snapshot commons CSV build (commons-csv-1.0-SNAPSHOT-r966014) 
-- solr does not change anything and would prefer to use a release build.

Has there been class refactorings that make replacing the .jar with the dev 
build impossible?  If so, perhaps we should just update the .jar file we are 
including.


 solr-commons-csv must not use the org.apache.commons.csv package
 

 Key: SOLR-3204
 URL: https://issues.apache.org/jira/browse/SOLR-3204
 Project: Solr
  Issue Type: Bug
Affects Versions: 3.5
Reporter: Emmanuel Bourg
Priority: Blocker
 Fix For: 3.6


 The solr-commons-csv artifact reused the code from the Apache Commons CSV 
 project but the package wasn't changed to something else than 
 org.apache.commons.csv in the process. This creates a compatibility issue as 
 the Apache Commons team works toward an official release of Commons CSV. It 
 prevents Commons CSV from using its own org.apache.commons.csv package, or 
 forces the renaming of all the classes to avoid a classpath conflict.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3206) FST package API refactoring

2012-03-05 Thread Robert Muir (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222910#comment-13222910
 ] 

Robert Muir commented on LUCENE-3206:
-

3.6 pruning: can we push this out to 4.0?

 FST package API refactoring
 ---

 Key: LUCENE-3206
 URL: https://issues.apache.org/jira/browse/LUCENE-3206
 Project: Lucene - Java
  Issue Type: Improvement
  Components: core/FSTs
Affects Versions: 3.2
Reporter: Dawid Weiss
Assignee: Dawid Weiss
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3206.patch


 The current API is still marked @experimental, so I think there's still time 
 to fiddle with it. I've been using the current API for some time and I do 
 have some ideas for improvement. This is a placeholder for these -- I'll post 
 a patch once I have a working proof of concept.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-2921) Now that we track the code version at the segment level, we can stop tracking it also in each file level

2012-03-05 Thread Robert Muir (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-2921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222912#comment-13222912
 ] 

Robert Muir commented on LUCENE-2921:
-

3.6 pruning: can we push this out to 4.0?

 Now that we track the code version at the segment level, we can stop tracking 
 it also in each file level
 

 Key: LUCENE-2921
 URL: https://issues.apache.org/jira/browse/LUCENE-2921
 Project: Lucene - Java
  Issue Type: Improvement
  Components: core/index
Reporter: Shai Erera
 Fix For: 3.6, 4.0


 Now that we track the code version that created the segment at the segment 
 level, we can stop tracking versions in each file. This has several major 
 benefits:
 # Today the constant names that use to track versions are confusing - they do 
 not state since which version it applies to, and so it's harder to determine 
 which formats we can stop supporting when working on the next major release.
 # Those format numbers are usually negative, but in some cases positive 
 (inconsistency) -- we need to remember to increase it one down for the 
 negative ones, which I always find confusing.
 # It will remove the format tracking from all the *Writers, and the *Reader 
 will receive the code format (String) and work w/ the appropriate constant 
 (e.g. Constants.LUCENE_30). Centralizing version tracking to SegmentInfo is 
 an advantage IMO.
 It's not urgent that we do it for 3.1 (though it requires an index format 
 change), because starting from 3.1 all segments track their version number 
 anyway (or migrated to track it), so we can safely release it in follow-on 3x 
 release.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3003) Move UnInvertedField into Lucene core

2012-03-05 Thread Robert Muir (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222914#comment-13222914
 ] 

Robert Muir commented on LUCENE-3003:
-

3.6 pruning: can we push this out to 4.0 (mark resolved?)

 Move UnInvertedField into Lucene core
 -

 Key: LUCENE-3003
 URL: https://issues.apache.org/jira/browse/LUCENE-3003
 Project: Lucene - Java
  Issue Type: Improvement
  Components: core/index
Reporter: Michael McCandless
Assignee: Michael McCandless
Priority: Minor
 Fix For: 3.6, 4.0

 Attachments: LUCENE-3003.patch, LUCENE-3003.patch, 
 byte_size_32-bit-openjdk6.txt


 Solr's UnInvertedField lets you quickly lookup all terms ords for a
 given doc/field.
 Like, FieldCache, it inverts the index to produce this, and creates a
 RAM-resident data structure holding the bits; but, unlike FieldCache,
 it can handle multiple values per doc, and, it does not hold the term
 bytes in RAM.  Rather, it holds only term ords, and then uses
 TermsEnum to resolve ord - term.
 This is great eg for faceting, where you want to use int ords for all
 of your counting, and then only at the end you need to resolve the
 top N ords to their text.
 I think this is a useful core functionality, and we should move most
 of it into Lucene's core.  It's a good complement to FieldCache.  For
 this first baby step, I just move it into core and refactor Solr's
 usage of it.
 After this, as separate issues, I think there are some things we could
 explore/improve:
   * The first-pass that allocates lots of tiny byte[] looks like it
 could be inefficient.  Maybe we could use the byte slices from the
 indexer for this...
   * We can improve the RAM efficiency of the TermIndex: if the codec
 supports ords, and we are operating on one segment, we should just
 use it.  If not, we can use a more RAM-efficient data structure,
 eg an FST mapping to the ord.
   * We may be able to improve on the main byte[] representation by
 using packed ints instead of delta-vInt?
   * Eventually we should fold this ability into docvalues, ie we'd
 write the byte[] image at indexing time, and then loading would be
 fast, instead of uninverting

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3138) IW.addIndexes should fail fast if an index is too old/new

2012-03-05 Thread Robert Muir (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222918#comment-13222918
 ] 

Robert Muir commented on LUCENE-3138:
-

3.6 pruning: can we push this out to 4.0?

 IW.addIndexes should fail fast if an index is too old/new
 -

 Key: LUCENE-3138
 URL: https://issues.apache.org/jira/browse/LUCENE-3138
 Project: Lucene - Java
  Issue Type: Improvement
  Components: core/index
Reporter: Shai Erera
Priority: Minor
 Fix For: 3.6, 4.0


 Today IW.addIndexes (both Dir and IR versions) do not check the format of the 
 incoming indexes. Therefore it could add a too old/new index and the app will 
 discover that only later, maybe during commit() or segment merges. We should 
 check that up front and fail fast.
 This issue is relevant only to 4.0 at the moment, which will not support 2.x 
 indexes anymore.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3204) solr-commons-csv must not use the org.apache.commons.csv package

2012-03-05 Thread Yonik Seeley (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222917#comment-13222917
 ] 

Yonik Seeley commented on SOLR-3204:


Background here: http://markmail.org/message/lsxtxoetpw7a47uf

As far as I can determine, this is just a maven meta-data issue?
i.e. it doesn't seem like we should change java package names, but just make 
sure that the maven meta-data is completely distinct from any official 
commons-csv releases?  (disclaimer: I still really know nothing of maven, other 
than it's becoming more a pain all the time ;-)

 solr-commons-csv must not use the org.apache.commons.csv package
 

 Key: SOLR-3204
 URL: https://issues.apache.org/jira/browse/SOLR-3204
 Project: Solr
  Issue Type: Bug
Affects Versions: 3.5
Reporter: Emmanuel Bourg
Priority: Blocker
 Fix For: 3.6


 The solr-commons-csv artifact reused the code from the Apache Commons CSV 
 project but the package wasn't changed to something else than 
 org.apache.commons.csv in the process. This creates a compatibility issue as 
 the Apache Commons team works toward an official release of Commons CSV. It 
 prevents Commons CSV from using its own org.apache.commons.csv package, or 
 forces the renaming of all the classes to avoid a classpath conflict.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-3204) solr-commons-csv must not use the org.apache.commons.csv package

2012-03-05 Thread Ryan McKinley (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-3204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222930#comment-13222930
 ] 

Ryan McKinley commented on SOLR-3204:
-

Emmanuel, can you confirm this is just a maven issue?

Can it be solved with adding exclusions?
{code:xml}
dependency
  groupIdorg.apache.solr/groupId
  artifactIdsolr-core/artifactId
  version${solr.version}/version
  exclusions
exclusion
  groupIdorg.apache.solr/groupId
  artifactIdsolr-commons-csv/artifactId
/exclusion
...
{code}

Obviously the best approach is to work with an official release... but until 
that exists, what do you suggest?  (Actually forking the project and changing 
the package names seems pretty bad) 


 solr-commons-csv must not use the org.apache.commons.csv package
 

 Key: SOLR-3204
 URL: https://issues.apache.org/jira/browse/SOLR-3204
 Project: Solr
  Issue Type: Bug
Affects Versions: 3.5
Reporter: Emmanuel Bourg
Priority: Blocker
 Fix For: 3.6


 The solr-commons-csv artifact reused the code from the Apache Commons CSV 
 project but the package wasn't changed to something else than 
 org.apache.commons.csv in the process. This creates a compatibility issue as 
 the Apache Commons team works toward an official release of Commons CSV. It 
 prevents Commons CSV from using its own org.apache.commons.csv package, or 
 forces the renaming of all the classes to avoid a classpath conflict.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3116) pendingCommit in IndexWriter is not thoroughly tested

2012-03-05 Thread Robert Muir (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222933#comment-13222933
 ] 

Robert Muir commented on LUCENE-3116:
-

its easy to add the sleep, but we dont even have good multithreaded tests with 
rollback() [except testing how exceptions are handled and not really asserting 
anything?]

Can we push this out to 4.0?

 pendingCommit in IndexWriter is not thoroughly tested
 -

 Key: LUCENE-3116
 URL: https://issues.apache.org/jira/browse/LUCENE-3116
 Project: Lucene - Java
  Issue Type: Test
  Components: core/index
Affects Versions: 3.2, 4.0
Reporter: Uwe Schindler
 Fix For: 3.6, 4.0


 When working on LUCENE-3084, I had a copy-paste error in my patch (see 
 revision 1124307 and corrected in 1124316), I replaced pendingCommit by 
 segmentInfos in IndexWriter, corrected by the following patch:
 {noformat}
 --- lucene/dev/trunk/lucene/src/java/org/apache/lucene/index/IndexWriter.java 
 (original)
 +++ lucene/dev/trunk/lucene/src/java/org/apache/lucene/index/IndexWriter.java 
 Wed May 18 16:16:29 2011
 @@ -2552,7 +2552,7 @@ public class IndexWriter implements Clos
  lastCommitChangeCount = pendingCommitChangeCount;
  segmentInfos.updateGeneration(pendingCommit);
  segmentInfos.setUserData(pendingCommit.getUserData());
 -rollbackSegments = segmentInfos.createBackupSegmentInfos(true);
 +rollbackSegments = pendingCommit.createBackupSegmentInfos(true);
  deleter.checkpoint(pendingCommit, true);
} finally {
  // Matches the incRef done in startCommit:
 {noformat}
 This did not cause any test failure.
 On IRC, Mike said:
 {quote}
 [19:21]   mikemccand: ThetaPh1: hmm
 [19:21]   mikemccand: well
 [19:22]   mikemccand: pendingCommit and sis only differ while commit() is 
 running
 [19:22]   mikemccand: ie if a thread starts commit
 [19:22]   mikemccand: but fsync is taking a long time
 [19:22]   mikemccand: and another thread makes a change to sis
 [19:22]   ThetaPh1: ok so hard to find that bug
 [19:22]   mikemccand: we need our mock dir wrapper to sometimes take a 
 long time syncing
 {quote}
 Maybe we need such a test, I feel bad when such stupid changes don't make any 
 test fail.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3844) Deprecate Token class and remove in trunk

2012-03-05 Thread Robert Muir (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13222938#comment-13222938
 ] 

Robert Muir commented on LUCENE-3844:
-

How much work is this? I think even removing would be difficult.

Maybe we should push deprecating this out to 4.0, giving us a lot of time to 
clean things up
before its totally removed.

 Deprecate Token class and remove in trunk
 -

 Key: LUCENE-3844
 URL: https://issues.apache.org/jira/browse/LUCENE-3844
 Project: Lucene - Java
  Issue Type: Task
  Components: core/other, modules/analysis
Affects Versions: 3.5
Reporter: Uwe Schindler
 Fix For: 3.6, 4.0


 After issues like LUCENE-3843, introducing new attributes, we should remove 
 Token class in trunk, as it leads to code that ignores those new attributes 
 (like PositionLengthAttribute, ScriptAttribute, KeywordAttribute,...). If you 
 want a holder for all Attributes a TokenStream, use TS.cloneAttributes().

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3161) consider warnings from the source compilation

2012-03-05 Thread Robert Muir (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-3161:


Fix Version/s: (was: 3.6)

 consider warnings from the source compilation
 -

 Key: LUCENE-3161
 URL: https://issues.apache.org/jira/browse/LUCENE-3161
 Project: Lucene - Java
  Issue Type: Task
  Components: general/build
Reporter: Robert Muir
  Labels: maybe32blocker
 Fix For: 4.0


 as Doron mentioned in his review: At compiling there are various warning 
 printed, I think it would be more assuring for downloaders if the build runs 
 without warning. These warnings are not a stopper.
 we could conditionalize these warnings so that they don't display when 
 compiling from actual releases, but I have to wonder if we should hide 
 these... being open source I think we should display all our warts, maybe 
 some contributor sees these warnings and decides they want to submit a patch 
 to fix some of them.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3755) Add examples to javadocs of Analyzer (4.0)/ReusableAnalyzerBase(3.6)

2012-03-05 Thread Robert Muir (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-3755:


Fix Version/s: (was: 3.6)

 Add examples to javadocs of Analyzer (4.0)/ReusableAnalyzerBase(3.6)
 

 Key: LUCENE-3755
 URL: https://issues.apache.org/jira/browse/LUCENE-3755
 Project: Lucene - Java
  Issue Type: Task
  Components: general/javadocs
Reporter: Robert Muir
  Labels: newdev
 Fix For: 4.0


 This stuff is great, it makes it easy to define analyzers:
 {code}
 Analyzer analyzer = new Analyzer() {
 @Override
 protected TokenStreamComponents createComponents(String fieldName, Reader 
 reader) {
   Tokenizer source = new FooTokenizer(reader);
   TokenStream filter = new FooFilter(source);
   filter = new BarFilter(filter);
   return new TokenStreamComponents(source, filter);
 }
   };
 {code}
 But, we should add basic examples to the javadocs I think (we can backport to 
 ReusableAnalyzerBase).
 Also it would be nice to throw in an example that adds a CharFilter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-2974) the hudson nightly for lucene should check out lucene by itself

2012-03-05 Thread Robert Muir (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-2974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-2974:


Fix Version/s: (was: 3.6)

 the hudson nightly for lucene should check out lucene by itself
 ---

 Key: LUCENE-2974
 URL: https://issues.apache.org/jira/browse/LUCENE-2974
 Project: Lucene - Java
  Issue Type: Bug
  Components: general/build
Reporter: Robert Muir
 Fix For: 4.0


 Currently its too easy to break the lucene-only packaging and build.
 the hudson job for lucene should check out lucene by itself, this will
 prevent it from being broken.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



  1   2   >