Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Shalin Shekhar Mangar
Welcome Areek! On Thu, Nov 12, 2015 at 2:18 AM, Simon Willnauer wrote: > I'm pleased to announce that Areek has accepted the PMC's invitation to > join. > > Welcome Areek! > > Simon -- Regards, Shalin Shekhar Mangar. - To un

[JENKINS] Lucene-Solr-NightlyTests-5.x - Build # 1014 - Still Failing

2015-11-11 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-NightlyTests-5.x/1014/ 1 tests failed. FAILED: org.apache.solr.cloud.CollectionsAPIDistributedZkTest.test Error Message: Captured an uncaught exception in thread: Thread[id=15513, name=collection4, state=RUNNABLE, group=TGRP-CollectionsAPIDistrib

[jira] [Comment Edited] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread David Smiley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001688#comment-15001688 ] David Smiley edited comment on LUCENE-6874 at 11/12/15 4:47 AM: ---

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread david.w.smi...@gmail.com
Welcome Areek! On Wed, Nov 11, 2015 at 3:49 PM Simon Willnauer wrote: > I'm pleased to announce that Areek has accepted the PMC's invitation to > join. > > Welcome Areek! > > > Simon > -- Lucene/Solr Search Committer, Consultant, Developer, Author, Speaker LinkedIn: http://linkedin.com/in/david

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread David Smiley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001688#comment-15001688 ] David Smiley commented on LUCENE-6874: -- +1 I like it Uwe; nice job. Automating the

[jira] [Closed] (SOLR-7669) Add SelectStream to Streaming API

2015-11-11 Thread Dennis Gove (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-7669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Gove closed SOLR-7669. - Resolution: Implemented Fix Version/s: Trunk > Add SelectStream to Streaming API > -

[jira] [Commented] (SOLR-7669) Add SelectStream to Streaming API

2015-11-11 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-7669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001680#comment-15001680 ] ASF subversion and git services commented on SOLR-7669: --- Commit 17139

[jira] [Updated] (SOLR-7669) Add SelectStream to Streaming API

2015-11-11 Thread Dennis Gove (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-7669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Gove updated SOLR-7669: -- Attachment: SOLR-7669.patch Fixes for pre-commit failures. Add documentation on the operations. > Add Se

[JENKINS-EA] Lucene-Solr-trunk-Linux (64bit/jdk1.9.0-ea-b90) - Build # 14876 - Still Failing!

2015-11-11 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/14876/ Java: 64bit/jdk1.9.0-ea-b90 -XX:+UseCompressedOops -XX:+UseSerialGC 3 tests failed. FAILED: junit.framework.TestSuite.org.apache.solr.cloud.SaslZkACLProviderTest Error Message: 5 threads leaked from SUITE scope at org.apache.s

[jira] [Assigned] (SOLR-7669) Add SelectStream to Streaming API

2015-11-11 Thread Dennis Gove (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-7669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Gove reassigned SOLR-7669: - Assignee: Dennis Gove > Add SelectStream to Streaming API > - > >

[JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.8.0_66) - Build # 14875 - Failure!

2015-11-11 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/14875/ Java: 32bit/jdk1.8.0_66 -server -XX:+UseParallelGC 1 tests failed. FAILED: org.apache.solr.cloud.SyncSliceTest.test Error Message: timeout waiting to see all nodes active Stack Trace: java.lang.AssertionError: timeout waiting

[JENKINS] Lucene-Solr-Tests-trunk-Java8 - Build # 611 - Failure

2015-11-11 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-Tests-trunk-Java8/611/ 1 tests failed. FAILED: org.apache.solr.handler.TestReplicationHandler.doTestIndexAndConfigAliasReplication Error Message: Index: 0, Size: 0 Stack Trace: java.lang.IndexOutOfBoundsException: Index: 0, Size: 0 at _

[jira] [Updated] (SOLR-8280) TestCloudSchemaless + ChangedSchemaMergeTest fail weirdly if you try to use SolrCoreAware sim factory: SchemaSimilarityFactory

2015-11-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-8280: --- Description: Something about the code path(s) involved in TestCloudSchemaless & ChangedSchemaMergeTest don't p

[jira] [Updated] (SOLR-8280) TestCloudSchemaless fails weirdly if you try to use SolrCoreAware sim factory: SchemaSimilarityFactory

2015-11-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-8280: --- Attachment: SOLR-8280.patch After filing this, it occurred to me the original issue that lead me down this inv

[JENKINS] Lucene-Solr-5.x-Solaris (multiarch/jdk1.7.0) - Build # 182 - Still Failing!

2015-11-11 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-5.x-Solaris/182/ Java: multiarch/jdk1.7.0 -d64 -XX:+UseCompressedOops -XX:+UseConcMarkSweepGC 1 tests failed. FAILED: org.apache.solr.cloud.hdfs.StressHdfsTest.test Error Message: Could not load collection from ZK:delete_data_dir Stack Trace: or

[jira] [Updated] (SOLR-8271) use SchemaSimilarityFactory as default when no explicit (top level) sim is configured

2015-11-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-8271: --- Attachment: SOLR-8271.patch Initial simple patch, currently causes failures in TestCloudSchemaless & ChangedSc

[jira] [Commented] (SOLR-6406) ConcurrentUpdateSolrServer hang in blockUntilFinished.

2015-11-11 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001429#comment-15001429 ] Yonik Seeley commented on SOLR-6406: I was analyzing another "shards-out-of-sync" failu

[jira] [Updated] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Attachment: (was: unicode-ws-tokenizer.patch) > WhitespaceTokenizer should tokenize on NBSP

[jira] [Updated] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Attachment: (was: unicode-ws-tokenizer.patch) > WhitespaceTokenizer should tokenize on NBSP

[jira] [Updated] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Attachment: unicode-ws-tokenizer.patch > WhitespaceTokenizer should tokenize on NBSP >

[jira] [Updated] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Attachment: unicode-ws-tokenizer.patch Minor changes > WhitespaceTokenizer should tokenize on

[jira] [Assigned] (SOLR-8281) Add RollupMergeStream to Streaming API

2015-11-11 Thread Joel Bernstein (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Bernstein reassigned SOLR-8281: Assignee: Joel Bernstein > Add RollupMergeStream to Streaming API >

[jira] [Updated] (SOLR-8281) Add RollupMergeStream to Streaming API

2015-11-11 Thread Joel Bernstein (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Bernstein updated SOLR-8281: - Description: The RollupMergeStream merges the aggregate results emitted by the RollupStream on *wo

[jira] [Comment Edited] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001419#comment-15001419 ] Uwe Schindler edited comment on LUCENE-6874 at 11/12/15 12:42 AM: -

[jira] [Updated] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Attachment: unicode-ws-tokenizer.patch Here is my patch with the UnicodeWhitespaceTokenizer, sm

[jira] [Created] (SOLR-8281) Add RollupMergeStream to Streaming API

2015-11-11 Thread Joel Bernstein (JIRA)
Joel Bernstein created SOLR-8281: Summary: Add RollupMergeStream to Streaming API Key: SOLR-8281 URL: https://issues.apache.org/jira/browse/SOLR-8281 Project: Solr Issue Type: Bug

[jira] [Updated] (SOLR-8280) TestCloudSchemaless fails weirdly if you try to use SolrCoreAware sim factory: SchemaSimilarityFactory

2015-11-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-8280: --- Attachment: SOLR-8280.patch With the attached patch, it's trivial to trigger a suite failure in TestCloudSche

Re: Sharing a class across cores

2015-11-11 Thread Gus Heck
Thanks for the links and perspective Hoss 3443 is exactly the same type of problem only for spellchecking. My initial thought was also a map, perhaps I'll adopt that ticket. The basic idea would be a Map on CoreContainer, accessors, and some very very strongly worded javadoc about keeping things si

[jira] [Created] (SOLR-8280) TestCloudSchemaless fails weirdly if you try to use SolrCoreAware sim factory: SchemaSimilarityFactory

2015-11-11 Thread Hoss Man (JIRA)
Hoss Man created SOLR-8280: -- Summary: TestCloudSchemaless fails weirdly if you try to use SolrCoreAware sim factory: SchemaSimilarityFactory Key: SOLR-8280 URL: https://issues.apache.org/jira/browse/SOLR-8280

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Christian Moen
Congrats, Areek! Christian > On Nov 12, 2015, at 5:48 AM, Simon Willnauer > wrote: > > I'm pleased to announce that Areek has accepted the PMC's invitation to join. > > Welcome Areek! > > Simon

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Koji Sekiguchi
Welcome Areek! Koji On 2015/11/12 5:48, Simon Willnauer wrote: I'm pleased to announce that Areek has accepted the PMC's invitation to join. Welcome Areek! Simon - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org

Re: Sharing a class across cores

2015-11-11 Thread Gus Heck
I'm trying to stay within the realm of solr's resource loader, so that I don't need to tweak startup parameters (e.g. classpath, sysprops) or rely on hardcoded stuff. The data must be usable by a query SearchComponent, and those are loaded by the core using the resource loader... One question I ha

[jira] [Updated] (SOLR-7669) Add SelectStream to Streaming API

2015-11-11 Thread Dennis Gove (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-7669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Gove updated SOLR-7669: -- Attachment: SOLR-7669.patch Rebased against trunk. > Add SelectStream to Streaming API > ---

[jira] [Closed] (SOLR-8188) Add hash style joins to the Streaming API and Streaming Expressions

2015-11-11 Thread Dennis Gove (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Gove closed SOLR-8188. - Resolution: Implemented Still closed > Add hash style joins to the Streaming API and Streaming Expressions

[jira] [Updated] (SOLR-8188) Add hash style joins to the Streaming API and Streaming Expressions

2015-11-11 Thread Dennis Gove (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Gove updated SOLR-8188: -- Attachment: SOLR-8188.patch This is the patch that was applied to trunk. > Add hash style joins to the S

[jira] [Reopened] (SOLR-8188) Add hash style joins to the Streaming API and Streaming Expressions

2015-11-11 Thread Dennis Gove (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Gove reopened SOLR-8188: --- Forgot to attach a slightly modified patch file (rebased off trunk). > Add hash style joins to the Streami

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Joel Bernstein
Welcome Areek! Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Nov 11, 2015 at 7:13 PM, Nicholas Knize wrote: > Congrats Areek!! > > On Nov 11, 2015, at 2:48 PM, Simon Willnauer > wrote: > > I'm pleased to announce that Areek has accepted the PMC's invitation to > join. > > Welcome Areek!

Re: Sharing a class across cores

2015-11-11 Thread Chris Hostetter
: However, my take on it is that this seems like a pretty broad brush to : paint with to move *all* our classes up and out of the normal core loading : process. I assume there are good reasons for segregating this stuff into : separate class loaders to begin with. It would also be fairly burdensom

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Nicholas Knize
Congrats Areek!! > On Nov 11, 2015, at 2:48 PM, Simon Willnauer > wrote: > > I'm pleased to announce that Areek has accepted the PMC's invitation to join. > > Welcome Areek! > > Simon

Re: Sharing a class across cores

2015-11-11 Thread Gus Heck
Thought of that but I'm trying to avoid additional infrastructure... and it will be accessed 0 or 1 times per query, so speed matters a bit. On Wed, Nov 11, 2015 at 7:05 PM, Walter Underwood wrote: > Depending on how fast the access needs to be, you could put that big map > in memcache. > > wund

[jira] [Closed] (SOLR-8188) Add hash style joins to the Streaming API and Streaming Expressions

2015-11-11 Thread Dennis Gove (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Gove closed SOLR-8188. - Resolution: Implemented Fix Version/s: Trunk > Add hash style joins to the Streaming API and Streami

[jira] [Commented] (SOLR-8188) Add hash style joins to the Streaming API and Streaming Expressions

2015-11-11 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001376#comment-15001376 ] ASF subversion and git services commented on SOLR-8188: --- Commit 17139

Re: Sharing a class across cores

2015-11-11 Thread Benson Margulies
What is the connection of a blob of data and a class in a class loader? Is it a class of your own that you're using to store the data? Solr can't change fundamental facts about class loader; if an object of a class needs to be shared across class loaders, it has to be loaded into a common parent.

[jira] [Assigned] (SOLR-8188) Add hash style joins to the Streaming API and Streaming Expressions

2015-11-11 Thread Dennis Gove (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Gove reassigned SOLR-8188: - Assignee: Dennis Gove > Add hash style joins to the Streaming API and Streaming Expressions >

Re: Sharing a class across cores

2015-11-11 Thread Walter Underwood
Depending on how fast the access needs to be, you could put that big map in memcache. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Nov 11, 2015, at 4:04 PM, Gus Heck wrote: > > P.S. I posted the original message concurrently with the chat sessi

Re: Sharing a class across cores

2015-11-11 Thread Gus Heck
P.S. I posted the original message concurrently with the chat session's occurance I beleive, certainly before I had read it, so no I haven't actually tried what you suggest yet. On Wed, Nov 11, 2015 at 7:02 PM, Gus Heck wrote: > Yes asked by a colleague :). The chat session is now in our jira ti

Re: Sharing a class across cores

2015-11-11 Thread Gus Heck
Yes asked by a colleague :). The chat session is now in our jira ticket :). However, my take on it is that this seems like a pretty broad brush to paint with to move *all* our classes up and out of the normal core loading process. I assume there are good reasons for segregating this stuff into sep

Re: Sharing a class across cores

2015-11-11 Thread Shawn Heisey
On 11/11/2015 4:11 PM, Gus Heck wrote: > I have a case where a component loads up a large CSV file (2.5 million > lines) to build a map. This worked ok in a case where we had a single > core, but it isn't working so well with 40 cores because each core loads > a new copy of the component in a new c

[jira] [Updated] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Attachment: (was: icu-datasucker.patch) > WhitespaceTokenizer should tokenize on NBSP > ---

[jira] [Updated] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Attachment: icu-datasucker.patch now the right patch. > WhitespaceTokenizer should tokenize on

[jira] [Updated] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Attachment: (was: icu-datasucker.patch) > WhitespaceTokenizer should tokenize on NBSP > ---

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001349#comment-15001349 ] Uwe Schindler commented on LUCENE-6874: --- Sorry updated my post, recognized this a m

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Steve Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001346#comment-15001346 ] Steve Rowe commented on LUCENE-6874: Uwe, you're using UCharacter,isWhitespace(), but

[jira] [Issue Comment Deleted] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Comment: was deleted (was: Result when running: {noformat} unicode-tokenizers: [groovy] Uni

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001345#comment-15001345 ] Uwe Schindler commented on LUCENE-6874: --- Sorry my fault, must be UCharacter.isUWhit

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001338#comment-15001338 ] Uwe Schindler commented on LUCENE-6874: --- Result when running: {noformat} unicode-t

[jira] [Updated] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Attachment: icu-datasucker.patch New simplified version, now printing unicode version (7.0 at m

[jira] [Updated] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-6874: -- Attachment: icu-datasucker.patch Here is my ICU datasucker. The lines with System.println shoul

[jira] [Created] (LUCENE-6895) TestGeo3DPointField.testBasic() failure

2015-11-11 Thread Steve Rowe (JIRA)
Steve Rowe created LUCENE-6895: -- Summary: TestGeo3DPointField.testBasic() failure Key: LUCENE-6895 URL: https://issues.apache.org/jira/browse/LUCENE-6895 Project: Lucene - Core Issue Type: Bug

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Steve Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001312#comment-15001312 ] Steve Rowe commented on LUCENE-6874: bq. My idea was to create the whitespace chars a

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001307#comment-15001307 ] Uwe Schindler commented on LUCENE-6874: --- ...hacking Groovy script using ICU4J as sp

Sharing a class across cores

2015-11-11 Thread Gus Heck
I have a case where a component loads up a large CSV file (2.5 million lines) to build a map. This worked ok in a case where we had a single core, but it isn't working so well with 40 cores because each core loads a new copy of the component in a new classloader and I get 40 new versions of the sam

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Erick Erickson
Congrats! On Wed, Nov 11, 2015 at 2:43 PM, Jan Høydahl wrote: > Welcome Areek! > > -- > Jan Høydahl, search solution architect > Cominvent AS - www.cominvent.com > > 11. nov. 2015 kl. 21.48 skrev Simon Willnauer : > > I'm pleased to announce that Areek has accepted the PMC's invitation to > join.

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001292#comment-15001292 ] Uwe Schindler commented on LUCENE-6874: --- My idea was to create the whitespace chars

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Steve Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001283#comment-15001283 ] Steve Rowe commented on LUCENE-6874: bq. Would this work? Yes, but I think ICU4J is

[jira] [Commented] (LUCENE-6894) Improve DISI.cost() by assuming independence for match probabilities

2015-11-11 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001272#comment-15001272 ] Paul Elschot commented on LUCENE-6894: -- That one is actually solved nowadays by the

[jira] [Commented] (LUCENE-6894) Improve DISI.cost() by assuming independence for match probabilities

2015-11-11 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001269#comment-15001269 ] Paul Elschot commented on LUCENE-6894: -- Suppose the query is a conjunction of 2 phra

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Jan Høydahl
Welcome Areek! -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 11. nov. 2015 kl. 21.48 skrev Simon Willnauer : > > I'm pleased to announce that Areek has accepted the PMC's invitation to join. > > Welcome Areek! > > Simon

[jira] [Comment Edited] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001257#comment-15001257 ] Uwe Schindler edited comment on LUCENE-6874 at 11/11/15 10:39 PM: -

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001257#comment-15001257 ] Uwe Schindler commented on LUCENE-6874: --- Cool! So my idea would be to write a smal

[jira] [Commented] (SOLR-8275) Unclear error message during recovery

2015-11-11 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001245#comment-15001245 ] Mark Miller commented on SOLR-8275: --- Also, I don't think BAD_REQUEST is probably the corr

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Alan Woodward
Welcome Areek! Alan Woodward www.flax.co.uk On 11 Nov 2015, at 20:48, Simon Willnauer wrote: > I'm pleased to announce that Areek has accepted the PMC's invitation to join. > > Welcome Areek! > > Simon

[jira] [Commented] (SOLR-8275) Unclear error message during recovery

2015-11-11 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001243#comment-15001243 ] Mark Miller commented on SOLR-8275: --- You should see the full info in the wait loop loggin

[jira] [Commented] (LUCENE-6894) Improve DISI.cost() by assuming independence for match probabilities

2015-11-11 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001241#comment-15001241 ] Adrien Grand commented on LUCENE-6894: -- bq. The independence that is assumed is norm

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Tomás Fernández Löbbe
Congratulations Areek!! On Wed, Nov 11, 2015 at 2:18 PM, Michael McCandless < luc...@mikemccandless.com> wrote: > Welcome Areek! > > Mike McCandless > > http://blog.mikemccandless.com > > > On Wed, Nov 11, 2015 at 3:48 PM, Simon Willnauer > wrote: > > I'm pleased to announce that Areek has accep

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Michael McCandless
Welcome Areek! Mike McCandless http://blog.mikemccandless.com On Wed, Nov 11, 2015 at 3:48 PM, Simon Willnauer wrote: > I'm pleased to announce that Areek has accepted the PMC's invitation to > join. > > Welcome Areek! > > Simon

[jira] [Commented] (LUCENE-6276) Add matchCost() api to TwoPhaseDocIdSetIterator

2015-11-11 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001228#comment-15001228 ] Adrien Grand commented on LUCENE-6276: -- I'm +1 on the patch. I'll do some more testi

[JENKINS] Lucene-Solr-5.x-MacOSX (64bit/jdk1.8.0) - Build # 2809 - Still Failing!

2015-11-11 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-5.x-MacOSX/2809/ Java: 64bit/jdk1.8.0 -XX:-UseCompressedOops -XX:+UseConcMarkSweepGC 1 tests failed. FAILED: org.apache.solr.cloud.CollectionsAPIDistributedZkTest.test Error Message: Error from server at http://127.0.0.1:64209/ljm/awholynewcollec

[jira] [Updated] (LUCENE-6894) Improve DISI.cost() by assuming independence for match probabilities

2015-11-11 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Elschot updated LUCENE-6894: - Description: The DocIdSetIterator.cost() method returns an estimation of the number of matching

[jira] [Commented] (LUCENE-6276) Add matchCost() api to TwoPhaseDocIdSetIterator

2015-11-11 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001142#comment-15001142 ] Paul Elschot commented on LUCENE-6276: -- I have opened LUCENE-6894 for the independen

[jira] [Updated] (LUCENE-6894) Improve DISI.cost() by assuming independence for match probabilities

2015-11-11 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Elschot updated LUCENE-6894: - Attachment: LUCENE-6894.patch Patch of 11 Nov 2015. Most of the changes are to pass numDocs down

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Varun Thacker
Congratulations Areek! On Wed, Nov 11, 2015 at 1:14 PM, Adrien Grand wrote: > Welcome Areek! > > > Le mer. 11 nov. 2015 à 21:49, Simon Willnauer > a écrit : > >> I'm pleased to announce that Areek has accepted the PMC's invitation to >> join. >> >> Welcome Areek! >> >> >> Simon >> > -- Reg

[jira] [Created] (LUCENE-6894) Improve DISI.cost() by assuming independence for match probabilities

2015-11-11 Thread Paul Elschot (JIRA)
Paul Elschot created LUCENE-6894: Summary: Improve DISI.cost() by assuming independence for match probabilities Key: LUCENE-6894 URL: https://issues.apache.org/jira/browse/LUCENE-6894 Project: Lucene

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Adrien Grand
Welcome Areek! Le mer. 11 nov. 2015 à 21:49, Simon Willnauer a écrit : > I'm pleased to announce that Areek has accepted the PMC's invitation to > join. > > Welcome Areek! > > > Simon >

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Yonik Seeley
Welcome Areek! -Yonik On Wed, Nov 11, 2015 at 3:48 PM, Simon Willnauer wrote: > I'm pleased to announce that Areek has accepted the PMC's invitation to > join. > > Welcome Areek! > > Simon - To unsubscribe, e-mail: dev-unsubsc

Re: Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Steve Rowe
Welcome Areek! Steve > On Nov 11, 2015, at 3:48 PM, Simon Willnauer > wrote: > > I'm pleased to announce that Areek has accepted the PMC's invitation to join. > > Welcome Areek! > > Simon - To unsubscribe, e-mail: dev-unsu

Welcome Areek Zillur to the Lucene / Solr PMC

2015-11-11 Thread Simon Willnauer
I'm pleased to announce that Areek has accepted the PMC's invitation to join. Welcome Areek! Simon

[JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.8.0_66) - Build # 14872 - Still Failing!

2015-11-11 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/14872/ Java: 32bit/jdk1.8.0_66 -server -XX:+UseConcMarkSweepGC 1 tests failed. FAILED: org.apache.solr.cloud.HttpPartitionTest.test Error Message: Didn't see all replicas for shard shard1 in c8n_1x2 come up within 3 ms! ClusterSt

[jira] [Updated] (SOLR-7989) Down replica elected leader, stays down after successful election

2015-11-11 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-7989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated SOLR-7989: -- Attachment: SOLR-7989.patch New version. Added some logging. Changed so we won't publish ACTIVE after a

[jira] [Commented] (SOLR-7036) Faster method for group.facet

2015-11-11 Thread Jim Musil (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-7036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001020#comment-15001020 ] Jim Musil commented on SOLR-7036: - I'm still working on this. Quite a bit changed under the

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Steve Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001000#comment-15001000 ] Steve Rowe commented on LUCENE-6874: bq. My idea was to use a Unicode data file and e

[jira] [Updated] (SOLR-8279) Add a new SolrCloud test that stops and starts the cluster while indexing data.

2015-11-11 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated SOLR-8279: -- Attachment: SOLR-8279.patch > Add a new SolrCloud test that stops and starts the cluster while indexing

[jira] [Updated] (SOLR-8279) Add a new SolrCloud test that stops and starts the cluster while indexing data.

2015-11-11 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-8279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated SOLR-8279: -- Attachment: SOLR-8279.patch Started some work on a new test. > Add a new SolrCloud test that stops and s

[jira] [Created] (SOLR-8279) Add a new SolrCloud test that stops and starts the cluster while indexing data.

2015-11-11 Thread Mark Miller (JIRA)
Mark Miller created SOLR-8279: - Summary: Add a new SolrCloud test that stops and starts the cluster while indexing data. Key: SOLR-8279 URL: https://issues.apache.org/jira/browse/SOLR-8279 Project: Solr

[jira] [Updated] (LUCENE-6892) various lucene.index initialCapacity tweaks

2015-11-11 Thread Christine Poerschke (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christine Poerschke updated LUCENE-6892: Lucene Fields: (was: New) > various lucene.index initialCapacity tweaks > ---

[jira] [Commented] (LUCENE-6892) various lucene.index initialCapacity tweaks

2015-11-11 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15000954#comment-15000954 ] ASF subversion and git services commented on LUCENE-6892: - Commit

[jira] [Resolved] (LUCENE-6892) various lucene.index initialCapacity tweaks

2015-11-11 Thread Christine Poerschke (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christine Poerschke resolved LUCENE-6892. - Resolution: Fixed Fix Version/s: 5.4 Trunk > various lu

[jira] [Commented] (LUCENE-6874) WhitespaceTokenizer should tokenize on NBSP

2015-11-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15000940#comment-15000940 ] Uwe Schindler commented on LUCENE-6874: --- bq. Why persist the bitset and deal with t

query complexity

2015-11-11 Thread search engine
Hi, I've been thinking how to use big O annotation to show complexity for different types of queries, like term query, prefix query, phrase query, wild card and fuzzy query. Any ideas? thanks, Zong

[JENKINS] Lucene-Solr-5.x-Linux (32bit/jdk1.7.0_80) - Build # 14580 - Failure!

2015-11-11 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-5.x-Linux/14580/ Java: 32bit/jdk1.7.0_80 -client -XX:+UseParallelGC 3 tests failed. FAILED: junit.framework.TestSuite.org.apache.solr.cloud.CollectionsAPIDistributedZkTest Error Message: 5 threads leaked from SUITE scope at org.apache.solr.clou

  1   2   >