[jira] [Updated] (LUCENE-6654) KNearestNeighborClassifier not taking in consideration Class ranking

2015-07-14 Thread Alessandro Benedetti (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-6654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alessandro Benedetti updated LUCENE-6654:
-
Lucene Fields: Patch Available  (was: New,Patch Available)

 KNearestNeighborClassifier not taking in consideration Class ranking
 

 Key: LUCENE-6654
 URL: https://issues.apache.org/jira/browse/LUCENE-6654
 Project: Lucene - Core
  Issue Type: Improvement
  Components: modules/classification
Affects Versions: 5.2.1
Reporter: Alessandro Benedetti
Priority: Minor
  Labels: classification, knn
 Attachments: LUCENE-6654.patch


 Currently the KNN Classifier assign the score for a ClassificationResult, 
 based only on the frequency of the class in the top K results.
 This is conceptually a simplification.
 Actually the ranking must take a part.
 If not this can happen :
 Top 4
 1) Class1
 2) Class1
 3) Class2
 4) Class2
 As a result of this Top 4 , both the classes will have the same score.
 But the expected result is that Class1 has a better score, as the MLT score 
 the documents accordingly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Assigned] (SOLR-7766) support creation of a coreless collection

2015-07-14 Thread Ramkumar Aiyengar (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ramkumar Aiyengar reassigned SOLR-7766:
---

Assignee: Ramkumar Aiyengar

 support creation of a coreless collection
 -

 Key: SOLR-7766
 URL: https://issues.apache.org/jira/browse/SOLR-7766
 Project: Solr
  Issue Type: New Feature
Reporter: Christine Poerschke
Assignee: Ramkumar Aiyengar
Priority: Minor

 By supplying a deliberately empty create node set (via 
 {{.../solr/admin/collections?action=CREATEcreateNodeSet=name=myCollection...}})
  the collection can be created in the usual manner but it will just not yet 
 contain any cores for any of its shards. Later on 
 {{.../solr/admin/collections?wt=jsonaction=ADDREPLICAcollection=myCollection...}}
  calls can create cores as and when and where required.
 github pull request with proposed changes to follow.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-6641) Idea CodeSyle should be enriched

2015-07-14 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-6641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14627183#comment-14627183
 ] 

Michael McCandless commented on LUCENE-6641:


You're welcome [~alessandro.benedetti]!  Thank you for raising this.

 Idea CodeSyle should be enriched
 

 Key: LUCENE-6641
 URL: https://issues.apache.org/jira/browse/LUCENE-6641
 Project: Lucene - Core
  Issue Type: Improvement
  Components: -tools
Affects Versions: 5.2.1
Reporter: Alessandro Benedetti
  Labels: codestyle
 Attachments: Eclipse-Lucene-Codestyle.xml


 Currently Idea CodeStyle has been fixed for latest intelljIdea version but it 
 is not complete.
 For example it does not contain spaces management ( space within method 
 params, space between operators ext )for the Java language ( and maybe for 
 the other languages involved as well ) .
 We should define a complete standard ( as some inconsistencies are in the 
 committed code as well) .



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Assigned] (SOLR-7765) TokenizerChain without char filters cause NPE in luke request handler

2015-07-14 Thread Hoss Man (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-7765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man reassigned SOLR-7765:
--

Assignee: Hoss Man

 TokenizerChain without char filters cause NPE in luke request handler
 -

 Key: SOLR-7765
 URL: https://issues.apache.org/jira/browse/SOLR-7765
 Project: Solr
  Issue Type: Bug
Affects Versions: 5.2.1
Reporter: Konstantin Gribov
Assignee: Hoss Man
Priority: Minor
 Attachments: SOLR-7765.patch


 {{TokenizerChain}} created using 2-arg constructor has {{null}} in 
 {{charFilters}}, so {{LukeRequestHandler}} throws NPE on iterating it.
 Will create PR in a couple of minutes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-7765) TokenizerChain without char filters cause NPE in luke request handler

2015-07-14 Thread Hoss Man (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-7765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated SOLR-7765:
---
Attachment: SOLR-7765.patch

Konstantin: i tried to whip up a quick patch to demonstrate the bug you were 
fixing (we always like to have test cases to help minimize the chance of 
regression in bug fixes) but i couldn't reproduce the problem (ie: the test 
passed)

can you please elaborate on the steps to reproduce in order to trigger an NPE 
in the Luke handler?

 TokenizerChain without char filters cause NPE in luke request handler
 -

 Key: SOLR-7765
 URL: https://issues.apache.org/jira/browse/SOLR-7765
 Project: Solr
  Issue Type: Bug
Affects Versions: 5.2.1
Reporter: Konstantin Gribov
Priority: Minor
 Attachments: SOLR-7765.patch


 {{TokenizerChain}} created using 2-arg constructor has {{null}} in 
 {{charFilters}}, so {{LukeRequestHandler}} throws NPE on iterating it.
 Will create PR in a couple of minutes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.8.0_45) - Build # 13465 - Failure!

2015-07-14 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/13465/
Java: 64bit/jdk1.8.0_45 -XX:+UseCompressedOops -XX:+UseG1GC

3 tests failed.
FAILED:  junit.framework.TestSuite.org.apache.solr.core.TestLazyCores

Error Message:
ERROR: SolrIndexSearcher opens=51 closes=50

Stack Trace:
java.lang.AssertionError: ERROR: SolrIndexSearcher opens=51 closes=50
at __randomizedtesting.SeedInfo.seed([680F5D1EE70417EF]:0)
at org.junit.Assert.fail(Assert.java:93)
at 
org.apache.solr.SolrTestCaseJ4.endTrackingSearchers(SolrTestCaseJ4.java:465)
at org.apache.solr.SolrTestCaseJ4.afterClass(SolrTestCaseJ4.java:232)
at sun.reflect.GeneratedMethodAccessor32.invoke(Unknown Source)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:497)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1627)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$5.evaluate(RandomizedRunner.java:799)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
com.carrotsearch.randomizedtesting.rules.SystemPropertiesRestoreRule$1.evaluate(SystemPropertiesRestoreRule.java:57)
at 
org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
org.apache.lucene.util.TestRuleStoreClassName$1.evaluate(TestRuleStoreClassName.java:42)
at 
com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:39)
at 
com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:39)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
org.apache.lucene.util.TestRuleAssertionsRequired$1.evaluate(TestRuleAssertionsRequired.java:54)
at 
org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
at 
org.apache.lucene.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:65)
at 
org.apache.lucene.util.TestRuleIgnoreTestSuites$1.evaluate(TestRuleIgnoreTestSuites.java:55)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:365)
at java.lang.Thread.run(Thread.java:745)


FAILED:  junit.framework.TestSuite.org.apache.solr.core.TestLazyCores

Error Message:
1 thread leaked from SUITE scope at org.apache.solr.core.TestLazyCores: 1) 
Thread[id=8235, name=searcherExecutor-3585-thread-1, state=WAITING, 
group=TGRP-TestLazyCores] at sun.misc.Unsafe.park(Native Method)
 at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175) 
at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2039)
 at 
java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:442) 
at 
java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:1067)   
  at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1127) 
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) 
at java.lang.Thread.run(Thread.java:745)

Stack Trace:
com.carrotsearch.randomizedtesting.ThreadLeakError: 1 thread leaked from SUITE 
scope at org.apache.solr.core.TestLazyCores: 
   1) Thread[id=8235, name=searcherExecutor-3585-thread-1, state=WAITING, 
group=TGRP-TestLazyCores]
at sun.misc.Unsafe.park(Native Method)
at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2039)
at 
java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:442)
at 
java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:1067)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1127)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
at __randomizedtesting.SeedInfo.seed([680F5D1EE70417EF]:0)


FAILED:  junit.framework.TestSuite.org.apache.solr.core.TestLazyCores

Error Message:
There are still zombie threads that couldn't be terminated:1) 
Thread[id=8235, 

[jira] [Commented] (SOLR-7036) Faster method for group.facet

2015-07-14 Thread Erick Erickson (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14627497#comment-14627497
 ] 

Erick Erickson commented on SOLR-7036:
--

Man, talk about a while. Anyway, I can't apply the patch and I spent some time 
tonight trying to figure out what needed to change but then it went into the 
weeds, compounded by the fact that I'm not all that familiar with the code.

[~jimtronic] Could I ask you to update the patch to _trunk_ a.k.a 6.x? Then 
I'll merge it into 5x if all goes well.

[~yo...@apache.org][~hossman_luc...@fucit.org] You guys should know I'm an 
amateur at all the faceting code. I'll look over what I can but if either of 
you would like to weigh in on any new patch that would be great. Or anyone else 
for that matter.

 Faster method for group.facet
 -

 Key: SOLR-7036
 URL: https://issues.apache.org/jira/browse/SOLR-7036
 Project: Solr
  Issue Type: Improvement
  Components: faceting
Affects Versions: 4.10.3
Reporter: Jim Musil
Assignee: Erick Erickson
 Fix For: 5.3, Trunk

 Attachments: SOLR-7036.patch, SOLR-7036.patch


 This is a patch that speeds up the performance of requests made with 
 group.facet=true. The original code that collects and counts unique facet 
 values for each group does not use the same improved field cache methods that 
 have been added for normal faceting in recent versions.
 Specifically, this approach leverages the UninvertedField class which 
 provides a much faster way to look up docs that contain a term. I've also 
 added a simple grouping map so that when a term is found for a doc, it can 
 quickly look up the group to which it belongs.
 Group faceting was very slow for our data set and when the number of docs or 
 terms was high, the latency spiked to multiple second requests. This solution 
 provides better overall performance -- from an average of 54ms to 32ms. It 
 also dropped our slowest performing queries way down -- from 6012ms to 991ms.
 I also added a few tests.
 I added an additional parameter so that you can choose to use this method or 
 the original. Add group.facet.method=fc to use the improved method or 
 group.facet.method=original which is the default if not specified.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.8.0_45) - Build # 13459 - Still Failing!

2015-07-14 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/13459/
Java: 32bit/jdk1.8.0_45 -client -XX:+UseParallelGC

1 tests failed.
FAILED:  org.apache.solr.cloud.CleanupOldIndexTest.test

Error Message:
Address already in use

Stack Trace:
java.net.BindException: Address already in use
at 
__randomizedtesting.SeedInfo.seed([9AB39A3B1B01A02C:12E7A5E1B5FDCDD4]:0)
at sun.nio.ch.Net.bind0(Native Method)
at sun.nio.ch.Net.bind(Net.java:437)
at sun.nio.ch.Net.bind(Net.java:429)
at 
sun.nio.ch.ServerSocketChannelImpl.bind(ServerSocketChannelImpl.java:223)
at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:74)
at 
org.eclipse.jetty.server.ServerConnector.open(ServerConnector.java:321)
at 
org.eclipse.jetty.server.AbstractNetworkConnector.doStart(AbstractNetworkConnector.java:80)
at 
org.eclipse.jetty.server.ServerConnector.doStart(ServerConnector.java:236)
at 
org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:68)
at org.eclipse.jetty.server.Server.doStart(Server.java:366)
at 
org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:68)
at 
org.apache.solr.client.solrj.embedded.JettySolrRunner.start(JettySolrRunner.java:386)
at 
org.apache.solr.cloud.CleanupOldIndexTest.test(CleanupOldIndexTest.java:109)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:497)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1627)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:836)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:872)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:886)
at 
org.apache.solr.BaseDistributedSearchTestCase$ShardsRepeatRule$ShardsFixedStatement.callStatement(BaseDistributedSearchTestCase.java:960)
at 
org.apache.solr.BaseDistributedSearchTestCase$ShardsRepeatRule$ShardsStatement.evaluate(BaseDistributedSearchTestCase.java:935)
at 
com.carrotsearch.randomizedtesting.rules.SystemPropertiesRestoreRule$1.evaluate(SystemPropertiesRestoreRule.java:57)
at 
org.apache.lucene.util.TestRuleSetupTeardownChained$1.evaluate(TestRuleSetupTeardownChained.java:50)
at 
org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
at 
org.apache.lucene.util.TestRuleThreadAndTestName$1.evaluate(TestRuleThreadAndTestName.java:49)
at 
org.apache.lucene.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:65)
at 
org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:365)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl.forkTimeoutingTask(ThreadLeakControl.java:798)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl$3.evaluate(ThreadLeakControl.java:458)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.runSingleTest(RandomizedRunner.java:845)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$3.evaluate(RandomizedRunner.java:747)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$4.evaluate(RandomizedRunner.java:781)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$5.evaluate(RandomizedRunner.java:792)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
com.carrotsearch.randomizedtesting.rules.SystemPropertiesRestoreRule$1.evaluate(SystemPropertiesRestoreRule.java:57)
at 
org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
org.apache.lucene.util.TestRuleStoreClassName$1.evaluate(TestRuleStoreClassName.java:42)
at 
com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:39)
at 
com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:39)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 

<    1   2