Re: Solr 7.X negative filter not working

2018-09-20 Thread Till Kinstler

Am 20.09.2018 um 10:04 schrieb damian.pawski:


The problematic search  "JobTitle:(NOT programmer)" is constructed via C#
code, so I can ont easily update to "-JobTitle".


JobTitle:(* NOT programmer) should also work, if that helps...
I think, NOT without left side term was never officially supported. I 
remember reading "Note: The NOT operator cannot be used with just one 
term." already many years ago in the Lucene documentation (in the 
current Lucene release documentation: 
<http://lucene.apache.org/core/7_4_0/queryparser/org/apache/lucene/queryparser/classic/package-summary.html#NOT>). 
But you are right, it once worked in Solr...


Till

--
Till Kinstler
Verbundzentrale des Gemeinsamen Bibliotheksverbundes (VZG)
Platz der Göttinger Sieben 1, D 37073 Göttingen
kinst...@gbv.de, +49 (0) 551 39-31414, http://www.gbv.de/


Re: SolrCloud constantly crashes after upgrading to Solr 4.7

2014-03-18 Thread Till Kinstler

Am 18.03.2014 15:26, schrieb Martin de Vries:

Martin, I’ve committed the SOLR-5875 fix, including to the
lucene_solr_4_7 branch.

Any chance you could test the fix?


Hi Steve,

I'm very happy you found the bug. We are running the version from SVN on
one server and it's already running fine for 5 hours. If it's still
stable tomorrow than we are absolutely sure, I will report it here.


Same here. We had the same OOM trouble as Martin very quickly after 
upgrading our Cloud to 4.7.0 (so switched back to to 4.6.1 immediately).
This morning, I compiled the current lucene_solr_4_7 branch and after 
having it run without issues on one node for a few hours, I upgraded all 
our 30 nodes about 6 hours ago. So far no OOMs as with 4.7.0. Seems fine 
to me...


Till




Re: Multilanguage

2009-02-17 Thread Till Kinstler

Paul Libbrecht schrieb:

Clearly, then, something that matches words in a dictionary and decides 
on the language based on the language of the majority could do a decent 
job to decide the analyzer.


Does such a tool exist?


I once played around with http://ngramj.sourceforge.net/ for language 
guessing. It did a good job. It doesn't use dictionaries for language 
identification but a statistical approach using ngrams.
I don't have any precise numbers, but out of about 1 documents in 
different languages (most in English, German and French, few in other 
european languages like Polish) there were only some 10 not identified 
correctly.


Till

--
Till Kinstler
Verbundzentrale des Gemeinsamen Bibliotheksverbundes (VZG)
Platz der Göttinger Sieben 1, D 37073 Göttingen
kinst...@gbv.de, +49 (0) 551 39-13431, http://www.gbv.de


Re: new faceting algorithm

2008-12-05 Thread Till Kinstler

Yonik Seeley schrieb:


We'd love some feedback on how it works to
ensure that it actually is a win for the majority and should be the
default.


I just did a quick test using Solr nightly 2008-11-30. I have an index 
of about 2.9 mil bibliographic records, size: 16G. I tested facetting 
author names, each index document may contain multiple author names, so 
author names go into a multivalued field (not analyzed). Queries used 
for testing were extracted from log files of a prototype application.
With facet.method=enum, 50 request threads, I get an average response 
time of about 19(!) ms, no cache evictions. With 1 request thread: 
about 1800 ms.
With facet.method=fc, 50 threads I get an average response time of 
around 300 ms. 1 thread: 16 ms.

Seems to be a major improvement at first sight :-)

Regards,
Till

--
Till Kinstler
Verbundzentrale des Gemeinsamen Bibliotheksverbundes (VZG)
Platz der Göttinger Sieben 1, D 37073 Göttingen
[EMAIL PROTECTED], +49 (0) 551 39-13431, http://www.gbv.de