Re: Simple search low speed
Perhaps belated and IANA Solr expert, but take a look at mergeFactor and ramBufferSizeMB. I've had a situation where after Solr upgrade old values were kept. Those are used for buffering during indexing AFAIR. If buffer size is exceeded, merge happens. If merge factor is exceeded, coalesced segments are (AFAIR) written to IO, thus you might see high CPU. pozdrawiam, LAFK 2015-04-24 14:39 GMT+02:00 Joel Bernstein joels...@gmail.com: Try breaking down the query to see which part of it is slow. If it turns out to be the range query you may want to look into using an frange postfilter. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Apr 24, 2015 at 6:50 AM, Norgorn lsunnyd...@mail.ru wrote: Thanks for your reply. Yes, 100% CPU is used by SOLR (100% - I mean 1 core, not all cores), I'm totally sure. I have more than 80 GB RAM on test machine and about 50 is cached as disk cache, SOLR uses about 8, Xmx=40G. I use GC1, but it can't be the problem, cause memory usage is much lower than GC start limit (45% of heap). I think, the problem can be in fully optimized index, and search over one big segment is much slower than parallel search over lot of segments, but it sounds weird, so I'm not sure. Setups with big indexes which I know are all with optimized indexes. Index scheme: field name=text type=text_general indexed=true stored=false termVectors=true termPositions=true termOffsets=true / field name=title type=text_general indexed=true stored=false termVectors=true termPositions=true termOffsets=true / field name=cat type=tint indexed=true stored=true multiValued=false required=true omitNorms=true omitTermFreqAndPositions=true/ field name=date type=date indexed=true stored=false omitNorms=true omitTermFreqAndPositions=true/ field name=field1 type=int indexed=true stored=true required=false omitNorms=true omitTermFreqAndPositions=true/ field name=field2 type=int indexed=true stored=true required=false omitNorms=true omitTermFreqAndPositions=true/ -- View this message in context: http://lucene.472066.n3.nabble.com/Simple-search-low-speed-tp4202135p4202157.html Sent from the Solr - User mailing list archive at Nabble.com.
trec_collection
Hi, I have a very general question. Is it possible to index a TREC collection with Solr (not Lucene Benchmark) using an own schema.xml (analyzer defintions,...)? I want to index a TREC collection with different settings to see which fits best.
Odp.: Remote connection to Solr
Try running a different server first and see if that ain't windows shutting down network when sleeping / hibernating and not waking it up later. Or blocking the sockets. @LAFK_PL Oryginalna wiadomość Od: Steven White Wysłano: piątek, 24 kwietnia 2015 16:04 Do: solr-user@lucene.apache.org Odpowiedz: solr-user@lucene.apache.org Temat: Remote connection to Solr Hi Everyone, This maybe a Jetty question but let me start here first. I have Solr running on my laptop and from my desktop I have no issue accessing it. However, if I take my laptop home and connect it to my home network, the next day when I connect the laptop to my office network, I no longer can access Solr from my desktop. A restart of Solr will not do, the only fix is to restart my Windows 8.1 OS (that's what's on my laptop). I have not been able to figure out why this is happening and I'm suspecting it has to do something with Jetty because I have Solr 3.6 running on my laptop in a WebSphere profile and it does not run into this issue. Any ideas what could be causing this? Is this question for the Jetty mailing list? Thanks Steve
Re: trec_collection
Hi Onur, Of course it is possible, you communicate solr with SolrJ for example. https://cwiki.apache.org/confluence/display/solr/Using+SolrJ Ahmet On Sunday, April 26, 2015 3:25 PM, onur kasimlar kasimlar.o...@gmail.com wrote: Hi, I have a very general question. Is it possible to index a TREC collection with Solr (not Lucene Benchmark) using an own schema.xml (analyzer defintions,...)? I want to index a TREC collection with different settings to see which fits best.
Re: and stopword in user query is being change to q.op=AND
Thank you Hoss from correcting my understanding, again i missed this concept of edismax. Do we have any solrj class or helper to handle the scenario to pass on the query terms (by stripping stopwords ) to edismax using solrj api. for ex: if user queries for *term1 and term2* build and query to pass on this to edismax so that this user query will be parsed as *parsedquery: (+(DisjunctionMaxQuery((textSpell:term1) DisjunctionMaxQuery((textSpell:term2/no_coord * *Thanks,* *Rajesh**.* On Fri, Apr 24, 2015 at 1:13 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : I was under understanding that stopwords are filtered even before being : parsed by search handler, i do have the filter in collection schema to : filter stopwords and the analysis shows that this stopword is filtered Generally speaking, your understanding of the order of operations for query parsing (regardless of hte parser) and analysis (regardless of the fields/analyzers/filters/etc...) is backwards. the query parser gets, as it's input, the query string (as a *single* string) and the request params. it inspects/parses the string according to it's rules options syntax and based on what it finds in that string (and in other request params) it passes some/all of that string to the analyzer for one or more fields, and uses the results of those analyzers as the terms for building up a query structure. ask yourself: if the raw user query input was first passed to an analyzer (for stop word filtering as you suggest) before the being passed to the query parser -- how would solr know what analyzer to use? in many parsers (like lucene and edismax) the fields to use can be specified *inside* the query string itself likewise: how would you ensure that syntactically significant string sequences (like ( and : and AND etc..) that an analyzer might normally strip out based on the tokenizer/tokenfilters would be preserved so that the query parser could have them and use them to drive hte resulting query structure? -Hoss http://www.lucidworks.com/
Re: Highlighting in Solr
I supposed currently the only way to show the highlighting snippets in xml and json output is via a separate section at the bottom, and it is currently not possible to show the highlighted snippets together with the rest of the response? Regards, Edwin On 22 April 2015 at 21:57, Zheng Lin Edwin Yeo edwinye...@gmail.com wrote: Hi, I'm currently implementing highlighting on my Solr-5.0.0. When I issue the following command: http://localhost:8983/solr/collection1/select?q=conducted http://localhost:8983/solr/edmtechnical/select?q=conducted hl=truehl.fl=Content,Summarywt=jsonindent=truerows=10, the highlighting result is listed at the bottom of the output, instead of together with the rest of the response above. The result is shown below: response:{numFound:10,start:0,docs:[ { id:1-1, Summary:i} Trial conducted, Content:Completed, _version_:1498407036159787020}, highlighting:{ 1-1:{ Summary:[i) Trial emconducted/em]} Is there any way to get the highlighted output to be displayed together with the rest of the response, instead of having it display separately at the bottom? Which is something like this response:{numFound:10,start:0,docs:[ { id:1-1, Summary:i} Trial emconducted/em, Content:Completed, _version_:1498407036159787020}, Regards, Edwin