Re: Simple search low speed

2015-04-26 Thread Tomasz Borek
Perhaps belated and IANA Solr expert, but take a look at mergeFactor and
ramBufferSizeMB. I've had a situation where after Solr upgrade old values
were kept. Those are used for buffering during indexing AFAIR. If buffer
size is exceeded, merge happens. If merge factor is exceeded, coalesced
segments are (AFAIR) written to IO, thus you might see high CPU.

pozdrawiam,
LAFK

2015-04-24 14:39 GMT+02:00 Joel Bernstein joels...@gmail.com:

 Try breaking down the query to see which part of it is slow. If it turns
 out to be the range query you may want to look into using an frange
 postfilter.

 Joel Bernstein
 http://joelsolr.blogspot.com/

 On Fri, Apr 24, 2015 at 6:50 AM, Norgorn lsunnyd...@mail.ru wrote:

  Thanks for your reply.
 
  Yes, 100% CPU is used by SOLR (100% - I mean 1 core, not all cores), I'm
  totally sure.
 
  I have more than 80 GB RAM on test machine and about 50 is cached as disk
  cache, SOLR uses about 8, Xmx=40G.
 
  I use GC1, but it can't be the problem, cause memory usage is much lower
  than GC start limit (45% of heap).
 
  I think, the problem can be in fully optimized index, and search over one
  big segment is much slower than parallel search over lot of segments, but
  it
  sounds weird, so I'm not sure.
  Setups with big indexes which I know are all with optimized indexes.
 
  Index scheme:
  field name=text type=text_general indexed=true stored=false
  termVectors=true termPositions=true termOffsets=true /
 field name=title type=text_general indexed=true stored=false
  termVectors=true termPositions=true termOffsets=true /
 
  field name=cat type=tint indexed=true stored=true
  multiValued=false required=true omitNorms=true
  omitTermFreqAndPositions=true/
 
  field name=date type=date indexed=true stored=false
  omitNorms=true omitTermFreqAndPositions=true/
 
  field name=field1 type=int indexed=true stored=true
  required=false omitNorms=true omitTermFreqAndPositions=true/
  field name=field2 type=int indexed=true stored=true
  required=false omitNorms=true omitTermFreqAndPositions=true/
 
 
 
  --
  View this message in context:
 
 http://lucene.472066.n3.nabble.com/Simple-search-low-speed-tp4202135p4202157.html
  Sent from the Solr - User mailing list archive at Nabble.com.
 



trec_collection

2015-04-26 Thread onur kasimlar
Hi,
I have a very general question. Is it possible to index a TREC collection
with Solr (not Lucene Benchmark) using an own schema.xml (analyzer
defintions,...)? I want to index a TREC collection with different settings
to see which fits best.


Odp.: Remote connection to Solr

2015-04-26 Thread LAFK
Try running a different server first and see if that ain't windows shutting 
down network when sleeping / hibernating and not waking it up later. Or 
blocking the sockets.

@LAFK_PL
  Oryginalna wiadomość  
Od: Steven White
Wysłano: piątek, 24 kwietnia 2015 16:04
Do: solr-user@lucene.apache.org
Odpowiedz: solr-user@lucene.apache.org
Temat: Remote connection to Solr

Hi Everyone,

This maybe a Jetty question but let me start here first.

I have Solr running on my laptop and from my desktop I have no issue
accessing it. However, if I take my laptop home and connect it to my home
network, the next day when I connect the laptop to my office network, I no
longer can access Solr from my desktop. A restart of Solr will not do, the
only fix is to restart my Windows 8.1 OS (that's what's on my laptop).

I have not been able to figure out why this is happening and I'm suspecting
it has to do something with Jetty because I have Solr 3.6 running on my
laptop in a WebSphere profile and it does not run into this issue.

Any ideas what could be causing this? Is this question for the Jetty
mailing list?

Thanks

Steve


Re: trec_collection

2015-04-26 Thread Ahmet Arslan
Hi Onur,

Of course it is possible, you communicate solr with SolrJ for example.
https://cwiki.apache.org/confluence/display/solr/Using+SolrJ

Ahmet



On Sunday, April 26, 2015 3:25 PM, onur kasimlar kasimlar.o...@gmail.com 
wrote:
Hi,
I have a very general question. Is it possible to index a TREC collection
with Solr (not Lucene Benchmark) using an own schema.xml (analyzer
defintions,...)? I want to index a TREC collection with different settings
to see which fits best.


Re: and stopword in user query is being change to q.op=AND

2015-04-26 Thread Rajesh Hazari
Thank you Hoss from correcting my understanding, again i missed this
concept of edismax.

Do we have any solrj class or helper to handle the scenario to pass on the
query terms (by stripping stopwords ) to edismax using solrj api.
for ex: if user queries for *term1 and term2* build and query to pass on
this to edismax so that this user query will be parsed as


*parsedquery: (+(DisjunctionMaxQuery((textSpell:term1)
DisjunctionMaxQuery((textSpell:term2/no_coord *

*Thanks,*
*Rajesh**.*

On Fri, Apr 24, 2015 at 1:13 PM, Chris Hostetter hossman_luc...@fucit.org
wrote:


 : I was under understanding that stopwords are filtered even before being
 : parsed by search handler, i do have the filter in collection schema to
 : filter stopwords and the analysis shows that this stopword is filtered

 Generally speaking, your understanding of the order of operations for
 query parsing (regardless of hte parser) and analysis (regardless of the
 fields/analyzers/filters/etc...) is backwards.


 the query parser gets, as it's input, the query string (as a *single*
 string) and the request params.  it inspects/parses the string according
 to it's rules  options  syntax and based on what it finds in that string
 (and in other request params) it passes some/all of that string to the
 analyzer for one or more fields, and uses the results of those analyzers
 as the terms for building up a query structure.

 ask yourself: if the raw user query input was first passed to an analyzer
 (for stop word filtering as you suggest) before the being passed to the
 query parser -- how would solr know what analyzer to use?  in many parsers
 (like lucene and edismax) the fields to use can be specified *inside* the
 query string itself

 likewise: how would you ensure that syntactically significant string
 sequences (like ( and : and AND etc..) that an analyzer might
 normally strip out based on the tokenizer/tokenfilters would be preserved
 so that the query parser could have them and use them to drive hte
 resulting query structure?



 -Hoss
 http://www.lucidworks.com/



Re: Highlighting in Solr

2015-04-26 Thread Zheng Lin Edwin Yeo
I supposed currently the only way to show the highlighting snippets in xml
and json output is via a separate section at the bottom, and it is
currently not possible to show the highlighted snippets together with the
rest of the response?

Regards,
Edwin


On 22 April 2015 at 21:57, Zheng Lin Edwin Yeo edwinye...@gmail.com wrote:

 Hi,

 I'm currently implementing highlighting on my Solr-5.0.0. When I issue the
 following command:
 http://localhost:8983/solr/collection1/select?q=conducted
 http://localhost:8983/solr/edmtechnical/select?q=conducted
 hl=truehl.fl=Content,Summarywt=jsonindent=truerows=10,
 the highlighting result is listed at the bottom of the output, instead of
 together with the rest of the response above. The result is shown below:

   response:{numFound:10,start:0,docs:[
   {
 id:1-1,
 Summary:i} Trial conducted,
 Content:Completed,
 _version_:1498407036159787020},


   highlighting:{
 1-1:{
   Summary:[i) Trial emconducted/em]}


 Is there any way to get the highlighted output to be displayed together with 
 the rest of the response, instead of having it display separately at the 
 bottom? Which is something like this


   response:{numFound:10,start:0,docs:[
   {
 id:1-1,
 Summary:i} Trial emconducted/em,
 Content:Completed,
 _version_:1498407036159787020},


 Regards,
 Edwin