Re:Re: solrcloud performance problem

2016-05-09 Thread lltvw
Shawn, By using jps command double check the parms used to start solr, i found that the max heap size already set to 10G. So I made a big mistake yesterday. But by using solr admin UI, I select the collection with performance problem, in the overview page I find that the heap memory is about

Using Ping Request Handler in SolrCloud within a load balancer

2016-05-09 Thread Sandy Foley
A couple of questions ... We've upconfig'd the ping request handler to ZooKeeper within the solrconfig.xml.  SolrCloud and ZooKeeper are working fine. I understand that the /solr/admin/ping command is for a ping on its local server only (not from a remote machine).  This is working.I also

Re: building solr with an error UNRESOLVED DEPENDENCIES

2016-05-09 Thread Erick Erickson
Not that particular one, but every once in a while the Ivy cache goes a bit wonky on my local machine. It's a bit painful, but on my mac there's a directory ~/.ivy2 I go there and "rm -r cache" then restart the build. Be prepared to wait a while while the build process downloads a lot of jars.

Re: Filter queries & caching

2016-05-09 Thread Erick Erickson
You're confusing a query clause with fq when thinking about filter() I think. Essentially they don't need to be used together, i.e. q=myclause AND filter(field:value) is identical to q=myclause=field:value both in docs returned and filterCache usage. q=myclause(fq=field:value) actually uses

Re: Streaming expressions join operations

2016-05-09 Thread Ryan Cutter
Yes, the people collection has the personId and pets has ownerId, as described. On May 9, 2016 8:55 PM, "Joel Bernstein" wrote: > The example is using two collections: people and pets. So these collections > would need to be present for the join expression to work. > > Joel

Re: Solr 5.x bug with Service installation script?

2016-05-09 Thread Erick Erickson
How do you shut down your Solrs? Any kind of un-graceful stopping (kill -9 is a favorite) may leave the lock file around. It can't be coming from nowhere, so my guess is that it's present in the source or destination before you do your copy... Best, Erick On Mon, May 9, 2016 at 10:30 AM, A

Re: Solr re-indexing in case of store=false

2016-05-09 Thread Erick Erickson
Stored data is compressed by default, anecdotally there's about a 2:1 compression ratio. But the _other_ reason not to store all the data is that it then gets replicated. If you have master/slave or SolrCloud with replicas, you have N copies of your index and each and every one of them has a copy

Re: Replicate Between sites

2016-05-09 Thread Erick Erickson
bq: How similar thing could be done in 4.9.1? That's not going to happen. More precisely, there is zero chance that anyone will take on that work unless it's a custom one-off that you hire done or develop internally. And even if someone took this on, it'd never be officially released. IOW, if

Re: Advice to add additional non-related fields to a collection or create a subset of it?

2016-05-09 Thread Erick Erickson
Not quite sure where you are at with this. It sounds like your slow loading is fixed and was a coding issue on your part, that happens to us all. bq: Is it advisable to has as less number of queries to solr in a page? Of course it is advisable to have as few Solr queries executed to display a

Re: Streaming expressions join operations

2016-05-09 Thread Joel Bernstein
The example is using two collections: people and pets. So these collections would need to be present for the join expression to work. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, May 9, 2016 at 10:43 PM, Ryan Cutter wrote: > Thanks Joel, I added the personId and

Re: Solr edismax field boosting

2016-05-09 Thread Nick D
One thing to note: you can also take on wt=ruby=true it makes the debug explain data look better for pasting. But what I am seeing is a score is being all based on that fact that it found the content you were looking for in an unboosted field, i.e. *_text_*, so your boosts don't look to be having

Re: solrcloud performance problem

2016-05-09 Thread Shawn Heisey
On 5/9/2016 9:11 PM, lltvw wrote: > You are right, the max heap is 512MB, thanks. 90 million documents split into 12 shards means 7.5 million documents per shard. With that many documents and a 512MB heap, you're VERY lucky if Solr doesn't experience OutOfMemoryError problems -- which will make

Re: Query String Limit

2016-05-09 Thread Shawn Heisey
On 5/9/2016 8:31 PM, Zheng Lin Edwin Yeo wrote: > Would like to check, for the increasing of maxBooleanClauses to a large > values, if there are 50 collections in my core, I will have to increase > that in the solrconfig.xml for all the 50 collections, before it will work? The maxBooleanClauses

Re:Re: solrcloud performance problem

2016-05-09 Thread lltvw
Hi shawn, You are right, the max heap is 512MB, thanks. -- 发自我的网易邮箱手机智能版 在 2016-05-10 10:02:44,"Shawn Heisey" 写道: >On 5/9/2016 4:41 PM, lltvw wrote: >> Shawn, thanks. >> >> Each machine with 48G memory installed, and now with 20G free, I check JVM >> heap size use

RE: Solr edismax field boosting

2016-05-09 Thread Megha Bhandari
To clarify on the debug information given earlier , we changed the query factor to the following to ignore title field completely metatag.description^9 h1^7 h2^6 h3^5 h4^4 _text_^1 id^0.5" But still title results are coming on top Full response with debug on: Full response {

Re: Streaming expressions join operations

2016-05-09 Thread Ryan Cutter
Thanks Joel, I added the personId and ownerId fields before ingested a little data. I made them to be stored=true/multiValue=false/longs (and strings, later). Is additional schema required? On Mon, May 9, 2016 at 6:45 PM, Joel Bernstein wrote: > Hi, > > The example in the

RE: Solr edismax field boosting

2016-05-09 Thread Megha Bhandari
Hi Following is the debug information with debug=true Excerpt of debug information : "debug":{ "rawquerystring":"Upendra", "querystring":"Upendra", "parsedquery":"(+DisjunctionMaxQuery(((metatag.description:Upendra)^9.0 | (h1:Upendra)^7.0 | (h2:Upendra)^6.0 | (h3:Upendra)^5.0 |

Re: Query String Limit

2016-05-09 Thread Zheng Lin Edwin Yeo
Hi Prasanna Would like to check, for the increasing of maxBooleanClauses to a large values, if there are 50 collections in my core, I will have to increase that in the solrconfig.xml for all the 50 collections, before it will work? Regards, Edwin On 6 May 2016 at 23:42, Erick Erickson

Re: Solr edismax field boosting

2016-05-09 Thread Nick D
You can add the debug flag to the end of the request and see exactly what the scoring is and why things are happening. =ALL will show you everything including the scoring. Showing the result of the debug query should help you, or adding that into your question here, decipher what is going on

Solr edismax field boosting

2016-05-09 Thread Megha Bhandari
Hi We are trying to boost certain fields with relevancy. However we are not getting results as per expectation. Below is the configuration in solr-config.xml. Even though the title field has a lesser boost than metatag.description results for title field are coming higher. We even created

Re: solrcloud performance problem

2016-05-09 Thread Shawn Heisey
On 5/9/2016 4:41 PM, lltvw wrote: > Shawn, thanks. > > Each machine with 48G memory installed, and now with 20G free, I check JVM > heap size use solr admin UI, the heap size is about 20M. What is the *max* heap? An unmodified install of Solr 5.x or later has a max heap of 512MB. In the admin

Re: Streaming expressions join operations

2016-05-09 Thread Joel Bernstein
Hi, The example in the cwiki would require setting up the people and pets collections. Unless I'm mistaken this won't work with the out of the box schemas. So you'll need to setup some test schemas to get started. Although having out of the box streaming schemas is a great idea. Joel Bernstein

building solr with an error UNRESOLVED DEPENDENCIES

2016-05-09 Thread ??????jwos
when i try to compile solr 5.4.0 with src, i met an error: [ivy:retrieve] report for org.apache.solr#core;working@zozt-PC test.MiniKdc produced in F:\solr-5.4.0-src (1)\solr-5.4.0\lucene\build\ivy-resolution-cache\org.apache.solr-core-test.MiniKdc.xml [ivy:retrieve] resolve done (1982ms

Re: Indexing xml documents using solrj 6.0 + solr 6.0

2016-05-09 Thread Abdel Belkasri
did you look at this: https://cwiki.apache.org/confluence/display/solr/Using+SolrJ Regards, --Abdel. On Mon, May 9, 2016 at 1:32 PM, Mat San wrote: > Hello, > > Could I ask please for urgent help since I'm new to solrj and solr. I've > read all documentation but I did

Streaming expressions join operations

2016-05-09 Thread Ryan Cutter
Hello, I'm checking out the cool stream join operations in Solr 6.0 but can't seem to the example listed on the wiki to work: https://cwiki.apache.org/confluence/display/solr/Streaming+Expressions#StreamingExpressions-innerJoin innerJoin( search(people, q=*:*, fl="personId,name",

Re:Re: solrcloud performance problem

2016-05-09 Thread lltvw
Shawn, thanks. Each machine with 48G memory installed, and now with 20G free, I check JVM heap size use solr admin UI, the heap size is about 20M. -- 发自我的网易邮箱手机智能版 在 2016-05-10 02:04:22,"Shawn Heisey" 写道: >On 5/9/2016 10:52 AM, lltvw wrote: >> Sorry, I missed the

Error on creating new collection with existing configs

2016-05-09 Thread Jay Potharaju
Hi, I created a new config and uploaded it to zk with the name test_conf. And then created a collection which uses this config. CREATE COLLECTION: /solr/admin/collections?action=CREATE=test2=1=2=test_conf When indexing the data using DIH I get an error.

Indexing xml documents using solrj 6.0 + solr 6.0

2016-05-09 Thread Mat San
Hello, Could I ask please for urgent help since I'm new to solrj and solr. I've read all documentation but I did not find a full complete example in java how to index arbitrary xml documents and rich documents. (These documents are placed in a folder). Can somebody provide some examples please

Solr Grouping

2016-05-09 Thread Srinivas Mudam
Hi How can i customize the group limit, I have 5 groups, i want different limit for each group like 3,3,3,2,1. Could please provide solution for this. Thanks, Srinivas Mudam.

Re: Filter queries & caching

2016-05-09 Thread Jay Potharaju
Thanks Ahmet...but I am not still clear how is adding filter() option better or is it the same as filtercache? My question is below. "As mentioned above adding filter() will add the filter query to the cache. This would mean that results are fetched from cache instead of running n number of

Re: Re-ranking query: issue with sort criteria and how to disable it

2016-05-09 Thread Andrea Gazzarini
Hi Joel, just created [1] a new issue for that. Many thanks again Andrea [1] https://issues.apache.org/jira/browse/SOLR-9095 On 06/05/16 20:21, Joel Bernstein wrote: Maybe one ticket would work. Something like: "ReRanker should gracefully handle sorts without score". Then you can describe

Re: Replicate Between sites

2016-05-09 Thread Abdel Belkasri
Hi Alex, just started reading about CDCR, looks very promissing. Is this only in 6.0? our PROD server are running 4.9.1 and we cannot upgrade just yet. How similar thing could be done in 4.9.1? Thanks, --Abdel On Mon, May 9, 2016 at 2:59 PM, Alexandre Rafalovitch wrote: >

auto purge for embedded zookeeper

2016-05-09 Thread tedsolr
I have a development environment that is using an embedded zookeeper, and the zoo_data folder continues to grow. It's filled with snapshot files that are not getting purged. zoo.cfg has properties autopurge.snapRetainCount=10 autopurge.purgeInterval=1 Perhaps it's not in the correct location so

Re: Replicate Between sites

2016-05-09 Thread Alexandre Rafalovitch
Have you looked at Cross Data Center replication that's the new big feature in Solr 6.0? Regards, Alex. Newsletter and resources for Solr beginners and intermediates: http://www.solr-start.com/ On 10 May 2016 at 02:13, Abdel Belkasri wrote: > Hi there, > > we have

Re: solrcloud performance problem

2016-05-09 Thread Shawn Heisey
On 5/9/2016 10:52 AM, lltvw wrote: > Sorry, I missed the size of each shard, the size is about 3G each. Thanks. > > 在 2016-05-10 00:41:13,lltvw 写道: >> Recently we setup a 4.10 solrcloud env with about 9000 doc indexed in >> it,this solrcloud with 12 shards, each shard on one

Re: Solr 5.x bug with Service installation script?

2016-05-09 Thread A Laxmi
yes, I always shutdown both source and destination Solr before copying the index over from one to another. Somehow the write.lock only happens when Solr restarts from service script. If loads just fine when started manually. On Mon, May 9, 2016 at 1:20 PM, Abdel Belkasri

Re: Solr 5.x bug with Service installation script?

2016-05-09 Thread Abdel Belkasri
Did you copy the core while solr is running? if yes, first shuown source and destination solr, copy intex to the other solr, then restat solr nodes. Lock files get written to the core while solr is running and doing indexing or searching, etc. On Mon, May 9, 2016 at 12:38 PM, A Laxmi

Re:solrcloud performance problem

2016-05-09 Thread lltvw
Sorry, I missed the size of each shard, the size is about 3G each. Thanks. - 在 2016-05-10 00:41:13,lltvw 写道: > >Hi all, > >Recently we setup a 4.10 solrcloud env with about 9000 doc indexed in >it,this solrcloud with 12 shards, each shard on one separate machine, but

solrcloud performance problem

2016-05-09 Thread lltvw
Hi all, Recently we setup a 4.10 solrcloud env with about 9000 doc indexed in it,this solrcloud with 12 shards, each shard on one separate machine, but when we try to search some infor on solrcloud, the response time is about 300ms. Seems that the performance is not good, please advice

Solr 5.x bug with Service installation script?

2016-05-09 Thread A Laxmi
Hi, I have installed Solr 5.3.1 using the Service Installation Script. I was able to successfully start and stop Solr using service solr start/stop commands and Solr loads up just fine. However, when I stop Solr service and copy an index of a core from one server to another with same exact

Replicate Between sites

2016-05-09 Thread Abdel Belkasri
Hi there, we have the main site setup as follows: solrCould: App --> smart Client (solrj) --> ensemble of zookeeper --> SolrCloud Noes (with slice/shard/recplica) Works fine. On the DR site we have a mirror setup, how can we keep the two site in sync, so that if something happened we point

Re: Solr 5.4.1 Mergeindexes duplicate rows

2016-05-09 Thread Kalpana
Hello Can anyone help me with a merge. Currently I have the two cores already pulling data from SQL Table based on the query I set up. Solr is running I also have a third core set up with schema similar to the first two. and then I wrote this in the url and hit enter

Re: Solr 6 / Solrj RuntimeException: First tuple is not a metadata tuple

2016-05-09 Thread Joel Bernstein
Yes, /sql has been in trunk / master for a long time (more then 6 months). But it was not released in the 5x branch because it requires Java 8. I'm wondering what the issue is with the server setup that you have that is throwing the NPE. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, May

Re: CloudSolrStream returns only top 30 results.

2016-05-09 Thread Joel Bernstein
Yes, you need to specify the /export request handler using the following named paramter: qt="/export" This is described in the documentation in a number of places. https://cwiki.apache.org/confluence/display/solr/Streaming+Expressions By default CloudSolrStream uses the /select handler and uses

Re: Solr re-indexing in case of store=false

2016-05-09 Thread Ali Nazemian
Dear Erick, Hi, Thank you very much. About the storing part you are right, unless the primary datastore uses some kind of data compression which in my case it does (I am using Cassandra as a primary datastore), and I am not sure about Solr that it has any kind of compression or not. According to

Re: Re: Re: MoreLikeThis Component - how to get fields of documents

2016-05-09 Thread Dr. Jan Frederik Maas
Hey Alessandro, it seems that the edismax MLThandler is not able to work correctly with a solr cloud/sharding: https://issues.apache.org/jira/browse/SOLR-4414 Using the MLThandler we got randomly a response for only very few requests, while the MLTcomponent works fine (except for the

CloudSolrStream returns only top 30 results.

2016-05-09 Thread Roshan Kamble
Hello, I have a plan to use streaming API for join queries. But it has been observed that CloudSolrStream returns only top 30 matching records. Is there any configuration which needs to be done to retrieve ALL records or to have pagination like provision to specify start or rows attributes?

RE: Nodes appear twice in state.json

2016-05-09 Thread Markus Jelsma
Issue created! https://issues.apache.org/jira/browse/SOLR-9089 -Original message- > From:Shalin Shekhar Mangar > Sent: Thursday 5th May 2016 12:58 > To: solr-user@lucene.apache.org > Subject: Re: Nodes appear twice in state.json > > Hmm not good. Definitely a

Re: Re: MoreLikeThis Component - how to get fields of documents

2016-05-09 Thread Alessandro Benedetti
Hey Jan, any reason you are using the MLTComponent, instead of a specific Request Handler with the MLT query parser ? Entities Similarity edismax 5 *,score explicit json {!mlt qf=field1,field2,field3,field4 mintf=1 mindf=5

Re: Facet ignoring repeated word

2016-05-09 Thread Ahmet Arslan
Hi, I understand the word cloud part. It looks like you want to use within-resultList term frequency information.In your first mail, I thought you want within-document term frequency. TermsComponent reports within-collection term frequency. I am not sure how to retrieve within-resultList

issue of the popped message "ERROR: Solr at http://xxxx:8983/solr did not come online within 30 seconds!" in Solr with basic authentication

2016-05-09 Thread tjlp
Hi, I am using Solr 5.5.0. I config the basic authentication to Solr following by http://muddyazian.blogspot.com/2013/11/how-to-require-password-authentication.html. Then when I start solr, the message "ERROR: Solr at http://:8983/solr did not come online within 30 seconds!" is popup on

Re: Re: MoreLikeThis Component - how to get fields of documents

2016-05-09 Thread Dr. Jan Frederik Maas
Hi Edwin, thanks for your reply - we currently use 5.0.0 Best wishes, Jan Am 05.05.2016 um 05:26 schrieb Zheng Lin Edwin Yeo: Hi Jan, Which version of Solr are you using? Regards, Edwin On 26 April 2016 at 23:46, Dr. Jan Frederik Maas < jan.m...@sub.uni-hamburg.de> wrote: Hello, I want

RE: Facet ignoring repeated word

2016-05-09 Thread G, Rajesh
Hi Ahmet, Please let me know if I am not clear Thanks Rajesh CEB India Private Limited. Registration No: U741040HR2004PTC035324. Registered office: 6th Floor, Tower B, DLF Building No.10 DLF Cyber City, Gurgaon, Haryana-122002, India. This e-mail and/or its attachments are intended only

Re: Solr Suggester no results

2016-05-09 Thread Grigoris Iliopoulos
Yes, i also realized that stored="false" was the problem..It is also stated clearly in the documentation : "To be used as the basis for a suggestion, the field must be stored." Thanks for your time, Grigoris 2016-05-06 19:42 GMT+03:00 Erick Erickson : > First off,

Re: Solr 6 / Solrj RuntimeException: First tuple is not a metadata tuple

2016-05-09 Thread deniz
was able to get "gettingstarted" example running with sql, on my local only with a single zk... still not sure why the core/collection i have tried, didnt work till now... thanks a lot for pointing out for the version related issues, it made me change my focus from client to server side :)

Re: Issues of deploying mutiple web applications to Solr Jetty

2016-05-09 Thread Jan Høydahl
Hi Don’t add multiple webapps to Solr’s Jetty. Install a separate Jetty on another port for your other apps. Solr using Jetty is an implementation detail which may suddenly disappear in a future release. You will find that Jetty is very light weight and it is a much cleaner approach to run a

Issues of deploying mutiple web applications to Solr Jetty

2016-05-09 Thread tjlp
Hi, At present I try to deploy multiple web applications to Solr Jetty. So I add context files for each web application under the folder $SOLR.HOME/contexts. Now I get one library conflict issues:one of web applications uses old version of slf-4j, which conflict with the slf-4j libraries used