Re: Collations are not working fine.

2015-02-12 Thread Nitin Solanki
Hi James Dyer, I did the same as you told me. Used WordBreakSolrSpellChecker instead of shingles. But still collations are not coming or working. For instance, I tried to get collation of "gone with the wind" by searching "gone wthh thes wint" on field=gram_ci but didn't s

Need autocomplete on whole phrase for multiple words .

2015-02-12 Thread vamshi kiran
Hi solrExperts, Need autocomplete on whole phrase for multiple words . When I typed *br, *the results are brad , brad pitt but I need only brad pitt to come. I’m using shinglefilterfactory + terms component for autocomplete feature , the query is something like http://localhost:8080/solr/acto

Re: Index directory containing only segments.gen

2015-02-12 Thread Erick Erickson
OK, I think this is the root of your problem: bq: Everything was setup using the - now deprecated - tags and inside solr.xml. There are a bunch of ways this could go wrong. I'm pretty sure you have something that would take quite a while to untangle, so unless you have a _very_ good reason fo

Re: creating a new collection fails as SearchHandler can't be found

2015-02-12 Thread Lee Carroll
Hi it was jars copied into a solr-zk-cli directory to allow easy running of solr zk cmd line client. well i think that is what fixed tomcat! I've also tried with jetty with a clean solr home and that also works and seems a much cleaner way of running multiple instances (probably more to do with ru

Re: SASL with zkcli.sh

2015-02-12 Thread Chris Hostetter
: I'm trying to start a SolrCloud cluster with a kerberized Zookeeper. I'm not : sure if it is possible, I have a Hadoop Cluster with an already running : zookeeper and I do not think running two zoo in parallel would be the wise : choice. : Is there a way to use SASL with SolrCloud ? Work has b

Re: Index directory containing only segments.gen

2015-02-12 Thread Zisis Tachtsidis
>From the logs I've got one instance failing as described in my first comment and the other two failing during PeerSync recovery when trying to communicate with the server that was missing the segments_* files. The exception follows org.apache.solr.client.solrj.SolrServerException: IOException oc

Re: Multy-tenancy and quarantee of service per application (tenant)

2015-02-12 Thread Jack Krupansky
There are two main, distinct forms of multi-tenancy: 1. The service provider controls the app and the Solr server and the app is carefully coded to isolate the data and load of the various tenants, such as adding a filter query with the tenant ID and throttling requests in an app server. 2. Each t

RE: How to make SolrCloud more elastic

2015-02-12 Thread Toke Eskildsen
Matt Kuiper [matt.kui...@issinc.com] wrote: > Thanks for your reply. Yes, I believe I will be working with a write > once archive. However, my understanding is that all shards are > defined up front, with the option to split later. Our situation might be a bit special as a few minutes downtime -

Re: Index directory containing only segments.gen

2015-02-12 Thread Zisis Tachtsidis
Well, I don't know If I'm being helpful but here goes. My clusterstate.json actually has no leader for the shard in question. I have 2 nodes as "recovery_failed" and one as "down". No leaders there. I've not used core admin or collections api to create anything. Everything was setup using the - now

RE: How to make SolrCloud more elastic

2015-02-12 Thread Matt Kuiper
Toke, Thanks for your reply. Yes, I believe I will be working with a write once archive. However, my understanding is that all shards are defined up front, with the option to split later. Can you describe, or point me to documentation, on how to create shards one at a time? Thanks, Matt

RE: How to make SolrCloud more elastic

2015-02-12 Thread Matt Kuiper
Otis, Thanks for your reply. I see your point about too many shards and search efficiency. I also agree that I need to get a better handle on customer requirements and expected loads. Initially I figured that with the shard splitting option, I would need to double my Solr nodes every time

Re: Index directory containing only segments.gen

2015-02-12 Thread Erick Erickson
So after adding some docs to the index (and committing) with those two nodes active, do segment files magically appear? My _guess_ is that there's something radially wrong with you set up the collection. Did you by any chance use the core admin API to create the cores? That can lead to "interestin

Re: 43sec commit duration - blocked by index merge events?

2015-02-12 Thread Otis Gospodnetic
If you are using Solr and SPM for Solr, you can check a report that shows the # of files in an index and the report that shows you the max docs-num docs delta. If you see the # of files drop during a commit, that's a merge. If you see a big delta change, that's probably a merge, too. You could a

Re: Solrcloud performance issues

2015-02-12 Thread Otis Gospodnetic
Hi, Did you say you have 150 servers in this cluster? And 10 shards for just 90M docs? If so, that 150 hosts sounds like too much for all other numbers I see here. I'd love to see some metrics here. e.g. what happens with disk IO around those commits? How about GC time/size info? Are JVM mem

Re: Multy-tenancy and quarantee of service per application (tenant)

2015-02-12 Thread Otis Gospodnetic
Not really, not 100%, if tenants share the same hardware and there is no isolation through things like containers (in which case they don't share the same SolrCloud cluster, really). Otis -- Monitoring * Alerting * Anomaly Detection * Centralized Log Management Solr & Elasticsearch Support * http:

RE: How to make SolrCloud more elastic

2015-02-12 Thread Matt Kuiper
Thanks Alex. Per your recommendation I checked out the presentation and it was very informative. While my problem space will not reach the scale addressed in this talk, some of the topics may be helpful. Those being the improvements to shard splitting and the new 'migrate' API. Thanks, Matt

Re: Solrcloud performance issues

2015-02-12 Thread Timothy Potter
Hi Vijay, We're working on SOLR-6816 ... would love for you to be a test site for any improvements we make ;-) Curious if you've experimented with changing the mergeFactor to a higher value, such as 25 and what happens if you set soft-auto-commits to something lower like 15 seconds? Also, make s

Re: American /British Dictionary for solr-4.10.2

2015-02-12 Thread Walter Underwood
You are looking for this sort of thing? elevator, lift blueberry, whortleberry, bilberry rutabega, swede hood, bonnet convertible top, hood trunk, boot daycare, preschool, nursery, playgroup arugula, rocket sidewalk, pavement sweater, jumper kerosene, paraffin paraffin, wax pants, trousers underwe

Index directory containing only segments.gen

2015-02-12 Thread Zisis Tachtsidis
I'm using SolrCloud 4.10.3 and the current setup is simple using 3 nodes with 1 shard. After a rolling restart of the Solr cluster I've ended up with 2 failing nodes reporting the following org.apache.solr.servlet.SolrDispatchFilter null:org.apache.solr.common.SolrException: SolrCore 'core' is not

Multy-tenancy and quarantee of service per application (tenant)

2015-02-12 Thread Victor Rondel
Hi everyone, I am wondering about multy-tenancy and garantee of service in SolrCloud : *Multy-tenant cluster* : Is there a way to *guarantee a level of service* / capacity planning for *each tenant* using the cluster (its *own collections*) ? Thanks,

Solrcloud performance issues

2015-02-12 Thread Vijay Sekhri
Hi Erick, We have following configuration of our solr cloud 1. 10 Shards 2. 15 replicas per shard 3. 9 GB of index size per shard 4. a total of around 90 mil documents 5. 2 collection viz search1 serving live traffic and search 2 for indexing. We swap collection when indexing fin

Re: Possible to dump clusterstate, system stats into solr log?

2015-02-12 Thread Erick Erickson
Jim: Not that I know of. I'm guessing that accessing ZK directly and dumping from there is also not possible? Best, Erick On Wed, Feb 11, 2015 at 10:47 AM, Jim.Musil wrote: > Hi, > > Is it possible to periodically dump the cluster state contents (or system > diagnostics) into the main solr log

Use cases - Production examples: datas, queries, cluster hardware and conf, and statistics

2015-02-12 Thread Victor Rondel
Hi everyone, I am considering moving one or several Solr clusters to production. Although Solr's documentation and community is *great*, I am strongly startled not to find any *complete use-case story* stretching from application(s) needs and data considerations to hardware ones. Indeed, I underst

SASL with zkcli.sh

2015-02-12 Thread Simon Minery
Hello, I'm trying to start a SolrCloud cluster with a kerberized Zookeeper. I'm not sure if it is possible, I have a Hadoop Cluster with an already running zookeeper and I do not think running two zoo in parallel would be the wise choice. Is there a way to use SASL with SolrCloud ? Thank you

Re: Stopwords in shingles suggester

2015-02-12 Thread O. Klein
With more and more people starting to use the Suggester it seems that enablePositionIncrements for StopFilterFactory is still needed. Not sure why it is being removed from Solr5, but is there a way to keep the functionality beyond lucene 4.3 ? Or can this feature be reinstated? -- View this mes

Re: Analytics Component not working Solr-5.0

2015-02-12 Thread sumitj25
Can somebody help, has anyone used analytics component here? -- View this message in context: http://lucene.472066.n3.nabble.com/Analytics-Component-not-working-Solr-5-0-tp4185666p4185977.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: ApacheCon 2015 at Austin, TX

2015-02-12 Thread Dmitry Kan
Hi, Looks like I'll be there. So if you want to discuss luke / lucene / solr, will be happy to de-virtualize. Dmitry On Mon, Jan 12, 2015 at 6:32 PM, CP Mishra wrote: > Hi, > > I am planning to attend ApacheCon 2015 at Austin, TX (Apr 13-16th) and > wondering if there will be lucene/solr sessi

Re: American British Dictionary for Solr

2015-02-12 Thread François Schiettecatte
Dinesh See this: http://wordlist.aspell.net/varcon/ You will need to do some work to convert to a SOLR friendly format though. Cheers François > On Feb 12, 2015, at 12:22 AM, dinesh naik wrote: > > Hi , > We are looking for a dictionary to support American/British English synonym.

Re: Multi words query

2015-02-12 Thread melb
I am using rub gem rsolr and querying simply the collection by this query: response = solr.get 'select', :params => { :q=>query, :fl=> 'id,title,description,body' :rows=>10 } response["response"]["docs"].each{|doc| puts doc["id"] } I created a text field to copy all the fields to and the q

Solr suggest is related to second letter, not to initial letter

2015-02-12 Thread Volkan Altan
Hello Everyone, All I want to do with Solr suggester is obtaining the fact that the asserted suggestions for the second letter whose entry actualizes after the initial letter is actually related to initial letter, itself. But; just like the initial letters, the second letters rotate independe

variaton on boosting recent documents gives exception

2015-02-12 Thread Michael Lackhoff
Since my field to measure recency is not a date field but a string field (with only year-numbers in it), I tried a variation on the suggested boost function for recent documents: recip(sub(2015,min(sortyear,2015)),1,10,10) But this gives an exception when used in a boost or bf parameter. I guess

RE: American /British Dictionary for solr-4.10.2

2015-02-12 Thread Markus Jelsma
There are no dictionaries that sum up all possible conjugations, using a heuristics based normalizer would be more appropriate. There are nevertheless some good sources to start: Contains lots of useful spelling issues, incl. british/american/canadian/australian http://grammarist.com/spelling

Batch updates and separate update request processor chain for atomic document updates

2015-02-12 Thread Andreas Hubold
Hi, we're using a SolrJ client which either adds (or overwrites) existing documents or updates some meta-data fields of existing documents. Our default update request processor chain is configured with a processor for language detection. To avoid setting a wrong language, we're using a diffe