Re: index fields with custom gaps between terms

2017-12-18 Thread Amin Raeiszadeh
Shawn i think your way is good. i will study more about it. thanks, Amin On Tue, Dec 19, 2017 at 9:57 AM, Amin Raeiszadeh wrote: > Erick, in your example if first entry contains 10 term then i need to > start the second entry position from 100 not from 110. > thanks, > Amin > > On Tue, Dec 19, 2

Re: index fields with custom gaps between terms

2017-12-18 Thread Amin Raeiszadeh
Erick, in your example if first entry contains 10 term then i need to start the second entry position from 100 not from 110. thanks, Amin On Tue, Dec 19, 2017 at 3:25 AM, Shawn Heisey wrote: > On 12/18/2017 12:29 AM, Amin Raeiszadeh wrote: >> thanks too much Erick and mikhail. >> i change SloppyP

Re: Solr 7.1 Solrcloud dynamic/automatic replicas

2017-12-18 Thread Erick Erickson
What have you configured to add the replica when a new node is spun up? If you're just copying the entire directory including the core.properties file, you're just getting lucky. The legcyCloud=true default is _probably_ adding the replica with a new URL and thus making it distinct. Please detail

Solr 7.1 Solrcloud dynamic/automatic replicas

2017-12-18 Thread Greg Roodt
Hi Background: * I am looking to upgrade from Solr 6.1 to Solr 7.1. * Currently the system is run in cloud mode with a single collection and single shard per node. * Currently when a new node is added to the cluster, it becomes a replica and copies the data / core "automagically". Question: Is it

Span queries

2017-12-18 Thread Sreenivas.T
Hi, I'm writing span query with in own custom query parser. I get the tokens from query analyzer and create span term queries from each token. Later I'm doing span near query with all these span term queries. This would work if all the tokens are present in the index with in specified slop. Howev

Re: index fields with custom gaps between terms

2017-12-18 Thread Shawn Heisey
On 12/18/2017 12:29 AM, Amin Raeiszadeh wrote: > thanks too much Erick and mikhail. > i change SloppyPhraseScorer class for my custom behavior with some fields. > so i need to index some fields with customized gap between terms of fields. > i'm not profession with solr and i think with schema.xml o

Re: ICUTransformFilter with traditional to simplified Chinese

2017-12-18 Thread Shawn Heisey
On 12/18/2017 9:49 AM, Eyal Naamati wrote: > We are using the ICUTransformFilter to normalize traditional Chinese text to > simplified Chinese. > We received feedback from some of our Chinese customers that there are some > traditional characters that are not converted to their simplified variant

Re: request dependent analyzer

2017-12-18 Thread Doug Turnbull
Yes I would like to get around to implementing that. You might find out match query parser useful for selecting analyzers at query time https://github.com/o19s/match-query-parser -- Consultant, OpenSource Connections. Contact info at http://o19s.com/about-us/doug-turnbull/; Free/Busy (http://b

RE: request dependent analyzer

2017-12-18 Thread Markus Jelsma
Thanks, interesting ticket (missed it, following now), this is similar what we use. Construct an analyzer (or get from cache) with a field as argument as its base config. But allow Java's variable arguments syntax that you can use for adding or disabling filters, or modifying a specific filter's

Re: request dependent analyzer

2017-12-18 Thread Stefan Matheis
Hendrik, this doesn't exactly answer your question, but I do remember reading a thread on the lucene-dev list which became a jira ticket eventually - not that long ago. Doug asked for something that sounds at least a little bit similar to what you're asking: https://issues.apache.org/jira/browse/

Re: using rank queries(rq) with grouping in solr cloud

2017-12-18 Thread Diego Ceccarelli
Hi Tomerg, 1. Did you consider using the collapse component? https://lucene.apache.org/solr/guide/6_6/collapse-and-expand-results.html it is compatible with rq. 2. If you implement group reranking as a separate component you will end up with a lot of code duplicated from QueryComponent, you could

RE: request dependent analyzer

2017-12-18 Thread Markus Jelsma
Hi - for example, in edismax where the query analyzer is retrieved, you can create your specific analyzer with a customer SynonymsFilter with its own synonyms file. Of course keep a cache of already constructed analyzers. We also create custom analyzers based on config but with a per-request m

Re: CDCR configuration in solrconfig

2017-12-18 Thread Webster Homer
We also have the same configurations used in different environments. We upload the configset to zookeeper and use the Config API to overlay environment specific settings in the solrconfig.xml. We have avoided having collections share the same configsets, basically for this reason. If CDCR supporte

Re: OOM spreads to other replica's/HA when OOM

2017-12-18 Thread Bojan Vukojevic
UNSUBSCRIBE On Mon, Dec 18, 2017 at 12:57 PM Susheel Kumar wrote: > Technically I agree Shawn with you on fixing OOME cause, Infact it is not > an issue any more but I was testing for HA when planing for any failures. > Same time it's hard to convince Business folks that HA wouldn't be there in

Re: OOM spreads to other replica's/HA when OOM

2017-12-18 Thread Susheel Kumar
Technically I agree Shawn with you on fixing OOME cause, Infact it is not an issue any more but I was testing for HA when planing for any failures. Same time it's hard to convince Business folks that HA wouldn't be there in case of OOME. I think the best option is to enable timeAllowed for now. T

Re: How to restart solr in docker?

2017-12-18 Thread Rick Leir
Christine Have a look at the API Lucene.apache.org/solr/guide/6_6/config-api.html (Choose whatever version of the doc that is appropriate) Various parts of solrconfig can be overlayed. Or do your debugging with a normally installed Solr, then dockerize. Cheers -- Rick On December 18, 2017 12:28

Re: index fields with custom gaps between terms

2017-12-18 Thread Erick Erickson
You probably are aware of this already, but I want to be sure. positionIncrementGap is _only_ applied between the last term of one multiValued entry and the first term of the next. So say I have a text field and the input looks like: some stuff other words and my positionIncrementGap is

Re: How to sort on dates?

2017-12-18 Thread Erick Erickson
You _might_ (and I haven't tried this) be able to use sorting by funtion with min/max for a date range field, see: "Sort by Function" here: https://lucene.apache.org/solr/guide/6_6/function-queries.html. On Mon, Dec 18, 2017 at 5:09 AM, Michael Kuhlmann wrote: > Am 16.12.2017 um 19:39 schrieb Ge

Re: CDCR configuration in solrconfig

2017-12-18 Thread Erick Erickson
CDCR doesn't do this yet but WDYT about an option where the target collection was _assumed_ to be the same as the source? You're right, SOLR-8389 (and associated) should address this but I don't know what the progress is on that. Seems like a reasonable default in any case. Erick On Mon, Dec 18,

Filtering Solr pivot facet values

2017-12-18 Thread Arun Rangarajan
Solr version: 6.6.0 There are two multi-valued string fields in my schema: * interests * hierarchy. Goal is to run a pivot facet query on both these fields, but only for specific values of `interests` field. This query: ``` /select ?wt=json &rows=0 &q=interests:(hockey OR soccer) &facet=true &fa

Re: Learning to Rank (LTR) with grouping

2017-12-18 Thread Roopa Rao
Hi - I merged the code from the bloomberg master-solr-8776 branch to branch_6_6 on Solr. When I tried to compile the solr source code, I am getting multiple compilation errors (Attached), which seems to be due to the fact that the branch master-solr-8776 may not be compatible with branch_6_6. Co

CDCR configuration in solrconfig

2017-12-18 Thread Elaine Cario
We've recently been exploring options for disaster recovery, and took a look at CDCR for our SolrCloud(s). It seems to meet our needs, but we've stumbled into a couple of issues with configuration. The first issue is that currently CDCR is configured as a request handler in solrconfig, but becaus

Re: How to restart solr in docker?

2017-12-18 Thread Buckler, Christine
That makes sense. I am trying to add the “Suggest” plugin so I modified the solrconfig.xml file. Is there a better way to do what I am trying to do? I have not been able to add the plugin successfully. Do you have a resource page that shows how to add the config file under a volume? On 12/16/1

ICUTransformFilter with traditional to simplified Chinese

2017-12-18 Thread Eyal Naamati
Hi All, We are using the ICUTransformFilter to normalize traditional Chinese text to simplified Chinese. We received feedback from some of our Chinese customers that there are some traditional characters that are not converted to their simplified variants. For example: "�w" should be converted to

Re: OOM spreads to other replica's/HA when OOM

2017-12-18 Thread Shawn Heisey
On 12/18/2017 9:01 AM, Susheel Kumar wrote: > Any thoughts on how one can provide HA in these situations. As I have said already a couple of times today on other threads, there are *exactly* two ways to deal with OOME.  No other solution is possible. 1) Configure the system to allow the process t

Re: OOM spreads to other replica's/HA when OOM

2017-12-18 Thread Susheel Kumar
Shawn/Emir - its the Java heap space issue. I can see in GCViewer sudden heap utilization and finally Full GC lines and oom killer script killing the solr. What I wonder is if there is retry from coordinating node which is causing this OOM query to spread to next set of replica's then how can we

Re: OOM spreads to other replica's/HA when OOM

2017-12-18 Thread Emir Arnautović
Ah, I misunderstood your usecase - it is not node that receives query that OOMs but nodes that are included in distributed queries are the one that OOMs. I would also say that it is expected because queries to particular shards fails and coordinating node retries using other replicas causing all

Re: OOM spreads to other replica's/HA when OOM

2017-12-18 Thread Shawn Heisey
On 12/18/2017 7:36 AM, Susheel Kumar wrote: Yes, Emir. If I repeat the query, it will spread to other nodes but that's not the case. This is my test env and i am deliberately executing the query with very high offset and wildcard to cause OOM but executing only one time. So it shouldn't spread

Re: OOM spreads to other replica's/HA when OOM

2017-12-18 Thread Susheel Kumar
Yes, Emir. If I repeat the query, it will spread to other nodes but that's not the case. This is my test env and i am deliberately executing the query with very high offset and wildcard to cause OOM but executing only one time. So it shouldn't spread to other replica sets and at the end of my te

Re: Implicit routing changes to Composite while re-deploy changes and make Upconfig command

2017-12-18 Thread Shawn Heisey
On 12/18/2017 7:21 AM, Nimesh Aghera wrote: Thanks Shawn for reply, Need to do below step. After creating collection using the Collection API clear the version-2 folder from zoo_data and after that I will start server and make upconfig the collection is up but the routing changes from implici

Re: OOM spreads to other replica's/HA when OOM

2017-12-18 Thread Emir Arnautović
Hi Susheel, The fact that only node that received query OOM tells that it is about merging results from all shards and providing final result. It is expected that repeating the same query on some other node will result in a similar behaviour - it just mean that Solr does not have enough memory t

RE: Implicit routing changes to Composite while re-deploy changes and make Upconfig command

2017-12-18 Thread Nimesh Aghera
Hi Thanks Shawn for reply, Need to do below step. After creating collection using the Collection API clear the version-2 folder from zoo_data and after that I will start server and make upconfig the collection is up but the routing changes from implicit to explicit with solr 6.1.0 while I do

Re: Identify Reference Leak in Custom Code related to Solr

2017-12-18 Thread Emir Arnautović
Hi Amrit, I suppose it is related to https://github.com/sematext/solr-researcher/issues/6 Can you please continue with conversation there, and one of us can update ML when issue is resolved in case there are others who could be interested in

OOM spreads to other replica's/HA when OOM

2017-12-18 Thread Susheel Kumar
Hello, I was testing Solr to see if a query which would cause OOM and would limit the OOM issue to only the replica set's which gets hit first. But the behavior I see that after all set of first replica's went down due to OOM (gone on cloud view) other replica's starts also getting down. Total 6

Re: Solr5.1 delete out of memory

2017-12-18 Thread Shawn Heisey
On 12/17/2017 9:36 PM, soul wrote: hi! I'm using solr5.1. There have about 0.3 billion doc in my solr. I can insert and select doc in my solr, while failed to delete doc. It remind me that this writer hit an OutOfMemoryError : cannot commit. I am curious that what cause this reason? The nature

Re: Identify Reference Leak in Custom Code related to Solr

2017-12-18 Thread Amrit Sarkar
Emir, Solr version: 6.6, SolrCloud We followed the instructions on README.md on the github project. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@s

Re: How to sort on dates?

2017-12-18 Thread Michael Kuhlmann
Am 16.12.2017 um 19:39 schrieb Georgios Petasis: > Even if the DateRangeField field can store a range of dates, doesn't > Solr understand that I have used single timestamps? No. It could theoretically, but sorting just isn't implemented in DateRangeField. > I have even stored the dates. > My prob

Re: Identify Reference Leak in Custom Code related to Solr

2017-12-18 Thread Emir Arnautović
Hi Amrit, I’ll check with my colleague that worked on this. In the meantime, can you provide more info about setup: Solr version, M-S or cloud and steps that we can do to reproduce it. Thanks, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Supp

Identify Reference Leak in Custom Code related to Solr

2017-12-18 Thread Amrit Sarkar
Hi, We incorporated *https://github.com/sematext/solr-researcher * into our project and it is responsible for memory leak / reference leak which is causing multiple *SolrIndexSearcher *objects in the heap dump. 37 instances of *"org.apache.solr.search.

Re: request dependent analyzer

2017-12-18 Thread Hendrik Haddorp
Hi, how do multiple analyzers help? On 18.12.2017 10:25, Markus Jelsma wrote: Hi - That is impossible. But you can construct many analyzers instead. -Original message- From:Hendrik Haddorp Sent: Monday 18th December 2017 8:35 To: solr-user Subject: request dependent analyzer Hi,

RE: request dependent analyzer

2017-12-18 Thread Markus Jelsma
Hi - That is impossible. But you can construct many analyzers instead. -Original message- > From:Hendrik Haddorp > Sent: Monday 18th December 2017 8:35 > To: solr-user > Subject: request dependent analyzer > > Hi, > > currently we use a lot of small collections that all basically have

Re: SOLR nested dataimport issues

2017-12-18 Thread Emir Arnautović
Hi, I did not check it but it seems to me that it might be related to using full path in your fields xpath: you are iterating hash-es and you should probable set field paths assuming it is the new root. E.g. for id it would be: > HTH, Emir -- Monitoring - Log Management - Alerting - Anomaly Det