Re: Solr Streaming Expression failures

2020-03-26 Thread Aroop Ganguly
I have personally not used streaming expressions to commit data to a collection (have used them a lot of querying), and would not recommend it for bulk indexing unless Joel recommends it :) On the other hand we have had decent success in indexing at scale and 12 million is not a big number.

Solr Streaming Expression failures

2020-03-26 Thread Mohamed Sirajudeen Mayitti Ahamed Pillai
Hi Everyone, We are using Solr 7.4 with 3 external ZKs and 7 Solr node in a cloud setup. We are using Streaming expression to pull 12million records from a different Solr Cloud using below expression.

Ingest data from multiple databases into a single solr collection

2020-03-26 Thread Guillermo Lopez Mackinnon
Hi, I've posted a question within stackoverflow regarding the subject of this email. https://stackoverflow.com/questions/60876128/ingest-data-from-multiple-databases-into-a-single-solr-collection If someone from this list could provide some help it'll be highly appreciated! Thanks in advance!

Re: Apache Solr 8.4.1 Basic Authentication

2020-03-26 Thread lstusr 5u93n4
Hey Emmanuel, If you're using Java, I'd highly suggest using solrj, it'll do the work that you need it to do: SolrRequest req ;//create a new request object req.setBasicAuthCredentials(userName, password); solrClient.request(req); If that doesn't work for you for some reason, you need to

Apache Solr 8.4.1 Basic Authentication

2020-03-26 Thread Altamirano, Emmanuel
Hello everyone, We recently enable Solr Basic Authentication in our Dev environment and we are testing Solr security. We followed the instructions provided in the Apache Solr website and it is working using curl command. If you could provide us any advice of how do we need to send the

Re: deduplication of suggester results are not enough

2020-03-26 Thread Michal Hlavac
Hi Roland, I wrote AnalyzingInfixSuggester that deduplicates data on several levels at index time. I will publish it in few days on github. I'll wrote to this thread when done. m. On štvrtok 26. marca 2020 16:01:57 CET Szűcs Roland wrote: > Hi All, > > I follow the discussion of the suggester

deduplication of suggester results are not enough

2020-03-26 Thread Szűcs Roland
Hi All, I follow the discussion of the suggester related discussions quite a while ago. Everybody agrees that it is not the expected behaviour from a Suggester where the terms are the entities and not the documents to return the same string representation several times. One suggestion was to

Re: edge ngram/find as you type sorting

2020-03-26 Thread Erick Erickson
From other mails, it looks like you’re inheriting something you had no input in building. My sympathies ;) Unless you’ve explicitly changed the memory by specifying -Xmx and -Xms at startup, you’re operating with 512M of memory, which is far too small for most Solr installations. the -m parameter

Re: Solr Instance Migration - Server Access

2020-03-26 Thread matthew sporleder
If it's solrcloud + zookeeper you can get most of the configs from the "tree" browser on the console: /solr/#/~cloud?view=tree You can otherwise derive a lot of the configs/schema/data-import properties from the web console and api, neither of which require server access. It is also possible to

Re: edge ngram/find as you type sorting

2020-03-26 Thread matthew sporleder
That explains the OOM's I've been getting in the initial test cycle. I'm working with about 50M (small) documents. On Thu, Mar 26, 2020 at 7:58 AM Erick Erickson wrote: > > the ngramming is a time/space tradeoff. Typically, > if you restrict the wildcards to have three or more > “real”

Autoscaling question

2020-03-26 Thread Kudrettin Güleryüz
Hi, I'd like to balance freedisk and cores across eight nodes. Here is my cluster-preferences and cluster-policy: { "responseHeader":{ "status":0, "QTime":0}, "cluster-preferences":[{ "precision":10, "maximize":"freedisk"} ,{ "minimize":"cores",

Re: edge ngram/find as you type sorting

2020-03-26 Thread Erick Erickson
the ngramming is a time/space tradeoff. Typically, if you restrict the wildcards to have three or more “real” characters performance is fine. One real character (i.e. a*) will be your worst-case. I’ve seen requiring two characters in the prefix work well too. It Depends (tm). Conceptually what

Re: Cross DC CloudSolr Client

2020-03-26 Thread Erick Erickson
I’ve never even heard of someone trying to put different ensembles in the same connection string for a single client. Create N CloudSolrClients, one for each DC. And why do you want to try to contact individual nodes? CloudSolrClient will do that for you. Best, Erick > On Mar 26, 2020, at 2:38

Re: Solr Instance Migration - Server Access

2020-03-26 Thread Charlie Hull
If you can get the server login details you should be able to copy the Solr installation and its configuration. If not, then Solr itself doesn't provide any way to get them - it's just a search engine, it's not responsible for securing a server in any way. Charlie On 26/03/2020 02:13, Landon

suggestion with multiple context field

2020-03-26 Thread Szűcs Roland
Hi All, Is there any way to define multiple context fields with the suggester? It is typical use case in an ecommerce environment that the facets are listed in the sidebar, and they are acting as filter queries, when the user select them. I am looking for similar functionality for the

Solr Instance Migration - Server Access

2020-03-26 Thread Landon Cowan
Hello! I’m working on a website for a client that was migrated from another website development company. The previous company used Solr to build out the site search – but they did not send us the server credentials. The developers who built the tool are no longer with the company – is there

Cross DC CloudSolr Client

2020-03-26 Thread Lucky Sharma
Hi all, Just wish to confirm on the cross DC connection situation from the CloudSolrClient. Scenario: We have multiple DC with the same collection data. Can we add the zookeeper connect string of the DC's to the cloud SolrClient. Will it work like this: The client will utilise this connection