Re: org.apache.solr.common.SolrException: this IndexWriter is closed

2021-03-05 Thread Dominique Bejean
Hi, You are using RAMDirectoryFactory without enough RAM ? regards Dominique Le ven. 5 mars 2021 à 16:18, 李世明 a écrit : > Hello: > > Have you encountered the following exception that will cause the index to > not be written? But you can query > Version:8.7.0 > >

Re: Collection Creation across DC

2021-02-11 Thread Dominique Bejean
Hi, Sorry, it is in French, but here is my suggestion in order to replace the deprecated CDCR and achieve HA https://www.eolya.fr/2020/11/16/solrcloud-disaster-recovery-alternative-a-cdcr/ In short, each shard has one PULL replica on remote datacenter and these PULL replicas are excluded from

Re: NRT - Indexing

2021-02-02 Thread Dominique Bejean
Hi, The issue was buildOnCommit=true on a SuggestComponent. Dominique Le mar. 2 févr. 2021 à 00:54, Shawn Heisey a écrit : > On 2/1/2021 12:08 AM, haris.k...@vnc.biz wrote: > > Hope you're doing good. I am trying to configure NRT - Indexing in my > > project. For this reason, I have

Re: NRT - Indexing

2021-02-01 Thread Dominique Bejean
rvice that fetches data from the Postgres > database and indexes it to solr. The service runs with a delay of 5 seconds. > > > Regards > > > Mit freundlichen Grüssen / Kind regards > > > Muhammad Haris Khan > > > *VNC - Virtual Network Consult* > > &g

Re: Tweaking Shards and Replicas for high volume queries and updates

2021-02-01 Thread Dominique Bejean
Hi, Some suggestions. * 64GB JVM Heap Are you sure you really need this heap size ? Did you check in your GC logs (with gceasy.io) ? A best practice is to minimize as possible the heap size and never more than 31 GB. * OS Caching Did you set swappiness to 1 ? * Put two instances of Solr on

Re: NRT - Indexing

2021-02-01 Thread Dominique Bejean
Hi, What is your Solr version ? Can you share your solrconfig.xml ? How is your sharding ? Did you grep your solr logs on with the "commit' pattern in order to see hard and soft commit occurrences ? How are you pushing new docs or updates in the collection ? Regards. Dominique Le lun. 1

Re: Solrcloud load balancing / failover

2020-12-26 Thread Dominique Bejean
ed and > the request may succeed. > > IMO, these are two weak areas of Solr right now. Suggestions/patches > are welcome :-) > > On 12/11/20, Dominique Bejean wrote: > > Hi, > > > > Is there in Solrcloud any load balancing based on CPU load on Solr nodes > ? &g

Solrcloud load balancing / failover

2020-12-11 Thread Dominique Bejean
Hi, Is there in Solrcloud any load balancing based on CPU load on Solr nodes ? If for shard a replica fails to handle a query, the query is sent to another replica in order to be completed ? Regards Dominique

Re: SolrCloud 6.6.2 suddenly crash due to slow queries and Log4j issue

2020-10-19 Thread Dominique Bejean
requests and slow requests. I suggested removing these two extra appenders. Regards Dominique Le lun. 19 oct. 2020 à 15:48, Dominique Bejean a écrit : > Hi Shawn, > > Thank you for your response. > > You are confirming my diagnosis. > > This is in fact a 8 nodes cl

Re: SolrCloud 6.6.2 suddenly crash due to slow queries and Log4j issue

2020-10-19 Thread Dominique Bejean
. [image: 2020-10-19 15_48_06-Photos.png] Regards Dominique Le dim. 18 oct. 2020 à 22:03, Shawn Heisey a écrit : > On 10/18/2020 3:22 AM, Dominique Bejean wrote: > > A few months ago, I reported an issue with Solr nodes crashing due to the > > old generation heap growing suddenly and

SolrCloud 6.6.2 suddenly crash due to slow queries and Log4j issue

2020-10-18 Thread Dominique Bejean
Hi, A few months ago, I reported an issue with Solr nodes crashing due to the old generation heap growing suddenly and generating OOM. This problem occurred again this week. I have threads dumps for each minute during the 3 minutes the problem occured. I am using fastthread.io in order to analyse

Re: Returning fields a specific order

2020-09-29 Thread Dominique Bejean
Hi, If data are in json format, you should use jq -S https://stackoverflow.com/a/38210345/5998915 Regards Dominique Le lun. 28 sept. 2020 à 18:30, gnandre a écrit : > Hi, > > I have a use-case where I want to compare stored fields values of Solr > documents from two different Solr

Re: Delete from Solr console fails

2020-09-25 Thread Dominique Bejean
Hi Goutham, I agree with Rahul, avoid large deletebyquery. It you can, prefere one query to get all the ids first than use ids with deletebyid Regards Dominique Le ven. 25 sept. 2020 à 06:50, Goutham Tholpadi a écrit : > I spoke too soon. I am getting the "Connection lost" error again. > >

Re: solr performance with >1 NUMAs

2020-09-25 Thread Dominique Bejean
Hi, This would be a Java VM option, not something Solr itself can know about. Take a look at this article in comments. May be it will help. https://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html?showComment=1347033706559#c229885263664926125 Regards Dominique Le jeu. 24

Re: Autoscaling Rule for replica distribution across zones

2020-09-21 Thread Dominique Bejean
t;policyzone": [{"replica": "50%", "shard": "#EACH", "nodeset":{ "sysprop.zone": "dc1"}}, {"replica": "50%", "shard": "#EACH", "nodeset":{"sysprop.zone": "dc2"}}

Autoscaling Rule for replica distribution across zones

2020-09-18 Thread Dominique Bejean
Hi, I have 4 nodes solrcloud cluster. 2 nodes (solr1 and solr3) are started with the parametrer -Dzone=dc1 and the 2 other nodes (solr 2 and Solr4) are started with the parametrer -Dzone=dc2 I want to create Autoscaling placement Rule in order to equally distribute replicas of a shard over zone

Re: "timeAllowed" param with "numFound" having a count value but doc list is empty

2020-09-15 Thread Dominique Bejean
Hi, 1. Yes, your analysis is correct 2. Yes, it can occurs too with very slow query. Regards Dominique Le mar. 15 sept. 2020 à 15:14, Mark Robinson a écrit : > Hi, > > When in a sample query I used "timeAllowed" as low as 10mS, I got value for > > "numFound" as say 2000, but no docs were

Re: Multi-word Synonyms not working properly with Edismax

2020-09-08 Thread Dominique Bejean
Hi, Can you try to remove the RemoveDuplicatesTokenFilter ? Dominique Le mar. 8 sept. 2020 à 13:52, Manish Bafna a écrit : > Hi, > > We are using the following configuration: > > > > -- > > *Schema: * > > > positionIncrementGap="100"

Re: schema.xml version attribute

2020-09-06 Thread Dominique Bejean
a écrit : > On 9/5/2020 3:30 AM, Dominique Bejean wrote: > > Hi, I often see a bad usage of the version attribute in shema.xml. For > > instance The version attribute is to > > specify the schema syntax and semantics version to be used by Solr. > > The current

schema.xml version attribute

2020-09-05 Thread Dominique Bejean
Hi, I often see a bad usage of the version attribute in shema.xml. For instance The version attribute is to specify the schema syntax and semantics version to be used by Solr. The current value is 1.6 It is clearly specified in schema.xml comments "It should not normally be changed by

Re: Understanding Solr heap %

2020-09-01 Thread Dominique Bejean
Hi, As all Java applications the Heap memory is regularly cleaned by the garbage collector (some young items moved to the old generation heap zone and unused old items removed from the old generation heap zone). This causes heap usage to continuously grow and reduce. Regards Dominique Le

Re: Rule-Based permissions for cores

2020-08-31 Thread Dominique Bejean
Hi, It looks like this issue I opened a long time ago. https://issues.apache.org/jira/browse/SOLR-13097 Regards Dominique Le lun. 31 août 2020 à 23:02, Thomas Corthals a écrit : > Hi, > > I'm trying to configure the Rule-Based Authorization Plugin in Solr 8.4.0 > in standalone mode. My goal

Re: How to Prevent Recovery?

2020-08-31 Thread Dominique Bejean
Hi, Even if it is not the root cause, I suggest to try to respect some basic best practices and so not have "2 Zk running on the same nodes where Solr is running". Maybe you can achieve this by just stopping these 2 Zk (and move them later). Did you increase ZK_CLIENT_TIMEOUT to 3 ? Did you

Re: Solr TLOG Replicas going in recovery

2020-08-29 Thread Dominique Bejean
Hi, Can you provide more information : Solr version, how are you indexing (DIH, threading, ...), more details in Solr logs ? Did you analyse JVM Gc logs ? Regards Dominique Le ven. 28 août 2020 à 22:53, amit3281 a écrit : > Hi, > > > > I am using Solr on EXT4 partition and have TLOG

Re: Odd Solr zkcli script behavior

2020-08-27 Thread Dominique Bejean
Hi, You can also connect to ZK element and use zkCli.sh tools http://www.mtitek.com/tutorials/zookeeper/zkCli.php Regards Dominique Le jeu. 27 août 2020 à 17:28, Webster Homer < webster.ho...@milliporesigma.com> a écrit : > I am using solr 7.7.2 solr cloud > > We version our collection and

Re: Error from server at http://localhost:8983/solr/search: Expected mime type application/octet-stream but got text/html

2020-08-27 Thread Dominique Bejean
Hi, There were few discussions about similar issues these days. A JIRA issue was created https://issues.apache.org/jira/browse/SOLR-14768 Regards Dominique Le jeu. 27 août 2020 à 15:00, Divino I. Ribeiro Jr. < divinoirj.ib...@gmail.com> a écrit : > Hello everyone! > When I run an query to

Re: Solr collections gets wiped on restart

2020-08-27 Thread Dominique Bejean
Hi, Which Solr version ? Restart which node ? Solr ? ZK ? Only one node ? Collections are missing in Solr console (lost in Zookeeper) but cores are still present ? Why put Zk data and datalog in a "temp" directory (dataDir=/applis/24374-iplsp-00/IPLS/apache-zookeeper-3.5.5-bin/temp) ? This

Re: Simple query

2020-08-24 Thread Dominique Bejean
Hi, We need to know how is analyzed your catch_all field at index and search time. I think you are using a stemming filter and "apache" is stemmed as "apach". So "apache" and "apach" match the document and not "apac". You can use the console in order to see how terms are removed or transformed

Re: IOException occured when talking to server

2020-08-17 Thread Dominique Bejean
ide with the IOException, > so I'm not sure they are related. > Reinaldo > > On Mon, Aug 17, 2020 at 12:59 PM Dominique Bejean < > dominique.bej...@eolya.fr> wrote: > >> Hi, >> >> It looks like this issues >> https://github.com/eclipse/jetty.pro

Re: IOException occured when talking to server

2020-08-17 Thread Dominique Bejean
I mean add this parameter on your client application JVM :) Le lun. 17 août 2020 à 18:36, Dominique Bejean a écrit : > If you want a more detailed debug information from your client > application, you can add this parameter while starting Solr JVM. > -Djavax.net.debug=all >

Re: IOException occured when talking to server

2020-08-17 Thread Dominique Bejean
If you want a more detailed debug information from your client application, you can add this parameter while starting Solr JVM. -Djavax.net.debug=all It is very verbose ! Dominique Le lun. 17 août 2020 à 17:59, Dominique Bejean a écrit : > Hi, > > It looks like this issue

Re: IOException occured when talking to server

2020-08-17 Thread Dominique Bejean
t > org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:210) > ... 3 more > > - > > I did consider using the solrJ cloud or lb clients, but nginx gives me > more flexibility in controlling how

Re: IOException occured when talking to server

2020-08-17 Thread Dominique Bejean
Hi, Can you provide more information ? - Solr and ZK version - full error stacktrace generated by SolrJ - any concomitant and relevant information in solr nodes logs or zk logs Just to know, why not use a load balanced LBHttp... Solr Client ? Regards. Dominique Le lun. 17 août 2020 à 00:41,

Re: Solr ping taking 600 seconds

2020-08-15 Thread Dominique Bejean
Hi, How long to display the solr console ? What about CPU and iowait with top ? You should start by eliminate network issue between your solr nodes by testing it with netcat on solr port. http://deice.daug.net/netcat_speed.html Dominique Le ven. 14 août 2020 à 23:40, Susheel Kumar a écrit :

Re: Adding additional zookeeper on top

2020-08-14 Thread Dominique Bejean
Hi, About the number of Zookeeper elements in an ensemble, you can find this good information in this page. It applies to Solr. https://www.cloudkarafka.com/blog/2018-07-04-cloudkarafka-how-many-zookeepers-in-a-cluster.html - 1 Node: no fault tolerance, no maintenance possibilities - 3

Re: [Subquery] Transform Documents across Collections

2020-08-12 Thread Dominique Bejean
Hi Norbert, The field name in collection2 is "reporting_to" not "reporting". Dominique Le mer. 12 août 2020 à 11:59, Norbert Kutasi a écrit : > Hello, > > We have been using [subquery] to come up with arbitrary complex hierarchies > in our document responses. > > It works well as long as

Re: Backups in SolrCloud using snapshots of individual cores?

2020-08-11 Thread Dominique Bejean
Le mar. 11 août 2020 à 14:07, Erick Erickson a écrit : > Dominique: > > Alternatives are under discussion, there isn’t a recommendation yet. > > Erick > > > On Aug 11, 2020, at 7:49 AM, Dominique Bejean > wrote: > > > > I missed that ! > &g

Re: Backups in SolrCloud using snapshots of individual cores?

2020-08-11 Thread Dominique Bejean
I missed that ! Are you aware about an alternative ? Regards Dominique Le mar. 11 août 2020 à 13:15, Erick Erickson a écrit : > CDCR is being deprecated. so I wouldn’t suggest it for the long term. > > > On Aug 10, 2020, at 9:33 PM, Ashwin Ramesh > wrote: > > > > I would love an answer to

Re: Solrcloud tlog are not deleted

2020-08-11 Thread Dominique Bejean
Hi, Did you disable CDCR buffer ? solr//cdcr?action=DISABLEBUFFER You can check with "cdcr?action=STATUS" Regards Dominique Le mar. 11 août 2020 à 10:57, Michel Bamouni a écrit : > Hello, > > > We had setup a synchronization between our solr instances on 2 datacenters > by using the CDCR.

Re: Backups in SolrCloud using snapshots of individual cores?

2020-08-11 Thread Dominique Bejean
Hi, This procedure looks fine but it is a little complexe to automatize. Why not consider backup based on CDCR for Solrcloud or Replication for Solr standalone ? For Solrcloud, CDCR can be configured with source and target collections in the same Solrcloud cluster. The target collection can

Re: Production Issue: TIMED_WAITING - Will net.ipv4.tcp_tw_reuse=1 help?

2020-08-10 Thread Dominique Bejean
rjRMWv5ritf5QxgbFxDa0kPzVlXdbySe/view?usp=sharing > > You have to analyze GC on all nodes ! Your heap is very big. According to full GC frequency, I don't think you really need such a big heap for only indexing. May be when you will perform queries. Did you check your network performances ? Did y

Re: Production Issue: TIMED_WAITING - Will net.ipv4.tcp_tw_reuse=1 help?

2020-08-10 Thread Dominique Bejean
Hi Doss, See a lot of TIMED_WATING connection occurs with high tcp traffic infrastructure as in a LAMP solution when the Apache server can't anymore connect to the MySQL/MariaDB database. In this case, tweak net.ipv4.tcp_tw_reuse is a possible solution (but never net.ipv4.tcp_tw_recycle as you

Re: Solr Down Issue

2020-08-10 Thread Dominique Bejean
Hi, Did you analyse your gc logs ? If not, it is the first action to do. Enable gc logs and use a tool like https://gceasy.io/ Please provide more details about your configuration (JVM settings, ...) and use case (QPS, queries, ...) We just know you have 28 million indexed books (just metadata

Re: java.lang.StackOverflowError if pass long string in q parameter

2020-08-10 Thread Dominique Bejean
Hi, It looks like uid field is a text field with graph filter. Do you really need this for this specific large "OR" query ? Can't you use a string field instead ? Do you need to compute the score for this query ? Maybe you can use fq instead of q ? You will have performance improvements by not

Re: Lucene-Solr project split

2020-06-09 Thread Dominique Bejean
https://lists.apache.org/thread.html/raab13cabe321d12b6cda7dc6e529176f51ece31d30f00997dd36570a%40%3Cdev.lucene.apache.org%3E >> >> Ilan >> >> On Tue, Jun 9, 2020 at 3:10 PM Dominique Bejean < >> dominique.bej...@eolya.fr> wrote: >> >>> Hi, >>> >>> One of my clients claims that

Re: Solrcloud 6.6 becomes nuts

2020-06-09 Thread Dominique Bejean
e customer not enable debug in production. Can query debug cause so high memory usage ? Regards. Dominique Le lun. 18 mai 2020 à 09:42, Dominique Bejean a écrit : > Hi Shawn, > > In fact, I was using logs from a core at WARN log level so with only slow > queries (>500ms). > > I

Lucene-Solr project split

2020-06-09 Thread Dominique Bejean
Hi, One of my clients claims that the Lucene-Solr project will split into two separate projects after a vote of the community. I cannot find any trace of discussions on this subject. Is it true ? Regards. Dominique

Re: Solrcloud 6.6 becomes nuts

2020-05-18 Thread Dominique Bejean
rds Dominique Le lun. 18 mai 2020 à 03:28, Shawn Heisey a écrit : > On 5/17/2020 4:18 PM, Dominique Bejean wrote: > > I was not thinking that queries using facet with fields with high number > > of unique value but with low hits count can be the origin of this > problem. > >

Re: Solrcloud 6.6 becomes nuts

2020-05-17 Thread Dominique Bejean
, Mikhail Khludnev a écrit : > Hello, Dominique. > What did it log? Which exception? > Do you have a chance to review heap dump? What did consume whole heap? > > On Sun, May 17, 2020 at 11:05 AM Dominique Bejean < > dominique.bej...@eolya.fr> wrote: > >> Hi, >> >

Re: Solrcloud 6.6 becomes nuts

2020-05-17 Thread Dominique Bejean
long. We will investigate this ! I was not thinking that queries using facet with fields with high number of unique value but with low hits count can be the origin of this problem. Regards Dominique Le dim. 17 mai 2020 à 21:45, Shawn Heisey a écrit : > On 5/17/2020 2:05 AM, Domini

Solrcloud 6.6 becomes nuts

2020-05-17 Thread Dominique Bejean
Hi, I have a six node Solrcoud that suddenly has its six nodes failed with OOM at the same time. This can happen even when the Solrcloud is not under heavy load and there is no indexing. I do not see any raison for this to happen. Here are the description of the issue. Thank you for your

Re: Solr Cloud on Docker?

2020-02-05 Thread Dominique Bejean
ations pitfalls can be > avoided if you do your Docker orchestration through kubernetes. > > > k/r, > Scott > > On Tue, Jan 28, 2020 at 3:34 AM Dominique Bejean < > dominique.bej...@eolya.fr> > wrote: > > > Hi Dwane, > > > > Thank you for sharing

Re: Solr Cloud on Docker?

2020-01-28 Thread Dominique Bejean
Hi Dwane, Thank you for sharing this great solr/docker user story. According to your Solr/JVM memory requirements (Heap size + MetaSpace + OffHeap size) are you specifying specific settings in docker-compose files (mem_limit, mem_reservation, mem_swappiness, ...) ? I suppose you are limiting

Re: Convert TLOG collection to NRT

2019-12-10 Thread Dominique Bejean
Thank you Shawn. You're right ! It is better to read the good version of the Collection API documentation. Le mar. 10 déc. 2019 à 19:49, Shawn Heisey a écrit : > On 12/10/2019 11:25 AM, Dominique Bejean wrote: > > I would like to convert a collection (3 shards x 3 replicas)

Convert TLOG collection to NRT

2019-12-10 Thread Dominique Bejean
Hi, I would like to convert a collection (3 shards x 3 replicas) from TLOG to NRT. The only solution I imagine is something like : * with collection API, remove replicas in order to keep only 1 replica per 3 shard * update the collection state.json in zookeer * with collection API, reload the

Re: Zk upconfig command is appending local directory to default confdir

2019-11-18 Thread Dominique Bejean
Hi Michael, It seems Sorl really don't find any solrconfig.xml file or a conf/solrconfig.xml file in the local path you specified. The last try is to look in "/opt/solr-6.5.1/server/solr/configsets/", but obviously it doesn't work has you didn't specify a confiset name. The code is here -

Re: When does Solr write in Zookeeper ?

2019-11-18 Thread Dominique Bejean
Thanh you Shawn Le lun. 18 nov. 2019 à 19:28, Shawn Heisey a écrit : > On 11/18/2019 8:39 AM, Dominique Bejean wrote: > > How Solr nodes know that something was changed in Zookeeper by an other > > node ? Is there any notification from ZK or do Solr nodes read > > systemat

Re: When does Solr write in Zookeeper ?

2019-11-18 Thread Dominique Bejean
thout _either_ reading or writing to ZK. > > One rather obscure cause for ZK writes is when using “schemaless” mode. > When a new field is detected, the schema (and thus the collection’s > configuration) is changed, which generates writes.. > > Best, > Erick > > > > On

Re: $deleteDocByQuery and $deleteDocByID

2019-11-15 Thread Dominique Bejean
Hi Paresh, Due to deleteDocByQuery impact on commits and searcher reopen, if a lot of deletions are done it is preferable when possible to use deletebyid . Regards Dominique Le mar. 12 nov. 2019 à 07:03, Paresh a écrit : > Hi Erik, > > I am also looking for some example of deleteDocByQuery.

Re: When does Solr write in Zookeeper ?

2019-11-15 Thread Dominique Bejean
g or writing to ZK. > > One rather obscure cause for ZK writes is when using “schemaless” mode. > When a new field is detected, the schema (and thus the collection’s > configuration) is changed, which generates writes.. > > Best, > Erick > > > > On Nov 15, 2019, at 12

When does Solr write in Zookeeper ?

2019-11-15 Thread Dominique Bejean
Hi, I would like to be certain to understand how Solr use Zookeeper and more precisely when Solr write into Zookeeper. Solr stores various informations in ZK - globale configuration (autoscaling, security.json) - collection configuration (configs) - collections state (state.json,

Re: NRT vs TLOG bulk indexing performances

2019-10-30 Thread Dominique Bejean
s replicating changed segments and that’s slowing down > ingestion? > > It’d be interesting to index to NRT, leader-only and also a single TLOG > collection. > > > Best, > Erick > > > On Oct 25, 2019, at 8:28 AM, Dominique Bejean > wrote: > > > > Shawn

Re: NRT vs TLOG bulk indexing performances

2019-10-25 Thread Dominique Bejean
10/25/2019 1:16 AM, Dominique Bejean wrote: > > For collection created with all replicas as NRT > > > > * Indexing time : 22 minutes > > > > > For collection created with all replicas as TLOG > > > > * Indexing time : 34 minutes > > NRT indexes sim

Re: NRT vs TLOG bulk indexing performances

2019-10-25 Thread Dominique Bejean
est? > > > Am 25.10.2019 um 09:16 schrieb Dominique Bejean < > dominique.bej...@eolya.fr>: > > > > Hi, > > > > I made some benchmarks for bulk indexing in order to compare performances > > and ressources usage for NRT versus TLOG replica. > > >

NRT vs TLOG bulk indexing performances

2019-10-25 Thread Dominique Bejean
Hi, I made some benchmarks for bulk indexing in order to compare performances and ressources usage for NRT versus TLOG replica. Environnent : * Solrcloud with 4 Solr nodes (8 Gb RAM, 4 Gb Heap) * 1 collection with 2 shards x 2 replicas (all NRT or all TLOG) * 1 core per Solr Server Indexing of

Re: Minimum Tomcat version that supports latest Solr version

2019-10-15 Thread Dominique Bejean
Hi, Solr is not tested with Tomcat since version 4. Why not using the embedded Jetty server ? Regards Dominique Le mar. 15 oct. 2019 à 10:44, vikas shinde a écrit : > Dear Solr team, > > Which is the latest Tomcat version that supports the latest Solr version > 8.2.0? > > Also provide

Re: solr.log explanations for update handler

2019-10-03 Thread Dominique Bejean
pdate/processor/LogUpdateProcessorFactory.java#L212 > > > On Wed, Oct 2, 2019 at 12:56 PM Dominique Bejean < > dominique.bej...@eolya.fr> > wrote: > > > Hi, > > > > I don't find explanations on what are the 2 numeric values mean at the > end > > of these log lines. >

solr.log explanations for update handler

2019-10-02 Thread Dominique Bejean
Hi, I don't find explanations on what are the 2 numeric values mean at the end of these log lines. Regards. Dominique 2019-09-30 09:19:17.474 INFO (qtp2051853139-9577) [c:maCollection3s3r s:shard1 r:core_node11 x:maCollection3s3r_shard1_replica_t2] o.a.s.u.p.LogUpdateProcessorFactory

Re: Synonym filters memory usage

2019-10-02 Thread Dominique Bejean
SynonymMaps. > >>>> > >>>> Regards > >>>> Bernd > >>>> > >>>> > >>>> Am 30.09.19 um 08:41 schrieb Andrea Gazzarini: > >>>>> Hi, > >>>>> looking at the stateful nature of

Synonym filters memory usage

2019-09-29 Thread Dominique Bejean
Hi, My concern is about memory used by synonym filter, especially if synonyms resources files are large. If in my schema, there are two field types "TypeSyno1" and "TypeSyno2" using synonym filter with the same synonyms files. For each of these two field types there are two fields Field1 type

Re: [ZOOKEEPER] - Error - HEAP MEMORY

2019-07-30 Thread Dominique Bejean
Hi, I don’t find any documentation about the parameter zookeeper_server_java_heaps in zoo.cfg. The way to control java heap size is either the java.env file of the zookeeper-env.sh file. In zookeeper-env.sh SERVER_JVMFLAGS="-Xmx=512m" How many RAM on your server ? Regards Dominique Le lun.

Field value different over replicas

2019-07-26 Thread Dominique Bejean
Hi, We have a date field with default set to “now”. For this field, some documents of the collection don’t have the same value in all replicas. The difference can be 3 or 4 minutes ! The collection has 1 shard and 2 NRT replicas. Solr version is 7.5. Collection is populated with DIH. Any ideas

Re: RuleBasedAuthorizationPlugin configuration

2019-01-01 Thread Dominique Bejean
Hi, I created a Jira issue https://issues.apache.org/jira/browse/SOLR-13097 Regards. Dominique Le lun. 31 déc. 2018 à 11:26, Dominique Bejean a écrit : > Hi, > > In debugging mode, I discovered that only in SolrCloud mode the collection > name is extract from the request path

Re: RuleBasedAuthorizationPlugin configuration

2018-12-31 Thread Dominique Bejean
in Solr standalone mode, only authentication is fully fonctionnal, not authorization ! Regards. Dominique Le dim. 30 déc. 2018 à 13:40, Dominique Bejean a écrit : > Hi, > > After reading more carefully the log file, here is my understanding. > > The request > > http://2:

Re: RuleBasedAuthorizationPlugin configuration

2018-12-30 Thread Dominique Bejean
? Regards Dominique Le ven. 21 déc. 2018 à 10:46, Dominique Bejean a écrit : > Hi, > > I am trying to configure security.json file, in order to define the > following users and permissions : > >- user "admin" with all permissions on all collections >- u

Re: Zookeeper timeout issue -

2018-12-21 Thread Dominique Bejean
Hi, What is the scenario ? High query activity ? High update activity ? Regards. Dominique Le mer. 19 déc. 2018 à 13:44, AshB a écrit : > Hi, > > We are facing issue with solr/zookeeper where zookeeper timeouts after > 1ms. Error below. > > *SolrException:

Re: Is there a common tool for SOLR benckmark?

2018-12-21 Thread Dominique Bejean
Hi, There are the powerfull JMeter obviously and also SolrMeter ( https://github.com/tflobbe/solrmeter). Regards Dominique Le jeu. 20 déc. 2018 à 03:17, zhenyuan wei a écrit : > Hi all, >Is there a common tool for SOLR benckmark? YCSB is not very > suitable for SOLR. Currently, Is

Re: ZooKeeper for Solr 7.6

2018-12-21 Thread Dominique Bejean
Hi, This is a Solr side issue not a Zookeeper side issue. Zookeeper 3.4.13 is 5 monthes old version so you can use it on server side with the zookeeper client 3.4.11 provided by Solr. Dominique Le jeu. 20 déc. 2018 à 01:53, Yasufumi Mizoguchi a écrit : > Hi, > > I searched JIRA and found

RuleBasedAuthorizationPlugin configuration

2018-12-21 Thread Dominique Bejean
Hi, I am trying to configure security.json file, in order to define the following users and permissions : - user "admin" with all permissions on all collections - user "read" with read permissions on all collections - user "1" with only read permissions on biblio collection -

Re: CMS GC - Old Generation collection never finishes (due to GC Allocation Failure?)

2018-10-12 Thread Dominique Bejean
Hi, 1/ As previously said by other persons, my first action would be to understand why you need so much heap ? The first step is to maximize your heap size to 31Gb (or obviously less if possible). https://blog.codecentric.de/en/2014/02/35gb-heap-less-32gb-java-jvm-memory-oddities/ Can you

Re: Index size issue in SOLR-6.5.1

2018-10-08 Thread Dominique Bejean
HI, In the Solr Admin console, you can access for each core to the "Segment info" page. You can see if there are more deleted documents in segments on server X. Dominique Le lun. 8 oct. 2018 à 07:29, SOLR4189 a écrit : > About which details do you ask? Yesterday we restarted all our solr >

Re: Index size issue in SOLR-6.5.1

2018-10-07 Thread Dominique Bejean
Hi, What about cores segment details in admin UI interface ? More deleted documents ? Regards Dominique Le dim. 7 oct. 2018 à 08:22, SOLR4189 a écrit : > Hi all, > > We use SOLR-6.5.1 and we have very strange issue. In our collection index > size is very different from server to server

Re: Docker and Solr Indexing

2018-09-12 Thread Dominique Bejean
Hi, Are you aware about issues in Java applications in Docker if java version is not 10 ? https://blog.docker.com/2018/04/improved-docker-container-integration-with-java-10/ Regards. Dominique Le mer. 12 sept. 2018 à 05:42, Shawn Heisey a écrit : > On 9/11/2018 9:20 PM, solrnoobie wrote: >

Re: SOLR zookeeper connection timeout during startup is hardcoded to 10000ms

2018-08-27 Thread Dominique Bejean
Hi, We also experimenting time-out issues from time to time. I sent this message one month ago, by mistake in the dev list. Why use hardcoded values just in ZkClientClusterStateProvider.java file while there are existing parameters for these time-out ? Regards Dominique

Solr and ZK timeout issues

2018-07-17 Thread Dominique Bejean
Hi, We are experimenting an issue related to Zk Timeout Stacktrace is : ERROR 19 juin 2018 06:24:07,152 - h.concurrent.ConcurrentService:67 - Erreur dans l'attente de la fin de l'exécution d'un thread ERROR 19 juin 2018 06:24:07,152 - h.concurrent.ConcurrentService:68 -

Re: Silk from LucidWorks

2018-07-16 Thread Dominique Bejean
Hi, Use Grafana with Solr starting version 7 si very easy and well documented. https://lucene.apache.org/solr/guide/7_3/monitoring-solr-with-prometheus-and-grafana.html Dominique Le lun. 16 juil. 2018 à 06:56, Aroop Ganguly a écrit : > How do you use Grafana with Solr ? Did you build a http

Re: Removed nodes still visible as gone in Solrcloud graph

2018-05-29 Thread Dominique Bejean
to ZK # server/scripts/cloud-scripts/zkcli.sh -z "xxx.xxx.xxx.xxx:2181" -cmd putfile /collections/xx/state.json /tmp/-state-local.json - Start all Solr nodes Dominique Le mar. 29 mai 2018 à 14:19, Dominique Bejean a écrit : > Hi, > > On a node, I accide

Removed nodes still visible as gone in Solrcloud graph

2018-05-29 Thread Dominique Bejean
Hi, On a node, I accidentally changed the SOLR_HOST value from uppercase to lowercase and I restarted the node. After I fixed the error, I restarted again the node but the node name in lowercase is still visible as "gone". How to definitively remove a gone node from the Solrcloud graph ?

Re: Howto disable PrintGCTimeStamps in Solr

2018-05-07 Thread Dominique Bejean
Hi, Which version of Solr are you using ? Regards Dominique Le ven. 4 mai 2018 à 09:13, Bernd Fehling a écrit : > Hi list, > > this sounds simple but I can't disable PrintGCTimeStamps in solr_gc > logging. > I tried with GC_LOG_OPTS in start scripts and

Re: What are descent disk I/O for Solr and Zookeeper ?

2018-03-11 Thread Dominique Bejean
”. Regards Dominique Le ven. 9 mars 2018 à 00:40, Shawn Heisey <apa...@elyograg.org> a écrit : > On 3/8/2018 2:55 PM, Dominique Bejean wrote: > > Disk I/O are critical for high performance Solrcloud. > > This statement has truth to it, but if your system is correctly size

What are descent disk I/O for Solr and Zookeeper ?

2018-03-08 Thread Dominique Bejean
Hi, Disk I/O are critical for high performance Solrcloud. I am looking for relevante disk I/O tests for both Solr node or Zookeeper element and with these tests what are bad, correct or good results. For instance how to know if these results with basic dd utility reports correct disk

Re: Multi words query time synonyms

2018-02-11 Thread Dominique Bejean
t; "parsedquery_toString":"+(((name_text_gp:maillot) ((name_text_gp:om (+name_text_gp:olympiqu +name_text_gp:marseil~1)", The query result are the same for all queries. It looks like this could be an acceptable workaround. Thank you Dominique Le dim. 11 févr. 2018 à 10:31, Dominiqu

Re: Multi words query time synonyms

2018-02-11 Thread Dominique Bejean
text_gp:maillot) (((+name_text_gp:olympiqu +name_text_gp:de > +name_text_gp:marseil) name_text_gp:om))) > > (btw my stop list only has “de” on it) > > Thanks, > > -- > Steve > www.lucidworks.com > > > On Feb 10, 2018, at 2:12 AM, Dominique Bejean <domin

Re: Multi words query time synonyms

2018-02-10 Thread Dominique Bejean
) olympiqu om marseil maillot So, i suspect an issue with edismax query parser. Regards. Dominique Le ven. 9 févr. 2018 à 18:25, Dominique Bejean <dominique.bej...@eolya.fr> a écrit : > Hi, > > I am trying multi words query time synonyms with Solr 6.6.2and > SynonymGraphFi

Multi words query time synonyms

2018-02-09 Thread Dominique Bejean
Hi, I am trying multi words query time synonyms with Solr 6.6.2and SynonymGraphFilterFactory filter as explain in this article https://lucidworks.com/2017/04/18/multi-word-synonyms-solr-adds-query-time-support/ My field type is :

Re: Solr JVM best pratices

2017-12-04 Thread Dominique Bejean
Thank you Shaw for replying each items I start to figure out better all these tricky jvm stuff. Dominique Le dim. 3 déc. 2017 à 01:30, Shawn Heisey <apa...@elyograg.org> a écrit : > On 12/2/2017 8:43 AM, Dominique Bejean wrote: > > I would like to have some advices on best pr

Re: Solr JVM best pratices

2017-12-02 Thread Dominique Bejean
; This has been solid in production with a 32 node Solr Cloud cluster. We do > not do faceting. > > wunder > Walter Underwood > wun...@wunderwood.org > http://observer.wunderwood.org/ (my blog) > > > > On Dec 2, 2017, at 7:43 AM, Dominique Bejean <dominique.bej...@eol

Re: JVM GC Issue

2017-12-02 Thread Dominique Bejean
:NON^203+size_facet_boost_exact:"velo"^299+size_facet_boost:velo^296+size_facet_relative_boost:velo^292+marque_boost_exact:"velo"^359+marque_boost:velo^356+marque_relative_boost:velo^352+=velo=200=velo=edismax=textSearch=true=1=true=json=EUR_0_price_decimal=sort_EUR_0_special_p

Solr JVM best pratices

2017-12-02 Thread Dominique Bejean
Hi, I would like to have some advices on best practices related to Heap Size, MMap, direct memory, GC algorithm and OS Swap. This is a waste subject and sorry for this long question but all these items are linked in order to have a stable Solr environment. My understanding and questions. About

Re: JVM GC Issue

2017-12-02 Thread Dominique Bejean
10,000 then that return > packet is obviously 1,000 times as large and must be assembled in > memory. > > I rather doubt the phonetic filter is to blame. But you can test this > by just omitting the field containing the phonetic filter in the > search query. I've certainly been wrong befor

  1   2   >