Re: optimize boosting parameters

2020-12-07 Thread Radu Gheorghe
Hi Derek, Ah, then my reply was completely off :) I don’t really see a better way. Maybe other than changing termfreq to field, if the numeric field has docValues? That may be faster, but I don’t know for sure. Best regards, Radu -- Sematext Cloud - Full Stack Observability - https

Re: doc for REQUESTSTATUS

2020-12-07 Thread Radu Gheorghe
Hi Elisabeth, This is the doc for REQUESTSTATUS, apparently only request ID is supported indeed: https://lucene.apache.org/solr/guide/8_6/coreadmin-api.html#coreadmin-requeststatus Best regards, Radu -- Sematext Cloud - Full Stack Observability - https://sematext.com Solr and Elasticsearch

Re: optimize boosting parameters

2020-12-07 Thread Radu Gheorghe
overall measure of how well you’re doing. Or even better, you can have something like Quaerite play with boost values for you: https://github.com/tballison/quaerite/blob/main/quaerite-examples/README.md#genetic-algorithms-ga-runga Best regards, Radu -- Sematext Cloud - Full Stack Observability

Re: Proximity Search with phrases

2020-12-03 Thread Radu Gheorghe
://lucene.apache.org/solr/guide/8_4/other-parsers.html#xml-query-parser Best regards, Radu -- Sematext Cloud - Full Stack Observability - https://sematext.com Solr and Elasticsearch Consulting, Training and Production Support > On 27 Nov 2020, at 14:25, Mark R wrote: > > Use Case: Is it possible t

Re: Shard Lock

2020-12-03 Thread Radu Gheorghe
Wild shot here: two Solr instances started on the same data directory? Best regards, Radu -- Sematext Cloud - Full Stack Observability - https://sematext.com Solr and Elasticsearch Consulting, Training and Production Support > On 1 Dec 2020, at 06:25, sambasivarao giddaluri > wrote: >

Re: Facet to part of search results

2020-12-03 Thread Radu Gheorghe
> On 3 Dec 2020, at 20:18, Shawn Heisey wrote: > > On 12/3/2020 9:55 AM, Jae Joo wrote: >> Is there any way to apply facet to the partial search result? >> For ex, we have 10m return by "dog" and like to apply facet to first 10K. >> Possible? > > The point of facets is to provide accurate numb

Re: facet.method=smart

2020-12-03 Thread Radu Gheorghe
, Radu -- Sematext Cloud - Full Stack Observability - https://sematext.com Solr and Elasticsearch Consulting, Training and Production Support > On 30 Nov 2020, at 22:46, Jae Joo wrote: > > Is "smart" really smarter than one explicitly defined? > > For "emun"

What do you usually look for in Solr logs?

2020-11-26 Thread Radu Gheorghe
nMonitor OR thread:closeThreadPool* OR thread:coreCloseExecutor*) for shutdown. Do you see better criteria? Last but not least, what do you usually look for in Solr logs? Anything that we don't cover in the above? Any feedback will be very much appreciated! Thanks and best regards,

Re: Is metrics api enabled by default in solr 8.2

2020-10-14 Thread Radu Gheorghe
Hi, Yes, the API works by default on 8.2: https://lucene.apache.org/solr/guide/8_2/metrics-reporting.html I don’t know of a way to disable it, but he configuration is described in the page above (i.e. on how to configure different reporters). Best regards, Radu -- Sematext Cloud - Full Stack

Re: how to config split authentication methods -- BasicAuth for WebUI, & none (or SSL client) for client connections?

2020-10-14 Thread Radu Gheorghe
HTTP access to Solr, then it’s not really secure. Best regards, Radu -- Sematext Cloud - Full Stack Observability - https://sematext.com Solr and Elasticsearch Consulting, Training and Production Support > On 12 Oct 2020, at 05:11, PGNet Dev wrote: > > I'm running, > >

Re: Solr Document Update issues

2020-10-14 Thread Radu Gheorghe
#UpdateHandlersinSolrConfig-Commits Best regards, Radu -- Sematext Cloud - Full Stack Observability - https://sematext.com Solr and Elasticsearch Consulting, Training and Production Support > On 13 Oct 2020, at 07:18, aparana bhatt wrote: > > Hi , > > I have been facing lot of issue

Re: Question regarding replica leader

2020-07-19 Thread Radu Gheorghe
each missed some updates). Best regards, Radu -- Sematext Cloud - Full Stack Observability - https://sematext.com Solr and Elasticsearch Consulting, Training and Production Support > On 20 Jul 2020, at 04:17, Vishal Vaibhav wrote: > > Hi any pointers on this ? > > On Wed, 15 Jul 2

Re: Log4J Logging to Http

2020-06-17 Thread Radu Gheorghe
/installation-docker/) which can parse Solr logs out of the box (see https://github.com/sematext/logagent-js/blob/master/patterns.yml#L140). But there are other options, like Fluentd or Logstash. Best regards, Radu > On 17 Jun 2020, at 10:33, Krönert Florian wrote: > > Hello everyone, >

Re: How to determine why solr stops running?

2020-06-08 Thread Radu Gheorghe
check the last logs after it crashed. Best regards, Radu https://sematext.com > On 8 Jun 2020, at 16:28, Ryan W wrote: > > "If Solr auto-restarts" > > It doesn't auto-restart. Is there some auto-restart functionality? I'm > not aware of that. > >

Re: Getting to grips with auto-scaling

2020-06-08 Thread Radu Gheorghe
u suggested, but it's simpler. So simple that I've seen it done without autoscaling (just a few scripts from when you add nodes in each tier). Best regards, Radu https://sematext.com vin., 5 iun. 2020, 21:59 Tom Evans a scris: > Hi > > I'm trying to get a handle on the new

Re: How to determine why solr stops running?

2020-06-08 Thread Radu Gheorghe
logs for what happened before Solr was restarted. Best regards, Radu https://sematext.com/ joi, 4 iun. 2020, 19:24 Ryan W a scris: > Happened again today. Solr stopped running. Apache hasn't stopped in 10 > days, so this is not due to a server reboot. > > Solr is not being r

Re: Shingles behavior

2020-05-21 Thread Radu Gheorghe
Turns out, it’s down to setting enableGraphQueries=false in the field definition. I completely missed that :( > On 21 May 2020, at 07:49, Radu Gheorghe wrote: > > Hi Alex, long time no see :) > > I tried with sow, and that basically invalidates query-time shingles (it only &

Re: Shingles behavior

2020-05-20 Thread Radu Gheorghe
order shingles naturally get lower frequencies, meaning they get a "natural" boost. Best regards, Radu joi, 21 mai 2020, 00:28 Alexandre Rafalovitch a scris: > Did you try it with 'sow' parameter both ways? I am not sure I fully > understand the question, especiall

Shingles behavior

2020-05-19 Thread Radu Gheorghe
the two, on the analysis side, is that tokens start at 0 in Elasticsearch and at 1 in Solr. I doubt that’s the problem, because I see that the default “text_en”, for example, also starts at position 1. Is it just a bug that mm doesn’t work in the context of shingles? Or is there a workaround? Thanks and best regards, Radu

Re: Which Solr metrics do you find important?

2020-04-29 Thread Radu Gheorghe
regards, Radu -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ On Tue, Apr 28, 2020 at 6:47 PM Walter Underwood wrote: > I also have some Python that pull stuff from clusterstatus and sends it to >

Re: Which Solr metrics do you find important?

2020-04-28 Thread Radu Gheorghe
Thanks a lot, Matthew! OK, so you do care about the size of tlogs. As well as Collections API stuff (clusterstatus, overseerstatus). And DIH, I didn't think that these stats would be interesting, but surely they are for people who use DIH :) Best regards, Radu -- Monitoring - Log Manag

Which Solr metrics do you find important?

2020-04-28 Thread Radu Gheorghe
that are up/down/recovering... as well as replica types. We plan on adding those, but there's a challenge in de-duplicating metrics. Because one would install one agent per node, and I'm not aware of a way to show only local shards in the Collections API -> CLUSTERSTATUS. Thanks in

Re: Filtered join in Solr?

2020-02-05 Thread Radu Gheorghe
still route by movie ID. But this means more expensive updates/indexing :( Best regards, Radu On Wed, Feb 5, 2020 at 1:00 AM Edward Ribeiro wrote: > > Just for the sake of an imagined scenario, you could use the [subquery] doc > transformer. A query like the one below: > > /select?q=f

Filtered join in Solr?

2020-02-04 Thread Radu Gheorghe
ry ' [{ "id": "user_1", "name": "Jane", "family": "Smith", "born": "1990-01-01", "watched_movies": ["1", "3"] }, { "id": "user_2", "title": "Joe", &qu

solr-diagnostics: utility for collecting info from the Solr installation

2020-01-16 Thread Radu Gheorghe
feedback. Or even better, please open an issue or submit a PR :) Thanks and best regards, Radu -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/

Re: Partial results from streaming expressions (i.e. making them "stream")

2018-01-17 Thread Radu Gheorghe
nderstood something), but I can't get partial results to the HTTP client issuing the original requests. Does this clarify my issue? Thanks again and best regards, Radu -- Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch Support * http://sematext.com/ On

Re: Partial results from streaming expressions (i.e. making them "stream")

2018-01-17 Thread Radu Gheorghe
east globally to the /stream handler. What do you think? Would such a patch be welcome, to add these parameters? But it still requires chunked encoding - would reverting SOLR-8669 be a problem? Or maybe there's a more elegant way to enable chunked encoding, maybe only for streams? Best rega

Partial results from streaming expressions (i.e. making them "stream")

2018-01-15 Thread Radu Gheorghe
e configurable per request. As for manual flushing, that would require changes to the streaming expressions themselves. Would that be the way to go? What do you think? [1] https://issues.apache.org/jira/secure/attachment/12787283/SOLR-8669.patch Best regards, Radu -- Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch Support * http://sematext.com/

autoAddReplicas doesn't respect replicationFactor?

2017-10-03 Thread Radu Gheorghe
e=hdfs://localhost:9000/solr At this point I have two replicas of my shard (one on each node). Am I missing something or is this a bug? Maybe replicationFactor=1 is an edge case? Best regards, Radu -- Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch Support * http://sematext.com/

Re: Multiple rollups/facets in one streaming aggregation?

2016-08-16 Thread Radu Gheorghe
he merge the metrics and then emit the >> merged metrics in and EOF Tuple. >> >> If you think this meets your needs, feel free to create a jira and add >> begin a patch and I can help get it committed. >> >> >> Joel Bernstein >> http://joelsolr.blogsp

Multiple rollups/facets in one streaming aggregation?

2016-08-16 Thread Radu Gheorghe
t of the box? B) whatever the best way is, could we do it in a way that can be contributed back to Solr? Any hints on how to do that? Just another decorator? Thanks and best regards, Radu -- Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch Support * ht

Re: SolrCloud commit process is too time consuming, even if documents are light

2013-07-25 Thread Radu Ghita
Forgot to attach server and solr configurations: SolrCloud 4.1, internal Zookeeper, 16 shards, custom java importer. Server: Intel(R) Xeon(R) CPU E5-2670 0 @ 2.60GHz, 32 cores, 192gb RAM, 10tb SSD and 50tb SAS memory On Thu, Jul 25, 2013 at 3:20 PM, Radu Ghita wrote: > > Hi, > > W

SolrCloud commit process is too time consuming, even if documents are light

2013-07-25 Thread Radu Ghita
alse 200 The big problem stands in SOLR, because I've run the mysql queries single and speed is great, but as the time passes solr adding function stays way too long and then it blocks, even tho server is top level and has lots of resources. I'm new to this so please assist.

How to disable\clear filterCache(from SolrIndexSearcher ) in a custom searchComponent

2013-01-11 Thread radu
ueryCommand().replaceFlags() but SolrIndexSearch.NO_CHECK_FILTERCACHE|NO_CHECK_QCACHE|NO_SET_QCACHE are not public too. *Question*: How to disable\clear filterCache(from SolrIndexSearcher ) *only *for a custom search component. Do I have other options\approaches? Best regards, Radu

Re: Entity with multiple datasources

2012-02-16 Thread Radu Toev
st probably) not globally, there will > be collisions. To aviod this, I would prepend a record_id with some static > value, like: concat("t1", CONVERT(id, CHAR(8))). > > Dmitry > > On Thu, Feb 16, 2012 at 4:47 PM, Radu Toev wrote: > > > I'm not sure I fo

Re: Entity with multiple datasources

2012-02-16 Thread Radu Toev
by the same data-config: > > > > > > > > > > > > > > > > > It's the 'rootEntity="false" that makes the child entity a document. > -- > > Dmitry > > On Thu, Feb 16, 2012 at 2:37 PM, Radu Toev

Re: Entity with multiple datasources

2012-02-16 Thread Radu Toev
the unique field in > the db? > > On Thu, Feb 16, 2012 at 2:13 PM, Radu Toev wrote: > > > I tried running with just one datasource(the one that has 6k entries) and > > it indexes them ok. > > The same, if I do sepparately the 1k database. It indexes ok. > >

Re: Entity with multiple datasources

2012-02-16 Thread Radu Toev
om the smaller dataset. That's why you have 2000. If you just have a > handler pointed to the bigger data set (6k), do you manage to get all 6k db > entries into solr? > > On Thu, Feb 16, 2012 at 1:46 PM, Radu Toev wrote: > > > 1. Nothing in the logs > > 2. No. &g

Re: Entity with multiple datasources

2012-02-16 Thread Radu Toev
1. Nothing in the logs 2. No. On Thu, Feb 16, 2012 at 12:44 PM, Dmitry Kan wrote: > 1. Do you see any errors / exceptions in the logs? > 2. Could you have duplicates? > > On Thu, Feb 16, 2012 at 10:15 AM, Radu Toev wrote: > > > Hello, > > > > I created a data

Entity with multiple datasources

2012-02-16 Thread Radu Toev
Hello, I created a data-config.xml file where I define a datasource and an entity with 12 fields. In my use case I have 2 databases with the same schema, so I want to combine in one index the 2 databases. I defined a second dataSource tag and duplicateed the entity with its field(changed the name

Solr multiple cores - multiple databases approach

2012-02-15 Thread Radu Toev
Hello, I have a use where I'm trying to integrate Solr: - 2 databases with the same schema - I want to index multiple enttities from those databases My question is what is the best way of approaching this topic: - should I create a core for each database and inside that core create a document w

Re: Fields not indexed?

2012-02-08 Thread Radu Toev
I just realized that as I pushed the send button :P Thanks, I'll have a look. On Wed, Feb 8, 2012 at 2:58 PM, Dmitry Kan wrote: > well, you should add these fields in schema.xml, otherwise solr won't know > them. > > On Wed, Feb 8, 2012 at 2:48 PM, Radu Toev wrote: >

Re: Fields not indexed?

2012-02-08 Thread Radu Toev
The schema.xml is the default file that comes with Solr 3.5, didn't change anything there. On Wed, Feb 8, 2012 at 2:45 PM, Dmitry Kan wrote: > How does your schema for the fields look like? > > On Wed, Feb 8, 2012 at 2:41 PM, Radu Toev wrote: > > > Hi, > > >