Re: SOLR 8.6 Synonyms search and out of context results

2021-01-22 Thread Colvin Cowie
Hello, Do you mean that you want searches for "gain" to match documents with "revenue" on them, but do *not* want searches for "revenue" to match documents with "gain" on them? If that's what you mean, how have you defined your synonyms? If you're using the SynonymGraphFilterFactory

Re: Solr cloud issuse: Async exception during distributed update

2020-12-10 Thread Colvin Cowie
Hi, sounds like https://issues.apache.org/jira/browse/SOLR-13963 which was fixed in Solr 8.3.1 On Thu, 10 Dec 2020 at 06:20, Ritvik Sharma wrote: > Hi Houston, > Thanks for reply > > We dont have this kind of field. It's a field value and it is coming > randomly, not all the time. > We are

Re: What's the most efficient way to check if there are any matches for a query?

2020-12-07 Thread Colvin Cowie
uspect to improve that, > depends on whether you think those are fast enough… > > Best, > Erick > > > > On Dec 5, 2020, at 5:04 AM, Colvin Cowie > wrote: > > > > Hello, > > > > I was just wondering. If I don't care about the number of matches for a

What's the most efficient way to check if there are any matches for a query?

2020-12-05 Thread Colvin Cowie
Hello, I was just wondering. If I don't care about the number of matches for a query, let alone what the matches are, just that there is *at least 1* match for a query, what's the most efficient way to execute that query (on the /select handler)? (Using Solr 8.7) As a general approach for a

Re: Solr 8.6.2 - Admin UI Issue

2020-10-08 Thread Colvin Cowie
Images won't be included on the mailing list. You need to put them somewhere else and link to them. With that said, if you're switching between versions, maybe your browser has the old UI cached? Try clearing the cache / viewing it in a private window and see if it's any different. On Wed, 7 Oct

Re: Unexpected Performance decrease when upgrading Solr 5.5.2 to 8.5.2

2020-09-16 Thread Colvin Cowie
Hello, Your images won't appear on the mailing list. You'll need to post them elsewhere and link to them. On Tue, 15 Sep 2020 at 09:44, Keene Chen wrote: > Hi Solr users community, > > > We have been doing some performance tests on Solr 5.5.2 and Solr 8.5.2 as > part of an upgrading process,

Re: "timeAllowed" param with "numFound" having a count value but doc list is empty

2020-09-16 Thread Colvin Cowie
Hi Mark, If queries taking 10 (or however many) seconds isn't acceptable, then either you need to a) prevent or optimize those queries, b) improve the performance of your index, c) use timeAllowed and accept that queries taking that long may fail or provide incomplete results, or d) a combination

Re: SynonymFilterFactory deprecated, documentation and search

2020-07-30 Thread Colvin Cowie
That does some like an unhelpful example to have, though https://lucene.apache.org/solr/guide/8_6/filter-descriptions.html#synonym-filter does clearly state that it is deprecated in favour of SynonymGraphFilterFactory . Deprecated classes will (should) continue to work, but are likely to be

Re: Production sizing and scaling guidelines -- Solr

2020-07-28 Thread Colvin Cowie
Maybe not the most up to date or relevant example for your usage but https://sbdevel.wordpress.com/2016/11/30/70tb-16b-docs-4-machines-1-solrcloud/ is one that sticks in my mind I definitely remember seeing a list of these sorts of blogs somewhere a long time ago... don't know where though On

Re: NumberFormatException when acessing ZK Status page in 8.6.0

2020-07-27 Thread Colvin Cowie
Hello, ZkDynamicConfig is something that's only been added in 8.6 as part of https://issues.apache.org/jira/browse/SOLR-14371 I'm not familiar with dynamic reconfig in zookeeper, so I don't know whether the client port number is supposed to optional or not.My reading of

Re: Reinstall broken?

2020-07-21 Thread Colvin Cowie
Hi, When you say you uninstalled 8.x what exactly does that mean? That you deleted the directory of the binary *and* the solr home where the index data was stored? Either way, check your logs (solr/server/logs in the location you extracted the binary by default) and you will see the exception

Re: [ANNOUNCE] Apache Solr 8.6.0 released

2020-07-16 Thread Colvin Cowie
Perhaps the deprecation notices should feature on https://lucene.apache.org/solr/news.html ? Because right now, they're not *very *visible in the changes. On Thu, 16 Jul 2020 at 01:18, Aroop Ganguly wrote: > May we ask what in hdfs support is being deprecated? Is Hdfs backup and > restore being

Re: Solr Float/Double multivalues fields

2020-07-02 Thread Colvin Cowie
The order of values within a multivalued field should match the insertion order. -- we certainly rely on that in our product. Order is guaranteed to be maintained for values in a multi-valued field. >

Re: Solr 8.5.1 cannot add replica using solr admin UI

2020-06-24 Thread Colvin Cowie
Have you checked the logs to see what is happening? What error messages are you getting? On Wed, 24 Jun 2020 at 15:23, Shreyas Kothiya < shreyas.koth...@archivesocial.com> wrote: > Hello > > I was trying to add replica using solr admin UI. but it seems not working > on solr 8.5.1 . > > I was

Re: Solr takes time to warm up core with huge data

2020-06-08 Thread Colvin Cowie
ries that match the filter, in that case the > doc IDs are stored. Consider if maxDoc is 1M and only 2 docs > match the query, it’s much more efficient to store two ints > rather than 1M/8. > > You can also limit the RAM used by specifying maxRamMB. > > Best, > Erick > &g

Re: Solr takes time to warm up core with huge data

2020-06-08 Thread Colvin Cowie
Sorry to hijack this a little bit. Shawn, what's the calculation for the size of the filter cache? Is that 1 bit per document in the core / shard? Thanks On Fri, 5 Jun 2020 at 17:20, Shawn Heisey wrote: > On 6/5/2020 12:17 AM, Srinivas Kashyap wrote: > >

Re: Multiple Solr instances using same ZooKeepers

2020-06-02 Thread Colvin Cowie
You can specify a different "chroot" directory path in zookeeper for each cloud https://lucene.apache.org/solr/guide/8_5/setting-up-an-external-zookeeper-ensemble.html#using-a-chroot On Tue, 2 Jun 2020 at 09:33, Gell-Holleron, Daniel < daniel.gell-holle...@gb.unisys.com> wrote: > Hi there, > >

Re: Solr Ref Guide Redesign coming in 8.6

2020-04-29 Thread Colvin Cowie
In addition to those points, I think it generally does look good but the thing I've noticed is that increase in text size on rollover in the menu makes it quite jumpy: https://drive.google.com/open?id=15EF0T_C_l8OIDuW8QHOFunL4VzxtyVyb On Wed, 29 Apr 2020 at 08:15, Bernd Fehling wrote: > +1 > >

Re: FuzzyQuery causing Out of Memory Errors in 8.5.x

2020-04-23 Thread Colvin Cowie
https://issues.apache.org/jira/browse/SOLR-14428 On Thu, 23 Apr 2020 at 08:45, Colvin Cowie wrote: > I created a little test that fires off fuzzy queries from random UUID > strings for 5 minutes > *FIELD_NAME + ":" + UUID.randomUUID().toString().replace("-", "

Re: FuzzyQuery causing Out of Memory Errors in 8.5.x

2020-04-23 Thread Colvin Cowie
512mb heap, and with a 4GB heap it only just stayed alive. On 8.3.1 it was completely happy. I'm guessing that the memory might be being leaked if the FuzzyQuery objects are referenced from the cache, while the FuzzyTermsEnum would not have been. I'm going to raise an issue On Wed, 22 Apr 2020

FuzzyQuery causing Out of Memory Errors in 8.5.x

2020-04-22 Thread Colvin Cowie
Hello, I'm moving our product from 8.3.1 to 8.5.1 in dev and we've got tests failing because Solr is getting OOMEs with a 512mb heap where it was previously fine. I ran our tests on both versions with jconsole to track the heap usage. Here's a little comparison. 8.5.1 dies part way through

Re: "SolrCore Initialization Failures" error message appears briefly in Solr 8.5.1 Admin UI

2020-04-21 Thread Colvin Cowie
>From a (very) brief googling it seems like using the ng-cloak attribute is the right way to fix this, and it certainly seems to work for me. https://issues.apache.org/jira/browse/SOLR-14422 On Mon, 20 Apr 2020 at 18:12, Colvin Cowie wrote: > Sorry if this has already been raised, but I

"SolrCore Initialization Failures" error message appears briefly in Solr 8.5.1 Admin UI

2020-04-20 Thread Colvin Cowie
Sorry if this has already been raised, but I didn't see it. When loading / refreshing the Admin UI in 8.5.1, it briefly but *visibly* shows a placeholder for the "SolrCore Initialization Failures" error message, with a lot of redness. It looks like there is a real problem. Obviously the message

Re: Refresh doesn't work in the new Nodes view in Admin UI on Windows

2020-04-20 Thread Colvin Cowie
I have opened https://issues.apache.org/jira/browse/SOLR-14416 for this On Thu, 20 Jun 2019 at 17:01, Colvin Cowie wrote: > On Solr 8.1.1 / 7.7.2 with Oracle 1.8.0_191 25.191-b12 with Solr running > on Windows 10 > > In the Nodes view of the Admin UI, > http://localhost:8983/sol

Re: Optimal size for queries?

2020-04-15 Thread Colvin Cowie
Hi, I can't answer the question as to what the optimal size of rows per request is. I would expect it to depend on the number of stored fields being marshaled, and their type, and your hardware. But using start + rows is a *bad thing* for deep paging. You need to use cursorMark, which looks like

Re: ReversedWildcardFilter - should it be applied only at the index time?

2020-04-15 Thread Colvin Cowie
You only need apply it in the index analyzer: https://lucene.apache.org/solr/8_4_0/solr-core/org/apache/solr/analysis/ReversedWildcardFilterFactory.html If it appears in the index analyzer, the query part of it is automatically applied at query time. The ReversedWildcardFilter indexes *every*

Re: CVE-2017-7525 fix for Solr 7.7.x

2019-12-19 Thread Colvin Cowie
Sorry, in Solr 8 and master there are some additional users of Jackson. But they still don't appear to use default typing or unrestricted subtypes. On Thu, 19 Dec 2019 at 16:50, Colvin Cowie wrote: > Hi, > > We've got users on Solr 6 (and use Jackson ourselves), so I had a look at &

Re: CVE-2017-7525 fix for Solr 7.7.x

2019-12-19 Thread Colvin Cowie
Hi, We've got users on Solr 6 (and use Jackson ourselves), so I had a look at this CVE and related Jackson exploits, to see whether they are actually exploitable in Solr. - What parts of Solr actually use Jackson (I thought noggit was used for the JSON de/serialization)? - Do any of the

Re: [ANNOUNCE] Apache Solr 8.3.1 released

2019-12-09 Thread Colvin Cowie
Cheers On Mon, 9 Dec 2019 at 11:19, Ishan Chattopadhyaya wrote: > Thanks, I'll fix. > > On Mon, Dec 9, 2019 at 4:15 PM Colvin Cowie > wrote: > > > > Oh, just looking at the way the announcement reads on > > http://lucene.apache.org/solr/news.html : >

Re: [ANNOUNCE] Apache Solr 8.3.1 released

2019-12-09 Thread Colvin Cowie
Oh, just looking at the way the announcement reads on http://lucene.apache.org/solr/news.html : Solr 8.3.1 Release Highlights: - JavaBinCodec has concurrent modification of CharArr resulting in corrupt internode updates That kind of sounds like the corrupt internode updates is something

Re: NPE on exceeding timeAllowed on SOLR-8.1.1

2019-12-04 Thread Colvin Cowie
Hi, this is an issue (or a variation of it) that we encountered when using timeAllowed since we started using Solr (5 or 6). I see that there's a couple of JIRA issues that cover similar problems e.g. https://issues.apache.org/jira/browse/SOLR-9882 which is marked as resolved in 8.1. It's possible

Re: solr 8.3 indexing wrong values in some fields

2019-12-02 Thread Colvin Cowie
This sounds like https://issues.apache.org/jira/browse/SOLR-13963 Solr 8.3.1 is likely to be available soon - RC2 is at https://dist.apache.org/repos/dist/dev/lucene/lucene-solr-8.3.1-RC2-reva3d456fba2cd1b9892defbcf46a0eb4d4bb4d01f/solr/ Re-index on it, and see if you still have issues. On Sun, 1

Re: Possible data corruption in JavaBinCodec in Solr 8.3 during distributed update?

2019-11-23 Thread Colvin Cowie
https://issues.apache.org/jira/browse/SOLR-13963 I'll see about modifying the test I have to fit in with the existing tests, and if there's a better option then open to whatever On Sat, 23 Nov 2019 at 16:43, Colvin Cowie wrote: > I've found the problem, JavaBinCodec has a CharArr,* arr*, wh

Re: Possible data corruption in JavaBinCodec in Solr 8.3 during distributed update?

2019-11-23 Thread Colvin Cowie
block into _readStr() fixes the problem. At least as far as my test goes. I'll raise a JIRA issue and can provide a patch with the synchronized block, but not sure what test(s) should be updated / added to cover this? On Thu, 21 Nov 2019 at 18:23, Colvin Cowie wrote: > *> the difference is b

Re: Possible data corruption in JavaBinCodec in Solr 8.3 during distributed update?

2019-11-21 Thread Colvin Cowie
6 INFO (zkConnectionManagerCallback-7-thread-1) [ ] o.a.s.c.c.ConnectionManager zkClient has connected2019-11-21 17:25:19.396 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client is connected to ZooKeeper* On Thu, 21 Nov 2019 at 17:30, Colvin Cowie wrote: > I've been a bit snowed under, but I

Re: Possible data corruption in JavaBinCodec in Solr 8.3 during distributed update?

2019-11-21 Thread Colvin Cowie
I've been a bit snowed under, but I've found the difference is because the _default config has the dynamic schema building in it, which I assume is pushing it down a different code path. I'm using the vanilla Solr 8.3.0 binary8.3.0 2aa586909b911e66e1d8863aa89f173d69f86cd2 - ishan - 2019-10-25

Re: Possible data corruption in JavaBinCodec in Solr 8.3 during distributed update?

2019-11-20 Thread Colvin Cowie
. Thanks On Wednesday, 20 November 2019, Noble Paul wrote: > Can you share the test please > > On Thu, Nov 21, 2019 at 7:02 AM Noble Paul wrote: > > > > Thanks Colvin, I'll take a look > > > > On Thu, Nov 21, 2019 at 4:24 AM Colvin Cowie > wrote: > > &g

Re: Possible data corruption in JavaBinCodec in Solr 8.3 during distributed update?

2019-11-20 Thread Colvin Cowie
iterations of indexing 2500 documents, with getRawValue() it succeeds for the 500 iterations I'm running it for) I'll see about providing a test that can be shared that demonstrates the problem, and see if we can find what is going wrong in the codec... On Tue, 19 Nov 2019 at 13:48, Colvin Co

Possible data corruption in JavaBinCodec in Solr 8.3 during distributed update?

2019-11-19 Thread Colvin Cowie
Hello Apologies for the lack of actual detail in this, we're still digging into it ourselves. I will provide more detail, and maybe some logs, once I have a better idea of what is actually happening. But I thought I might as well ask if anyone knows of changes that were made in the Solr 8.3

The waitForZk (SOLR_WAIT_FOR_ZK) setting does not work

2019-07-24 Thread Colvin Cowie
Hello, I didn't see an existing issue for this in Jira: The system property *waitForZk* was added in https://issues.apache.org/jira/browse/SOLR-5129 and is supposed to increase the timeout for an initial connection to Solr at startup, From the solr.in.sh: *# By default Solr will try to connect

Re: different numFound value /select vs. /export

2019-06-28 Thread Colvin Cowie
*/stream?explain=true=sear* *ch(myCore,zkHost=”192.168.1.10:2181 ",qt=”/export”,q=”*:*”, fl=”id”,sort=”id asc”) returns* * 'search(myCore,zkHost=”192.168.1.10:2181 \",qt=”/export”,q=”** *:*”, fl=”id”,sort=”id asc”)' is not a proper expression

Re: refused connection

2019-06-28 Thread Colvin Cowie
I've not seen that error before (except when it's a failed JVM_BIND because the port is in use), but a quick google suggests it might be related to file descriptor limits being enforced by your OS https://groups.google.com/forum/#!topic/gatling/rRpv8LPa51I On Fri, 28 Jun 2019 at 09:34, Midas A

Re: REINDEXCOLLECTION does not work with (basic) authentication

2019-06-23 Thread Colvin Cowie
Patch available for review on https://issues.apache.org/jira/browse/SOLR-13566 On Thu, 20 Jun 2019 at 10:02, Colvin Cowie wrote: > I've raised SOLR-13566 > > On Thu, 20 Jun 2019 at 09:14, Jan Høydahl wrote: > >> I think this may be a case where the (background) job should us

Refresh doesn't work in the new Nodes view in Admin UI on Windows

2019-06-20 Thread Colvin Cowie
On Solr 8.1.1 / 7.7.2 with Oracle 1.8.0_191 25.191-b12 with Solr running on Windows 10 In the Nodes view of the Admin UI, http://localhost:8983/solr/#/~cloud?view=nodes there is a refresh button. However when you click it, the only thing that gets visibly refreshed is the 'bar chart' (not sure

Re: REINDEXCOLLECTION does not work with (basic) authentication

2019-06-20 Thread Colvin Cowie
gt; 19. jun. 2019 kl. 20:50 skrev Colvin Cowie : > > > > Hello > > > > I'm on the Solr 8.1 branch off commit > > f26388d034fe5eadca7416aa63b509b8db2c7688 so I have the authentication > fixes > > from SOLR-13510 (intermittent 401s for internode requests) > >

REINDEXCOLLECTION does not work with (basic) authentication

2019-06-19 Thread Colvin Cowie
Hello I'm on the Solr 8.1 branch off commit f26388d034fe5eadca7416aa63b509b8db2c7688 so I have the authentication fixes from SOLR-13510 (intermittent 401s for internode requests) When trying to use the new REINDEXCOLLECTION command with basic auth enabled, the daemon stream fails with repeated

Re: Increased disk space usage 8.1.1 vs 7.7.1

2019-06-13 Thread Colvin Cowie
Hello, For context it would probably be helpful to know some more info about the collection. e.g. it's 1GB bigger, but what percentage increase does that represent? Like is it 0.5% or 50%? On Thu, 13 Jun 2019 at 11:19, Markus Jelsma wrote: > Hello, > > We are upgrading to Solr 8. One of our

Re: Loading of zkCredentialsProvider has changed in Solr 7 or 8?

2019-06-12 Thread Colvin Cowie
I realize that attachments might not work on the mailing list, so here is the log on Drive https://drive.google.com/file/d/0B7mypFpwbHptWkp0X2U0azU2dGREb1k2WGlpeUM3MlRIWmRB/view?usp=sharing On Tue, 11 Jun 2019 at 11:21, Colvin Cowie wrote: > Hello all > > I hit another problem in mo

Re: ContentStreamUpdateRequest no longer closes stream

2019-06-12 Thread Colvin Cowie
I realize that attachments might not work on the mailing list, so here is the test case on Drive https://drive.google.com/file/d/0B7mypFpwbHptTE5nZE0weURFOExFSHphRFlUV0EyTElaOC0w/view?usp=sharing On Mon, 10 Jun 2019 at 13:17, Colvin Cowie wrote: > Hello, I'm in the process of moving from Sol

Re: Intermittent BasicAuthPlugin Not Authorized

2019-06-12 Thread Colvin Cowie
Jun 2019 at 23:43, Brian Lininger wrote: > Thanks Erick, > I had read thru https://issues.apache.org/jira/browse/SOLR-13510 earlier > today but it seemed specific to Solr 8 as Colvin Cowie wasn't able to > reproduce on 7.7.0 or 7.7.1. I am going to see if the 'forwardCredentials'

Loading of zkCredentialsProvider has changed in Solr 7 or 8?

2019-06-11 Thread Colvin Cowie
Hello all I hit another problem in moving from Solr 6 to 8. We secure our ZooKeeper entirely (there's a restrictive ACL for every znode) To pass the ZooKeeper credentials to Solr we implemented ZkCredentialsProvider and ZkACLProvider to load the credentials from a file on disk, which has the

Re: Basic Authentication in Standalone Configuration ?

2019-06-10 Thread Colvin Cowie
Hello, You need to use the *set *command in windows cmd files to set values. The example solr.in.cmd has commented out examples, e.g. *REM Settings for authenticationREM Please configure only one of SOLR_AUTHENTICATION_CLIENT_BUILDER or SOLR_AUTH_TYPE parametersREM set

ContentStreamUpdateRequest no longer closes stream

2019-06-10 Thread Colvin Cowie
Hello, I'm in the process of moving from Solr 6. to Solr 8. We have a client application that streams CSV files to Solr using ContentStreamUpdateRequest and then deletes the CSV file once the data is indexed. That worked fine in Solr 6, but when using 8, the file is locked and can't be deleted.

Re: Unexpected behaviour when Solr 6 Admin UI pages are cached and server is Solr 8?

2019-06-06 Thread Colvin Cowie
I've raised https://issues.apache.org/jira/browse/SOLR-13522 - feel free to update the description as you like Cheers On Wed, 5 Jun 2019 at 21:48, Shawn Heisey wrote: > On 6/5/2019 2:40 PM, Gus Heck wrote: > > Experiences that force the user to think about the browser cache are > > sub-par :).

Unexpected behaviour when Solr 6 Admin UI pages are cached and server is Solr 8?

2019-06-05 Thread Colvin Cowie
Hello, I have just hit this and wondered if anyone has seen similar before since the login page was added to the Admin UI? I'm using Solr 6.6.6 currently, but I'm in the process of moving to 8.1.x. That means I've been accessing the UI from 6 and had it cached. I switched off 6 and run the 8.1.x

Re: Intermittent error 401 with JSON Facet query to retrieve count all collections

2019-06-03 Thread Colvin Cowie
ns at all.) > > > - I tried looking for a pattern in which requests saw the 401s, but > > > didn't have any luck. The 401 occurs when talking to the whole > > > collection or targeting individual cores directly. It occurs when > > > curl hits a host containing a

Re: Intermittent error 401 with JSON Facet query to retrieve count all collections

2019-06-02 Thread Colvin Cowie
Hello. I encountered this issue too and wrote this up before I found this thread, but I thought I might as well post it still, if it helps... Currently I'm trying to move our product on to Solr 8.1.1. We are currently using 6.6.6, so things have definitely moved on. We use the BasicAuthPlugin +