Hi Solr users,
I can see there are some parameters that can help in controlling the
trigger condition for spellcheck mechanism or filter the spell suggestions
like maxQueryFrequency or thresholdTokenFrequency. I could not find a
parameter that will filter the suggestions based on (origFreq/freq) r
I know I can apply PatternReplaceFilterFactory to remove control characters
from indexed value. However, is it possible to do similar thing for stored
value? Because of some control characters included in indexing request,
Solr throws Illegal Character Exception.
When 'polt' is passed as keyword, both suggestions and collations
parameters are returned. But if I pass 'tag:polt' as search query then only
suggestions parameter is returned. Is this a bug?
Thanks for replying. I tried spellcheck.q=polt and it does not help.
Here is how the query looks like:
http://solr:8983/solr/myapp/select?q=tag:polt&wt=json&indent=true&spellcheck=true&spellcheck.count=7&spellcheck.onlyMorePopular=true&spellcheck.extendedResults=true&spellcheck.collate=true&spell
with spellcheck.q I don't get anything back at all.
{
"responseHeader": {
"status": 0,
"QTime": 10,
"params": {
"spellcheck.collateExtendedResults": "true",
"spellcheck.q": "tag:polt",
"indent": "true",
"spellcheck": "true",
"spellcheck.accuracy": "0.72"
numFound": 7,
"startOffset": 4,
"endOffset": 9,
"origFreq": 0,
"suggestion": [
{
"word": "plot",
"freq": 5934
},
{
"word": "port",
I tried spellcheck.q=polt and q=tag:polt. I get collations, but they are
only for polt and not tag:polt. Because of that, the hits that I get back
are for frequency of plot and not frequency of tag:plot
{
"responseHeader": {
"status": 0,
"QTime": 20,
"params": {
"spellcheck.col
Let me know if I should open a JIRA issue for this. Thanks.
On Tue, Oct 17, 2017 at 10:40 AM, Arnold Bronley
wrote:
> I tried spellcheck.q=polt and q=tag:polt. I get collations, but they are
> only for polt and not tag:polt. Because of that, the hits that I get back
> are for frequenc
Hi,
Before implementing optimistic concurrency solution, I had written one test
case to check if two threads atomically writing two different fields (say
f1 and f2) of the same document (say d) run into conflict or not.
Thread t1 atomically writes counter c1 to field f1 of document d, commits
and
Thanks for replying, Chris.
1) depending on the number of CPUs / load on your solr server, it's
possible you're just getting lucky. it's hard to "prove" with a
multithreaded test that concurrency bugs exist.
- Agreed. However, between 200k total calls, race condition not happening
even once - I f
hl.encoder escapes html characters in highlight text response except the
highlight html characters that Solr uses. Is there something similar
available for field text that we get back as response from Solr?
Hi,
Are there any features that are only exclusive to SolrCloud?
e.g. when I am reading Streaming Expressions documentation, first sentence
there says 'Streaming Expressions provide a simple yet powerful stream
processing language for Solr Cloud.'
So, does this mean that streaming expressions ar
e and not in Solr master-slave mode?
>
> yes, and its annoying as there are features of solr cloud I do not like.
> as far as a comprehensive list, that I do not know but would be interested
> in one as well
>
> On Thu, Feb 14, 2019 at 5:07 PM Arnold Bronley
> wrote:
>
>
On Sat, Feb 16, 2019 at 7:07 PM Arnold Bronley
wrote:
> I am glad to learn that there are others in similar need. A list for
> SolrCloud exclusive features will be really awesome.
> Can any Solr devs please reply to this thread?
>
>
> On Fri, Feb 15, 2019 at 8:39
Hi,
I am using ExactStatsCache in SolrCloud (7.7.1) by adding following to
solrconfig.xml file for all collections. I restarted and indexed the
documents of all collections after this change just to be sure.
However, when I do multi-collection query, the scores do not change before
and after ad
Hi,
I used unidirectional CDCR in SolrCloud (7.7.1) without any issues. But
after setting up bidirectional cdcr configuration, I am not able to index a
document.
Following is the error that I am getting:
Async exception during distributed update: Error from server at
http://host1:8983/solr/techp
quot; to the "fl" param -- if different (for a given doc) than the score
> as reported in debug, then it's probably working as intended ... just a
> little confusing in the debug output.
>
> On Thu, Mar 14, 2019 at 3:23 PM Arnold Bronley
> wrote:
>
> > Hi,
> >
in.com/in/sarkaramrit2
> Medium: https://medium.com/@sarkaramrit2
>
>
> On Fri, Mar 15, 2019 at 1:03 AM Arnold Bronley
> wrote:
>
> > Hi,
> >
> > I used unidirectional CDCR in SolrCloud (7.7.1) without any issues. But
> > after setting up bidirectional cdcr
Configuration is almost identical for both clusters in terms of cdcr except
for zkHost parameter configuration.
On Thu, Mar 14, 2019 at 3:45 PM Arnold Bronley
wrote:
> Exactly. I have it defined in both clusters. I am following the
> instructions from here .
> https://lucene.apache
gard if you have already considered that in your configuration.
> I had a lot of issues trying to figure out the issue when I realized that
> it was a documentation error.
>
> Thanks
> Nishant
>
>
> On Thu, Mar 14, 2019, 2:54 PM Arnold Bronley wrote:
>
> > Configurati
Hi,
is it possible to use CDCR with one source SolrCloud cluster and multiple
target SolrCloud clusters? I tried to edit the zkHost setting in source
cluster's solrconfig file by adding multiple comma separated values for
target zkhosts for multuple target clusters. But the CDCR replication
happen
cannot use cdcr
for one source- multiple target cluster situation.
Can somebody please confirm whether this is even supported?
On Wed, Mar 20, 2019 at 1:12 PM Arnold Bronley
wrote:
> Hi,
>
> is it possible to use CDCR with one source SolrCloud cluster and multiple
> target SolrCloud clust
, 2019 at 10:40 AM Arnold Bronley
wrote:
> I see a similar question asked but no answers there too.
> http://lucene.472066.n3.nabble.com/CDCR-Replication-from-one-source-to-multiple-targets-td4308717.html
> OP there is using multiple cdcr request handlers but in my case I am using
> mul
Hi,
Does Solr spellcheck collator consider proximity between words in
multi-word search phrase?
i.e. instead of returning spell suggestions by considering each individual
word separately, does it consider them in group if the words occur together
often?
E.g. bll gats should return bill gates ins
Hi,
in a spellchecker call, if I don't get back collations object in the
response, is it correct to assume that even if I create a query myself by
joining the individually spell-corrected words in suggestions object in
response, it will have 0 results?
E.g. In the following spellchecker response
Thanks for replying Alessandro.
I am passing these parameters:
q=polt&spellcheck.q=polt&wt=json&indent=true&spellcheck=true&spellcheck.count=7&spellcheck.onlyMorePopular=true&spellcheck.extendedResults=true&spellcheck.collate=true&spellcheck.maxCollations=3&spellcheck.maxCollationTries=3&spellche
Hi,
I am using MoreLikeThis handler to get related documents for a given
document. To determine if I am getting good results or not, here is what I
do:
The same original document should be returned as a top match.
If it is not, then there is some problem with the relevancy.
Then, as same input
Thanks for the reply, Alessandro.
Can you please elaborate on a point "a document which has a score 50% of
the original doc score, it doesn't
mean it is 50% similar"? I did not understand this for two reasons:
1. In the end, we are calculating similarity score between documents when
we are solv
Hi,
How much of a problem would it be if I use OpenJDK 11 with Solr 6.3. I am
aware that the system requirements page for Solr mentions that 'You should
avoid Java 9 or later for Lucene/Solr 6.x or earlier.' I am interested in
knowing what sort functionalities would break in Solr if I try to use
O
Hi,
I use Solr docker images from https://hub.docker.com/_/solr/. It uses
Oracle OpenJDK. It is a no go for where I work. What is the best way to
replace this JDK with some other OpenJDK such as Amazon Corretto OpenJDK
for my docker containers if I still want to use above images?
rom-Oracle-Java-Licensing-fee
> >
> >
> >> On Fri, Jan 31, 2020 at 3:45 PM Erick Erickson >
> >> wrote:
> >>
> >> Why is it a no-go? It’s free too.
> >>
> >>> On Jan 31, 2020, at 12:31 AM, Arnold Bronley
> >> wrote
d90afb072284bce5f1715345c53c/8.4/Dockerfile
> >>
> >> which uses openjdk:11-stretch
> >>
> >> and looking up that is
> >>
> >>
> >>
> https://github.com/docker-library/openjdk/blob/1b6e2ef66a086f47315f5d05ecf7de3dae7413f2/11/jdk/Dock
= AdoptOpenJDK
> java.version = 11.0.4
> java.version.date = 2019-07-16
> java.vm.specification.vendor = Oracle Corporation
> java.vm.specification.version = 11
> java.vm.vendor = AdoptOpenJDK
> java.vm.version = 11.0.4+11
> os.version = 5.0.0-32-generic
> openjdk
Thanks, Jan.
The issue is created here -
https://github.com/docker-solr/docker-solr/issues/289
On Fri, Jan 31, 2020 at 7:13 PM Jan Høydahl wrote:
> I poked around a bit and ended up reading this thead
> https://github.com/docker-library/openjdk/issues/320 <
> https://github.com/docker-library/
Following is the definition of the getFields function
in org.apache.lucene.document.Document class. As you can see, it can't
handle the dynamic fields because dynamic fields have pattern like
field_name_*, so the equals condition won't match in following function.
Shouldn't we use matches function
Hi,
We are trying to decide whether we should upgrade to Solr 7.7.2 version or
Solr 8.2.0 version. We are currently on Solr 6.3.0 version.
On one hand 8.2.0 version feels like a good choice because it is the latest
version. But then experience tells that initial versions usually have lot
of bugs
Hi,
I have a topic verctor calculated for each of the Solr document in a
collection. Topic vector is calculated using LDA (
https://en.wikipedia.org/wiki/Latent_Dirichlet_allocation). Now I want to
return the similar document to a given document from this collection. I can
simply use normalized d
Hi,
Is there a way to turn off the CDCR for only selected target clusters.
Say, I have a configuration like following. I have 3 target clusters
targetZkHost1, targetZkHost2 and targetZkHost3. Is it possible to turn off
the CDCR for targetZkHost2 and targetZkHost3 but keep it on for
targetZkHost1?
Hi,
I am getting following warning in Solr admin UI logs. I did not get this
warning in Solr 8.1.1
Please note that I am using Solr docker slim image from here -
https://hub.docker.com/_/solr/
Unable to load jetty, not starting JettyAdminServer
ch steps do you follow to run Solr docker?
>
> Kind Regards,
> Furkan KAMACI
>
> On Tue, Aug 20, 2019 at 1:25 AM Arnold Bronley
> wrote:
>
> > Hi,
> >
> > I am getting following warning in Solr admin UI logs. I did not get this
> > warning in Solr 8.1.1
ctive for ZooKeeper instances really makes no sense
> as the
> point of ZK ensembles is to keep running even if one goes away.
>
> So can you rephrase the question? Or state the problem you’re trying to
> solve another way?
>
> Best,
> Erick
>
> > On Aug 13, 2019, at 1:
Hi,
In a custom Solr plugin code,
req.getCore().getCoreContainer().getCore(core_name) is returning null even
if core by name core_name is loaded and up in Solr. req is object
of SolrQueryRequest class. I am using Solr 8.2.0 in SolrCloud mode.
Any ideas on why this might be the case?
l docker log output).
>
> — Martijn
>
> > On 20 Aug 2019, at 19:26, Arnold Bronley
> wrote:
> >
> > Hi,
> >
> > I am using 8.2.0-slim version. I wrap it in my own image by specifying
> some
> > additional settings in Dockerfile (all it does is
PM, Arnold Bronley wrote:
> > I have configured the SolrCloud collection-wise only and there is no
> other
> > way. The way you have defined 3 zkHosts (comma separated values for
> zkHost
> > property), I tried that one before as it was more intuitive. But it did
> not
&
core
information to be abstracted out that way.
On Wed, Aug 28, 2019 at 5:13 PM Erick Erickson
wrote:
> Hmmm, should work. What is your core_name? There’s strings like
> collection1_shard1_replica_n4 and core_node6. Are you sure you’re using the
> right one?
>
> > On Aug 28, 2019, at
ink you need to get a core? What do you want to _do_
> with the doc that you need the core it resides in?
>
> Best,
> Erick
>
> > On Aug 28, 2019, at 5:28 PM, Arnold Bronley
> wrote:
> >
> > Wait, would I need to use core name like collection1_shard1_replica_n4
(in this case a CloudSolrClient) instead of dealing
> with such low-level details? The actual location of the document you are
> looking for would be completely abstracted.
>
> Best,
> Andrea
>
> On Thu, 29 Aug 2019, 18:50 Arnold Bronley,
> wrote:
>
> > So, here is the pr
it from
> _collection_ B. Since you have the doc ID (presumably the ),
> using get-by-id instead of a standard query will be very efficient. I can
> imagine
> under very heavy load this might introduce too much overhead, but it’s
> where I’d start.
>
> Best,
> Erick
>
> &
h.
>
> Andrea
>
> On Thu, 29 Aug 2019, 21:32 Arnold Bronley,
> wrote:
>
> > @Andrea: I agree with you. Do you know if there is a way to initialize
> > SolrCloudClient directly from some information that I get
> > from SolrQueryRequest or from Add
Hi,
Is there a way to create SolrClient from inside processAdd function for
custom update processor for the same Solr on which it is executing?
to do this…..
>
> Best,
> Erick
>
> > On Sep 3, 2019, at 6:17 PM, Arnold Bronley
> wrote:
> >
> > Hi,
> >
> > Is there a way to create SolrClient from inside processAdd function for
> > custom update processor for the same Solr on which it is executing?
>
>
s for indexing to incoming documents (and recall
> discussing this with David Smiley), but eventually decided against this
> approach on the grounds of complexity.
>
> -Simon
>
> On Wed, Sep 4, 2019 at 2:10 PM Arnold Bronley
> wrote:
>
> > I need to search some other c
Why atomic indexing is not the default mode of indexing in Solr? That way
the ownership model of the content changes from document level to field
level for clients. Multiple clients can participate in the contribution
process of the same Solr document without overwriting each other.
need the SolrClient
> > > _for_? I suspect there’s an easier way to do this…..
> > >
> > > Best,
> > > Erick
> > >
> > > > On Sep 3, 2019, at 6:17 PM, Arnold Bronley
> > > wrote:
> > > >
> > > > Hi,
> > > >
> > > > Is there a way to create SolrClient from inside processAdd function
> for
> > > > custom update processor for the same Solr on which it is executing?
> > >
> > >
> >
>
Hi,
is there a way to get host/port information for current Solr from inside
custom Solr URP plugin? One way it to do use 'localhost:8983' but I feel
little uncomfortable with such hardcoding of the port.
Hi,
In SolrTestCaseJ4 there is initCore function. After using this function how
to know on which host and port the solr is running. Same goes for
EmbeddedSolrServer?
How to know on which port and host it is running?
gt; > -Original message-
> > > > From:Arnold Bronley
> > > > Sent: Wednesday 4th September 2019 20:10
> > > > To: solr-user@lucene.apache.org
> > > > Subject: Re: SolrClient from inside processAdd function
> > > >
> > > >
}
});
}
On Fri, Sep 6, 2019 at 5:40 PM Arnold Bronley
wrote:
> Hi Markus,
>
> "Depending on cloudMode we create new SolrClient instances based on these
> classes. "
>
> But I still do not see SolrClient creation anywhere in your code snippet.
> Am I missin
Hi,
in Solr 6.3, I was able to use OR and AND operators in case insensitive
manner.
E.g.
If I have two documents like following in my corpus:
document 1:
{
id:1
author:rick
}
document 2:
{
id:2
author:morty
}
Then if I pass 'rick OR morty' to q param then I would get both documents
back. I woul
Hi,
Why is it that I need to reload collection after I created it? CDCR runs
into issues if I do not do this.
Hi,
I am not changing the the config to enable CDCR. I am just using the CDCR
API to start it. Does that count as changing configuration?
On Thu, Sep 19, 2019 at 12:20 PM Shawn Heisey wrote:
> On 9/19/2019 9:36 AM, Arnold Bronley wrote:
> > Why is it that I need to reload collectio
Hi,
I am using Solr version 8.2.0 and I see that there is one critical JIRA
issue open(link below) for CDCR. The issue does not mention anything about
8.2.0 but it says that it is fixed in 8.3.0. Does this mean that CDCR is
not functional in Solr 8.2.0 and should I wait for 8.3.0 to be released?
h
Here is the detailed question in stack-overflow. Please help.
https://stackoverflow.com/questions/14280506/how-to-update-range-of-dynamic-fields-in-solr-4
Hi,
I have a Solr collection 'products' for different products that users
interact with. With MoreLikeThis, I can retrieve for a given product
another related product. Now, I want to create a Solr collection for users
such that I can use MoreLikeThis approach between users and products. Not
just t
I am also facing similar issue. I have also switched to other browsers to
solve this issue.
On Tue, Dec 10, 2019 at 2:22 PM Webster Homer <
webster.ho...@milliporesigma.com> wrote:
> It seems like the Solr Admin console has become slow when you use it on
> the chrome browser. If I go to the query
Hi,
I have one custom Solr plugin that uses MoreLikeThis class. AFAIK,
MoreLikeThis handler does not support distributed mode and the issue is
still open for that - https://issues.apache.org/jira/browse/SOLR-5480.
However, I saw that there is some possibility to use CloudMLTQParser to
work around
Hi,
I have one custom Solr plugin that uses following logic to access some
other core present on the same Solr instance.
request.getCore().getCoreContainer().getCore(otherCoreName) where request
is an object of type SolrQueryRequest
This works fine in master-slave mode.
Now if try to use the sa
rsing the query and, in this
> case
> getting the relevant from the indicated document to add to the
> query while doing some sanity checking. The bits that
> distribute the query to shards and collate the results are elsewhere.
>
> Best,
> Erick
>
> > On Jan 3, 20
ogic on SolrCloud is making it harder than it needs to be.
>
> Best,
> Erick
>
> > On Jan 6, 2020, at 2:39 PM, Arnold Bronley
> wrote:
> >
> > Hi,
> >
> > I have one custom Solr plugin that uses following logic to access some
> > other core prese
Hi,
I have a running dockerized instance of Solr which runs fine with the
following setting for command option for solr service in docker-compose.yml
file
command: >
bash -c "solr start -cloud -s /var/lib/solr -t /var/data/solr;
set -x; export; wait-for-solr.sh;"
Recently, I wanted
LR_OPTS=-agentlib:jdwp=transport=dt_socket,server=y,suspend=n,address=5005”
>
> The thing to look for in the logs is: solr1| Listening for transport
> dt_socket at address: 5005
>
> BTW, when looking at these sort of subtleties, it’s always useful to exec
> into t
n
> foreground:
>
> command: >
> bash -c "./myscript.sh& exec solr start -c -f -a
> -agentlib:jdwp=transport=dt_socket,server=y,suspend=n,address=5005; "
>
> See that there's an 'exec' keyword between the script and the solr
> launching command.
>
Hi,
I am trying to upgrade my system from Solr master-slave architecture to
SolrCloud architecture.
Meanwhile, I stumbled upon this very negative post about SolrCloud.
https://lucene.472066.n3.nabble.com/A-Last-Message-to-the-Solr-Users-td4452980.html
Given that it is from one of the initial au
Hi,
I have a Lucene query as following (toString represenation of Lucene's
Query object):
+(topics:29)^2 (topics:38)^3 +(-id:41135)
It works fine when I am using it as a lucene query in
SolrIndexSearcher.getDocList function.
However, now I want to use it as a Solr query and query against a
coll
Hi,
BooleanQueryBuilder is not adding parenthesis around the query. It
only adds + sign at the start of the query but not the parentheses around
the query. Why is that? How should I add it?
booleanQueryBuilder.add(query, BooleanClause.Occur.MUST)
lder.build(), BooleanClause.Occur.SHOULD);
> builder.add(query3, BooleanClause.Occur.SHOULD);
>
> BooleanQuery booleanQuery = builder.build();
>
> This booleanQuery.toString() will be:
>
> (+text:toys +text:children) age:12
>
> That is the parsing of "(text:chil
; - that was the key
point I was missing.
On Wed, Jan 22, 2020 at 2:40 PM Arnold Bronley
wrote:
> Thanks, Edaward. This was the exact answer I was looking for :)
>
> On Wed, Jan 22, 2020 at 1:08 PM Edward Ribeiro
> wrote:
>
>> If you are using Lucene's BooleanQuery
Hi,
I have following code that does some parsing with QParser plugin. I noticed
that it does not retain the double quotes in the filterQueryString. How
should make it retain the double quotes?
QParser.getParser(filterQueryString, null, req).getQuery();
filterQueryString passed = id:"x:1234"
78 matches
Mail list logo