Re: Enabling/disabling docValues

2019-06-10 Thread John Davis
You have made many assumptions which might not always be realistic a) TextField is always tokenized b) Users care about precise counts and c) Users have the luxury or ability to do a full re-index anytime. These are real issues and there is no black/white solution. I will ask Lucene folks on the

How to increase maximum size of files allowed in configuration for MiniSolrCloudCluster

2019-06-10 Thread Pratik Patel
Hi, I am trying to upload a configuration to "MiniSolrCloudCluster" in my unit test. This configuration has some binary files for NLP related functionality. Some of these binary files are bigger than 5 MB. If I try to upload configuration with these files then it doesn't work. I can set up the

Re: Sort date stored in text field?

2019-06-10 Thread Shawn Heisey
On 6/10/2019 3:26 PM, Dave Beckstrom wrote: I have a field called metatag.date that is field-type: org.apache.solr.schema.TextFieldThe field is being populated by NUTCH, which grabs the date from the html: I'm trying to sort by date (metatag.date desc) passed on the URL and it's not

Sort date stored in text field?

2019-06-10 Thread Dave Beckstrom
Hi Everyone, Running SOLR 7.3.1 I have a field called metatag.date that is field-type: org.apache.solr.schema.TextFieldThe field is being populated by NUTCH, which grabs the date from the html: and stores it in the metatag.date field in SOLR. I'm trying to sort by date (metatag.date

Scoring to Synonym in query

2019-06-10 Thread Rathor, Piyush
Hi Team Do we have a mechanism to provide score in query for synonym search. My synonym field is called - "First_syn" Thanks This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the

Re: Collections API timeout

2019-06-10 Thread Софія Строчик
Yes, I've checked them and all nodes are pointing to the sme IP and the same port (2181). Also all of them are visible in the SolrCloud Graph section so this would mean they are part of the same cloud. Largest file is solrconfig which is 58K so this shouldn't be a problem either. The potential

Re: Collections API timeout

2019-06-10 Thread Erick Erickson
Hmmm, I didn’t really look carefully at the end of your e-mail. There not being an /overseer znode _looks_ like one or more of your Solr nodes isn’t connecting to the proper ZooKeeper ensemble. bq. All of the instances are able to talk to zookeeper (they are > >>> displayed as active in the

[ANNOUNCE] Apache Solr Reference Guide for 8.0 released

2019-06-10 Thread Cassandra Targett
The Lucene PMC is pleased to announce that the Solr Reference Guide for 8.0 is available. This 1,452 page PDF is the definitive guide to Apache Solr, the search server built on Apache Lucene. The PDF can be downloaded from:

Re: Collections API timeout

2019-06-10 Thread Софія Строчик
Hi Erick, thanks for your reply! I didn't mention it but we have tried async requests. Then it does not time out of course, but instead appears to run indefinitely, with REQUESTSTATUS response like this: { "responseHeader":{ "status":0, "QTime":1}, "status":{ "state":"submitted",

Re: Collections API timeout

2019-06-10 Thread Erick Erickson
Certainly at times some things just take a long time. The 180 second timeout is fairly arbitrary. GC pauses, creating a zillion replicas etc. can cause timeouts like this to be exceeded. Rather than rely on lengthening some magic timeout value and hoping, I suggest you use the async option,

Collections API timeout

2019-06-10 Thread Софія Строчик
Hi everyone, recently when trying to delete a collection we have noticed that all calls to the Collections API time out after 180s. Something similar is described here however restarting the instance or the server does

Re: Enabling/disabling docValues

2019-06-10 Thread Erick Erickson
bq. Does lucene look at %docs in each state, or the first doc or something else? Frankly I don’t care since no matter what, the results of faceting mixed definitions is not useful. tl;dr; “When I use a word,’ Humpty Dumpty said in rather a scornful tone, ‘it means just what I choose it to

Re: Loading pre created index files into MiniSolrCloudCluster of test framework

2019-06-10 Thread Pratik Patel
So, I found a way to programmatically restore a collection from a backup. I though that I could create a backup of a collection, put it on the classpath, restore it during unit test set up and run the queries against newly created collection using restore. Theoretically, it sounded like it would

Re: highlighting not working as expected

2019-06-10 Thread David Smiley
Please try hl.method=unified and tell us if that helps. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Mon, Jun 3, 2019 at 4:06 AM Martin Frank Hansen (MHQ) wrote: > Hi, > > I am having some difficulties making highlighting work. For some reason

Re: Streaming expression function which can give parent document along with its child documents ?

2019-06-10 Thread Pratik Patel
If your children documents have a link to parent documents (like parent id or something) then you can use graph traversal to do this. On Mon, Jun 10, 2019 at 8:01 AM Jai Jamba wrote: > Can anyone help me in this ? > > > > -- > Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Query takes a long time Solr 6.1.0

2019-06-10 Thread Shawn Heisey
On 6/10/2019 3:24 AM, vishal patel wrote: We have 27 collections and each collection has many schema fields and in live too many search and index create requests come and most of the searching requests are sorting, faceting, grouping, and long query. So approx average 40GB heap are used so we

RE: [SPAM] Re: query parsed in different ways in two identical solr instances

2019-06-10 Thread Danilo Tomasoni
Yes I identical because the configuration (solrconfig.xml etc) is identical, just some fields changed. Sorry I was not so precise in the description of the environment. Nice to know it's already fixed. Danilo Tomasoni Fondazione The Microsoft Research - University of Trento Centre for

RE: No files to download for index generation

2019-06-10 Thread Oakley, Craig (NIH/NLM/NCBI) [C]
Does anyone yet have any insight on interpreting the severity of this message? -Original Message- From: Oakley, Craig (NIH/NLM/NCBI) [C] Sent: Tuesday, June 04, 2019 4:07 PM To: solr-user@lucene.apache.org Subject: No files to download for index generation We have occasionally been

Re: [SPAM] Re: query parsed in different ways in two identical solr instances

2019-06-10 Thread Alexandre Rafalovitch
Ok, great. We now moved from "identical setup breaks things in a bugfix version" to "strange behavior when field does not exist". The "identical" part was actually throwing us off the trail. And all this leads us to https://issues.apache.org/jira/browse/SOLR-5163 , fixed in 8.0. Hope it helps,

RE: [SPAM] Re: query parsed in different ways in two identical solr instances

2019-06-10 Thread Danilo Tomasoni
Hello I was able to reproduce this behaviour in an isolated environment, and performed some differential analysis between the two versions (that has different schemas, diff of schemas attached) With the schema of solr1, the query is parsed as +(+() +()) while with the schema of

Re: Basic Authentication in Standalone Configuration ?

2019-06-10 Thread Colvin Cowie
Hello, You need to use the *set *command in windows cmd files to set values. The example solr.in.cmd has commented out examples, e.g. *REM Settings for authenticationREM Please configure only one of SOLR_AUTHENTICATION_CLIENT_BUILDER or SOLR_AUTH_TYPE parametersREM set

ContentStreamUpdateRequest no longer closes stream

2019-06-10 Thread Colvin Cowie
Hello, I'm in the process of moving from Solr 6. to Solr 8. We have a client application that streams CSV files to Solr using ContentStreamUpdateRequest and then deletes the CSV file once the data is indexed. That worked fine in Solr 6, but when using 8, the file is locked and can't be deleted.

Re: Streaming expression function which can give parent document along with its child documents ?

2019-06-10 Thread Jai Jamba
Can anyone help me in this ? -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Basic Authentication in Standalone Configuration ?

2019-06-10 Thread Paul
Hi, I am not sure if Basic Authentication is possible in SOLR standalone configuration (version 7.6). I have a working SOLR installation using SSL. When following the docs I add options into solr.in.cmd, as in: SOLR_AUTH_TYPE="basic" SOLR_AUTHENTICATION_OPTS="-Dbasicauth=solr:SolrRocks" When I

Re: query parsed in different ways in two identical solr instances

2019-06-10 Thread Alexandre Rafalovitch
Were you able to simplify it to the simplest use case showing the issue? Or reproduce it on the stock Solr with stock example? Because otherwise, we would be just as stuck in a Jira as now. It is the same people helping For example, is the _query_ part significant? Also, did you try running

RE: query parsed in different ways in two identical solr instances

2019-06-10 Thread Danilo Tomasoni
Hello all, maybe I should consider this as a bug and open an issue? Danilo Tomasoni Fondazione The Microsoft Research - University of Trento Centre for Computational and Systems Biology (COSBI) Piazza Manifattura 1, 38068 Rovereto (TN), Italy tomas...@cosbi.eu http://www.cosbi.eu As for the

Re: Query takes a long time Solr 6.1.0

2019-06-10 Thread vishal patel
> An 80GB heap is ENORMOUS. And you have two of those per server. Do you > *know* that you need a heap that large? You only have 50 million > documents total, two instances that each have 80GB seems completely > unnecessary. I would think that one instance with a much smaller heap > would