SolrDeletionPolicy & Core Reload

2021-01-02 Thread John Davis
Hi, Does Core Reload pick up changes to SolrDeletionPolicy in solrconfig.xml or does the solr server needs to be restarted? And what would be the best way to check the current values of SolrDeletionPolicy (eg

Re: [EXTERNAL] Re: Getting rid of Master/Slave nomenclature in Solr

2020-06-18 Thread John Gallagher
While on the topic of renaming roles, I'd like to propose finding a better term than "overseer" which has historical slavery connotations as well. Director, perhaps? John Gallagher On Thu, Jun 18, 2020 at 8:48 AM Jason Gerlowski wrote: > +1 to rename master/slave, and +

Require java 8 upgrade

2020-05-21 Thread Akhila John
Hi Team, We use solr 5.3.1 for sitecore 8.2. We require to upgrade Java version to 'Java 8 Update 251' and remove / Upgrade Wireshark to 3.2.3 in our application servers. Could you please advise if this would have any impact on the solr. Does solr 5.3.1 support Java 8. Thanks and regards,

Re: Solrcloud Garbage Collection Suspension linked across nodes?

2020-05-13 Thread John Blythe
can we get this person blocked? -- John Blythe On Wed, May 13, 2020 at 1:05 PM ART GALLERY wrote: > check out the videos on this website TROO.TUBE don't be such a > sheep/zombie/loser/NPC. Much love! > https://troo.tube/videos/watch/aaa64864-52ee-4201-922f-41300032f219 > > On M

Blocking certain queries

2020-02-03 Thread John Davis
Hello, Is there a way to block certain queries in solr? For eg a delete for *:* or if there is a known query that causes problems, can these be blocked at the solr server layer.

Re: Haystack CFP is open, come and tell us how you tune relevance for Lucene/Solr

2020-01-28 Thread John Blythe
t; including Think Like a Relevance Engineer, Hello LTR and NLP) - > hopefully this will give you enough information for budgeting. > > Speakers get a small discount too! > > Cheers > > Charlie > > On 27/01/2020 22:21, John Blythe wrote: > > Hey Doug. Do you

Re: Haystack CFP is open, come and tell us how you tune relevance for Lucene/Solr

2020-01-27 Thread John Blythe
gt; > > > > > -- > *Doug Turnbull **| CTO* | OpenSource Connections > <http://opensourceconnections.com>, LLC | 240.476.9983 > Author: Relevant Search <http://manning.com/turnbull> > This e-mail and all contents, including attachments, is considered to be > Company Confidential unless explicitly stated otherwise, regardless > of whether attachments are marked as such. > -- John Blythe

Solr Payloads

2019-09-20 Thread John Davis
131|*1568263581*", "payload(payload_field, 573131)": *1568263550* ... "payload_field": "573131|1568263582", "payload(payload_field, 573131)": 1568263550 Field definition: John

Re: Solr with encrypted HDFS

2019-09-12 Thread John Thorhauer
don't seem to play well with HDFS and triple replication of the blocks > in HDFS. But encryption did not add anything issues for us. > > regards, > Hendrik > > On 11.09.19 22:53, John Thorhauer wrote: > > Hi, > > > > I am interested in encrypting/prot

Solr with encrypted HDFS

2019-09-11 Thread John Thorhauer
has anyone had experience running this? Either good or bad? Thanks, John

Re: Enabling/disabling docValues

2019-06-11 Thread John Davis
& resources, and if we empower power users to understand the system better it will help making more informed tradeoffs. On Tue, Jun 11, 2019 at 6:52 AM Gus Heck wrote: > On Mon, Jun 10, 2019 at 10:53 PM John Davis > wrote: > > > You have made many assumptions which might not

Re: Enabling/disabling docValues

2019-06-10 Thread John Davis
ly happens…. > > Best, > Erick > > P.S. I _think_ Lucene tries to use the definition from the first segment, > but since whether the lists of segments to be merged don’t look at the > field definitions at all. Whether the first segment in the list has > SortableText or not

Re: Enabling/disabling docValues

2019-06-09 Thread John Davis
tructing low-level analysis chains. > > So I’d _strongly_ recommend you re-index your corpus to a new collection > with the current definition, then perhaps use CREATEALIAS to seamlessly > switch. > > Best, > Erick > > > On Jun 9, 2019, at 12:50 PM, John Davis &

Enabling/disabling docValues

2019-06-09 Thread John Davis
Hi there, We recently changed a field from TextField + no docValues to SortableTextField which has docValues enabled by default. Once I did this I do not see any facet values for the field. I know that once all the docs are re-indexed facets should work again, however can someone clarify the

Re: Solr Heap Usage

2019-06-07 Thread John Davis
figure out questions like number of shards/replicas, heap size, memory etc. > Hard data, good process and regular testing will trump guesswork every time > > Greg > > On Tue, Jun 4, 2019 at 9:22 AM John Davis > wrote: > > > You might want to test with softcommit of hours

Re: Solr Heap Usage

2019-06-04 Thread John Davis
overhead associated with it. On Tue, Jun 4, 2019 at 8:03 AM Erick Erickson wrote: > I need to update that, didn’t understand the bits about retaining internal > memory structures at the time. > > > On Jun 4, 2019, at 2:10 AM, John Davis > wrote: > > > > Erick -

Re: Solr Heap Usage

2019-06-04 Thread John Davis
they’d be something like this: > Do a hard commit with openSearcher=false every 60 seconds. > Do a soft commit every 5 minutes. > > I’d actually be surprised if you were able to measure differences between > those settings and just hard commit with openSearcher=true every 60 > s

Adding Multiple JSON Documents

2019-06-02 Thread John Davis
Hi there, I was looking at the solr documentation for indexing multiple documents via json and noticed inconsistency in the docs. Should the POST url be /update/*json/docs *instead of just /update. It does look like former does work, unless both will work just fine?

Re: Solr Heap Usage

2019-06-02 Thread John Davis
see: https://issues.apache.org/jira/browse/SOLR-12962. > > In short, there’s not enough information until you dive in and test > bunches of stuff to tell. > > Best, > Erick > > > > On Jun 2, 2019, at 2:22 AM, John Davis > wrote: > > > > This makes sense, any

Re: Solr Heap Usage

2019-06-02 Thread John Davis
and does streaming merge it shouldn't matter? On Sat, Jun 1, 2019 at 9:24 AM Walter Underwood wrote: > > On May 31, 2019, at 11:27 PM, John Davis > wrote: > > > > 2. Merging segments - does solr load the entire segment in memory or > chunks > > of it? if la

Solr Heap Usage

2019-06-01 Thread John Davis
or chunks of it? if later how large are these chunks 3. Queries, facets, caches - anything else major? John

Re: Facet count incorrect

2019-05-23 Thread John Davis
leValued or vice versa (particularly with docValues) > etc. are all “fraught”. > > My usual reply is “if you’re going to reindex everything anyway, why not > just do it to a new collection and alias when you’re done?” It’s much safer. > > Best, > Erick > > > On May 22, 2019,

Facet count incorrect

2019-05-22 Thread John Davis
Hi there - Our facet counts are incorrect for a particular field and I suspect it is because we changed the type of the field from StrField to TextField. Two questions: 1. If we do re-index all the documents in the index, would these counts get fixed? 2. Is there a "safe" way of changing field

Re: Optimizing fq query performance

2019-04-18 Thread John Davis
FYI https://issues.apache.org/jira/browse/SOLR-11437 https://issues.apache.org/jira/browse/SOLR-12488 On Thu, Apr 18, 2019 at 7:24 AM Shawn Heisey wrote: > On 4/17/2019 11:49 PM, John Davis wrote: > > I did a few tests with our instance solr-7.4.0 and field:* vs field:[* TO > >

Re: Optimizing fq query performance

2019-04-17 Thread John Davis
wrote: > On 4/17/2019 1:21 PM, John Davis wrote: > > If what you describe is the case for range query [* TO *], why would > lucene > > not optimize field:* similar way? > > I don't know. Low level lucene operation is a mystery to me. > > I have seen first-hand that

Re: Optimizing fq query performance

2019-04-17 Thread John Davis
If what you describe is the case for range query [* TO *], why would lucene not optimize field:* similar way? On Wed, Apr 17, 2019 at 10:36 AM Shawn Heisey wrote: > On 4/17/2019 10:51 AM, John Davis wrote: > > Can you clarify why field:[* TO *] is lot more efficient than field:*

Re: Optimizing fq query performance

2019-04-17 Thread John Davis
Can you clarify why field:[* TO *] is lot more efficient than field:* On Sun, Apr 14, 2019 at 12:14 PM Shawn Heisey wrote: > On 4/13/2019 12:58 PM, John Davis wrote: > > We noticed a sizable performance degradation when we add certain fq > filters > > to the query even tho

Re: Optimizing fq query performance

2019-04-14 Thread John Davis
for indexed fields because all terms for the > field need to be iterated (e.g. does term1 match doc1, does term2 match > doc1, etc) > One can optimize this by indexing a term in a different field to turn it > into a single term query (i.e. exists:field1) > > -Yonik > &g

Optimizing fq query performance

2019-04-13 Thread John Davis
is of type tlongs while field2 is a string. Any tips for optimizing this? John

Re: What causes new searcher to be created?

2019-03-10 Thread John Davis
s that until a new searcher is created all the > > newly indexed docs will not be visible > > This should be the case. So regardless of what the admin says, _can_ > you see newly indexed documents? > > Best, > Erick > > > On Mar 9, 2019, at 7:24 PM, John Davis &

What causes new searcher to be created?

2019-03-09 Thread John Davis
console I do see a new one being created every few minutes but I could not find the parameter that controls it. John

Re: child docs

2019-03-07 Thread John Blythe
thanks for the quick response! that was my inkling from what i've read thus far, but was curious if any benefits could make it potentially worthwhile. interested in other "gotchas" the nesting may cause us to incur. thanks again! -- John Blythe On Thu, Mar 7, 2019 at 10:23 AM Eric

child docs

2019-03-07 Thread John Blythe
] child1: {tranx4} could be up to several hundred child docs per entity, though usually will be double digits only (per entity), sometimes as low as < 10. hope this makes sense. thanks for any insight! best, -- John Blythe

Improve indexing speed?

2019-01-01 Thread John Milton
? Note : All the fields contains maximum 20 characters only. Field type is text general with case insensitive. Thanks, John Milton

PC hang while running Solr cloud instance?

2018-12-30 Thread John Milton
, but task manager shows 95% usage of CPU and memory. How to solve this problem? Thanks, John Milton

Config change needs reindex?

2018-12-21 Thread John Milton
Hi Solr Team, We are using Solr cloud and storing all my application log in Solr only. For some features or new field add or copy field generation, if I change my schema and upload it to zookeeper, am I need to reindex all the data? Or need to restart is enough? Thanks, John Milton

Re: unsubscribe

2018-12-07 Thread John Santosuosso
Unsubscribe  Sent from Yahoo Mail for iPhone On Friday, December 7, 2018, 9:57 AM, samuel kim wrote: Sent from Outlook From: samuel kim Sent: Monday, July 31, 2017 3:48 PM To: solr-user@lucene.apache.org Subject:

Re: Solr on Java 11?

2018-11-30 Thread John Gallagher
We're interested in this as well. It is tracked here - https://issues.apache.org/jira/browse/SOLR-12809 And you can see the test status for different JDKs here: https://jenkins.thetaphi.de/job/Lucene-Solr-7.x-Linux/ 8 and 9 pass completely; 10, 11, 12ea don't On Fri, Nov 30, 2018 at 10:23 AM

Re: Time-Routed Alias Not Distributing Wrongly Placed Docs

2018-11-30 Thread John Nashorn
Hi Gus, thanks for writing a detailed answer. I've written some bits between quotings from your post. On 2018/11/30 05:15:10, Gus Heck wrote: > Hi John, > > TRA's really do require that you index via the alias. Internally the code > is wrapping the Distributed Upda

Re: is SearchComponent the correct way?

2018-11-29 Thread John Thorhauer
Tue, Nov 13, 2018 at 6:36 AM John Thorhauer > wrote: > > > Mikhail, > > > > Where do I implement the buffering? I can not do it in then collect() > > method. > > Please clarify why exactly? Notice my statement about one segment only. > > > >

How to implement ssl for Solr cloud?

2018-11-27 Thread John Milton
Hi Solr Team, In my Solr cloud cluster, I am having 3 Zookeeper external ensemble and 2 Solr cloud instance. Is it needs to implement ssl for all the available Solr instance? Based on the ssl implementation any additional configuration needed in zookeeper? Thanks, John Milton

Time-Routed Alias Not Distributing Wrongly Placed Docs

2018-11-27 Thread John Nashorn
Hello Everyone, I'm using "hive-solr" from Lucidworks to index my data into Solr (v:7.5, cloud mode). As written in the Solr Manual, TRA expects documents to be indexed using its alias name, and not directly into the collections under it. Unfortunately, hive-solr doesn't allow using TRA names

How to use multiple data drives?

2018-11-15 Thread John Milton
as a data directory. Is it possible to do with Solr cloud? Thanks, John Milton

Solr cloud change collection index directory

2018-11-13 Thread John Milton
drives, for example if my Solr instance is installed on C drive means, I want to store all the collection index on D drive. How to achieve this in Solr 7.5? Kindly give the suggestion to solve this... Thanks, John Milton

Re: is SearchComponent the correct way?

2018-11-12 Thread John Thorhauer
Mikhail, Where do I implement the buffering? I can not do it in then collect() method. I can not see how I can get access to what I need in the finish() method. Thanks, John On Tue, Nov 6, 2018 at 12:44 PM Mikhail Khludnev wrote: > Not really. It expect to work segment by segment. So it

Re: is SearchComponent the correct way?

2018-11-06 Thread John Thorhauer
to be able to iterate through the documents, gathering all the critical ID's and then send a single query to redis, getting back my security related data, and then iterate through the documents, pulling out the ones that the user should not see. Is this possible? Thanks again for your help! John On Tue

is SearchComponent the correct way?

2018-11-06 Thread John Thorhauer
this by creating a SearchComponent. I am able to iterate thru the results and identify the items I want to remove from the results but I am not sure how to proceed in removing them. Is SearchComponent the best way to do this? If so, any thoughts on how to proceed? Thanks, John Thorhauer

Re: More Like This Query problems

2018-10-18 Thread John Bickerstaff
Found it. My SOLR does NOT store fields and after some careful checking, it turns out we do NOT do term vectors either... So, according to the docs, MLT will not work. Thanks for the response David! On Thu, Oct 18, 2018 at 1:44 PM John Bickerstaff wrote: > Thanks. There are many d

Re: More Like This Query problems

2018-10-18 Thread John Bickerstaff
ur doc id” > Also be certain there are other documents that will meet your criteria for > a test case. Remember it’s unique words in your core/collection > > On Oct 18, 2018, at 2:43 PM, John Bickerstaff <mailto:j...@johnbickerstaff.com>> wrote: > > All, > >

More Like This Query problems

2018-10-18 Thread John Bickerstaff
All, I am having trouble with a “more like this” query in Solr. Here’s what I think should be happening: 1. Query contains Document ID (q=id:"942316176:9009:66

Re: Faceting with a multi valued field

2018-09-25 Thread John Blythe
you can update your filter query to be a facet query, this will apply the query to the resulting facet set instead of the Communities field itself. -- John Blythe On Tue, Sep 25, 2018 at 4:15 PM Hanjan, Harinder wrote: > Hello! > > I am doing faceting on a field which has multip

Term insight with facets

2018-09-22 Thread John Blythe
on a dynamically set limit? Thanks for any thoughts! -- John Blythe

admin auth

2018-09-21 Thread John Blythe
*does* require it. any low hanging fruit ideas we could try out to help resolve this? thanks! -- John Blythe

Re: 6.x to 7.x differences

2018-09-12 Thread John Blythe
thanks, shawn. yep, i saw the multi term synonym discussion when googling around a bit after your first reply. pretty jazzed about finally getting to tinker w that instead of creating our regex ducktape solution for_multi_term_synonyms! thanks again- -- John Blythe On Wed, Sep 12, 2018 at 2:15

enquoted searches

2018-09-12 Thread John Blythe
r results as expected, too. from the bit of reading i did on the spanquery stuff i was thinking that maybe it was related to positioning issues, specifically with 'sterile'. in the Analysis tab, however, it's in position 6 in both indexing and querying output. thanks for any thoughts or assists here! best, -- John Blythe

Re: large query producing graph error ... maybe?

2018-09-12 Thread John Blythe
be the value add on fq + cache=false variation? thanks for the help! -- John Blythe On Wed, Sep 12, 2018 at 1:02 PM Erick Erickson wrote: > Looks like your SKU field is points-based? Strings would probably be > better, if you switched to points-based it's new code. > > And maxBo

large query producing graph error ... maybe?

2018-09-12 Thread John Blythe
of times (maybe 600? idk). no mention of maxBooleanClause issues specifically in the output, shows as a stack overflow error. is this something we can solve in our solr/cloud/zk configuration or is it somewhere else to be solved? thanks! -- John Blythe

Re: 6.x to 7.x differences

2018-09-12 Thread John Blythe
batty by the analysis vs. query results though, so looking forward to playing w that some more. for our immediate purposes, however, i think this solves it! -- John Blythe On Wed, Sep 12, 2018 at 1:35 AM Preeti Bhat wrote: > Hi John, > > Please check the solrQueryParser option, it wa

Re: parent/child rows in solr

2018-09-11 Thread John Smith
On Tue, Sep 11, 2018 at 11:05 PM Walter Underwood wrote: > Have you tried modeling it with multivalued fields? > > That's an interesting idea, but I don't think that would work. We would lose the concept of "rows". So let's say child1 has col "a" and col "b", both are turned into multi-value

Re: parent/child rows in solr

2018-09-11 Thread John Smith
On Tue, Sep 11, 2018 at 11:00 PM Shawn Heisey wrote: > On 9/11/2018 8:35 PM, John Smith wrote: > > The problem is that the math isn't a simple case of adding up all the row > > counts. These are "left outer join"s. In sql, it would be this query: > > I think we'll

Re: parent/child rows in solr

2018-09-11 Thread John Smith
On Tue, Sep 11, 2018 at 9:32 PM Shawn Heisey wrote: > On 9/11/2018 7:07 PM, John Smith wrote: > > header: 223,580 > > > > child1: 124,978 > > child2: 254,045 > > child3: 127,917 > > child4:1,009,030 > > child5:

6.x to 7.x differences

2018-09-11 Thread John Blythe
. i'm left scratching my head at this point. i'm guessing it's from the lucene parser? hoping to get some clarity from you guys! thanks! -- John Blythe

Re: parent/child rows in solr

2018-09-11 Thread John Smith
> > On 9/7/2018 7:44 PM, John Smith wrote: > > Thanks Shawn, for your comments. The reason why I don't want to go flat > > file structure, is due to all the wasted/duplicated data. If a department > > has 100 employees, then it's very wasteful in terms of disk space to >

Re: 504 timeout

2018-09-11 Thread John Blythe
ah, great thought. didn't even think of that. we already have a couple ngram-based fields. will send over to the stakeholder who was attempting this. thanks! -- John Blythe On Sun, Sep 9, 2018 at 11:31 PM Erick Erickson wrote: > First of all, wildcards are evil. Be sure that the rea

504 timeout

2018-09-09 Thread John Blythe
w our server configuration or is there some solr/cloud config'ing that we could work on that would allow better response to these sorts of queries (though it'd be at a cost, i'd imagine!). thanks for any insight! best, -- John Blythe

Re: parent/child rows in solr

2018-09-07 Thread John Smith
9:32 PM Shawn Heisey wrote: > On 9/7/2018 3:06 PM, John Smith wrote: > > Hi, I have a document structure like this (this is a made up schema, my > > data has nothing to do with departments and employees, but the structure > > holds true to my real data): > > > > d

parent/child rows in solr

2018-09-07 Thread John Smith
Hi, I have a document structure like this (this is a made up schema, my data has nothing to do with departments and employees, but the structure holds true to my real data): department 1 employee 11 employee 12 employee 13 room 11 room 12 room 13 department 2 employee

Re: Local development and SolrCloud

2018-08-23 Thread John Blythe
e a slave instance and > > fire it up. Also, load benchmarking is easier when indexing is on a > > separate instance. > > > > In prod, we have 45 Solr hosts in four clusters. > > > > wunder > > Walter Underwood > > wun...@wunderwood.org > >

Local development and SolrCloud

2018-08-22 Thread John Blythe
a “standard” (read: non-cloud) local solr environment locally for our development workflow and using cloud only for our remote environments. Curious to know how wise or stupid that play would be. Thanks for any info! -- John Blythe

Ignored fields and copyfield

2018-08-06 Thread John Davis
Hi there, If a field is set as "ignored" (indexed=false, stored=false) can it be used for another field as part of copyfield directive which might index/store it. John

Index size by document fields

2018-08-04 Thread John Davis
Hi, Is there a way to monitor the size of the index broken by individual fields across documents? I understand there are different parts - the inverted index and the stored fields - and an estimate would be good start. Thanks John

Re: Preferred PHP Client Library

2018-07-17 Thread John Blythe
migrating to cloud, but have also found that it's not entirely necessary (i can't yet speak to the cost/benefit of using the extension when we deploy to cloud) best of luck- -- John Blythe On Tue, Jul 17, 2018 at 11:57 AM Christian Spitzlay < christian.spitz...@biologis.com> wrote: > Hi,

Re: Preferred PHP Client Library

2018-07-16 Thread John Blythe
what are the benefits of > switching to it? > > Thanks! > TZ > -- John Blythe

Re: Sort by payload value

2018-05-25 Thread John Davis
; guts of the payload > calcs. > > FYI, ties are broken by the internal Lucene doc ID. If the theory that > you are getting > no matches, then your sort order is determined by this value which you > don't really > have much access to. > > Best, > Erick > > On Thu, May

Sort by payload value

2018-05-24 Thread John Davis
Hello, We are trying to use payload values as described in [1] and are running into issues when issuing *sort by* payload value. Would appreciate any pointers to what we might be doing wrong. We are running solr 6.6.0. * Here's the payload value definition:

Re: replication

2018-04-13 Thread John Blythe
great. thanks, erick! -- John Blythe On Wed, Apr 11, 2018 at 12:16 PM, Erick Erickson <erickerick...@gmail.com> wrote: > bq: are you simply flagging the fact that we wouldn't direct the queries > to A > v. B v. C since SolrCloud will make the decisions itself as to which part

Re: replication

2018-04-11 Thread John Blythe
the background will point to the ELB the slaves are sitting behind. we'll also begin setting up a simple solrcloud instance to toy with per your suggestion above. inb4 tons more questions on my part :) thanks! -- John Blythe On Tue, Apr 10, 2018 at 11:14 AM, Erick Erickson <erickerick...@gmail

Re: replication

2018-04-09 Thread John Blythe
for writing and user access in NRT events, but slaves for the heavier backend processing. Thoughts? - anyone do consulting on this that would be interested in chatting? Thanks again! On Mon, Apr 9, 2018 at 18:18 Shawn Heisey <apa...@elyograg.org> wrote: > On 4/9/2018 12:15 PM, John Bly

replication

2018-04-09 Thread John Blythe
couple seconds for 4 replicas. that said, i'm going *entirely* off of assumption at this point and wanted to check in w you all to see any nuances, gotchas, hidden landmines, etc. that we should be considering before rolling things out. thanks for any info! -- John Blythe

Re: statistics in hitlist

2018-03-16 Thread John Smith
gt; > The echo parameter tells the let expression which variables to output. > > > > > > > > Joel Bernstein > > http://joelsolr.blogspot.com/ > > > > On Thu, Mar 15, 2018 at 3:13 PM, Erick Erickson <erickerick...@gmail.com > > > > wrote: &g

Re: statistics in hitlist

2018-03-15 Thread John Smith
"true", > > a=random(tx_prod_production, q="*:*", fq="isParent:true", rows="15", > > fl="oil_first_90_days_production,oil_last_30_days_production"), > > b=col(a, oil_first_90_days_production)) > > > > > &g

Re: Solr Developer needed urgently

2018-03-15 Thread John Bickerstaff
Hi - thanks for thinking of me! I'm currently lead on the Solr team for Ancestry - and having a good time. I might be interested, but moving to New York isn't going to work for me. If there is a good chance of working from home, then I might be interested... Let me know... On Wed, Mar 14, 2018

Re: HDInsight with Solr 4.9.0 Create Collection

2018-03-09 Thread john spooner
would be nice to not get this email. On 3/9/2018 1:23 PM, Abhi Basu wrote: This has been resolved! Turned out to be schema and config file version diff between 4.10 and 4.9. Thanks, Abhi On Fri, Mar 9, 2018 at 11:41 AM, Abhi Basu <9000r...@gmail.com> wrote: That was due to a folder not

Re: CDCR performance issues

2018-03-09 Thread john spooner
please unsubscribe i tried to manaually unsubscribe On 3/9/2018 12:59 PM, Tom Peters wrote: Thanks. This was helpful. I did some tcpdumps and I'm noticing that the requests to the target data center are not batched in any way. Each update comes in as an independent update. Some follow-up

Re: statistics in hitlist

2018-03-05 Thread John Smith
ut I consider this a bug. It should accept multiple. I'll create ticket > > for getting this fixed. > > > > > > > > Joel Bernstein > > http://joelsolr.blogspot.com/ > > > > On Thu, Mar 1, 2018 at 4:55 PM, John Smith <localde...@gmai

Re: statistics in hitlist

2018-03-01 Thread John Smith
The > values in fieldB are stored in variable "c". > > Then the regress function performs a simple linear regression on arrays > stored in variables "b" and "c". > > The output of the regress function is a map containing the regression > result. Thi

Re: statistics in hitlist

2018-02-23 Thread John Smith
.com/ > > On Fri, Feb 23, 2018 at 12:17 PM, John Smith <localde...@gmail.com> wrote: > > > I'm using solr, and enabling stats as per this page: > > https://lucene.apache.org/solr/guide/6_6/the-stats-component.html > > > > I want to get more stat values though. Spe

statistics in hitlist

2018-02-23 Thread John Smith
I'm using solr, and enabling stats as per this page: https://lucene.apache.org/solr/guide/6_6/the-stats-component.html I want to get more stat values though. Specifically I'm looking for r-squared (coefficient of determination). This value is not present in solr, however some of the pieces used

Re: SolrCloud: How best to do backups?

2018-02-08 Thread John Bickerstaff
that deep before... https://n2ws.com/how-to-guides/automate-amazon-ec2-instance-backup.html On Thu, Feb 8, 2018 at 2:00 PM, John Bickerstaff <j...@johnbickerstaff.com> wrote: > Hmmm... > > Can you (fairly quickly) reproduce this AWS environment (including the > indexes)? Or

Re: SolrCloud: How best to do backups?

2018-02-08 Thread John Bickerstaff
Hmmm... Can you (fairly quickly) reproduce this AWS environment (including the indexes)? Or does it require that several week process to provision new Solr boxes...? What happens now if one of those ec2 instances gets into trouble? Do you have autoscaling groups set up? On Thu, Feb 8, 2018 at

Solr needs a restart to recover from "No space left on device"

2018-02-06 Thread John Davis
Hi there! We ran out of disk on our solr instance. However even after cleaning up the disk solr server did not realize that there is free disk available. It only got fixed after a restart. Is this a known issue? Or are there workarounds that don't require a restart? Thanks John

Matching within list fields

2018-01-29 Thread John Davis
to find a comment that matches all the terms. Short of creating separate documents for each element in the list, is there any other efficient way of accomplishing this? Thanks John

Re: Bitnami, or other Solr on AWS recommendations?

2018-01-26 Thread John Bickerstaff
I guess I'd say test with the image - especially if you're deploying a larger number of Solr boxes. We do a lot of them where I work and (unfortunately, for reasons I won't bother you with) can't use an image. The time it takes to install solr is noticeable when we deploy Solr on our 100 plus EC2

Re: Profanity

2018-01-08 Thread John Blythe
Gladly. Good luck! On Mon, Jan 8, 2018 at 8:27 PM Sadiki Latty <sla...@uottawa.ca> wrote: > Thanks for the feedback John, > > This is a genius idea if I don’t want to create my own processor. I could > simply check that field for data for my reports. Either the field will have

Re: Profanity

2018-01-08 Thread John Blythe
you could use the keepwords functionality. have a field that only keeps profanity and then you can query against that field having its default value vs. profane text -- John Blythe On Mon, Jan 8, 2018 at 3:12 PM, Sadiki Latty <sla...@uottawa.ca> wrote: > Hey > > I would like to

Re: SolrCloud

2017-12-15 Thread John Davis
; new_collection, basically all your routing is the same. You can create > aliases pointing to multiple collections or specify multiple > collections on the query, don't know if that fits your use case or not > though. > > > Best, > Erick > > On Fri, Dec 15, 2017 at 9:03 AM, Joh

SolrCloud

2017-12-15 Thread John Davis
loud? Thanks, John

PayloadScoreQuery always returns score of zero

2017-12-13 Thread John Anonymous
The PayloadScoreQuery always returns a score of zero, regardless of payloads. The PayloadCheckQParser works fine, so I know that I am successfully indexing the payloads. Details below *payload field that I am searching on:* *definition of payload field type:*

Solr index size statistics

2017-12-02 Thread John Davis
Hello, Is there a way to get index size statistics for a given solr instance? For eg broken by each field stored or indexed. The only things I know of is running du on the index data files and getting counts per field indexed/stored, however each field can be quite different wrt size. Thanks John

Re: does the payload_check query parser have support for simple query parser operators?

2017-11-30 Thread John Anonymous
Erik > > > On Nov 30, 2017, at 02:41, John Anonymous <orro...@gmail.com> wrote: > > > > I would like to use wildcards and fuzzy search with the payload_check > query > > parser. Are these supported? > > > > {!payload_check f=text payloads

  1   2   3   4   5   6   7   8   9   >