Re: Urgent- General Question about document Indexing frequency in solr

2021-02-04 Thread Scott Stults
r the sole use of the > intended recipient and may contain confidential and privileged information. > Any unauthorized view, use, disclosure or distribution is prohibited. If > you are not the intended recipient, please contact the sender by reply > email and destroy all copies of the

Distributing and scaling Lucene Monitor?

2021-02-03 Thread Scott Stults
seem to deal with CRUD operations on the monitor queries themselves. So has anyone built this or given some thought about how to incorporate the monitor index into SolrCloud? Thank you, Scott -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780

Re: Solr Cloud on Docker?

2020-01-29 Thread Scott Stults
experience running a big Solr Cloud cluster on Docker > > containers? By “big”, I mean 35 million docs, 40 nodes, 8 shards, with 36 > > CPU instances. We are running version 6.6.2 right now, but could upgrade. > > > > If people have specific things to do or avoid, I’d really appreciate it. > > > > I got a couple of responses on the Slack channel, but I’d love more > > stories from the trenches. This is a direction for our company > architecture. > > > > We have a master/slave cluster (Solr 4.10.4) that is awesome. I can > > absolutely see running the slaves as containers. For Solr Cloud? Makes me > > nervous. > > > > wunder > > Walter Underwood > > wun...@wunderwood.org > > http://observer.wunderwood.org/ (my blog) > > > > > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Query terms and the match state

2019-09-09 Thread Scott Stults
ith that > information (as ratio of matched-terms/total-terms) > > *Ask:* Is it possible to get back the terms of the query and the matched > state ? > > I tried > >- debug=query option (with the default select handler) >- with terms in the debug response I could w

Re: BBox question

2019-02-04 Thread Scott Stults
; Sr Engineering Manager, Panamera > > Buenos Aires - Argentina > > Email: fernando.ot...@olx.com > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Query over nested documents with an AND Operator

2019-02-01 Thread Scott Stults
> > Consultant SAP Development Workbench > > > > > > *FIS Informationssysteme und Consulting GmbH *Röthleiner Weg 1 > > 97506 Grafenrheinfeld > > > > P +49 (9723) 9188-667 > > F +49 (9723) 9188-200 > > E j.gels...@fis-gmbh.de > > www.fis

Re: Need to perfom search and group the record on basis of domain,subject,from address and display the count of label i.e inbox,spam

2019-02-01 Thread Scott Stults
e label and label status > should be displayed as percentage. > > > 2. Need to paginate the record along with the implementation 1 > > > Display will be as mentioned below > > > 1. domain name : @ subject:hello from addredd: abcd@i > > inbox percentage : 20% spam percentage : 80% > read percentage : 30% unread percentage : 70% > > 2. domain name : @ subject:hi from addredd: abcd@i > > inbox percentage : 20% spam percentage : 80% > read percentage : 30% unread percentage : 70% > > > 3. domain name : @ subject:where from addredd: abcd@i > > inbox percentage : 20% spam percentage : 80% > read percentage : 30% unread percentage : 70% > > > > -- > Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: PatternReplaceFilterFactory problem

2019-01-28 Thread Scott Stults
ignoreCase="true" synonyms="synonyms.txt"/> > ignoreCase="true"/> > > pattern="([-a-z])\.com" replacement="$1"/> > > protected="protwords.txt"/> > > >

Re: Aggregate functions

2019-01-28 Thread Scott Stults
ion column. Thanks for your quick > help. > > > > Regards > > Pradeep > > +917204007740 > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Region wise query routing with solr

2019-01-28 Thread Scott Stults
uti > -- > Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Active node "kicked out" when starting a new node

2019-01-28 Thread Scott Stults
there any documents about how > SolrCloud manages the cluster behind the scenes? > > > Thanks, > Teddie > > > > > -- > Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Log Statements: Collection, Shard, Replica, and Core Info Missing

2019-01-28 Thread Scott Stults
" > filePattern="<%= @solr_logs %>/solr_metrics.log.%i" > > > > %d{-MM-dd HH:mm:ss.SSS} %-5p (%t) [%X{collection} %X{shard} > %X{replica} %X{core}] %c{1.} %m%n > > > > > > > > > > A

Re: Need help on Solr authorization

2019-01-18 Thread Scott Stults
neValidate(PKIXValidator.java:302)\n\tat > > sun.security.validator.Validator.validate(Validator.java:260)\n\tat > > > sun.security.ssl.X509TrustManagerImpl.validate(X509TrustManagerImpl.java:324)\n\tat > > > sun.security.ssl.X509TrustManagerImpl.checkTrusted(X509TrustManagerImpl.java:229)\n\tat > > > sun.security.ssl.X509TrustManagerImpl.checkServerTrusted(X509TrustManagerImpl.java:124)\n\tat > > > sun.security.ssl.ClientHandshaker.serverCertificate(ClientHandshaker.java:1496)\n\t... > > 53 more\nCaused by: > > sun.security.provider.certpath.SunCertPathBuilderException: unable to > find > > valid certification path to requested target\n\tat > > > sun.security.provider.certpath.SunCertPathBuilder.build(SunCertPathBuilder.java:141)\n\tat > > > sun.security.provider.certpath.SunCertPathBuilder.engineBuild(SunCertPathBuilder.java:126)\n\tat > > java.security.cert.CertPathBuilder.build(CertPathBuilder.java:280)\n\tat > > > sun.security.validator.PKIXValidator.doBuild(PKIXValidator.java:392)\n\t... > > 59 more\n", > > > > "code":500}} > > > > > > > > > > > > Regards, > > > > Sathish. > > > > > > > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: [QA-search] About field setting

2019-01-18 Thread Scott Stults
0" multiValued="true"> > > > > > >ignoreCase="true"/> > > > > > > > > ignoreCase="true" synonyms="synonyms.txt"/> > > > >ignoreCase="true"/> > > > > > > > > > > > > thanks > > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: regarding debugging solr in eclipse

2019-01-18 Thread Scott Stults
server in debug mode? > Thanks. Regards > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: So Many Zookeeper Warnings--There Must Be a Problem

2019-01-03 Thread Scott Stults
ks! > > Joe > > > > -- > Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: So Many Zookeeper Warnings--There Must Be a Problem

2019-01-03 Thread Scott Stults
logs impossible to read, but worse, I assume indicate > that something is wrong. > > Thanks for any help! > > Joe Lerner > > > > -- > Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Excessive resources consumption migrating from Solr 6.6.0 Master/Slave to SolrCloud 6.6.0 (dozen times more resources)

2017-08-28 Thread Scott Stults
tcommit/hardcommit values > (from few seconds to minutes) with no appreciable improvements :( > > Thanks for your reply! > > - Daniel > > 2017-08-25 6:45 GMT+02:00 Scott Stults <sstu...@opensourceconnections.com > >: > > > Hi Dani, > > > > It seems lik

Re: Excessive resources consumption migrating from Solr 6.6.0 Master/Slave to SolrCloud 6.6.0 (dozen times more resources)

2017-08-24 Thread Scott Stults
solr.RunUpdateProcessorFactory" /> < > requestHandler name="/update" class="solr.UpdateRequestHandler" > name= > "defaults"> dedupe > > Thanks for your reply :) > > - Dani > > 2017-08-24 14:49 GMT+02:00 Scott Stults

Re: Excessive resources consumption migrating from Solr 6.6.0 Master/Slave to SolrCloud 6.6.0 (dozen times more resources)

2017-08-24 Thread Scott Stults
e aren't using a load balancer for indexing, but we have all our Rest > Query services through an HAProxy (using 'leastconn' algorithm). The Rest > Query Services performs queries using the CloudSolrClient. > > Thanks for your reply, > if you need any further information don't hes

Re: Excessive resources consumption migrating from Solr 6.6.0 Master/Slave to SolrCloud 6.6.0 (dozen times more resources)

2017-08-23 Thread Scott Stults
s > 18 GB. > > > > *Indexation **(SolrCloud infrastructure)* > > > > The commits *aren’t* triggered by the application described before. The > hardcommit/softcommit interval are configured in Solr: > > > >- *HardCommit:* every 15 minutes (with opensearcher =

Re: solr jetty based auth and distributed solr requests

2017-08-23 Thread Scott Stults
ant links so that i can try it out. > > > Thanks, > Radhakrishnan > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Facet date Range without start and and date

2017-01-12 Thread Scott Stults
ifying start and and > of the range. > Otherwise, is it possible to put in the same request start to min value > and end to max value. > Thank you. > Regards,NKI. > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: regarding extending classes in org.apache.solr.client.solrj.io.stream.metrics package

2017-01-12 Thread Scott Stults
ned classes as protected so that it will be easy to extend > > variables > --- > > longMax > > doubleMax > > columnName > > > and > > methods > > --- > > init > > > > Thanks, > > Radhakrishnan D >

Re: Max length of solr query

2017-01-12 Thread Scott Stults
item_id: ... > (The number of item_id is maybe over than one million) > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: [More Like This] Query building

2016-04-12 Thread Scott Stults
t; >>>>> > "Tyger, tyger burning bright > >>>>> > In the forests of the night, > >>>>> > What immortal hand or eye > >>>>> > Could frame thy fearful symmetry?" > >>>>> > > >>>>> > William Blake - Songs of Experience -1794 England > >>>>> > > >>>>> > >>>>> > >>>>> > >>>>> -- > >>>>> Anshum Gupta > >>>>> > >>>> > >>>> > >>>> > >>>> -- > >>>> -- > >>>> > >>>> Benedetti Alessandro > >>>> Visiting card : http://about.me/alessandro_benedetti > >>>> > >>>> "Tyger, tyger burning bright > >>>> In the forests of the night, > >>>> What immortal hand or eye > >>>> Could frame thy fearful symmetry?" > >>>> > >>>> William Blake - Songs of Experience -1794 England > >>>> > >>> > >>> > >>> > >>> -- > >>> -- > >>> > >>> Benedetti Alessandro > >>> Visiting card : http://about.me/alessandro_benedetti > >>> > >>> "Tyger, tyger burning bright > >>> In the forests of the night, > >>> What immortal hand or eye > >>> Could frame thy fearful symmetry?" > >>> > >>> William Blake - Songs of Experience -1794 England > >>> > >> > >> > >> > >> -- > >> -- > >> > >> Benedetti Alessandro > >> Visiting card : http://about.me/alessandro_benedetti > >> > >> "Tyger, tyger burning bright > >> In the forests of the night, > >> What immortal hand or eye > >> Could frame thy fearful symmetry?" > >> > >> William Blake - Songs of Experience -1794 England > >> > > > > > > > > -- > > -- > > > > Benedetti Alessandro > > Visiting card : http://about.me/alessandro_benedetti > > > > "Tyger, tyger burning bright > > In the forests of the night, > > What immortal hand or eye > > Could frame thy fearful symmetry?" > > > > William Blake - Songs of Experience -1794 England > > > > > > -- > -- > > Benedetti Alessandro > Visiting card : http://about.me/alessandro_benedetti > > "Tyger, tyger burning bright > In the forests of the night, > What immortal hand or eye > Could frame thy fearful symmetry?" > > William Blake - Songs of Experience -1794 England > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Boosts for relevancy (shopping products)

2016-03-19 Thread Scott Stults
random. > > I know it's going to be a case of rounds upon rounds of testing, but maybe > there's a good starting point that will save me some time? > > My initial thoughts right now are to actually just search on the name > field, and maybe the brand (for things like "A

Re: Multi-lingual search

2016-02-02 Thread Scott Stults
/lucene.472066.n3.nabble.com/Multi-lingual-search-tp4254398.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Solr+HDFS

2016-02-02 Thread Scott Stults
after 555448ms > INFO - 2016-01-28 22:25:44.808; [ UNCLASS] > org.apache.solr.util.FSHDFSUtils; recoverLease=false, attempt=11 on > > file=hdfs://nameservice1:8020/solr5.2/UNCLASS/core_node14/data/tlog/tlog.0282933 > after 617168ms > INFO - 2016-01-28 22:26:45.9

Re: upgrade SolrCloud

2016-02-02 Thread Scott Stults
er > node and upgrade it? Or are there caveats to consider? > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Scripting server side

2016-02-02 Thread Scott Stults
execution. > > Best regards, > Vincenzo > > -- > Vincenzo D'Amore > email: v.dam...@gmail.com > skype: free.dev > mobile: +39 349 8513251 > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: plugging an analyzer

2016-02-02 Thread Scott Stults
s > are and how can I trigger their call? Any suggestion that does not imply to > divide the code on Tokenizer/Filters? > > Thank you very much in advance, > Roxana > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: How to achieve exact string match query which includes spaces and quotes

2016-01-13 Thread Scott Stults
ucene" > parser > >> or > >> "edismax" parser. > >> > >> > >> > >> -- > >> View this message in context: > >> > http://lucene.472066.n3.nabble.com/How-to-achieve-exact-string-match-query-which-includes-spaces-and-quotes-tp4250402p4250405.html > >> Sent from the Solr - User mailing list archive at Nabble.com. > >> > > -- > > Regards, > > Binoy Dalal > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Permutations of entries in a multivalued field

2015-12-18 Thread Scott Stults
tations are found > (using e.g. type=text is not an option since a term could occur in a > different entry of the multivalued field) or trigger an alphabetical > sorting of incoming queries. > > Thanks a lot for your feedback, best regards > > Johannes > > -- Scott Stult

Re: query to get parents without childs

2015-12-16 Thread Scott Stults
<toe.al...@gmail.com> wrote: > Hi guys, > > I have few parent index without child, what would wold be the query for > those to get? > > Thanks, > Novin > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Highlighting large documents

2015-12-08 Thread Scott Stults
> > > > > I have tried to increase the value of the hl.maxAnalyzedChars, as the > > > > default value is 51200, and I have documents that are much larger > than > > > > 51200 characters. Although this method works, but, when I increase > this > > &

Re: Highlighting tag problem

2015-12-07 Thread Scott Stults
those results. So > I'm not sure why those results are returned. > > Regards, > Edwin > > > On 4 December 2015 at 01:03, Scott Stults < > sstu...@opensourceconnections.com > > wrote: > > > Edwin, > > > > Is this related to what's described in

Re: Highlighting tag problem

2015-12-03 Thread Scott Stults
ignoreCase="true" expand="false"/> > > > > maxGramSize="15"/> > > > > > > segMode="SEARCH"/> > > > > > > words="org/apache/lucene/analys

Re: Different Similarities for the same field

2015-12-02 Thread Scott Stults
default one on the SAME field. Because there > are no differences for the index. E.g. by creating a SearchComponent to > have something like solr/mySelect for queries with my similarity and the > usual solr/select for the default similarity? > How could I achive this, has anybody a hint? &

Re: Highlighting content field problem when using JiebaTokenizerFactory

2015-11-23 Thread Scott Stults
> > words="org/apache/lucene/analysis/cn/smart/stopwords.txt"/> > >>> > > >>> > > maxGramSize="15"/> > >>> > > > >>> > > > >>> > > > >>> > > >>> > > segMode="SEARCH"/> > >>> > > > >>> > > > >>> > > >>> > > words="org/apache/lucene/analysis/cn/smart/stopwords.txt"/> > >>> > > > >>> > > > >>> > > > >>> > > > >>> > > > >>> > > Here's my solrconfig.xml on the highlighting portion: > >>> > > > >>> > > > >>> > > > >>> > > explicit > >>> > > 10 > >>> > > json > >>> > > true > >>> > > text > >>> > > id, title, content_type, last_modified, url, score > >>> > >>> > > > >>> > > on > >>> > > id, title, content, author, tag > >>> > > true > >>> > > true > >>> > > html > >>> > > 200 > >>> > > true > >>> > > signature > >>> > > true > >>> > > 100 > >>> > > > >>> > > > >>> > > > >>> > > >>> > > class="solr.highlight.BreakIteratorBoundaryScanner"> > >>> > > > >>> > > WORD > >>> > > en > >>> > > SG > >>> > > > >>> > > > >>> > > > >>> > > > >>> > > Meanwhile, I'll take a look at the articles too. > >>> > > > >>> > > Thank you. > >>> > > > >>> > > Regards, > >>> > > Edwin > >>> > > > >>> > > > >>> > > On 20 October 2015 at 11:32, Scott Chu <scott@udngroup.com > >>> <+scott@udngroup.com> > >>> > <+scott@udngroup.com <+scott@udngroup.com>> > >>> > > <+scott@udngroup.com <+scott@udngroup.com> <+ > >>> scott@udngroup.com <+scott@udngroup.com>>>> wrote: > >>> > > > >>> > > > Hi Edwin, > >>> > > > > >>> > > > I didn't use Jieba on Chinese (I use only CJK, very > foundamental, I > >>> > > > know) so I didn't experience this problem. > >>> > > > > >>> > > > I'd suggest you post your schema.xml so we can see how you define > >>> your > >>> > > >>> > > > content field and the field type it uses? > >>> > > > > >>> > > > In the mean time, refer to these articles, maybe the answer or > >>> > workaround > >>> > > > can be deducted from them. > >>> > > > > >>> > > > https://issues.apache.org/jira/browse/SOLR-3390 > >>> > > > > >>> > > > > >>> http://qnalist.com/questions/661133/solr-is-highlighting-wrong-words > >>> > >>> > > > > >>> > > > > http://qnalist.com/questions/667066/highlighting-marks-wrong-words > >>> > > > > >>> > > > Good luck! > >>> > > > > >>> > > > > >>> > > > > >>> > > > > >>> > > > Scott Chu,scott@udngroup.com <+scott@udngroup.com> <+ > >>> scott@udngroup.com <+scott@udngroup.com>> <+ > >>> > scott@udngroup.com <+scott@udngroup.com> <+ > >>> scott@udngroup.com <+scott@udngroup.com>>> > >>> > > > 2015/10/20 > >>> > > > > >>> > > > - Original Message - > >>> > > > *From: *Zheng Lin Edwin Yeo <edwinye...@gmail.com > >>> <+edwinye...@gmail.com> > >>> > <+edwinye...@gmail.com <+edwinye...@gmail.com>> > >>> > > <+edwinye...@gmail.com <+edwinye...@gmail.com> <+ > >>> edwinye...@gmail.com <+edwinye...@gmail.com>>>> > >>> > > > *To: *solr-user <solr-user@lucene.apache.org > >>> <+solr-user@lucene.apache.org> > >>> > <+solr-user@lucene.apache.org <+solr-user@lucene.apache.org>> > >>> > > <+solr-user@lucene.apache.org <+solr-user@lucene.apache.org> <+ > >>> solr-user@lucene.apache.org <+solr-user@lucene.apache.org>>>> > >>> > > >>> > > > *Date: *2015-10-13, 17:04:29 > >>> > > > *Subject: *Highlighting content field problem when using > >>> > > > JiebaTokenizerFactory > >>> > > > > >>> > > > Hi, > >>> > > > > >>> > > > I'm trying to use the JiebaTokenizerFactory to index Chinese > >>> characters > >>> > > in > >>> > > > > >>> > > > Solr. It works fine with the segmentation when I'm using > >>> > > > the Analysis function on the Solr Admin UI. > >>> > > > > >>> > > > However, when I tried to do the highlighting in Solr, it is not > >>> > > > highlighting in the correct place. For example, when I search of > >>> > > 自然環境与企業本身, > >>> > > > it highlight 認為自然環境与企業本身的 > >>> > > > > >>> > > > Even when I search for English character like responsibility, it > >>> > > highlight > >>> > > > *responsibilit*y. > >>> > > > > >>> > > > Basically, the highlighting goes off by 1 character/space > >>> consistently. > >>> > > > > >>> > > > This problem only happens in content field, and not in any other > >>> > fields. > >>> > > > >>> > > > Does anyone knows what could be causing the issue? > >>> > > > > >>> > > > I'm using jieba-analysis-1.0.0, Solr 5.3.0 and Lucene 5.3.0. > >>> > > > > >>> > > > > >>> > > > Regards, > >>> > > > Edwin > >>> > > > > >>> > > > > >>> > > > > >>> > > > - > >>> > > > 未在此訊息中找到病毒。 > >>> > > > 已透過 AVG 檢查 - www.avg.com > >>> > > > 版本: 2015.0.6140 / 病毒庫: 4447/10808 - 發佈日期: 10/12/15 > >>> > > > > >>> > > > > >>> > > > >>> > > > >>> > > > >>> > > - > >>> > > 未在此訊息中找到病毒。 > >>> > > 已透過 AVG 檢查 - www.avg.com > >>> > > 版本: 2015.0.6172 / 病毒庫: 4447/10853 - 發佈日期: 10/19/15 > >>> > > > >>> > > > >>> > > >>> > > >>> > > >>> > - > >>> > 未在此訊息中找到病毒。 > >>> > 已透過 AVG 檢查 - www.avg.com > >>> > 版本: 2015.0.6172 / 病毒庫: 4450/10867 - 發佈日期: 10/21/15 > >>> > > >>> > > >>> > >>> > >>> > >>> - > >>> 未在此訊息中找到病毒。 > >>> 已透過 AVG 檢查 - www.avg.com > >>> 版本: 2015.0.6173 / 病毒庫: 4450/10871 - 發佈日期: 10/22/15 > >>> > >>> > >> > > > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Number of fields in qf & fq

2015-11-20 Thread Scott Stults
>>> http://observer.wunderwood.org/ (my blog) > > > >>> > > > >>> > > > >>>> On Nov 19, 2015, at 3:46 PM, Steven White <swhite4...@gmail.com> > > > wrote: > > > >>>> > > > >>>> Hi

Re: StringIndexOutOfBoundsException using spellcheck and synonyms

2015-11-16 Thread Scott Stults
NOTICE > This e-mail (including any attachments) may contain confidential and/or > privileged information. If you are not the intended recipient or have > received this e-mail in error, please inform the sender immediately and > delete this e-mail (including any attachments) from your computer, and you > must not use, disclose to anyone else or copy this e-mail (including any > attachments), whether in whole or in part. > This e-mail and any reply to it may be monitored for security, legal, > regulatory compliance and/or other appropriate reasons. -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Stopping Solr on Linux when run as a service

2015-11-10 Thread Scott Stults
t; Linux. Now what I don't get is this: what happens when the system is > shutting down? How does Solr knows to shutdown gracefully when there is > noting on that page talks about issuing a "stop" command on system > shutdown? Can someone shed some light on this? Like I said,

Re: Solr Search: Access Control / Role based security

2015-11-09 Thread Scott Stults
rrent security plugins are more towards making Solr apis/resources > secure not towards securing/controlling data during search. > > https://cwiki.apache.org/confluence/display/solr/Authentication+and+Authorization+Plugins > > > Please share your thoughts. > > Thanks, >

Re: Securing field level access permission by filtering the query itself

2015-11-05 Thread Scott Stults
> >>>> are all awesome. > >>>> > >>>> In my implementation has much of the solr query building working on > the > >>>> browser, solr is behind a php server which acts as “proxy” and > doorman, > >>>> filtering at t

Re: Securing field level access permission by filtering the query itself

2015-10-30 Thread Scott Stults
uery itself > whether it be parameters (fl, facet.fields, terms, etc) … or even deny any > request in which fieldname occurs … > > Is there someway someone might obscure a fieldname in a request? > > Kind Regards & thanks in davacne, > Douglas -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Question on index time de-duplication

2015-10-30 Thread Scott Stults
add a special tag (e.g. dedup=true) to 2013 and > > 2014 content, keeping 2015 (the latest release) untouched. During query > > time, I'll add a filter which will exclude contents tagged with "dedup". > > > > Just wondering if this is achievable by perhaps extending

Re: Solr collection alias - how rank is affected

2015-10-27 Thread Scott Stults
ollection alias for > searching on two collections with same schema ? is it affected at all ? > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Solr-collection-alias-how-rank-is-affected-tp4236776.html > Sent from the Solr - User mailing list arch

Re: Does docValues impact termfreq ?

2015-10-26 Thread Scott Stults
unt you can do to improve performance. > >>>>>>>>>>> > >>>>>>>>>>> The term frequency is a number that is used often, so it is > >>>>>>>>>>> > >>>>>>>>>> stored > >>>>> > >>>>>> in > >>>>>>> > >>>>>>>> the index pre-calculated. Perhaps, if your data is not > >>>>>>>>>>> > >>>>>>>>>> changing, > >>>>> > >>>>>> optimising your index would reduce it to one segment, and > >>>>>>>>>>> > >>>>>>>>>> thus > >>> > >>>> might > >>>>>>> > >>>>>>>> ever so slightly speed the aggregation of term frequencies, > >>>>>>>>>>> > >>>>>>>>>> but I > >>>>> > >>>>>> doubt > >>>>>>> > >>>>>>>> it'd make enough difference to make it worth doing. > >>>>>>>>>>> > >>>>>>>>>>> Upayavira > >>>>>>>>>>> > >>>>>>>>>>> On Sat, Oct 24, 2015, at 03:37 PM, Aki Balogh wrote: > >>>>>>>>>>> > >>>>>>>>>>>> Thanks, Jack. I did some more research and found similar > >>>>>>>>>>>> > >>>>>>>>>>> results. > >>>>> > >>>>>> In our application, we are making multiple (think: 50) > >>>>>>>>>>>> > >>>>>>>>>>> concurrent > >>>>> > >>>>>> requests > >>>>>>>>>>>> to calculate term frequency on a set of documents in > >>>>>>>>>>>> > >>>>>>>>>>> "real-time". The > >>>>>>> > >>>>>>>> faster that results return, the better. > >>>>>>>>>>>> > >>>>>>>>>>>> Most of these requests are unique, so cache only helps > >>>>>>>>>>>> > >>>>>>>>>>> slightly. > >>>>> > >>>>>> This analysis is happening on a single solr instance. > >>>>>>>>>>>> > >>>>>>>>>>>> Other than moving to solr cloud and splitting out the > >>>>>>>>>>>> > >>>>>>>>>>> processing > >>>>> > >>>>>> onto > >>>>>>> > >>>>>>>> multiple servers, do you have any suggestions for what > >>>>>>>>>>>> > >>>>>>>>>>> might > >>> > >>>> speed up > >>>>>>> > >>>>>>>> termfreq at query time? > >>>>>>>>>>>> > >>>>>>>>>>>> Thanks, > >>>>>>>>>>>> Aki > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> On Fri, Oct 23, 2015 at 7:21 PM, Jack Krupansky > >>>>>>>>>>>> <jack.krupan...@gmail.com> > >>>>>>>>>>>> wrote: > >>>>>>>>>>>> > >>>>>>>>>>>> Term frequency applies only to the indexed terms of a > >>>>>>>>>>>>> > >>>>>>>>>>>> tokenized > >>>>> > >>>>>> field. > >>>>>>>>>> > >>>>>>>>>>> DocValues is really just a copy of the original source > >>>>>>>>>>>>> > >>>>>>>>>>>> text > >>> > >>>> and is > >>>>>>> > >>>>>>>> not > >>>>>>>>>> > >>>>>>>>>>> tokenized into terms. > >>>>>>>>>>>>> > >>>>>>>>>>>>> Maybe you could explain how exactly you are using term > >>>>>>>>>>>>> > >>>>>>>>>>>> frequency in > >>>>>>> > >>>>>>>> function queries. More importantly, what is so "heavy" > >>>>>>>>>>>>> > >>>>>>>>>>>> about > >>>>> > >>>>>> your > >>>>>>> > >>>>>>>> usage? > >>>>>>>>>>> > >>>>>>>>>>>> Generally, moderate use of a feature is much more > >>>>>>>>>>>>> > >>>>>>>>>>>> advisable to > >>>>> > >>>>>> heavy > >>>>>>>>> > >>>>>>>>>> usage, > >>>>>>>>>>> > >>>>>>>>>>>> unless you don't care about performance. > >>>>>>>>>>>>> > >>>>>>>>>>>>> -- Jack Krupansky > >>>>>>>>>>>>> > >>>>>>>>>>>>> On Fri, Oct 23, 2015 at 8:19 AM, Aki Balogh < > >>>>>>>>>>>>> > >>>>>>>>>>>> a...@marketmuse.com> > >>>>>>> > >>>>>>>> wrote: > >>>>>>>>>>> > >>>>>>>>>>>> Hello, > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> In our solr application, we use a Function Query > >>>>>>>>>>>>>> > >>>>>>>>>>>>> (termfreq) > >>>>> > >>>>>> very > >>>>>>> > >>>>>>>> heavily. > >>>>>>>>>>> > >>>>>>>>>>>> Index time and disk space are not important, but > >>>>>>>>>>>>>> > >>>>>>>>>>>>> we're > >>> > >>>> looking to > >>>>>>> > >>>>>>>> improve > >>>>>>>>>>> > >>>>>>>>>>>> performance on termfreq at query time. > >>>>>>>>>>>>>> I've been reading up on docValues. Would this be a > >>>>>>>>>>>>>> > >>>>>>>>>>>>> way to > >>> > >>>> improve > >>>>>>> > >>>>>>>> performance? > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> I had read that Lucene uses Field Cache for Function > >>>>>>>>>>>>>> > >>>>>>>>>>>>> Queries, so > >>>>>>> > >>>>>>>> performance may not be affected. > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> And, any general suggestions for improving query > >>>>>>>>>>>>>> > >>>>>>>>>>>>> performance > >>>>> > >>>>>> on > >>>>>>> > >>>>>>>> Function > >>>>>>>>>>> > >>>>>>>>>>>> Queries? > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> Thanks, > >>>>>>>>>>>>>> Aki > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > > -- > > Monitoring * Alerting * Anomaly Detection * Centralized Log Management > > Solr & Elasticsearch Support * http://sematext.com/ > > < > https://t.yesware.com/tl/506312808dab13214164f92fbcf5714d3ce38c6b/92f5492fd055692ff7f03b2888be3b50/7a8fd1f72b93af5d79583420b3483a7d?ytl=http%3A%2F%2Fsematext.com%2F > > > > > > > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Highlight with NGram and German S Sharp "ß"

2015-10-20 Thread Scott Stults
class="solr.MappingCharFilterFactory" > mapping="mapping-ISOLatin1Accent.txt"/> > > Jérôme > > > Le 13/10/2015 07:46, Scott Stults a écrit : > >> My guess is that the boundary scanner isn't configured right for your >> highlighter. Try setting the bs.l

Re: Highlighting content field problem when using JiebaTokenizerFactory

2015-10-19 Thread Scott Stults
pace consistently. > > This problem only happens in content field, and not in any other fields. > Does anyone knows what could be causing the issue? > > I'm using jieba-analysis-1.0.0, Solr 5.3.0 and Lucene 5.3.0. > > > Regards, > Edwin > -- Scott Stults | F

Re: Autostart Zookeeper and Solr using scripting

2015-10-19 Thread Scott Stults
ailable in .NET via a > nuget package. Not sure if this could be also implemented to check if a > zookeeper is running. > > Any thoughts on anyone using a script to perform this? > > Regards, > Adrian > > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Highlight with NGram and German S Sharp "ß"

2015-10-12 Thread Scott Stults
> pattern="[\s,;: > \-\']"/> > splitOnNumerics="0" > generateWordParts="1" > generateNumberParts="0" > catenateWords="0" > catenateNumbers="0" > catenateAll="0" > splitOnCaseChange="0" > preserveOriginal="1" > types="wdfftypes.txt" > /> > > > pattern="^(.{20})(.*)?" replacement="$1" replace="all"/> > > > > Is it a problem in our configuration or a known bug ? > Regards > Jérôme > > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: are there any SolrCloud supervisors?

2015-10-12 Thread Scott Stults
r work and > could clean up to share on GitHub if there is much interest. > > I asked in the IRC about a SolrCloud supervisor utility but wanted to > extend that question to this list. are there any more "full featured" > supervisors out there? > > > -renning &

Re: Selective field query

2015-10-12 Thread Scott Stults
n unwanted fields, > but > > >> > these > > >> > appear to have still been returned in results. > > >> > > >> q=ServiceName:(Searched Service) > > >> > > >> That'll look in just one field. > > >> > > >>

Re: Why is Process Total Time greater than Elapsed Time?

2015-09-03 Thread Scott Stults
a Process Total > : Time of 1300ms? Does the Process Total Time add up the amount of time > each > : leaf reader takes, or some other concurrent things? > > > -Hoss > http://www.lucidworks.com/ > -- Scott Stults | Founder & Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Why is Process Total Time greater than Elapsed Time?

2015-09-03 Thread Scott Stults
>From what I can tell, each component processes the request sequentially. So how can I see an Elapsed Time of 750ms (SolrJ client) and a Process Total Time of 1300ms? Does the Process Total Time add up the amount of time each leaf reader takes, or some other concurrent things? Thank you, Scott

Re: Solr packages in Apache BigTop.

2015-03-09 Thread Scott Stults
, solr, hbase and so on in rpm/deb format, and supply puppet provisioners along with vagrant recipse for testing. -- jay vyas -- Scott Stults | Founder Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Multi words query

2015-02-13 Thread Scott Stults
[id] } I created a text field to copy all the fields to and the query handler request this field rgds, -- View this message in context: http://lucene.472066.n3.nabble.com/Multi-words-query-tp4185625p4185922.html Sent from the Solr - User mailing list archive at Nabble.com. -- Scott

Re: bulk indexing with optimistick lock

2015-02-13 Thread Scott Stults
a way to do this? Regards Sankalp Gupta -- Scott Stults | Founder Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: SpellingQueryConverter and query parsing

2015-01-29 Thread Scott Stults
opening a jira contributing the code, even if what you end up with isn't in a final polished state for general use. James Dyer Ingram Content Group -Original Message- From: Scott Stults [mailto:sstu...@opensourceconnections.com] Sent: Tuesday, January 27, 2015 11:26 AM To: solr-user

SpellingQueryConverter and query parsing

2015-01-27 Thread Scott Stults
Hello! SpellingQueryConverter parses the incoming query in sort of a quick and dirty way with a regular expression. Is there a reason the query string isn't parsed with the _actual_ parser, if one was configured for that type of request? Even better, could the parsed query object be added to the

Re: zkCli zkhost parameter

2014-04-28 Thread Scott Stults
you tried a comma-separated list or are you going by documentation? It should work. -- Mark Miller about.me/markrmiller On April 26, 2014 at 1:03:25 PM, Scott Stults ( sstu...@opensourceconnections.com) wrote: It looks like this only takes a single host as its value, whereas the zkHost

zkCli zkhost parameter

2014-04-26 Thread Scott Stults
It looks like this only takes a single host as its value, whereas the zkHost environment variable for Solr takes a comma-separated list. Shouldn't the client also take a comma-separated list? k/r, Scott

JVM tuning?

2013-11-12 Thread Scott Stults
We've been using a slightly older version of this script to start Solr in server environments: https://github.com/apache/cassandra/blob/trunk/conf/cassandra-env.sh The thing I especially like about it is its ability to dynamically cap memory usage, and the garbage collection log section is a

Re: Thoughts on production deployment?

2013-02-02 Thread Scott Stults
recipe is that it's going to grab whatever version of Jetty was blessed by the distro maintainers on your target platform. -- Scott Stults | Founder Solutions Architect | OpenSource Connections, LLC | 434.409.2780 http://www.opensourceconnections.com

Re: Thoughts on production deployment?

2013-02-01 Thread Scott Stults
Thanks for jumping in guys. I agree the SolrJetty page needs just a little updating -- I commented at the bottom of SOLR-3159 about that. Michael and Paul, are your chef and ant recipes generic enough to share? My next install is going to be on RHEL 6, so I can take a crack at an install script

Thoughts on production deployment?

2013-01-31 Thread Scott Stults
Part of this is a rant, part is a plea to others who've run successful production deployments. Solr is a second-class citizen when it comes to production deployment. Every recipe I've seen (RPM, DEB, chef, or puppet) makes assumptions that in one way or another run afoul of best-practices when

Re: Will SolrCloud always slice by ID hash?

2013-01-07 Thread Scott Stults
Thanks guys. Yeah, separate rolling collections seem like the better way to go. -Scott On Sat, Dec 29, 2012 at 1:30 AM, Otis Gospodnetic otis.gospodne...@gmail.com wrote: https://issues.apache.org/jira/browse/SOLR-4237

Will SolrCloud always slice by ID hash?

2012-12-18 Thread Scott Stults
I'm going to be building a Solr cluster and I want to have a rolling set of slices so that I can keep a fixed number of days in my collection. If I send an update to a particular slice leader, will it always hash the unique key and (probably) forward the doc to another leader? Thank you, Scott

Re: Do Hignlighting + proximity using surround query parser

2012-01-24 Thread Scott Stults
results. Highlighter does not recognize Surround Query. It must be re-written to enable highlighting in o.a.s.search.QParser#getHighlightQuery() method. Not sure this functionality should be added in SOLR-2703 or a separate jira issue. -- Scott Stults | Founder Solutions Architect