Re: CustomScoreProvider Sucks, Need Help

2018-02-23 Thread ~$alpha`
But why when i am using customize sort, with just return it should work fast -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

RE: autosuggestion indexing in a solr cluster

2018-02-23 Thread Deepak Udapudi
Using the solr collection/suggest?suggesttrue URL for populating the index. Regards, Deepak -Original Message- From: Deepak Udapudi [mailto:dudap...@delta.org] Sent: Friday, February 23, 2018 5:42 PM To: solr-user@lucene.apache.org Cc: Balakrishna Sudabathula ;

autosuggestion indexing in a solr cluster

2018-02-23 Thread Deepak Udapudi
Hi all, We are using a Solr cluster. We have Solr configuration for auto-suggestions as shown below. Specialty specialty specialty AnalyzingInfixLookupFactory specialty_suggester_infix_dir DocumentDictionaryFactory

Re: custom unique numeric id

2018-02-23 Thread Clay McDonald
Thank you, Clay > On Feb 23, 2018, at 6:29 PM, Shawn Heisey wrote: > >> On 2/23/2018 2:57 PM, Clay McDonald wrote: >> I'm new to Solr/Lucene and I'd like to know if there is a way to auto-create >> a unique numeric id in a custom field that we can them reference when

Re: custom unique numeric id

2018-02-23 Thread Shawn Heisey
On 2/23/2018 2:57 PM, Clay McDonald wrote: > I'm new to Solr/Lucene and I'd like to know if there is a way to auto-create > a unique numeric id in a custom field that we can them reference when making > calls to the index from Python. It seems to use that using a numeric id would > speed up our

Re: statistics in hitlist

2018-02-23 Thread Joel Bernstein
This is going to be a complex answer because Solr actually now has multiple ways of doing regression analysis as part of the Streaming Expression statistical programming library. The basic documentation is here: https://lucene.apache.org/solr/guide/7_2/statistical-programming.html Here is a

Rename solrconfig.xml

2018-02-23 Thread Zheng Lin Edwin Yeo
Hi, Would like to check, how can we rename solrconfig.xml to something else? For example, I want to rename it to myconfig.xml. Is this possible? I'm using Solr 6.5.1, and planning to upgrade to Solr 7.2.1. Regards, Edwin

custom unique numeric id

2018-02-23 Thread Clay McDonald
Hello, I'm new to Solr/Lucene and I'd like to know if there is a way to auto-create a unique numeric id in a custom field that we can them reference when making calls to the index from Python. It seems to use that using a numeric id would speed up our calls to and fro Solr from our PySpark ML

Re: Indexing timeout issues with SolrCloud 7.1

2018-02-23 Thread Tom Peters
I included the last 25 lines from the logs from each of the five nodes during that time period. I _think_ I'm running into issues with bulking up deleteByQuery. Quick background: we have objects in our system that may have multiple availability windows. So when we index an object, will store

RE: SOLR Score Range Changed

2018-02-23 Thread Hodder, Rick
Classic Similarity helped, but the ranges of values don’t have a min near 0 like back in 4's version Are there other attributes/elements to this factory that could get me back the old functionality? -Original Message- From: Joël Trigalo [mailto:jtrig...@gmail.com] Sent: Friday,

RE: SOLR Score Range Changed

2018-02-23 Thread Hodder, Rick
Hi Shawn, Thanks for your help - I'm still finding my way in the weeds of SOLR. Combining everything into one query is what I'd prefer because as you said, one would think that with everything in the same query, the score would organize everything nicely. >>Assuming you're using the default

Re: Indexing timeout issues with SolrCloud 7.1

2018-02-23 Thread Deepak Goel
Can you please post all the errors? The current error is only for the node 'solr-2d' On 23 Feb 2018 09:42, "Tom Peters" wrote: I'm trying to debug why indexing in SolrCloud 7.1 is having so many issues. It will hang most of the time, and timeout the rest. Here's an

Re: configure jetty to use both http1.1 and H2

2018-02-23 Thread Jeff Dyke
Answering a bit of my own question, the underlying jetty would have to be built with it, and get pushed into its jar directory. I think i'll put nginx in front of this, do a quick proxy forcing 1.1 and move on, but if anyone knows any tricks, it'll be good just for thoroughness of this thread and

Re: configure jetty to use both http1.1 and H2

2018-02-23 Thread Jeff Dyke
Thanks for the tip Jason. I didn't see the -j option there or here https://lucene.apache.org/solr/guide/7_2/solr-control-script-reference.html I'll keep this short, i tried to add it to the init.d script and then interacting directly with the solr binary, but ultimately saw that

Re: statistics in hitlist

2018-02-23 Thread John Smith
Hi Joel, thanks for the answer. I'm not really a stats guy, but the end result of all this is supposed to be obtaining R^2. Is there no way of obtaining this value, then (short of iterating over all the results in the hitlist and calculating it myself)? On Fri, Feb 23, 2018 at 12:26 PM, Joel

Re: Turn on/off query based on a url parameter

2018-02-23 Thread Roopa Rao
Thanks, I got it working as below, features is true or false based on the efi parameter is_var. (Field and value should be substituted with correct names) { "store": "featurestore", "name": "isfeaturematch", "class": "org.apache.solr.ltr.feature.SolrFeature", "params": {

Re: Atomic Updates : Performance Impact

2018-02-23 Thread Uday Jami
Thanks Erick for the useful information. Will keep the below points in mind while designing my solution. Thanks, Uday On Sat, Feb 24, 2018 at 12:47 AM, Erick Erickson wrote: > bq: However if i dont have majority of other column data while doing update > operations, is

Re: Atomic Updates : Performance Impact

2018-02-23 Thread Erick Erickson
bq: However if i dont have majority of other column data while doing update operations, is it better to go with atomic update? I don't understand what you're asking. To use Atomic Updates, _every_ original field (i.e. any field that is _not_ the destination of a copyField directive) must be

Re: configure jetty to use both http1.1 and H2

2018-02-23 Thread Jason Gerlowski
Hi Jeff, I haven't tested your exact use case regarding H/2, but the "bin/solr" startup script has a special "-j" options that can be used to pass arbitrary flags to the underlying Jetty server. If you have options that work with vanilla Jetty, they _should_ work when passed through the

Re: CustomScoreProvider Sucks, Need Help

2018-02-23 Thread Walter Underwood
By “6lac row”, do you mean you are fetching 600,000 results? That will be very, very slow. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Feb 23, 2018, at 9:12 AM, ~$alpha` wrote: > > public class MatchingScoreProvider

Re: StandardTokenizer and splitting on mixedcase strings

2018-02-23 Thread Rick Leir
Dan, Lowercase filter before the tokenizer? Cheers -- Rick On February 23, 2018 6:08:27 AM EST, "Dan ." wrote: >Hi, > >The StandardTokenizerFactory splits strings like 'JavaScript' into >'Java' >and 'Script', but then searches with 'javascript' do not match the >document. >

Re: Object not fetched because its identifier appears to be already in processing

2018-02-23 Thread Rick Leir
Ven, Where do you see that message? Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: At which solr version was "Managed-schema" set as default?

2018-02-23 Thread BlackIce
Thank you, at the current Nutch documentation for Nutch 14 which is build against Solr 6.6.x, it states to delete the "managed-schema.xml" and provide the Nutch specific schema.xml and THEN create the core. This works, it creates the core with all nutch specific fields, but solr then

Re: Atomic Updates : Performance Impact

2018-02-23 Thread Uday Jami
Hello Erick, Thanks for the explanation. However if i dont have majority of other column data while doing update operations, is it better to go with atomic update? And also during the update process, if there is a simultaneous search request on the collection, will there be any lag in response?

Re: StandardTokenizer and splitting on mixedcase strings

2018-02-23 Thread Steve Rowe
Hi Dan, StandardTokenizerFactory does not do this. Maybe you have a filter in your analysis chain that does this? E.g. WordDelimiterFilterFactory has this capability. -- Steve www.lucidworks.com > On Feb 23, 2018, at 6:08 AM, Dan . wrote: > > Hi, > > The

Re: statistics in hitlist

2018-02-23 Thread Joel Bernstein
Typically SSE is the sum of the squared errors of the prediction in a regression analysis. The stats component doesn't perform regression, although it might be a nice feature. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Feb 23, 2018 at 12:17 PM, John Smith

Re: Indexing timeout issues with SolrCloud 7.1

2018-02-23 Thread Rick Leir
Tom I think you are saying that all updates fail? Need to do a bit of troubleshooting. How about queries? What else is in the logs? Cheers -- Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Atomic Updates : Performance Impact

2018-02-23 Thread Erick Erickson
The approximate amount of work will be very close to what it would take Solr to just index the documents from a client. Actually it puts a little _more_ of a load on Solr. In the case you do an Atomic Update, Solr has to 1> fetch all the stored fields from the index 2> construct a Solr document 3>

statistics in hitlist

2018-02-23 Thread John Smith
I'm using solr, and enabling stats as per this page: https://lucene.apache.org/solr/guide/6_6/the-stats-component.html I want to get more stat values though. Specifically I'm looking for r-squared (coefficient of determination). This value is not present in solr, however some of the pieces used

Re: SolrException: Error Instantiating queryParser, com.site.s.CustomQParserPlugin failed to instantiate org.apache.solr.search.QParserPlugin

2018-02-23 Thread ~$alpha`
Resolved by inclding solr on depency using maven -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

CustomScoreProvider Sucks, Need Help

2018-02-23 Thread ~$alpha`
public class MatchingScoreProvider extends CustomScoreProvider { } Issues: 1. CustomScoreProvider works but is too slow. Even when I am writing return 1 on 1st line it's still taking 5 seconds for 6lac row. 2. I added @Slf4j but still not logger not working -- Sent from:

Atomic Updates : Performance Impact

2018-02-23 Thread Uday Jami
Can you please let me know what will be the performance impact of trying to update 120Million records in a collection containing 1 billion records. The collection contains around 30 columns and only one column out of it is updated as part of atomic update. Its not a batch update, the 120 Million

configure jetty to use both http1.1 and H2

2018-02-23 Thread Jeff Dyke
Hi, I've been googling around for a while and can't seem to find an answer to this. Is it possible to have the embedded jetty listen to H/2 as well has HTTP/1.1, mainly i'd like to use this to access it on a private subnet on AWS through HAProxy which is set up to prefer H/2. With base jetty its

Re: At which solr version was "Managed-schema" set as default?

2018-02-23 Thread Shawn Heisey
On 2/23/2018 6:23 AM, BlackIce wrote: I'm reworking some documentation for the Nutch project, and for the sake of correctness and completness could someone tell me at which version did Solr switch over to the "managed-Schema" by default? It was version 5.5. In the versions before that, the

Object not fetched because its identifier appears to be already in processing

2018-02-23 Thread YELESWARAPU, VENKATA BHAN
Information Classification: ** Limited Access Dear Users, While indexing job is running we are seeing the below message for all the objects. Object not fetched because its identifier appears to be already in processing What is the issue and how to resolve this so that indexing can work.

Re: LTR and 'searching' a streaming expression result

2018-02-23 Thread Joel Bernstein
In the scenario you describe above the answer is no. That's because the joins rely on the sort order of the result set and require exporting of the entire result set. Both those requirements will not work with ltr. The search expression though could be used with ltr and the fetch expression,

Re: Spark-Solr -- unresolved dependencies

2018-02-23 Thread Shawn Heisey
On 2/23/2018 4:50 AM, Selvam Raman wrote: spark version - EMR 2.0.0 spark-shell --packages com.lucidworks.spark:spark-solr:3.0.1 when i tired about command, am getting below error :: :: UNRESOLVED DEPENDENCIES ::

Re: Solr not accessible - javax.net.ssl.SSLException

2018-02-23 Thread Shawn Heisey
On 2/22/2018 11:16 PM, protonmail4us wrote: ERROR: Failed to get system information from https://localhost:8282/solr due to: javax.net.ssl.SSLException: Certificate for doesn't match any of the subject alternative names: [*.ishippo.com, ishippo.com] It says the certificate is valid for

Re: SOLR Score Range Changed

2018-02-23 Thread Joël Trigalo
The difference seems due to the fact that default similarity in solr 7 is BM25 while it used to be TF-IDF in solr 4. As you realised, BM25 function is smoother. You can configure schema.xml to use ClassicSimilarity, for instance

Re: Download solr data(only one field) to csv

2018-02-23 Thread Emir Arnautović
Hi Selvam, Using start/rows to download 10M docs is what is called deep paging. You need to either use cursors (https://lucene.apache.org/solr/guide/6_6/pagination-of-results.html ) or export handler

Re: Issue Using JSON Facet API Buckets in Solr 6.6

2018-02-23 Thread Antelmo Aguilar
Hi Yonik, Good to hear you were able to reproduce it. Looking forward for the fix. Will use the version of Solr that works in the meantime. -Antelmo On Thu, Feb 22, 2018 at 5:10 PM, Yonik Seeley wrote: > I've reproduced the issue and opened >

At which solr version was "Managed-schema" set as default?

2018-02-23 Thread BlackIce
hi, I'm reworking some documentation for the Nutch project, and for the sake of correctness and completness could someone tell me at which version did Solr switch over to the "managed-Schema" by default? Thank you very much! RRK

Re: Solrj : ConcurrentUpdateSolrClient based on QueueSize and Time

2018-02-23 Thread Santosh Narayan
Thanks Jason. Hope this can be fixed in the next update of SolrJ. On Thu, Feb 22, 2018 at 10:49 AM, Jason Gerlowski wrote: > My apologies Santosh. I added that comment a few releases back based > on a misunderstanding I've only recently been disabused of. I will >

Download solr data(only one field) to csv

2018-02-23 Thread Selvam Raman
Hi, I have 10 million of record in solr index. I want to download whole record in csv format with one field. I have 20+ fields, but i want to download data with (fl=title) only title field. http://localhost:8983/solr/containerMetadata/select?q=*=external_id_s,container_title_en=csv=true=100

Spark-Solr -- unresolved dependencies

2018-02-23 Thread Selvam Raman
Hi, spark version - EMR 2.0.0 spark-shell --packages com.lucidworks.spark:spark-solr:3.0.1 when i tired about command, am getting below error :: :: UNRESOLVED DEPENDENCIES :: :: ::

StandardTokenizer and splitting on mixedcase strings

2018-02-23 Thread Dan .
Hi, The StandardTokenizerFactory splits strings like 'JavaScript' into 'Java' and 'Script', but then searches with 'javascript' do not match the document. Is there a solr way to prevent StandardTokenizer from splitting mixedcase strings? Cheers, Dan

LTR and 'searching' a streaming expression result

2018-02-23 Thread Gintautas Sulskus
Hi, Is it possible to apply another search to a streaming expression result? E.g. to use leftOuterJoin as a source for search: search( leftOuterJoin( leftOuterJoin(search(), search()) leftOuterJoin(..) ), q=... ) Is it possible to apply LTR to the streaming

Re: Solr Basic Authentication setup issue (password SolrRocks not accepted) on Solr6.1.0/Zkp3.4.6

2018-02-23 Thread Atita Arora
Hi, I tried the same on version 7.0.1 and it works with the same json. However , I remember setting this up for another client who used the same version and they reported similar issues. They later planned an upgrade to resolve this. I would also advice you to look into SOLR-9188

Solr Basic Authentication setup issue (password SolrRocks not accepted) on Solr6.1.0/Zkp3.4.6

2018-02-23 Thread Tarjono, C. A.
Dear All, We are trying to implement basic authentication in our solrcloud implementation. We followed the PDF (for version 6.1.0) as below: 1. Start Solr 2. Created security.json { "authentication":{ "blockUnknown": true,