Re: CursorMarks and 'end of results'

2018-06-21 Thread Chris Hostetter
: the documentation of 'cursorMarks' recommends to fetch until a query returns : the cursorMark that was passed in to a request. : : But that always requires an additional request at the end, so I wonder if I : can stop already, if a request returns less results than requested (num rows). :

Re: Applying streaming expression as a filter in graph traversal expression (gatherNodes)

2018-06-21 Thread Joel Bernstein
Currently the gatherNodes expression can only be filtered by a traditional filter query. I'm curious about the type of expression you are thinking of filtering by? Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Jun 20, 2018 at 1:54 PM, Pratik Patel wrote: > We can limit the scope of

Re: Search streaming expressions returns rows times number of shards docs

2018-06-21 Thread Alfonso Muñoz-Pomer Fuentes
Yes, I specifically was addressing the /select handler, sorry about not mentioning it explicitly. My use case was, originally, with CloudSolrStream in SolrJ, where I could observe the same behaviour and I created the streaming expression in the UI to test if it was SolrJ-specific. > On 21 Jun

Re: Search streaming expressions returns rows times number of shards docs

2018-06-21 Thread Alfonso Muñoz-Pomer Fuentes
Thanks a lot for the clarification. I created a Jira ticket not to lose track of this: https://issues.apache.org/jira/browse/SOLR-12510 > On 21 Jun 2018, at 20:53, Joel Bernstein wrote: > > That is actually the current behavior of the search expression. The initial > use cases from Streaming

Re: Search streaming expressions returns rows times number of shards docs

2018-06-21 Thread Aroop Ganguly
So I think 2 things are being missed here. You should be specifying the qt=“/export” to see all the results. If you do not do that, then the select handler is used by default which gives the default 10-20 rows as result. > On Jun 21, 2018, at 12:53 PM, Joel Bernstein wrote: > > That is

Re: Search streaming expressions returns rows times number of shards docs

2018-06-21 Thread Joel Bernstein
That is actually the current behavior of the search expression. The initial use cases from Streaming Expressions revolved around joins and rollups which really require the entire result set. So the search expression just merged the results from the shards and let the wrapping expression deal with

Re: CURL DELETE BLOB do not working in solr 7.3 cloud

2018-06-21 Thread Jason Gerlowski
Hi Maxence, Yes, unfortunately that's the wrong API to delete an item from the Blob Store. Items in the blob store are deleted like any other Solr document (i.e. either delete-by-id, or delete-by-query). This is mentioned quite obliquely in the Solr Ref Guide here:

Re: Delete By Query issue followed by Delete By Id Issues

2018-06-21 Thread sujatha sankaran
Thanks,Shawn. Our use case is something like this in a batch load of several 1000's of documents,we do a delete first followed by update.Example delete all 1000 docs and send an update request for 1000. What we see is that there are many missing docs due to DBQ re-ordering of the order of

Re: Solr basic auth

2018-06-21 Thread Jan Høydahl
Hi, As I said there is not way to combine multiple authentication plugins at the moment. So your best shot is probably to create your own CustomAuthPlugin where you implement the logic that you need. You can fork the code from BasicAuth and add the logic you need to whitelist the requests you

Search streaming expressions returns rows times number of shards docs

2018-06-21 Thread Alfonso Muñoz-Pomer Fuentes
I’m having a weird issue with the search streaming expressions and I’d like to share it before opening a ticket in Jira, just in case I’m missing something obvious. I’m currently on Solr 7.1 and I have a collection named bioentities split into two shards and no replicas. Whenever I run a query

Re: solr basic authentication

2018-06-21 Thread Christopher Schultz
Dinesh, On 6/21/18 11:40 AM, Dinesh Sundaram wrote: > is there any way to disable basic authentication for particular domain. i > have proxy pass from a domain to solr which is always asking credentials so > wanted to disable basic auth only for that domain. is there any way? I wouldn't

Re: Delete By Query issue followed by Delete By Id Issues

2018-06-21 Thread Shawn Heisey
On 6/21/2018 9:59 AM, sujatha sankaran wrote: > Currently from our business perspective we find that we are left with no > options for deleting docs in a batch load as : > > DBQ+ batch does not work well together > DBI+ custom routing (batch load / normal)would not work as well. I would

no stable results using morelikethis in distributed mode

2018-06-21 Thread guenterh.li...@bluewin.ch
Hi, I realize a weird behaviour I can't explain (so far we are still running in master/slave mode) Requesting the collection I see logs randomly against the two available shards "green_shard1_replica_n1" and "green_shard2_replica_n2" 2018-06-21 15:35:40.970 INFO (qtp1873653341-17) [c:green

Re: Indexing part of Binary Documents and not the entire contents

2018-06-21 Thread Erick Erickson
This may help you get started: https://lucidworks.com/2012/02/14/indexing-with-solrj/ Best, Erick On Thu, Jun 21, 2018 at 8:11 AM, Shawn Heisey wrote: > On 6/20/2018 9:05 AM, neotorand wrote: >> >> I have a specific Requirement where i need to index below things >> >> Meta Data of any document

Re: Delete By Query issue followed by Delete By Id Issues

2018-06-21 Thread sujatha sankaran
Thanks,Shawn. Currently from our business perspective we find that we are left with no options for deleting docs in a batch load as : DBQ+ batch does not work well together DBI+ custom routing (batch load / normal)would not work as well. We are not sure how we can proceed unless we don't

Re: Solr basic auth

2018-06-21 Thread Dinesh Sundaram
thanks for your valuable feedback. I really want to allow this domain without any credentials. i need basic auth only if anyone access the solr url directly. so no option in solr to do that? On Sun, Jun 17, 2018 at 4:18 PM, Jan Høydahl wrote: > Of course, but Dinesh explicitly set

solr basic authentication

2018-06-21 Thread Dinesh Sundaram
Hi, is there any way to disable basic authentication for particular domain. i have proxy pass from a domain to solr which is always asking credentials so wanted to disable basic auth only for that domain. is there any way? Thanks, Dinesh Sundaram.

Re: Trouble using the MIGRATE command in the collections API on solr 7.3.1

2018-06-21 Thread Matthew Faw
Hi Shawn, Thanks for your reply. According to the MIGRATE documentation, the split.key parameter is required, and removing it returns a missing parameter exception. I’ve tried setting the “split.key=DERP_”, and after doing that I still see no documents in the destination collection.

Re: Indexing part of Binary Documents and not the entire contents

2018-06-21 Thread Shawn Heisey
On 6/20/2018 9:05 AM, neotorand wrote: I have a specific Requirement where i need to index below things Meta Data of any document Some parts from the Document that matches some keywords that i configure The first part i am able to achieve through ERH or FilelistEntityProcessor. I am

Re: CloudSolrClient - setDefaultCollection

2018-06-21 Thread Shawn Heisey
On 6/21/2018 5:04 AM, Greenhorn Techie wrote: While indexing, is there going to be any performance benefit to set the collection name first using setDefaultCollection

Re: Trouble using the MIGRATE command in the collections API on solr 7.3.1

2018-06-21 Thread Shawn Heisey
On 6/21/2018 7:08 AM, Matthew Faw wrote: For background, I’m using solr version 7.3.1 and lucene version 7.3.1 I have a solr collection with 2 shards and 3 replicas using the compositeId router. Each solr document has “id” as its unique key, where each id is of format DERP_${X}, where ${X}

Trouble using the MIGRATE command in the collections API on solr 7.3.1

2018-06-21 Thread Matthew Faw
Hello, For background, I’m using solr version 7.3.1 and lucene version 7.3.1 I have a solr collection with 2 shards and 3 replicas using the compositeId router. Each solr document has “id” as its unique key, where each id is of format DERP_${X}, where ${X} is some 24 character alphanumerical

CloudSolrClient - setDefaultCollection

2018-06-21 Thread Greenhorn Techie
Hi, While indexing, is there going to be any performance benefit to set the collection name first using setDefaultCollection (String

Re: Drive Change for Solr Setup

2018-06-21 Thread Rahul Singh
If it’s windows it may be using a tool called NSSM to manage the solr service. Look at windows services and task scheduler and understand if solr services are being managed by windows via services or the task scheduler — or just .batch files. Rahul On Jun 20, 2018, 11:34 AM -0400, Shawn Heisey

Indexing Approach

2018-06-21 Thread solrnoobie
So we have to optimize our current implementation of our indexing. Our current implementation is to index per batch and each batch will have a query call from the database that will return multiple result sets and the application will be responsible in assembling the document based on the result

Re: How to split index more than 2GB in size

2018-06-21 Thread Michael Kuhlmann
Hi Sushant, while this is true in general, it won't hold here. If you split your index, searching on each splitted shard might be a bit faster, but you'll increase search time much more because Solr needs to send your search queries to all shards and then combine the results. So instead of having