Re: _childDocuments_ automatically multivalued field type

2018-07-02 Thread jeebix
Ok, I'll have a look at the link above. Thanks a lot... Best JB -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: _childDocuments_ automatically multivalued field type

2018-07-02 Thread jeebix
Ok, I see what I have to look for, thanks to your reply. I'll adjust the schema and see difference. Thanks. Best JB -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

please unsubscribe

2018-07-02 Thread Karl Hampel

Re: Managed Schemas and Version Control

2018-07-02 Thread Zimmermann, Thomas
Thanks all! I think we will maintain our current approach of hand editing the configs in git and implement something at the shell level to automate the process of running upconfig and performing a core reload.

Override a single value in a Config Set

2018-07-02 Thread Zimmermann, Thomas
Hi, We have several cores with identical configurations with the sole exception being the language of their document sets. I'd like to leverage Config Sets to manage the going forward, but ran into two issues I'm struggling to solve conceptually. Sample Cores: our_documents our_documents_de ou

Re: Solrcloud collection sharding and querying

2018-07-02 Thread Sushant Vengurlekar
We have two collections which are 21G and constantly growing. The index on one of them is also 12G. I am trying to see how sharding can be employed to improve the query performance by adding the route to a shard based on a field in schema.xml. So I am trying to figure out how to split the collectio

Block Join Child Query returns incorrect result

2018-07-02 Thread kristaclaire14
Hi, I'm having a problem in my solr when querying third level child documents. I want to retrieve parent documents that have specific third level child documents. The example data is: [{ "id":"1001" "path":"1.Project", "Project_Title":"Sample Project", "_childDocuments_":[

Re: Solrcloud collection sharding and querying

2018-07-02 Thread Erick Erickson
This seems like an "XY problem". _Why_ do you want to do this? Has your collection outgrown one shard and you feel you have to split it? Sharding should only be used when you can't host your entire collection on a single replica and still get adequate performance. When you do reach that point, the

Re: Solr 7.1.0 - NoNode for /collections

2018-07-02 Thread Shawn Heisey
On 7/2/2018 5:57 PM, Joe Obernberger wrote: > Just to add to this - looks like the only valid replica that is > remaining is a TLOG type, and I suspect that is why it no longer has a > leader.  Poop. A replica of that type (TLOG) should be capable of becoming leader.  The PULL replica type is the

Solrcloud collection sharding and querying

2018-07-02 Thread Sushant Vengurlekar
I want to split a collection based on one field. How do I do it and then query based off that. Ex: collection1. Field to split off col1 Thank you

Re: CDCR Custom Document Routing

2018-07-02 Thread Jay Potharaju
Solr cdcr : https://issues.apache.org/jira/browse/SOLR-12380 deletebyid: https://issues.apache.org/jira/browse/SOLR-8889 Thanks Jay Potharaju On Mon, Jul 2, 2018 at 5:41 PM Jay Potharaju wrote: > Hi Amrit, > I am using a curl command to send a request to solr for deleting > documents. That is

Re: CDCR Custom Document Routing

2018-07-02 Thread Jay Potharaju
Hi Amrit, I am using a curl command to send a request to solr for deleting documents. That is because deleteById does not work for collections setup with implicit routing. curl http:/localhost:8983/solr/test_5_replica2/update/json/ -H 'Content-type:application/json/docs' -d '{ "delete": {"id":"doc

Re: CDCR Custom Document Routing

2018-07-02 Thread Amrit Sarkar
Jay, Can you sample delete command you are firing at the source to understand the issue with Cdcr. On Tue, 3 Jul 2018, 4:22 am Jay Potharaju, wrote: > Hi > The current cdcr setup does not work if my collection uses implicit > routing. > In my testing i found that adding documents works without

Re: Creating single CloudSolrClient object which can be used throughout the application

2018-07-02 Thread Shawn Heisey
On 7/2/2018 7:35 AM, Ritesh Kumar wrote: > I have got a static method which returns CloudSolrClient object if Solr is > running in Cloud mode and HttpSolrClient object otherwise. Declare that method as synchronized, so that multiple usages do not step on each other's toes.  This will also eliminat

Re: Can't recover - HDFS

2018-07-02 Thread Shawn Heisey
On 7/2/2018 1:40 PM, Joe Obernberger wrote: > Hi All - having this same problem again with a large index in HDFS.  A > replica needs to recover, and it just spins retrying over and over > again.  Any ideas?  Is there an adjustable timeout? > > Screenshot: > http://lovehorsepower.com/images/SolrShot

Re: A user defined request handler is failing to fetch the data.

2018-07-02 Thread Shawn Heisey
On 7/2/2018 12:58 AM, Adarsh_infor wrote: > Yes am going to have the shards on 6 different servers which will be later > called in my searchHandler by specifying the shards list. But for that > initially i was testing the filesearch with the single shard which was > suppose to work. I know solr c

Resources for Monitoring Cassandra, Spark, Solr

2018-07-02 Thread Rahul Singh
Folks, We often get questions on monitoring here so I assembled this post with articles from those in the community as well as links to the component tools to give folks a more comprehensive listing. https://blog.anant.us/resources-for-monitoring-datastax-cassandra-spark-solr-performance/ This i

Re: Solr 7.1.0 - NoNode for /collections

2018-07-02 Thread Joe Obernberger
Just to add to this - looks like the only valid replica that is remaining is a TLOG type, and I suspect that is why it no longer has a leader.  Poop. -Joe On 7/2/2018 7:54 PM, Joe Obernberger wrote: Hi - On startup, I'm getting the following error.  The shard had 3 replicas, but none are sel

Solr 7.1.0 - NoNode for /collections

2018-07-02 Thread Joe Obernberger
Hi - On startup, I'm getting the following error.  The shard had 3 replicas, but none are selected as the leader.  I deleted one, and adding a new one back, but that had no effect, and at times the calls would timeout.  I was having the same issue with another shard on the same collection and d

CDCR Custom Document Routing

2018-07-02 Thread Jay Potharaju
Hi The current cdcr setup does not work if my collection uses implicit routing. In my testing i found that adding documents works without any problems. It doesn't seem to work correctly when deleting documents. Is there an alternative to cdcr that would work in cross data center scenario. Setup: 8

Re: Creating single CloudSolrClient object which can be used throughout the application

2018-07-02 Thread Ritesh Kumar
Yes, the client object is closed each time. The bulk indexing service calls the service which, let's say, indexes all the orders from the database. So, a service is called *asynchronously* from within the bulk service which indexes order related data individually for each order. There may be more

Can't recover - HDFS

2018-07-02 Thread Joe Obernberger
Hi All - having this same problem again with a large index in HDFS.  A replica needs to recover, and it just spins retrying over and over again.  Any ideas?  Is there an adjustable timeout? Screenshot: http://lovehorsepower.com/images/SolrShot1.jpg Thank you! -Joe Obernberger

Re: NgramTokenizerFactory question

2018-07-02 Thread Alexandre Rafalovitch
I am not familiar with Lucene method to create analyzer. Perhaps it was already doing just analyzes phase. But here is what the NGram would do to a string of '123456' with just trigrams: 123 234 345 456 So, if you only apply it on the index side, and your query is '2345' - there is no such token i

Re: NgramTokenizerFactory question

2018-07-02 Thread Kudrettin Güleryüz
> 1) if you want face to match interface, you need max value to be at least 4. Can you please explain this a bit more? I am not following this one. Values are set to 3,3 and Solr already matches interface and interfaces when searched for face. In addition to that Solr matches the trigrams of face

Scores with Solr Suggester

2018-07-02 Thread Buckler, Christine
Is it possible to return a score field for Suggester results like it does with standard search? I am looking for the score which quantifies how close of a match between type entered and suggestion result (not the weight associated with the suggestion). Is this possible? Christine Buckler [id:im

Re: _childDocuments_ automatically multivalued field type

2018-07-02 Thread Shawn Heisey
On 7/2/2018 9:18 AM, jeebix wrote: > I don't understand why for example "type_cmd_s" get the field type attribute > "singleValued", but "TTC" or "kits_sans_suite" get "multiValued" attribute ? > Why those field are in the managed-schema and enseigne_s (for example) is > not ? The field named ensei

Re: NgramTokenizerFactory question

2018-07-02 Thread Alexandre Rafalovitch
Two things: 1) if you want face to match interface, you need max value to be at least 4. 2) you probably have the factory symmetrically or on Query analyzer. You probably want it on Index analyzer side only. Otherwise you are trying to match any 3-letter query substring against yoir index. Admin U

Re: _childDocuments_ automatically multivalued field type

2018-07-02 Thread Alexandre Rafalovitch
Because your _s fields must be mapping to the dynamicField definition and are created accordingly in the schema dynamically without needing a special definition for each field. The TTC field you did map explicitly, perhaps with "schemaless" mapping autodiscovery. Which does create specific field d

Re: NgramTokenizerFactory question

2018-07-02 Thread Kudrettin Güleryüz
It is correct that a search string causes following query to be generated: +(field:fac field:ace) Hence the results... However, I fail to see how (fac OR ace) is a relevant query, shouldn't it be +field:fac +field:ace instead? What is the suggested way to change this this behaviour? On Mon, Jul 2

Re: NgramTokenizerFactory question

2018-07-02 Thread Erick Erickson
Take a look at two things: 1> the admin/analysis page. This is probably mostly a sanity check to insure you're seeing what you expect. 2> add debug=query to the query and look at the parsed query. My bet is that the grams are being OR'd together and your search term is effectively fac OR ace

_childDocuments_ automatically multivalued field type

2018-07-02 Thread jeebix
Hello everybody, I have a problem with some field types in the managed-schema generated. First, the data SOLR returned with a standard query : response":{"numFound":365567,"start":0,"docs":[ { "id":"560.561.134676", "parent_i":560, "asso_i":561, "personne_i"

NgramTokenizerFactory question

2018-07-02 Thread Kudrettin Güleryüz
Hi, When using NgramTokenizerFactory with settings min ngram size=3 and max ngram size=3 I get the following behaviour. Assume that search term is, face I expect the results to show documents with strings: * interface or * face or * faceted but not * ace or * fac Why would I get the matches wi

Re: Server refused connection at: http://localhost:xxxx/solr/collectionName

2018-07-02 Thread Erick Erickson
Given your other e-mail I suspect you're not closing the client and creating new ones for every update request. You should simply not run out of connections, your client is most probably incorrect. Best, Erick On Mon, Jul 2, 2018 at 3:38 AM, Ritesh Kumar wrote: > I could get the live Solr nodes

Re: CursorMarks and 'end of results'

2018-07-02 Thread Erick Erickson
OK, that makes sense then. I don't think we've mentioned streaming as an alternative. It has some restrictions (it can only export docValues), and frankly I don't really remember how much of it was in 5.5 so you'll have to check. Streaming is designed exactly to, well, stream the entire result se

Re: Creating single CloudSolrClient object which can be used throughout the application

2018-07-02 Thread Erick Erickson
It's recommended to use one object of course. That said, you should not be having a connection problem just because you create new ones all the time. Are you closing it after you're done with it each time? As to your question about how to reuse the same one, the "singleton pattern" is one solution

Creating single CloudSolrClient object which can be used throughout the application

2018-07-02 Thread Ritesh Kumar
Hello Team, I have got a static method which returns CloudSolrClient object if Solr is running in Cloud mode and HttpSolrClient object otherwise. When running bulk indexing service, this method is called from within the indexing service to get the appropriate client object. Each time, this method

Re: CursorMarks and 'end of results'

2018-07-02 Thread David Frese
Am 29.06.18 um 17:42 schrieb Erick Erickson: bq. It basically cuts down the search time in half in the usual case for us, so it's an important 'feature'. Wait. You mean that the "extra" call to get back 0 rows doubles your query time? That's surprising, tell us more. How many times does your "u

Re: /replication?command=details does not show infos for all replicas on the core

2018-07-02 Thread Arturas Mazeika
Hi Shawn, hi Erick, hi et al., Very nice clarifications indeed. I also looked at the index replication section. In addition to the clarifications in this thread this brought quite some light into the area (and shows that I need to read solrcloud part of the manual more extensively). Thanks a lot i

Running Solr on Aws S3

2018-07-02 Thread Taher Koitawala
Hi All, Has anyone here tried to run solr on S3? I found a page here which describes how you can run solr on S3. I followed the link, however, i get the following exception Class com.amazon.ws

Re: Server refused connection at: http://localhost:xxxx/solr/collectionName

2018-07-02 Thread Ritesh Kumar
I could get the live Solr nodes using this piece of code ZkStateReader zkStateReader = client.getZkStateReader(); ClusterState clusterState = zkStateReader.getClusterState(); Set liveNodes = clusterState.getLiveNodes(); This way, I will be able to send a query to one of the live nodes

Re: Server refused connection at: http://localhost:xxxx/solr/collectionName

2018-07-02 Thread Ritesh Kumar
I did use CloudSolrClient to query or index data. I did not have to check which Solr node is active. The problem I am facing during bulk indexing is that the Zookeeper runs out of connections resulting in Connection Timeout error. How can I get to know in advance the active Solr nodes? Any referen

Re: Server refused connection at: http://localhost:xxxx/solr/collectionName

2018-07-02 Thread Yasufumi Mizoguchi
Hi, I think ZooKeeper can not notice requests to dead nodes, if you send requests to Solr nodes directly. It will be better that asking ZooKeeper which Solr nodes will be running before requesting Solr nodes with CloudSolrClient etc... Thanks, Yasufumi 2018年7月2日(月) 16:49 Ritesh Kumar : > Hello

Server refused connection at: http://localhost:xxxx/solr/collectionName

2018-07-02 Thread Ritesh Kumar
Hello Team, I have two Solr nodes running in cloud mode. I know that we send queries and updates directly to Solr's collection e.g.http://host: port/solr/. Any of the Solr nodes can be used. If the node does not have the collection being queried then the request will be forwarded internally to a S