Re: Solr 6 managed-schema & version control

2016-07-27 Thread John Bickerstaff
Erick - the UI you mention -- something that exists or something that has to be built? (I'm upgrading to version 6 as well and this question is one I'll have to deal with...) On Wed, Jul 27, 2016 at 5:31 PM, Rachid Bouacheria wrote: > Thank you very much Erick, I appreciate

Re: Uploading all files under directory with no extension

2016-07-27 Thread Alexandre Rafalovitch
I believe the extensions are used for type guessing normally. You could try explicitly specify files if they are in one directory and you only have non-extension files. Or you could do a find|grep -v|xargs -n command sequence to find whatever you need and feed it to the post script. Regards,

Re: Solr 6 managed-schema & version control

2016-07-27 Thread Rachid Bouacheria
Thank you very much Erick, I appreciate your feed back. On Wed, Jul 27, 2016 at 2:24 PM, Erick Erickson wrote: > Using classic schema is perfectly acceptable/reasonable, you can > continue to do so freely (you'll have to change to > ClassicSchemaFactory though). > >

Re: AnalyticsQuery fails on a sharded collection

2016-07-27 Thread Joel Bernstein
The finish() method operates on the search node, not the aggregator node. So whether it's distributed shouldn't effect how it runs. If you can post your code I might be able to see the issue. As far using a MergeStrategy, I would suggest creating a streaming expression that handles the merge.

Is there a way to filter the results based on weight - SOLR suggester?

2016-07-27 Thread bbarani
Hi, I am using suggester component in SOLR 5.5.1 and sort the matching suggestion based on a custom field (lookupCount) field. The below configuration seems to work fine but its returning the matching term even if the weight is set to 0. Is there a way to restrict returning the matching term

AnalyticsQuery fails on a sharded collection

2016-07-27 Thread tedsolr
I'm looking to create a merge strategy for a custom QParserPlugin I have. The plugin works fine on collections with one shard. I was very surprised to see it throw an exception when I ran it against a sharded collection. So my question is a bit of a shot in the dark. I'll first note that the

Re: Solr 6 managed-schema & version control

2016-07-27 Thread Erick Erickson
Using classic schema is perfectly acceptable/reasonable, you can continue to do so freely (you'll have to change to ClassicSchemaFactory though). Also, you can freely edit managed-schema just as you did schema.xml. The "trick" here is that you have to take some care _not_ to issue commands that

Solr 6 managed-schema & version control

2016-07-27 Thread Rachid Bouacheria
Hi All, I am upgrading from solr 4 to 6. In solr 4 I have a schema.xml that is under version control. But solr 6 has the notion of a managed schema that could be modified via a solr api call. This seems great and flexible, but my assumption is that in this case zookeeper becomes the authoritative

Re: SolrCloud create_collection not uploading configs to zookeeper

2016-07-27 Thread Nirav Patel
Got it. That make sense. Thanks On Wed, Jul 27, 2016 at 9:08 AM, Shawn Heisey wrote: > On 7/26/2016 5:30 PM, Nirav Patel wrote: > > OK, I can see '/configs' directory in Solr UI and under that I can see > > configuration fo my 'test' collection. BUt this all seemed to be >

SolrCloud: Failure to recover on restart following OutOfMemoryError

2016-07-27 Thread Kelly, Frank
Hi All, We have a SolrCloud cluster with 3 Virtual Machines, assigning 4GB to the Java Heap. Recently we added a number of collections to the machine going from around 80 collections (each with 3 shards x 3 replicas) to 150 collections We've hit Heap errors. That wasn't the surprise, the

Uploading all files under directory with no extension

2016-07-27 Thread Nirav Patel
I tried using post tool with following parameters. Looks like its not uploading files if it doesn't have known extension. ./bin/post -c mycol1 -params "separator=%09" -type text/tsv -filetypes tsv /dev/datascience/pod1/population/baseline/ /usr/java/jdk1.8.0_102//bin/java -classpath

Re: SolrCloud create_collection not uploading configs to zookeeper

2016-07-27 Thread Shawn Heisey
On 7/26/2016 5:30 PM, Nirav Patel wrote: > OK, I can see '/configs' directory in Solr UI and under that I can see > configuration fo my 'test' collection. BUt this all seemed to be > disjointed information. Doc is definitely not clear. And what that > Tree represent anyway where;s information for

Re: problems with bulk indexing with concurrent DIH

2016-07-27 Thread Bernd Fehling
If there is a problem in single index then it might also be in CloudSolr. As far as I could figure out from INFOSTREAM, documents are added to segments and terms are "collected". Duplicate term are "deleted" (or whatever). These deletes (or whatever) are not concurrent. I have a lines like: BD 0

Solr: Block Join Faceting

2016-07-27 Thread Morse, Matthew K.
Sorry for the double post.

Solr: Block Join Faceting

2016-07-27 Thread Morse, Matthew K.
I have a document with multiple different children. Below is the structure: 1 true 11 false 22 6 12 false 11 7276 13 false 100419 14 false 245 2 true

Re: problems with bulk indexing with concurrent DIH

2016-07-27 Thread Erick Erickson
Well, at least it'll be easier to debug in my experience. Simple example. At some point you'll call CloudSolrClient.add(doc list). Comment just that out and you'll be able to isolate whether the issue is querying the be or sending to Solr. Then CloudSolrClient (assuming SolrCloud) has

Re: problems with bulk indexing with concurrent DIH

2016-07-27 Thread Bernd Fehling
So writing some SolrJ doing the same job as the DIH script and using that concurrent will solve my problem? I'm not using Tika. I don't think that DIH is my problem, even if it is not the best solution right now. Nevertheless, you are right SolrJ has higher performance, but what if I have the

Re: Is it possible to force a Shard Leader change?

2016-07-27 Thread Erick Erickson
The REBALANCELEADERS stuff was put in to deal with 100s of leaders winding up on a single machine in a case where extremely high throughput was required. Until you get into pretty high scale the additional "work" on a leader is minimal. So unless your CPU usage is consistently significantly higher

Re: problems with bulk indexing with concurrent DIH

2016-07-27 Thread Erick Erickson
I'd actually recommend you move to a SolrJ solution or similar. Currently, you're putting a load on the Solr servers (especially if you're also using Tika) in addition to all indexing etc. Here's a sample: https://lucidworks.com/blog/2012/02/14/indexing-with-solrj/ Dodging the question I know,

Re: How to configure solr while having Apostrophes in fields

2016-07-27 Thread Erick Erickson
I'd _strongly_ recommend you become familiar with the admin>>(your core)>>analysis page. It tells you exactly what each filter does to your input and makes it much simpler to answer questions like this. Hover over each of the gray letter pairs (e.g. "SF" will be gray, hover over it and you'll see

problems with bulk indexing with concurrent DIH

2016-07-27 Thread Bernd Fehling
After enhancing the server with SSDs I'm trying to speed up indexing. The server has 16 CPUs and more than 100G RAM. JAVA (1.8.0_92) has 24G. SOLR is 4.10.4. Plain XML data to load is 218G with about 96M records. This will result in a single index of 299G. I tried with 4, 8, 12 and 16 concurrent

How to configure solr while having Apostrophes in fields

2016-07-27 Thread nitin.garg88
When i search for "plato" it return me all records with "plato,platos,plato's" When i search for "platos" it return me all records with "platos,plato's" When i search for "plato's" it return me all records with "platos,plato's" Please suggest me how to configure schema.xml .Below is my "text"

RE: Query Solr

2016-07-27 Thread Kostas
There are some examples on the web for this: http://yonik.com/solr/query-syntax/ http://stackoverflow.com/questions/634765/using-or-and-not-in-solr-query If you are using .NET, maybe also try SolrNet. Maybe those help. -Original Message- From: Hardika Catur S

Re: The Query Elevation Component

2016-07-27 Thread Alessandro Benedetti
Hi Ryan, can you explain this ? " I'd like the search request to search multiple fields, but only elevate if the query is found in one of the fields." You mean, that you want to apply the elevation component only if the user selected a particular field in the query ? If i remember well, you have

Re: Solr 5.5.2 mm parameter not working the same

2016-07-27 Thread elisabeth benoit
oh sorry wrote too fast. had to change the defaultOperator to OR. Elisabeth 2016-07-27 10:11 GMT+02:00 elisabeth benoit : > > Hello, > > We are migrating from solr 4.10.1 to solr 5.5.2, and it seems that the mm > parameter is not working the same anymore. > > In fact,

SOLR-7036 - Faster method for group.facet - new patch for trunk

2016-07-27 Thread danny teichthal
Hi, SOLR-7036 introduced a new faster method for group.facet, which uses UnInvertedField. It was patched for version 4.x. Over the last week, my colleague uploaded a new patch that work against the trunk. We would really appreciate if anyone could take a look at it and give us some feedback about

Solr 5.5.2 mm parameter not working the same

2016-07-27 Thread elisabeth benoit
Hello, We are migrating from solr 4.10.1 to solr 5.5.2, and it seems that the mm parameter is not working the same anymore. In fact, as soon as there is a word not in the index in the query, no matter what mm value I send, I get no answer as if my query is a pure AND query. Does anyone have a

Query Solr

2016-07-27 Thread Hardika Catur S
Hi, I will create a query multiple collections in solr where query in mysql like this "SELECT colection1.field_colection1 FROM colection1 WHERE colection1.field_colection1 NOT IN (SELECT colection2.field_colection2 FROM colection2);". But I find it difficult for create that query. Please