Re: Arabic words search in solr

2017-02-23 Thread Steve Rowe
Hi Mohan, I indexed your 9 examples as simple documents after mapping dynamic field “*_ar” to the “text_ar” field type: - [{"id":"1", "name_ar":"المؤسسة التجارية العمانية"}, {"id":"2", "name_ar":"شركة التأمين الأهلية ش.م.ع.م"}, {"id":"3", "name_ar":"شرطة عمان السلطانية - قيادة شرطة محافظة

Index Segments not Merging

2017-02-23 Thread Jordan Drake
We have solr with the index stored in HDFS. We are running MapReduce jobs to build the index using the MapReduceIndexerTool from Cloudera with the go-live option to merge into our live index. We are seeing an issue where the number of segments in the index never reduces. It continues to grow

Re: Setting Solr data dir isn't really working (6.3.0)

2017-02-23 Thread Walter Underwood
The bug is that the dataDir is /solr/data and the index data is in /apps/solr6/server/solr. Except for the suggest data. No index data should be outside the dataDir, right? wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Feb 23, 2017, at 6:11 PM,

Re: Setting Solr data dir isn't really working (6.3.0)

2017-02-23 Thread Erick Erickson
Not quite sure what you're complaint is. Is it that you've get the index directory under /solr/data and not under, say, /solr/data/tutors? Or that /apps/solr6/server/solr/tutors_shard1_replica5 exists at all? And what's in tutors_shard1_replica5 anyway? Just the core.properties file? Erick On

Setting Solr data dir isn't really working (6.3.0)

2017-02-23 Thread Walter Underwood
I did this in the solrconfig.xml for both collections (tutors and questions). /solr/data I deleted the old collection indexes, reloaded, restarted, and created a new collection for “tutors". And I see this on the disk. [wunder@new-solr-c02.test3]# ls -l /solr/data total 36 drwxr-xr-x 2 bin

Re: Question about best way to architect a Solr application with many data sources

2017-02-23 Thread Joel Bernstein
Alfresco has spent ten+ years building a content management system that follows this basic design: 1) Original bytes (PDF, Word Doc, image file) are stored in a filesystem based content store. 2) Meta-data is stored in a relational database, normalized. 3) Content is transformed to text and

Re: SOLRCloud on 6.4 on Ubuntu

2017-02-23 Thread Erick Erickson
Getting configs up (and down) from solr is done either with zkCli or bin/solr. Personally I find the latter easier if only because it's in a single place. Try bin/solr zk -help and you'll see a bunch of options. Once you do upload the config, you must reload the collection for it to "take". Best,

Subsciption to group

2017-02-23 Thread kurtuluş yılmaz
Hi; I want to be part of solr user group. Can you add me.

Re: SOLRCloud on 6.4 on Ubuntu

2017-02-23 Thread Alexandre Rafalovitch
I don't know which of these you read, so it is a bit of a grab bag. And I haven't reviewed some of them in depth. But hopefully, there is a nugget of gold somewhere in there for you: https://github.com/LucidWorks/solr-scale-tk

Phrase field matches not counting towards minimum match

2017-02-23 Thread dboychuck
Ok let me explain what I am trying to do first since there may be a better approach. Recently I had been trying to increase solr's matching precision by requiring that all of the words in a field match before allowing a match on a field. I am using edismax as my query parser and since it tokenizes

SOLRCloud on 6.4 on Ubuntu

2017-02-23 Thread Pouliot, Scott
I'm trying to find a good beginner level guide to setting up SolrCloud NOT using the example configs that are provided with SOLR. Here are my goals (and the steps I have done so far!): 1. Use an external Zookeeper server a. wget

Re: Interval Facets with JSON

2017-02-23 Thread Tomás Fernández Löbbe
Hi Deniz, Interval Facets is currently not supported with JSON Facets as Tom said. Could you create a Jira issue? On Fri, Feb 10, 2017 at 6:16 AM, Tom Evans wrote: > On Wed, Feb 8, 2017 at 11:26 PM, deniz wrote: > > Tom Evans-2 wrote > >> I

Re: Replicas fail immediately in new collection

2017-02-23 Thread Walter Underwood
I finally figured this out yesterday. Because the jar files have the version in the file name, I had a mix of jars from different versions. Depending on the load order, Solr could get into a situation where it was calling something that didn’t exist. That was mysterious. wunder Walter

Re: Replicas fail immediately in new collection

2017-02-23 Thread Shalin Shekhar Mangar
SOLR-9739 changed the writeStr method to accept a CharSequence from String in 6.4 so my guess is that your classpath has a newer (6.4+) solrj version but an older solr-core jar that cannot find this new method. On Sat, Feb 18, 2017 at 5:16 AM, Walter Underwood

maxwarmingSearchers and memory leak

2017-02-23 Thread SOLR4189
We have maxwarmingSearchers set to 2 and field value cache set to initial size of 64. We saw that by taking a heap dump that our caches consume 70% of the heap size, by looking into the dump we saw that fieldValueCache has 6 occurences of org.apache.solr.util.concurrentCache. When we have