Re: Solr/lucene "planet" + recommendations for blogs to follow

2016-11-30 Thread hairymcclarey
Just noticed that FeatherCast have also put out three podcasts from Apache Big Data Seville about Solr and SolrCloud, these hit my feed on Monday. On Wednesday, November 23, 2016 12:55 PM, Charlie Hull wrote: Hi all, We also blog about various Solr topics at

Re: insert lat/lon from jpeg into solr

2016-11-30 Thread Erick Erickson
I'd consider parsing with Tika outside of Solr, here's a blog on the topic. That gives you the most flexibility. See: https://lucidworks.com/blog/2012/02/14/indexing-with-solrj/ Not quite what you asked for... Best, Erick On Wed, Nov 30, 2016 at 5:36 AM, win harrington

Save the date: ApacheCon Miami, May 15-19, 2017

2016-11-30 Thread Rich Bowen
Dear Apache enthusiast, ApacheCon and Apache Big Data will be held at the Intercontinental in Miami, Florida, May 16-18, 2017. Submit your talks, and register, at http://apachecon.com/ Talks aimed at the Big Data section of the event should go to

How to use the StandardTokenizer with currency

2016-11-30 Thread Vinay B,
Prior discussion at http://stackoverflow.com/questions/40877567/using-standardtokenizerfactory-with-currency I'd like to maintain other aspects of the StandardTokenizer functionality but I'm wondering if to do what I want, the task boils down to be able to instruct the StandardTokenizer not to

Re: How to use the StandardTokenizer with currency

2016-11-30 Thread Steve Rowe
Hi Vinay, You should be able to use a char filter to convert “$” characters into something that will survive tokenization, and then a token filter to convert it back. Something like this (untested):

RE: boost solr results using users behavior

2016-11-30 Thread Srinivasa Meenavalli
Hi Vijay, Map user behaviors and offer intelligent suggestions to a user segments ( user segment group1 ,User segment group2 etc). Define some ranking criteria and index based on above data . Your application layer should identify the user segment group from the user profile and give to

Realtime multi get with different (_route_, fields, etc) for each id

2016-11-30 Thread Dorian Hoxha
Hello searchers, Looks like this is not possible, right ? It means I have to specify all the _route_ in the request, and each shard will try to lookup all the ids internally. Is there a way to specify it ? Like elasticsearch does

Replica document counts out of sync

2016-11-30 Thread Mike Thomsen
In one of our environments, we have an issue where one shard has two replicas with smaller document counts than the third one. This is on Solr 4.10.3 (Cloudera's build). We've found that shutting down the smaller replicas, deleting their data folders and restarting one by one will do the trick of

insert lat/lon from jpeg into solr

2016-11-30 Thread win harrington
I have jpeg files with latitude and longitudein separate fields. When I run the post tool,it stores the lat/lon in separate fields. For geospatial search, Solr wants themcombined into one field with the format'latitude,longitude'. How can I combine lat+lon into one field?

timeAllowed?

2016-11-30 Thread William Bell
When should one use timeAllowed in Solr 5.5.3? How much overhead is involved in it? Does it create a thread and a timer event if you use it? Or is the timer event used even if you don't set it explicitly? We are thinking about using it for all requests. Just set it +10 milliseconds longer than

Recovery failed in SOLR cloud - SOLR 6

2016-11-30 Thread Bharath Kumar
Hi All, We have an issue in production, where we have the SOLR cloud with 3 nodes and in that on one of the node is failing with the below error:- index fetcher error recovery failed 1 of 500 attempts no content received for file :tlog.0105987.1552 We have cdcr logging enabled, and

Optimizing documentdictionary build on solr cloud suggester

2016-11-30 Thread diwakar bhardwaj
I have around 300,000 records to be uploaded on a solr cloud suggester. These records are dynamic i.e. new documents will be added and some document will be deleted in future on a regular basis. The problem I am facing is either: 1. Use FileDictionaryFactory: this method is an operational

Re: timeAllowed?

2016-11-30 Thread Walter Underwood
We use it on all requests. We are running 4.10.4 for most searches. We set a pretty high timeAllowed, either five or thirty seconds, depending on the service. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Nov 30, 2016, at 6:28 PM, William Bell