Just noticed that FeatherCast have also put out three podcasts from Apache Big
Data Seville about Solr and SolrCloud, these hit my feed on Monday.
On Wednesday, November 23, 2016 12:55 PM, Charlie Hull
wrote:
Hi all,
We also blog about various Solr topics at
I'd consider parsing with Tika outside of Solr, here's a blog on the
topic. That gives you the most flexibility. See:
https://lucidworks.com/blog/2012/02/14/indexing-with-solrj/
Not quite what you asked for...
Best,
Erick
On Wed, Nov 30, 2016 at 5:36 AM, win harrington
Dear Apache enthusiast,
ApacheCon and Apache Big Data will be held at the Intercontinental in
Miami, Florida, May 16-18, 2017. Submit your talks, and register, at
http://apachecon.com/ Talks aimed at the Big Data section of the event
should go to
Prior discussion at
http://stackoverflow.com/questions/40877567/using-standardtokenizerfactory-with-currency
I'd like to maintain other aspects of the StandardTokenizer functionality
but I'm wondering if to do what I want, the task boils down to be able to
instruct the StandardTokenizer not to
Hi Vinay,
You should be able to use a char filter to convert “$” characters into
something that will survive tokenization, and then a token filter to convert it
back.
Something like this (untested):
Hi Vijay,
Map user behaviors and offer intelligent suggestions to a user segments ( user
segment group1 ,User segment group2 etc).
Define some ranking criteria and index based on above data . Your application
layer should identify the user segment group from the user profile and give to
Hello searchers,
Looks like this is not possible, right ?
It means I have to specify all the _route_ in the request, and each shard
will try to lookup all the ids internally.
Is there a way to specify it ?
Like elasticsearch does
In one of our environments, we have an issue where one shard has two
replicas with smaller document counts than the third one. This is on Solr
4.10.3 (Cloudera's build). We've found that shutting down the smaller
replicas, deleting their data folders and restarting one by one will do the
trick of
I have jpeg files with latitude and longitudein separate fields. When I run the
post tool,it stores the lat/lon in separate fields.
For geospatial search, Solr wants themcombined into one field with the
format'latitude,longitude'.
How can I combine lat+lon into one field?
When should one use timeAllowed in Solr 5.5.3?
How much overhead is involved in it? Does it create a thread and a timer
event if you use it? Or is the timer event used even if you don't set it
explicitly?
We are thinking about using it for all requests. Just set it +10
milliseconds longer than
Hi All,
We have an issue in production, where we have the SOLR cloud with 3 nodes
and in that on one of the node is failing with the below error:-
index fetcher
error recovery failed 1 of 500 attempts
no content received for file :tlog.0105987.1552
We have cdcr logging enabled, and
I have around 300,000 records to be uploaded on a solr cloud suggester.
These records are dynamic i.e. new documents will be added and some
document will be deleted in future on a regular basis. The problem I am
facing is either:
1. Use FileDictionaryFactory: this method is an operational
We use it on all requests. We are running 4.10.4 for most searches.
We set a pretty high timeAllowed, either five or thirty seconds, depending on
the service.
wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/ (my blog)
> On Nov 30, 2016, at 6:28 PM, William Bell
13 matches
Mail list logo