Hi,
I have got a searcher server replicating index from master server. Recently
I have noticed the huge difference in the index size between master and
slave followed by LockObtainFailedException in catalin.out log. When I
debugged the searcher index folder, I could see more that 100 segement_N
: segment number during optimize of index
|
| I have an other question, does the number of segment affect speed for
| update index?
|
| 2012/10/10 jame vaalet jamevaa...@gmail.com
|
| Guys,
| thanks for all the inputs, I was continuing my research to know
| more about
| segments
Monitoring - http://sematext.com/spm/index.html
On Fri, Oct 5, 2012 at 4:27 PM, jame vaalet jamevaa...@gmail.com wrote:
Hi Eric,
I am in a major dilemma with my index now. I have got 8 cores each
around
300 GB in size and half of them are deleted documents in it and above
that
each has
?
thanks
.
On 5 October 2012 05:46, Shawn Heisey s...@elyograg.org wrote:
On 10/4/2012 3:22 PM, jame vaalet wrote:
so imagine i have merged the 150 Gb index into single segment, this would
make a single segment of 150 GB in memory. When new docs are indexed it
wouldn't alter this 150 Gb index
.
Best
Erick
On Fri, Oct 5, 2012 at 1:08 PM, jame vaalet jamevaa...@gmail.com wrote:
hi Shawn,
thanks for the detailed explanation.
I have got one doubt, you said it doesn matter how many segments index
have
but then why does solr has this merge policy which merges segments
frequently
18:29, Otis Gospodnetic otis.gospodne...@gmail.comwrote:
Hi,
Look for the word merge in solrconfig.xml :)
Otis
--
Performance Monitoring - http://sematext.com/spm
On Oct 4, 2012 7:50 AM, jame vaalet jamevaa...@gmail.com wrote:
Hi,
I would like to know the different merge policies
, Oct 4, 2012 at 9:14 AM, jame vaalet jamevaa...@gmail.com wrote:
Thats the first thing i tried, but it had only merge factor and
maxmergedocs in it. We have different merge policies like
LogMergePolicy
http://lucene.apache.org/core/old_versioned_docs/versions/3_4_0/api/core/org/apache
to the index anyway.
Otis
--
Search Analytics - http://sematext.com/search-analytics/index.html
Performance Monitoring - http://sematext.com/spm/index.html
On Thu, Oct 4, 2012 at 4:20 PM, jame vaalet jamevaa...@gmail.com wrote:
Hi,
I was about to do optimize on my index which has got around
hi,
in my use case i have list of key value pairs in each document object, if i
index them as separate index fields then in the result doc object i will get
two arrays corresponding to my keys and values. The problem i face here is
that there wont be any mapping between those keys and values.
do
Original-Nachricht
Datum: Mon, 24 Oct 2011 17:11:49 +0530
Von: jame vaalet jamevaa...@gmail.com
An: solr-user@lucene.apache.org
Betreff: indexing key value pair into lucene solr index
hi,
in my use case i have list of key value pairs in each document object, if
i
index them
hi,
i was wondering if there is any method to get back the term vector list from
solr through solr.NET?
from the source code for SOLR.NET i couldn't notice any term vector parser
in SOLR.NET .
--
-JAME
? Appending
sort=_docid_ asc to the query? When you say
client I assume you're talking SolrJ, and I'm pretty
sure that SolrQuery.setSortField is what you want.
I suppose you could also set this as the default in your
query handler.
Best
Erick
On Tue, Aug 23, 2011 at 4:43 AM, jame vaalet jamevaa
okey, so this is something i was looking for .. the default order of result
docs in lucene\solr ..
and you are right, since i don care about the order in which i get the docs
ideally i shouldn't ask solr to do any sorting on its raw result list ...
though i understand your point, how do i do it as
hi.
Is it possible to say fq=score[1 TO *]
i have tried but solr is throwing error ? can this be done with some other
syntax ?
--
-JAME
On Mon, Aug 22, 2011 at 7:28 AM, jame vaalet jamevaa...@gmail.com wrote:
hi.
Is it possible to say fq=score[1 TO *]
i have tried but solr is throwing error ? can this be done with some
other
syntax ?
--
-JAME
--
-JAME
thanks hoss... thats a real good explanation ..
well, don care about the sort order i just want all of the docs .. and yes
score values may be duplicated which will deteriorate my search
performance...
before going into lucene doc id , i have got creationDate datetime field in
my index which i
#queryResultWindowSize
On Fri, Aug 19, 2011 at 10:34 AM, jame vaalet jamevaa...@gmail.com
wrote:
wiki says *size
The maximum number of entries in the cache.
andqueryResultCache
This cache stores ordered sets of document IDs — the top N results of a
query ordered by some criteria
are reserved for caching ?
can some please give me an answer ?
On 14 August 2011 21:41, Erick Erickson erickerick...@gmail.com wrote:
Yep.
ResultWindowSize in
solrconfig.xml
Best
Erick
On Sun, Aug 14, 2011 at 8:35 AM, jame vaalet jamevaa...@gmail.com
wrote:
thanks erick
hi,
i understand that queryResultCache tag in solrconfig is the one which
determines the cache size of SOLR in jvm.
queryResultCache class=*solr.LRUCache*
size=*${queryResultCacheSize:0}*initialSize
=*${queryResultCacheInitialSize:0}* autowarmCount=*
${queryResultCacheRows:0}* /
out of the
On Fri, Aug 19, 2011 at 8:39 AM, jame vaalet jamevaa...@gmail.com wrote:
hi,
i understand that queryResultCache tag in solrconfig is the one which
determines the cache size of SOLR in jvm.
queryResultCache class=*solr.LRUCache*
size=*${queryResultCacheSize:0}*initialSize
is
100)...
Best
Erick
On Wed, Aug 10, 2011 at 10:58 AM, jame vaalet jamevaa...@gmail.com
wrote:
when you say queryResultCache, does it only cache n number of result for
the
last one query or more than one queries?
On 10 August 2011 20:14, simon mtnes...@gmail.com wrote:
Worth
pretty much have to try it and see.
Best
Erick
On Sun, Aug 14, 2011 at 5:42 AM, jame vaalet jamevaa...@gmail.com wrote:
speaking about pagesizes, what is the optimum page size that should be
retrieved each time ??
i understand it depends upon the data you are fetching back fromeach hit
a map of queries
and result document IDs. The number of doc IDs cached for
each query is controlled by queryResultWindowSize in
solrconfig.xml
Best
Erick
On Sun, Aug 14, 2011 at 8:35 AM, jame vaalet jamevaa...@gmail.com wrote:
thanks erick ... that means it depends upon the memory allocated
or after first
query the result will be cached somewhere for the subsequent pages ?
JAME VAALET
-by-page by incrementing the start parameter as you
indicated.
You can keep from re-executing the search by setting your
queryResultCache
appropriately, but this affects all searches so might be an issue.
Best
Erick
On Wed, Aug 10, 2011 at 9:09 AM, jame vaalet jamevaa...@gmail.com
wrote
How do u write solr query to mention proximity between two phrases
dance jockey should appear within 10 words before video jokey
(dance jockey) (video jockey)~10
This isn't working fine . can some one suggest a way ?
-JAME
hi,
while searching for word SOLR in
highlighting in solr can be manipulated
with frag-size =10 .
how is the fragment decided ? how many characters are taken before the world
SOLR and after the word SOLR ?
jame
hi
when u highlight and get back snippet fragments , can you over write the
default hl.regex.pattern through url .
can some quote an example url of that sort ?
what if i make pass hl.slop=0 will this stop considering regex pattern at
all ?
--
-JAME
hi ,
i just wanna be clear in the concepts of core and shard ?
a single core is an index with same schema , is this wat core really is ?
can a single core contain two separate indexes with different schema in it ?
Is a shard refers to a collection of index in a single physical machine
?can a
hi ,
i just wanna be clear in the concepts of core and shard ?
a single core is an index with same schema , is this wat core really is ?
can a single core contain two separate indexes with different schema in it ?
Is a shard refers to a collection of index in a single physical machine
?can a
hi,
is there any performance degradation (response time etc ) if the index has
document content text stored in it (stored=true)?
-JAME
set in
solrconfig.xml, and the other are the heap/GC settings I should set for
tomcat. Anything else?
Cheers
François
On Jul 8, 2011, at 4:08 AM, jame vaalet wrote:
hi,
is there any performance degradation (response time etc ) if the index
has
document content text stored
just in this set of documents
rather than the entire index ? if yes what should be query to limit search into
this subset ?
Regards,
JAME VAALET
Software Developer
EXT :8108
Capital IQ
Thanks.
But does this range query just limit the universe logically or does it have any
mechanism to limit this physically as well .Do we leverage time factor by using
the range query ?
Regards,
JAME VAALET
-Original Message-
From: shashi@gmail.com [mailto:shashi@gmail.com
core will it solve the problem ?
I have also read about multiple ports for listening request from different apps
, can this be used.
Regards,
JAME VAALET
-Original Message-
From: Pierre GOSSE [mailto:pierre.go...@arisem.com]
Sent: Tuesday, July 05, 2011 3:52 PM
To: solr-user
But incase the website docs contribute around 50 % of the entire docs , why to
recreate the indexes . don't you think its redundancy ?
Can two web apps (solr instances ) share a single index file to search on it
without interfering each other
Regards,
JAME VAALET
Software Developer
EXT :8108
Hi,
What would be the maximum size of a single SOLR index file for resulting in
optimum search time ?
In case I have got to index all the documents in my repository (which is in TB
size) what would be the ideal architecture to follow , distributed SOLR ?
Regards,
JAME VAALET
Software
37 matches
Mail list logo