I have a Solr Cloud setup with 220 million records.They are separated into 2
shards without any replica.I have not changed any caching and every setting
is a default one.In one case I have to return get top 5 candidates form
the Solr. The response time approximately 50 seconds which is too
Hi All,I am trying to detect the language of the business name filed and the
address field. I am using Solr's lang Detect(Google Library) , not Tika. It
works ok in most of the cases but in some it detects the language
wrongly.For an example the document -OrgName: EXPLOITS VALLEY
HIGHGREENWOOD,
I agree with you Jack . But I request you to see here that still this filter
works perfectly fine .Only in one case case where even all the words are
latin , the language is getting detected as German.My question is why and
how ?
If it works perfectly for the other docs what in this case is
Hi Chris,
I would like to moderate and you can use the mail id vibhoren...@gmail.com
for this purpose .
Regards,
Vibhor Jaiswal
--
View this message in context:
http://lucene.472066.n3.nabble.com/Seeking-New-Moderators-for-solr-user-lucene-tp4096447p4096448.html
Sent from the Solr - User
Hi All,
I have a big index of 256 GB .Right now it is on one physical box of 256 GB
RAM . I am planning to virtualize it to the size of 32 GB Ram*8
boxes.Whether the MMap will work regardless in this condition ?
Vibhor Jaiswal
--
View this message in context:
Yes Jack ! I think Solr is still hot in market and there is less material
available .So a prize tag of 10$ will be ok.
--
View this message in context:
http://lucene.472066.n3.nabble.com/The-book-Solr-4-x-Deep-Dive-Early-Access-Release-1-tp4072118p4072123.html
Sent from the Solr - User mailing
Hi All,
I have been facing problems in the fuzzy queries.For an example if I query
((FIELDNAME1:FRANK~0.80) AND (FIELDNAME1:INDIANO~0.80))^0.80 , the parsed
query changes my distance grade to ~0 etc.Also in the other cases the
distance is changed in the pared query like ~1 and ~2.Can anyone tell
Hi ,
Below is what I am passing to solr .Only the fuzzy part is creating the
problem for me-
{!lucene q.op=OR}(TRIGRAM_FIELDNAME1:(FRANK INDIANO))^0.82 OR pkey: OR
((FIELDNAME1:FRANK~0.80) AND (FIELDNAME1:INDIANO~0.80))^0.80
Also the field type has whitespace,synonym,lowercase and stopwords in
Hi Mark,
What you said is correct . But still if I do an AND operation the fussy
query doesn't work.Surprisingly OR is working perfectly but that does not
give me what I want.
{!lucene q.op=OR}(TRIGRAM_FIELDNAME1:(FRANK INDIANO))^0.82 OR pkey: OR
((FIELDNAME1:FRANK~2)* AND
Hi Erik,
In my case I have to calculate a custom value depending on the retrieved
candidates .This will be for each document.So my choice will be Doc
Transformer.
Lets say in this case if I need to include a java class which does the
computation , how does I tie that with Doc transformer.
Solr
Hi,
There seems to be a problem in the querying.My query is like-
(NORM_BUS_NME:TEST1\\ TEST)^35.44 OR (TRIGRAM_NORM_BUS_NME:TEST1 TEST)
Individually NORM_BUS_NME:TEST1\\ TEST query returns 1 result and
TRIGRAM_NORM_BUS_NME:TEST1 TEST
returns 355 results but after I do an OR operation the
Krupansky
-Original Message-
From: vibhoreng04
Sent: Sunday, June 02, 2013 3:04 PM
To:
solr-user@.apache
Subject: Individual Query Works fine but OR Query not worning
Hi,
There seems to be a problem in the querying.My query is like-
(NORM_BUS_NME:TEST1\\ TEST)^35.44
Hi All,
I have a requirement where I need to retrieve the candidates from the solr,
do some calculation on the basis of the search result and return the
calculated values along with the solr document.
I am planning to use Custom Response Handlers for this .
Anybody can guide me what will be the
Hi Erick and Markus,
Any Idea on this ? can we resolve this by group by queries?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Nested-Facets-and-distributed-shard-system-tp4065847p4066583.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi All,
I have a use case where I need to Search like this-
Apple Corporation Limited should create the pairs like -Apple
Corporation,Corporation Apple,Corporation Limited,Limited Corporation.
Below is the filter I am using-
fieldType name=text_shingle class=solr.TextField
Hi Sawmya,
Are you able to resolve your problem?
If not check the field type in the solr schema.It should be text if u r
tokenising and searching.
--
View this message in context:
http://lucene.472066.n3.nabble.com/indexing-but-not-able-to-search-tp3144695p3787592.html
Sent from the Solr - User
Hi,
I implemented TextProfileSignature dedupe as suggested but here is something
weired which I came through while implementing -
I am testing it with two documents and trying to index them .
Please see the below content-
Content starts Here
I bought a Toyota Camry in 2007. After driven
Hi Lance,
This is out of context but still asking you the question .
I implemented TextProfileSignature dedupe as suggested but here is something
weired which I came through while implementing -
I am testing it with two documents and trying to index them .
Please see the below content-
Content
Hi,
I am doing dedup for my solr instance which works on the content and the url
fields.My question is if I want to eliminate the records which are 80%
matching or 90% matching in the content field then how I should proceed for
that?
Already I have changed my solrconfig.xml and have changed the
Hi iorixxx,
Thanks for the quick update.I hope I can take it from here !
Regards,
Vibhor
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-to-run-the-solr-dedup-for-the-document-which-match-80-or-match-almost-tp3614239p3614253.html
Sent from the Solr - User mailing list
Hi Shashi,
That's correct !But I need something for index time comparision.Can cosine
compare from the already indexed documents and compare the incrementally
indexed files ?
Regards,
Vibhor
--
View this message in context:
21 matches
Mail list logo