Re: [Announce] Solr 3.5 with RankingAlgorithm 1.3, NRT support

2011-12-27 Thread Dmitry Kan
Hello Nagendra, Congratulations on the new release! In terms of downloading: does one need to be registered on the site do download the bundle? The download links lead to http://solr-ra.tgels.org/solr-ra.jsp. Regards, Dmitry Kan On Tue, Dec 27, 2011 at 4:30 PM, Nagendra Nagarajayya

[Solr 3.5] Facets and stats become a lot slower during concurrent inserts

2011-12-27 Thread Lyuba Romanchuk
Hi, I test facets and stats in Solr 3.5 and I see that queries are running a lot slower during inserts into index with more than 15M documents . If I stop to insert new documents facet/stats queries run 10-1000 times faster than with concurrent inserts. I don't see this degradation in Lucene.

How to run the solr dedup for the document which match 80% or match almost.

2011-12-27 Thread vibhoreng04
Hi, I am doing dedup for my solr instance which works on the content and the url fields.My question is if I want to eliminate the records which are 80% matching or 90% matching in the content field then how I should proceed for that? Already I have changed my solrconfig.xml and have changed the

Re: Solr - Mutivalue field search on different elements

2011-12-27 Thread Gora Mohanty
On Tue, Dec 27, 2011 at 6:11 PM, meghana meghana.rav...@amultek.com wrote: Hi iorixxx, I have changed my multiValued field to single value filed.. and now my field appears as below - 1s: This is very nice day. 3s: Christmas is about come and christmas

[Announce] Solr 3.5 with RankingAlgorithm 1.3, NRT support

2011-12-27 Thread Nagendra Nagarajayya
Hi! I am very excited to announce the availability of Solr 3.5 with RankingAlgorithm 1.3 (NRT support). The performance to add 1 million docs in NRT to the MBArtists index with 1 concurrent request thread executing *:* is about 5000 docs in 498 ms. The query performance is about 168K query

Re: Solr - Mutivalue field search on different elements

2011-12-27 Thread meghana
Hi iorixxx, I have changed my multiValued field to single value filed.. and now my field appears as below - 1s: This is very nice day. 3s: Christmas is about come and christmas 4s:preparation is just on - but by doing this, i

Re: Configuring Replication

2011-12-27 Thread Ahson Iqbal
Hi Ahmet Thank you for your response both of the following urls http://localhost:8983 http://localhost:8983/solr are working and also it is not a multi core setup. Regards Ahsan From: Ahmet Arslan iori...@yahoo.com To: solr-user@lucene.apache.org; Ahson

Re: How can I check if a more complex query condition matched?

2011-12-27 Thread Ahmet Arslan
I have a more complex query condition like this: (city:15 AND country:60)^4 OR city:15^2 OR country:60^2 What I want to achive with this query is basically if a document has city = 15 AND country = 60 it is more important then another document which only has city = 15 OR country = 60

Re: Solr - Mutivalue field search on different elements

2011-12-27 Thread Ahmet Arslan
I have changed my multiValued field to single value filed.. and now my field appears as below - 1s: This is very nice day. 3s: Christmas is about come and christmas 4s:preparation is just on - but by doing this, i

Re: [Solr 3.5] Facets and stats become a lot slower during concurrent inserts

2011-12-27 Thread Yonik Seeley
On Tue, Dec 27, 2011 at 10:43 AM, Lyuba Romanchuk lyuba.romanc...@gmail.com wrote: I test facets and stats in Solr 3.5 and I see that queries are running a lot slower during inserts into index with more than 15M documents . Are you also doing commits (or have autocommit enabled)? The first time

Re: Configuring Replication

2011-12-27 Thread Ahson Iqbal
Hi Ahmet Same issue, one more thing i am using solr 1.4.1 with tomcat 7.0 Regards Ahsan From: Ahmet Arslan iori...@yahoo.com To: solr-user@lucene.apache.org; Ahson Iqbal mianah...@yahoo.com Sent: Tuesday, December 27, 2011 2:51 PM Subject: Re: Configuring

Re: How to run the solr dedup for the document which match 80% or match almost.

2011-12-27 Thread vibhoreng04
Hi iorixxx, Thanks for the quick update.I hope I can take it from here ! Regards, Vibhor -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-run-the-solr-dedup-for-the-document-which-match-80-or-match-almost-tp3614239p3614253.html Sent from the Solr - User mailing list

Re: Solr - Mutivalue field search on different elements

2011-12-27 Thread meghana
Hi iorixxx, Sorry for confusion in my question... yes , 1s, 3s, 4s are part of my field value.. i have my data in this format. and the field is non-multivalued field (single valued). so as PositionIncrementGap is only work for multivalued field , in my search i always have to apply slop in my

Re: How to run the solr dedup for the document which match 80% or match almost.

2011-12-27 Thread Ahmet Arslan
I am doing dedup for my solr instance which works on the content and the url fields.My question is if I want to eliminate the records which are 80% matching or 90% matching in the content field then how I should proceed for that? Already I have changed my solrconfig.xml and have changed

Re: [Announce] Solr 3.5 with RankingAlgorithm 1.3, NRT support

2011-12-27 Thread Nagendra Nagarajayya
Yes, you will need to register to download the bundle or the war file. Regards, Nagendra Nagarajayya http://solr-ra.tgels.org https://service.mail.com/dereferrer/?target=http%3A%2F%2Fsolr-ra.tgels.orglang=en http://rankingalgorithm.tgels.org

Re: [Solr 3.5] Facets and stats become a lot slower during concurrent inserts

2011-12-27 Thread Lyuba Romanchuk
autoCommit is disabled in solrconfig.xml and I use SolrServer::addBeans(beans, 100) for inserts. I need to insert new documents continually in high rate with concurrent running queries. Best regards, Lyuba On Tue, Dec 27, 2011 at 6:15 PM, Yonik Seeley yo...@lucidimagination.comwrote: On Tue,

Re: solr keep old docs

2011-12-27 Thread Alexander Aristov
Hi I am not using database. All needed data is in solr index that's why I want to skip excessive checks. I will check DIH but not sure if it helps. I am fluent with Java and it's not a problem for me to write a class or so but I want to check first maybe there are any ways (workarounds) to

Re: Configuring Replication

2011-12-27 Thread Erick Erickson
I suspect you haven't enabled the replication handler in solrconfig.xml. Look in solrconfig.xml for a line like: requestHandler name=/replication class=solr.ReplicationHandler by default, I believe it's commented out. Have you uncommented it? Best Erick On Tue, Dec 27, 2011 at 5:38 AM, Ahson

Re: Configuring Replication

2011-12-27 Thread Ahmet Arslan
Thank you for your response both of the following urls http://localhost:8983 http://localhost:8983/solr are working and also it is not a multi core setup. What happens when you use class=solr.ReplicationHandler instead of class=org.apache.solr.ReplicationHandler in your

How can I check if a more complex query condition matched?

2011-12-27 Thread Max
I have a more complex query condition like this: (city:15 AND country:60)^4 OR city:15^2 OR country:60^2 What I want to achive with this query is basically if a document has city = 15 AND country = 60 it is more important then another document which only has city = 15 OR country = 60

Re: solr keep old docs

2011-12-27 Thread Erick Erickson
Mikhail is right as far as I know, the assumption built into Solr is that duplicate IDs (when uniqueKey is defined) should trigger the old document to be replaced. what is your system-of-record? By that I mean what does your SolrJ program do to send data to Solr? Is there any way you could just

Re: How to run the solr dedup for the document which match 80% or match almost.

2011-12-27 Thread Shashi Kant
You can also look at cosine similarity (or related metrics) to measure document similarity. On Tue, Dec 27, 2011 at 6:51 AM, vibhoreng04 vibhoren...@gmail.com wrote: Hi iorixxx, Thanks for the quick update.I hope I can take it from here ! Regards, Vibhor -- View this message in

Re: Storing only unique terms in index

2011-12-27 Thread Chris Hostetter
: I have catchall text field, and use it for searching.This field : stores the non-unique terms. For example, this field stores the : following terms:test test searchIs it possible to store non-unique : terms in the following way: term|number of terms, i.e. test|2 : search? : I guess it should

Re: [Solr 3.5] Facets and stats become a lot slower during concurrent inserts

2011-12-27 Thread Chris Hostetter
: autoCommit is disabled in solrconfig.xml and I use : SolrServer::addBeans(beans, 100) for inserts. have you looked at the javadocs for that method? https://lucene.apache.org/solr/api/org/apache/solr/client/solrj/SolrServer.html#addBean%28java.lang.Object,%20int%29 public UpdateResponse

Re: VelocityResponseWriter's future

2011-12-27 Thread Jan Høydahl
Hi, I think a /browse type of experience is crucial for newcomers to quickly get familiar with Solr. Whether it's Velocity based, AJAX based or another technology is less important. I personally like VRW and frequently use it as the first step in prototyping in a project. I've also contributed

Using sort_values (fsv=true parameter) and Field Collapsing (group=true) at the same time

2011-12-27 Thread Jose Aguilar
Hi all, I am using Solr 4.0 trunk with the Field Collapsing feature (http://wiki.apache.org/solr/FieldCollapsing) and I notice that when used at the same time as the fsv=true parameter, the sort_values in the response is gone. I haven't found much information about the fsv parameter, so I

Re: Looking for a good commit/merge strategy

2011-12-27 Thread Jan Høydahl
Have a look at http://wiki.apache.org/solr/NearRealtimeSearch which will help you (in TRUNK/4.0) with an efficient in-memory handling of NRT changes. Combine this with CommitWithin for persisting to disk: http://wiki.apache.org/solr/CommitWithin. -- Jan Høydahl, search solution architect

Re: Custom content extractor for Solr Cell

2011-12-27 Thread Jan Høydahl
Hi John, See discussion about the issue of indexing contents of ZIP files: https://issues.apache.org/jira/browse/SOLR-2416 Depending on your use case, you may be able to write a Tika parser which handles your specific case, such as uncompressing a GZIP file and using AutoDetect on its

Re: lower score for synonyms

2011-12-27 Thread Jan Høydahl
Hi, Also see discussion in https://issues.apache.org/jira/browse/LUCENE-3130 for possible future way to do this with one field. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 6. des. 2011, at 13:47, Marc SCHNEIDER wrote:

Re: best practice to introducing singletons inside of Solr (IoC)

2011-12-27 Thread Mikhail Khludnev
Colleagues, Don't hesitate to emit your opinion. Please! Regards On Wed, Dec 21, 2011 at 11:06 PM, Mikhail Khludnev mkhlud...@griddynamics.com wrote: Hello, I need to introduce several singletons inside of Solr and make them available for my own SearchHandlers, Components, and even

Re: How to run the solr dedup for the document which match 80% or match almost.

2011-12-27 Thread vibhoreng04
Hi Shashi, That's correct !But I need something for index time comparision.Can cosine compare from the already indexed documents and compare the incrementally indexed files ? Regards, Vibhor -- View this message in context:

Re: Solr - Mutivalue field search on different elements

2011-12-27 Thread meghana
i can't delete 1s ,2s ...etc from my field value , i have to keep text in this format... so i'll apply slop in my search to do my needed search done. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Mutivalue-field-search-on-different-elements-tp3604213p3615816.html Sent

hl.boundaryScanner and hl.bs.chars

2011-12-27 Thread meghana
Hi all , i seen hl.boundaryScanner and hl.bs.chars parameters in solr highlighting feature. but i didn't get its meaning exactly , what its use and how can i use it in my search? My need is something like ,i want to set my fragment to start and end from special character / string that i can

Re: solr keep old docs

2011-12-27 Thread Alexander Aristov
I get docs from external sources and the only place I keep them is solr index. I have no a database or other means to track indexed docs (my personal oppinion is that it might be a huge headache). Some docs might change slightly in there original sources but I don't need that changes. In fact I

Re: hl.boundaryScanner and hl.bs.chars

2011-12-27 Thread Koji Sekiguchi
(11/12/28 15:29), meghana wrote: Hi all , i seen hl.boundaryScanner and hl.bs.chars parameters in solr highlighting feature. but i didn't get its meaning exactly , what its use and how can i use it in my search? My need is something like ,i want to set my fragment to start and end from special

Custom Solr FunctionQuery Error

2011-12-27 Thread Parvin Gasimzade
Hi all, I have created custom Solr FunctionQuery in Solr 3.4. I extended ValueSourceParser, ValueSource, Query and QParserPlugin classes. I set the name parameter as graph inside GraphQParserPlugin class. But when try to search i got an error. Search queries are