Re: Solr boost relevancy

2012-05-27 Thread Gau
Hi Lori, Yeah. I thought exactly of the same solution. Use a copy field and boost the relevancy of the the exact match. But my question is more broad here. For eg, if i have a synonym for James as Jim, Games, Jimmy, Jameson And if I normalize the tf, norm, etc factors to 1, on searching for

Re: Solr boost relevancy

2012-05-27 Thread Walter Underwood
Solr automatically scales the scores of fuzzy matches by their distance from an exact match. So, you don't have to change anything. wunder On May 26, 2012, at 11:52 PM, Gau wrote: Hi Lori, Yeah. I thought exactly of the same solution. Use a copy field and boost the relevancy of the the

Re: Solr boost relevancy

2012-05-27 Thread Gau
Wait, I thought the fuzzy match is invoked with a ~. I am not invoking any ~ but expanding my query terms with the synonyms at query time. So from what I understarnd, when I query for James, internally, Solr would expand using synonym search to James, Jim, Games, Jameson. So I guess, the original

[Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Nagendra Nagarajayya
Hi! I am very excited to announce the availability of Solr 3.6 with RankingAlgorithm 1.4.2. This NRT supports now works with both RankingAlgorithm and Lucene. The insert/update performance should be about 5000 docs in about 490 ms with the MbArtists Index. RankingAlgorithm 1.4.2 has

Re: [Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Darren Govoni
Hi, Have you tested this with a billion documents? Darren On Sun, 2012-05-27 at 07:24 -0700, Nagendra Nagarajayya wrote: Hi! I am very excited to announce the availability of Solr 3.6 with RankingAlgorithm 1.4.2. This NRT supports now works with both RankingAlgorithm and Lucene. The

Re: [Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Nagendra Nagarajayya
Actually, RankingAlgorithm 1.4.2 has been scaled to more than 2 billion docs. With RankingAlgorithm 1.4.3, using the parameters age=latestdocs=number feature, you can retrieve the NRT inserted documents in milliseconds from such a huge index improving query and faceting performance and using

Re: [Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Radim Kolar
My company is thinking to buy search algorithm from famous expert in searching Petr Hejl - http://www.milionovastranka.net/ but i see RankingAlgorithm has fantastic results too and looking at its reference page it even powers sites like oracle.com and ebay.com.

Re: [Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Yonik Seeley
On Sun, May 27, 2012 at 11:57 AM, Radim Kolar h...@filez.com wrote: but i see RankingAlgorithm has fantastic results too and looking at its reference page it even powers sites like oracle.com and ebay.com. What reference page are you referring to? -Yonik http://lucidimagination.com

Re: [Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Radim Kolar
What reference page are you referring to? http://tgels.com/wiki/en/Sites_using/downloaded_RankingAlgorithm_or_Solr-RA

Re: [Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Yonik Seeley
On Sun, May 27, 2012 at 12:42 PM, Radim Kolar h...@filez.com wrote: What reference page are you referring to? http://tgels.com/wiki/en/Sites_using/downloaded_RankingAlgorithm_or_Solr-RA Ah, ok sites using/downloaded So someone with a .oracle email / domain checked it out - that certainly

Re: [Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Nagendra Nagarajayya
Solr with RankingAlgorithm is free (for test/development/production). Here is the license page http://tgels.com/docs/SoftwareLicenseAgreement.txt. RankingAlgorithm is also free ( for test/development/production). Regards, - NN On 5/27/2012 8:57 AM, Radim Kolar wrote: My company is thinking

Re: [Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Darren Govoni
I think people on this list would be more interested in your approach to scaling 2 billion documents than modifying solr/lucene scoring (which is already top notch). So given that, can you share any references or otherwise substantiate good performance with 2 billion documents? Thanks. On Sun,

Rolling partitions with solr shards

2012-05-27 Thread avenka
Is there a simple way to get solr to maintain shards as rolling partitions by date, e.g., the last day's documents in one shard, the week before yesterday in the next shard, the month before that in the next shard, and so on? I really don't need querying to be fast on the entire index, but it is

Re: [Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Nagendra Nagarajayya
Let me see if I can reference the performance with RankingAlgorithm. Regards, Nagendra Nagarajayya http://solr-ra.tgels.org http://rankingalgorithm.tgels.org On 5/27/2012 10:06 AM, Darren Govoni wrote: I think people on this list would be more interested in your approach to scaling 2

Re: Why is Solr still shipped with Jetty 6 / switching to Jetty 8?

2012-05-27 Thread Lance Norskog
The Solr 3.x branch ended development with 3.6. So, there won't be a 3.7 and there's no point doing infrastructure work like this. On Fri, May 25, 2012 at 9:23 AM, William Bell billnb...@gmail.com wrote: Let's just wait until SOLR 4.0 is out in a couple months. On Fri, May 25, 2012 at 9:06 AM,

Re: Solr 4.0 Distributed Concurrency Control Mechanism?

2012-05-27 Thread Lance Norskog
Please start separate threads for separate q On Fri, May 25, 2012 at 6:30 AM, Nicholas Ball nicholas.b...@nodelay.com wrote: Hey all, I have another question with regards to this thread. Does anyone know what the state is of the rollback command in 4.0 and how it works with both; replicas

Re: Solr 4.0 Distributed Concurrency Control Mechanism?

2012-05-27 Thread Lance Norskog
Please start separate threads for separate questions- you have more chances of getting an answer. On Sun, May 27, 2012 at 1:55 PM, Lance Norskog goks...@gmail.com wrote: Please start separate threads for separate q On Fri, May 25, 2012 at 6:30 AM, Nicholas Ball nicholas.b...@nodelay.com

Tika ExtractingRequestHandler and field postprocessing

2012-05-27 Thread Raphaël
Hi, I use Tika through the Solr ExtractingRequestHandler and I face a very common use case namely: postprocessing fields from Tika in order to normalize their values or override them with explicitly passed literal values. With exception of some vagues statements about ContentHandler, I failed to

Re: [Announce] Solr 3.6 with RankingAlgorithm 1.4.2 - NRT support

2012-05-27 Thread Li Li
yes, I am also interested in good performance with 2 billion docs. how many search nodes do you use? what's the average response time and qps ? another question: where can I find related paper or resources of your algorithm which explains the algorithm in detail? why it's better than google

Re: Tika ExtractingRequestHandler and field postprocessing

2012-05-27 Thread Jack Krupansky
You can create your own update processor that gets control between the output of Tika and the indexing of the document. See: http://wiki.apache.org/solr/UpdateRequestProcessor -- Jack Krupansky -Original Message- From: Raphaël Sent: Sunday, May 27, 2012 6:24 PM To: