Index-time boosting is not something very common and raises issues if you want to omitNorms in Solr.
In Solr DisMax you can use a bq (boost query) to boost site:example.com^10. All results that match the boost query receive a ^10 boost. This is only client side. You can also abuse the boost field Nutch is writing. By default this is 1.0f. You can write a simple scoring filter or even an indexing filter that check's the site field for your site and sets the boost field accordingly. On Wednesday 31 August 2011 15:30:08 Johan Svensson wrote: > I guess this is the solution. Though, I have been trying to implement this > the whole afternoon with no success. I have a field "site" in my > scheme.xml, stored and indexed. I'm using nutch -solrindex to tell solr to > index what nutch has crawled. How can I tell nutch to tell solr to boost > all documents with the value "www.example.com" of the "site" field? An > example would be perfect for a loser like myself. I've googled all the > Internets over and over. > > 2011/8/31 Gora Mohanty <[email protected]> > > > On Wed, Aug 31, 2011 at 2:51 PM, Johan Svensson > > > > <[email protected]> wrote: > > > Thank you! This looks interesting. However, I wonder if it really can > > > > solve > > > > > this problem. No part of the search query is by necessary means part of > > > > the > > > > > domain name. Let's say for example that we search for "foobar". On > > > www.example.com/page42.html this word is found, as well for lots of > > > > pages > > > > > with different names at blog.example.com/. Can you apply boosting magic > > > > for > > > > > the hit at www.example.com although the search term is not a part of > > > the url? > > > > Presumably, you know the domain name from which the > > document originates at indexing time. If so, you can use > > index-time boosting: > > http://wiki.apache.org/solr/SolrRelevancyFAQ#index-time_boosts > > E.g., this can be used to boost all documents from www.example.com > > over those from blog.example.com. > > > > Regards, > > Gora -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

