;>>>> On Wed, Feb 3, 2010 at 6:56 AM, Mike Polzin
>>>>> wrote:
>>>>>
>>>>>> I am working on building a web search engine and I would like to build
>>>>>> a
>>>>>&g
ng on building a web search engine and I would like to build
>>>>> a
>>>>> reults page similar to what Google does. The functionality I am
>>>>> looking
>>>>> to
>>>>> include is what I refer to a "rolling up" site
e (defined by its base URL) has many relevent hits on
>>>> various
>>>> pages for the searches keywords, that site is only shown once in the
>>>> results
>>>> listing with a link to the most relevent hit on that site. What I do not
>>>&g
sults
>>> listing with a link to the most relevent hit on that site. What I do not
>>> want is to have one site dominate a search results page.
>>>
>>> Does it make sense to just do the search, get the hits list and then
>>> programatically remove the results w
arch
>> criteria, are not as relevent? Is there a way to do this through queries?
>>
>> Thanks in advance!
>>
>> Mike
>>
>>
>>
>
>
--
View this message in context:
http://old.nabble.com/Limiting-search-result-for-web-search-engine-tp27430155p274
s in the RamIndex based on hits for the search terms once. In
> other terms you will filter the documents based on maximum term
> similarity. Is it make sense for you?
>
>
>
>
> -
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> For additional commands, e-mail: j
Mike Polzin wrote:
I am working on building a web search engine and I would like to build a reults page similar to what Google does. The functionality I am looking to include is what I refer to a "rolling up" sites, meaning that even if a particular site (defined by its base URL) has many relevent
Hi Mike,
Not really through queries, but you may do this by writing a custom
collector. You'd need some supporting data structure to mark/hash the
occurrence of a domain in your result set.
--
Anshum Gupta
Naukri Labs!
http://ai-cafe.blogspot.com
The facts expressed here belong to everybody, the
I am working on building a web search engine and I would like to build a reults
page similar to what Google does. The functionality I am looking to include is
what I refer to a "rolling up" sites, meaning that even if a particular site
(defined by its base URL) has many relevent hits on various