Hi Timo!

Thanks a lot! now I have a clearly knowledge about this file. This article
helps a lot too: http://searchenginewatch.com/showPage.html?page=2156061

Thanks again!

On 8/11/06, Timo Scheuer < [EMAIL PROTECTED]> wrote:

Hi,

> Could anyone explain me what does exactly the common-terms.utf8 file? I
> don't understand the real functionality of this file...

During indexing (and also during searching) the common terms are used to
form
n-grams to make search faster for common words like articles for example.
It
is an alternative to using stop words. N-grams keep the common words by
appending them to the following word. This approach increases the
selectivity.


Cheers,
Timo.




--
Lourival Junior
Universidade Federal do Pará
Curso de Bacharelado em Sistemas de Informação
http://www.ufpa.br/cbsi
Msn: [EMAIL PROTECTED]

Reply via email to