Re: [sqlite] Preventing certain query keywords from getting stemmed

2016-05-30 Thread Dan Kennedy

On 05/30/2016 12:00 AM, Abhinav Upadhyay wrote:

Hi,

While running queries, sometimes there are technical keywords which
shouldn't be stemmed by the tokenizer. For example, if I query for
"lfs" (which is a file system), the porter stemmer, converts it to
"lf", which matches many other unrelated keywords in the corpus (such
as ascii lf or some other acronyms).

I'm wondering if there is an option to tell the tokenizer not to stem
certain keywords and take them as it is?


No way to do that with any of the built-in tokenizers for either fts3/4 
or fts5. The only way would be to write a custom tokenizer.


  https://www.sqlite.org/fts5.html#section_7_1

Dan.




-
Abhinav
___
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users


___
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users


[sqlite] Preventing certain query keywords from getting stemmed

2016-05-29 Thread Abhinav Upadhyay
Hi,

While running queries, sometimes there are technical keywords which
shouldn't be stemmed by the tokenizer. For example, if I query for
"lfs" (which is a file system), the porter stemmer, converts it to
"lf", which matches many other unrelated keywords in the corpus (such
as ascii lf or some other acronyms).

I'm wondering if there is an option to tell the tokenizer not to stem
certain keywords and take them as it is?

-
Abhinav
___
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users