RE: Grammatical tenses Stemming in SOLR

2018-09-21 Thread Markus Jelsma
-coded either within the algorithm, which it is not, or outside by for example a StemmerOverrideFilter. Regards, Markus -Original message- > From:aishwarya > Sent: Friday 21st September 2018 10:38 > To: solr-user@lucene.apache.org > Subject: Grammatical tenses Stemming in SO

Grammatical tenses Stemming in SOLR

2018-09-21 Thread aishwarya
1 down vote favorite i want to know which stemming filter factory can be used to fetch all the possible tenses of a stem word. example : if "run" is the search word -> it has to fetch results for all files involving run , running , runs , ran. also the vice-versa --> whichever tense of a word

Re: Advice on Stemming in Solr

2017-11-04 Thread Zheng Lin Edwin Yeo
Hi Emir, We are looking at the configuration, to try to adjust the rules to suit our use case. Regards, Edwin On 3 November 2017 at 16:24, Emir Arnautović wrote: > Hi Edwin, > Hunspell is configurable, language independent library and you can define > any

Re: Advice on Stemming in Solr

2017-11-03 Thread Emir Arnautović
Hi Edwin, Hunspell is configurable, language independent library and you can define any morphology rules. It’s beed there for a while and I would not be surprised if someone already adjusted english rules to suite you case. Thanks, Emir -- Monitoring - Log Management - Alerting - Anomaly

Re: Advice on Stemming in Solr

2017-11-02 Thread Zheng Lin Edwin Yeo
Hi Emir, We are looking to change to HunspellStemFilterFactory. This has a dictionary file containing words and applicable flags, and an affix file that specifies how these flags will control spell checking. Probably we can control it from those files in HunspellStemFilterFactory? Regards, Edwin

Re: Advice on Stemming in Solr

2017-11-02 Thread Emir Arnautović
Hi Edwin, It seems that it would be best if you do not apply *ing stemming rule at all. The first idea is to trick stemmer and replace any word that ends with ing to some nonexisting char combination e.g. ‘wqx’. You can use solr.PatternReplaceFilterFactory to do that. You can switch it back

Re: Advice on Stemming in Solr

2017-11-01 Thread Zheng Lin Edwin Yeo
Hi Emir, We do have quite alot of words that should not be stemmed. Currently, the KStemFilterFactory are stemming all the non-English words that end with "ing" as well. There are quite alot of places and names which ends in "ing", and all these are being stemmed as well, which leads to an

Re: Advice on Stemming in Solr

2017-11-01 Thread Emir Arnautović
Hi Edwin, If the number of words that should not be stemmed is not high you could use KeywordMarkerFilterFactory to flag those words as keywords and it should prevent stemmer from changing them. Depending on what you want to achieve, you might not be able to avoid using stemmer at indexing

Advice on Stemming in Solr

2017-11-01 Thread Zheng Lin Edwin Yeo
Hi, We are currently using KStemFilterFactory in Solr, but we found that it is actually doing stemming on non-English words like "ximenting", which it stem to "ximent". This is not what we wanted. Another option is to use the HunspellStemFilterFactory, but there are some English words like

Re: Stemming with SOLR

2016-12-18 Thread Lasitha Wattaladeniya
Thank you all for the replies. I am considering the suggestions On 17 Dec 2016 01:50, "Susheel Kumar" wrote: > To handle irregular nouns ( > http://www.ef.com/english-resources/english-grammar/ > singular-and-plural-nouns/), > the simplest way is handle them using

Re: Stemming with SOLR

2016-12-16 Thread Susheel Kumar
To handle irregular nouns ( http://www.ef.com/english-resources/english-grammar/singular-and-plural-nouns/), the simplest way is handle them using StemOverriderFactory. The list is not so long. Or otherwise go for commercial solutions like basistech etc. as Alex suggested oR you can customize

Re: Stemming with SOLR

2016-12-15 Thread Alexandre Rafalovitch
If you need the full fidelity solution taking care of multiple edge-cases, it could be worth looking at commercial solutions. http://www.basistech.com/ has one, including a free-level SAAS plan. Regards, Alex. http://www.solr-start.com/ - Resources for Solr users, new and experienced

Re: Stemming with SOLR

2016-12-15 Thread Lasitha Wattaladeniya
Hi all, Thanks for the replies, @eric, ahmet : since those stemmers are logical stemmers it won't work on words such as caught, ran and so on. So in our case it won't work @susheel : Yes I thought about it but problems we have is, the documents we index are some what large text, so copy

Re: Stemming with SOLR

2016-12-15 Thread Susheel Kumar
We did extensive comparison in the past for Snowball, KStem and Hunspell and there are cases where one of them works better but not other or vice-versa. You may utilise all three of them by having 3 different fields (fieldTypes) and during query, search in all of them. For some of the cases where

Re: Stemming with SOLR

2016-12-15 Thread Ahmet Arslan
Hi, KStemFilter returns legitimate English words, please use it. Ahmet On Thursday, December 15, 2016 6:17 PM, Lasitha Wattaladeniya wrote: Hello devs, I'm trying to develop this indexing and querying flow where it converts the words to its original form (lemmatization).

Re: Stemming with SOLR

2016-12-15 Thread Erick Erickson
What about things like PorterStemFilterFactory, EnglishMinimalStemFilterFactory and the like? Best, Erick On Thu, Dec 15, 2016 at 7:16 AM, Lasitha Wattaladeniya wrote: > Hello devs, > > I'm trying to develop this indexing and querying flow where it converts the > words to its

Stemming with SOLR

2016-12-15 Thread Lasitha Wattaladeniya
Hello devs, I'm trying to develop this indexing and querying flow where it converts the words to its original form (lemmatization). I was doing bit of research lately but the information on the internet is very limited. I tried using hunspellfactory but it doesn't convert the word to it's

Re: Stemming in Solr

2009-03-20 Thread Chris Hostetter
: Can someone please let me know how to implement stemming in solr. I am : particularly looking of the changes, I might need to do in the config files : and also if I need to use some already supplied libraries/factories etc etc. i would start by searching the wiki and email archives

Stemming in Solr

2009-03-13 Thread dabboo
Hi, Can someone please let me know how to implement stemming in solr. I am particularly looking of the changes, I might need to do in the config files and also if I need to use some already supplied libraries/factories etc etc. It would be a great help. Thanks, Amit Garg -- View this message

Stemming in Solr

2009-03-04 Thread dabboo
Hi, I am trying to implement stemming in solr. If user searches for walk then all the records which have walk, walking, walks, walked etc should display. Please suggest. Thanks, Amit Garg -- View this message in context: http://www.nabble.com/Stemming-in-Solr-tp22328850p22328850.html Sent

Re: Stemming in Solr

2009-03-04 Thread Lukáš Vlček
Hi, did you check Snowball stemmers (http://snowball.tartarus.org/)? Regards, Lukas On Wed, Mar 4, 2009 at 1:12 PM, dabboo ag...@sapient.com wrote: Hi, I am trying to implement stemming in solr. If user searches for walk then all the records which have walk, walking, walks, walked etc

Re: Stemming in Solr

2009-03-04 Thread Lukáš Vlček
, 2009 at 1:12 PM, dabboo ag...@sapient.com wrote: Hi, I am trying to implement stemming in solr. If user searches for walk then all the records which have walk, walking, walks, walked etc should display. Please suggest. Thanks, Amit Garg -- View this message in context: http