[ 
https://issues.apache.org/jira/browse/OPENNLP-1236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16749871#comment-16749871
 ] 

ASF GitHub Bot commented on OPENNLP-1236:
-----------------------------------------

maximestein commented on issue #345: OPENNLP-1236:  Add arabic and greek 
stemmers support
URL: https://github.com/apache/opennlp/pull/345#issuecomment-456778410
 
 
   Thanks for your review. All the documentation about generating the stemmers 
can be found on the [snowball 
website](http://snowballstem.org/runtime/use.html). 
   
   The steps are:
   - Download [Snowball](https://github.com/snowballstem/snowball)
   - Build the Snowball compiler
   - Call the Snowball compiler on the ```.sbl``` stemmer file of your choice 
among [those 
available](https://github.com/snowballstem/snowball/tree/master/algorithms). 
The ```-java``` option is used to generate Java code.
   
   However, the stemmer files in this PR are slightly different from the output 
of the Snowball compiler. The ```methodname``` and ```methodobject``` arguments 
have been added to the ```Among``` objects instantiations, as you will find out 
that those are lacking from the Snowball compiler output.
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


> Add support for Arabic and Greek stemmers
> -----------------------------------------
>
>                 Key: OPENNLP-1236
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1236
>             Project: OpenNLP
>          Issue Type: Improvement
>            Reporter: Maxime Steinmetz
>            Priority: Major
>             Fix For: 1.9.1
>
>
> The arabic and greek Snowball stemmers are now available 
> (https://github.com/snowballstem/snowball/tree/master/algorithms) and it 
> would be nice to add support for those two
>  
> This would require:
>  * Converting the .sbl files into Java code and adding it to the stemmer 
> folder
>  * Updating relevant classes to support the new stemmers
>  * Adding a tests for the new stemmers



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to