[
https://issues.apache.org/jira/browse/OPENNLP-912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17790947#comment-17790947
]
ASF GitHub Bot commented on OPENNLP-912:
----------------------------------------
rzo1 commented on PR #390:
URL: https://github.com/apache/opennlp/pull/390#issuecomment-1831371994
> I would like to better understand the origins of the rules used. Does
there need to be license attribution?
@jzonthemtn Looks these "golden-rules.txt" is from
https://github.com/diasks2/pragmatic_segmenter#the-golden-rules (at least, if
we trust the textual description). Also in some other languages:
https://s3.amazonaws.com/tm-town-nlp-resources/golden_rules.txt - the library
itself with the content is MIT, so no compliance issue but we would need to
attribute it accordingly.
> Add a rule based sentence detector
> ----------------------------------
>
> Key: OPENNLP-912
> URL: https://issues.apache.org/jira/browse/OPENNLP-912
> Project: OpenNLP
> Issue Type: Improvement
> Reporter: Jörn Kottmann
> Priority: Major
> Labels: help-wanted
>
> It would be nice to offer a simpler rule based sentence detector, in some
> languages this might work rather well.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)