[ 
https://issues.apache.org/jira/browse/OPENNLP-1531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17801477#comment-17801477
 ] 

Bruno P. Kinoshita commented on OPENNLP-1531:
---------------------------------------------

The list of abbreviations seem to be the same between European and Brazilian 
Portuguese. Can't say if the African or Creole Portuguese follow the same list, 
but will leave that for a follow-up issue.

> Add Portuguese abbreviation dictionary
> --------------------------------------
>
>                 Key: OPENNLP-1531
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1531
>             Project: OpenNLP
>          Issue Type: Improvement
>    Affects Versions: 2.3.1
>            Reporter: Bruno P. Kinoshita
>            Priority: Minor
>
> Similar to the addition inĀ OPENNLP-570 and OPENNLP-1526, an abbreviation 
> dictionary for Italian sentence detection and tokenisation might be 
> beneficial.
> Aims:
>  - Create and add a new file {{abb_PT.xml}} to _opennlp-tools/lang/pt_
>  - Add basic set of test cases
> Other:
>  - Confirm if European/Brazilian/African/Creole Portuguese have the same 
> abbreviations or if we need different languages...



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to