[ 
https://issues.apache.org/jira/browse/OPENNLP-402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16413635#comment-16413635
 ] 

Luca commented on OPENNLP-402:
------------------------------

support for CONL02 format is currently broken since it expects 3 columns, 
however the conl002 format has only 2 columns 
([https://www.clips.uantwerpen.be/conll2002/ner/data/000README)]

 

> CLI tools and formats refactored
> --------------------------------
>
>                 Key: OPENNLP-402
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-402
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Command Line Interface, Formats
>    Affects Versions: tools-1.5.3
>            Reporter: Aliaksandr Autayeu
>            Assignee: Joern Kottmann
>            Priority: Major
>              Labels: patch
>             Fix For: tools-1.5.3
>
>         Attachments: 0016-CLI-tools-and-formats-refactored.patch, 
> 0017-added-direct-format-support-to-CLI-tools-and-formats.patch, 
> open-nlp-cli-package.png, opennlp-cmdline-package-class-structure.png
>
>
> Proposed patch refactors CLI tools and simplifies the code by introducing 
> hierarchy and removing a lot of code duplication. It also introduces better 
> error and help messages, including help for formats and listing available 
> formats in various tools, which are now able to work with formats directly. 
> This, in turn, eliminates the need to keep converted files on disk.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to