Package: mecab
Version: 0.996-6
Severity: minor
Tags: upstream
Forwarded: https://github.com/taku910/mecab/pull/50

Manpage was generated with mecab of 0.98pre1 binary.

It best to be regenerated.

More over, it should be more than mere help2man output.

I have created pull request to upstream:
  https://github.com/taku910/mecab/pull/50

Basically, this updates manpage to as attached.

Now -O "" is highlighted !

-- System Information:
Debian Release: buster/sid
  APT prefers testing
  APT policy: (500, 'testing'), (10, 'unstable')
Architecture: amd64 (x86_64)

Kernel: Linux 4.19.0-2-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), 
LANGUAGE=en_US:en (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages mecab depends on:
ii  libc6                                 2.28-7
ii  libgcc1                               1:8.2.0-20
ii  libmecab2                             0.996-6
ii  libstdc++6                            8.2.0-20
ii  mecab-ipadic                          2.7.0-20070801+main-2
ii  mecab-jumandic                        7.0-20130310-7
ii  mecab-jumandic-utf8 [mecab-jumandic]  7.0-20130310-7

mecab recommends no packages.

mecab suggests no packages.

-- no debconf information
.\" DO NOT MODIFY THIS FILE!  It was generated by help2man 1.47.8.
.TH MECAB "1" "February 2019" "mecab of 0.996" "User Commands"
.SH NAME
mecab \- Yet Another Part\-of\-Speech and Morphological Analyzer
.SH SYNOPSIS
.B mecab
[\fI\,options\/\fR] \fI\,files\/\fR
.SH DESCRIPTION
Mecab is a morphological analysis system.  It reads continuous text sentences
such as Japanese ones from the standard input, segments them into morpheme
sequences, and outputs them to the standard output with many additional pieces
of information (pronunciation, semantic information, etc).
.SH OPTIONS
.TP
\fB\-r\fR, \fB\-\-rcfile\fR=\fI\,FILE\/\fR
use FILE as resource file
.TP
\fB\-d\fR, \fB\-\-dicdir\fR=\fI\,DIR\/\fR
set DIR  as a system dicdir
.TP
\fB\-u\fR, \fB\-\-userdic\fR=\fI\,FILE\/\fR
use FILE as a user dictionary
.TP
\fB\-l\fR, \fB\-\-lattice\-level\fR=\fI\,INT\/\fR
lattice information level (DEPRECATED)
.TP
\fB\-D\fR, \fB\-\-dictionary\-info\fR
show dictionary information and exit
.TP
\fB\-O\fR, \fB\-\-output\-format\-type\fR=\fI\,TYPE\/\fR
set output format type (SEE OUTPUT FORMAT)
.TP
\fB\-a\fR, \fB\-\-all\-morphs\fR
output all morphs(default false)
.TP
\fB\-N\fR, \fB\-\-nbest\fR=\fI\,INT\/\fR
output N best results (default 1)
.TP
\fB\-p\fR, \fB\-\-partial\fR
partial parsing mode (default false)
.TP
\fB\-m\fR, \fB\-\-marginal\fR
output marginal probability (default false)
.TP
\fB\-M\fR, \fB\-\-max\-grouping\-size\fR=\fI\,INT\/\fR
maximum grouping size for unknown words (default 24)
.TP
\fB\-F\fR, \fB\-\-node\-format\fR=\fI\,STR\/\fR
use STR as the user\-defined node format
.TP
\fB\-U\fR, \fB\-\-unk\-format\fR=\fI\,STR\/\fR
use STR as the user\-defined unknown node format
.TP
\fB\-B\fR, \fB\-\-bos\-format\fR=\fI\,STR\/\fR
use STR as the user\-defined beginning\-of\-sentence format
.TP
\fB\-E\fR, \fB\-\-eos\-format\fR=\fI\,STR\/\fR
use STR as the user\-defined end\-of\-sentence format
.TP
\fB\-S\fR, \fB\-\-eon\-format\fR=\fI\,STR\/\fR
use STR as the user\-defined end\-of\-NBest format
.TP
\fB\-x\fR, \fB\-\-unk\-feature\fR=\fI\,STR\/\fR
use STR as the feature for unknown word
.TP
\fB\-b\fR, \fB\-\-input\-buffer\-size\fR=\fI\,INT\/\fR
set input buffer size (default 8192)
.TP
\fB\-P\fR, \fB\-\-dump\-config\fR
dump MeCab parameters
.TP
\fB\-C\fR, \fB\-\-allocate\-sentence\fR
allocate new memory for input sentence
.TP
\fB\-t\fR, \fB\-\-theta\fR=\fI\,FLOAT\/\fR
set temparature parameter theta (default 0.75)
.TP
\fB\-c\fR, \fB\-\-cost\-factor\fR=\fI\,INT\/\fR
set cost factor (default 700)
.TP
\fB\-o\fR, \fB\-\-output\fR=\fI\,FILE\/\fR
set the output file name
.TP
\fB\-v\fR, \fB\-\-version\fR
show the version and exit.
.TP
\fB\-h\fR, \fB\-\-help\fR
show this help and exit.
.SH "OUTPUT FORMAT"

The default output format and the selectable output formats from the
\fB\-O\fR option argument are defined in the resource file.
There are few special hard coded formats.

.TP
\fB""\fR (null string)
disable format setting of resource file.  This is required to set user\-defined
format from the command line.
.TP
\fBwakati\fR
output each node separated by a space
.TP
\fBdump\fR
dump all node data in one line 
.TP
\fBnone\fR
no output

.PP
See <https://taku910.github.io/mecab/format.html> for details of format
definition.

.SH DICTIONARY

See 
.IP \(bu 4
<https://taku910.github.io/mecab/learn.html>
.IP \(bu 4
<https://taku910.github.io/mecab/dic-detail.html>
.PP
for details of preparation and updating of the mecab dictionary.

.SH EXAMPLE

Output reading in KataKana with installed and fully configured UniDic.

    $ mecab -O "" -F"%pS%f[9]" -U"%M" -E"\\n" <input_file

Output writing in Hiragana with installed and fully configured UniDic.

    $ mecab -O "" -F"%pS%f[6]" -U"%M" -E"\\n" <input_file \\
.br
      |nkf --hiragana

Output reading in KataKana with installed and fully configured Ipadic.

    $ mecab -O yomi <input_file

.SH "COPYRIGHT
Copyright(C) 2001\-2012 Taku Kudo
.br
Copyright(C) 2004\-2008 Nippon Telegraph and Telephone Corporation
.SH "SEE ALSO"
Full documentation at: <https://taku910.github.io/mecab/>

Reply via email to