Repository: incubator-joshua-site Updated Branches: refs/heads/asf-site ccc928165 -> b34fdd117
cleared up description for language packs Project: http://git-wip-us.apache.org/repos/asf/incubator-joshua-site/repo Commit: http://git-wip-us.apache.org/repos/asf/incubator-joshua-site/commit/afcbef14 Tree: http://git-wip-us.apache.org/repos/asf/incubator-joshua-site/tree/afcbef14 Diff: http://git-wip-us.apache.org/repos/asf/incubator-joshua-site/diff/afcbef14 Branch: refs/heads/asf-site Commit: afcbef14f1a8c517e8b53cf7207a976c0c6e3fcf Parents: 43424bf Author: Matt Post <[email protected]> Authored: Tue May 19 11:12:18 2015 -0400 Committer: Matt Post <[email protected]> Committed: Tue May 19 11:12:31 2015 -0400 ---------------------------------------------------------------------- 6.0/bundle.md | 24 +++++++++++++----------- 1 file changed, 13 insertions(+), 11 deletions(-) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/incubator-joshua-site/blob/afcbef14/6.0/bundle.md ---------------------------------------------------------------------- diff --git a/6.0/bundle.md b/6.0/bundle.md index cc3b8a9..eb2ff5a 100644 --- a/6.0/bundle.md +++ b/6.0/bundle.md @@ -44,9 +44,14 @@ very large, so that it takes a long time to load and to filtered against the tuning and testing data in the pipeline, and these filtered models will be what is listed in the source config files. However, when exporting a model for use as a language pack, you -need to export the full model instead of the filtered one. The `--tm` -line is used to accomplish this; it takes two arguments: the first -identifies the TM's owner, and the second, the updated path. +need to export the full model instead of the filtered one. The +`--tm` parameter is used to accomplish this; it takes an argument +specifying the path to the full model. Multiple `--tm` parameters can +be passed; they will replace TMs found in the config file in order. If +you would additionally like the large model to be +[packed](packing.html) (which reformats it so that it can be quickly +loaded at run time), you can use `--pack-tm` instead. You can only +pack one TM, but typically there is only one anyway. Here is an example invocation: @@ -63,14 +68,11 @@ Here is an example invocation: The copy config options tell the decoder to present just the single-best (`-top-n 0`) translated output string that has been heuristically capitalized (`-output-format %S`), to not append `_OOV` -to OOVs (`-mark-oovs false`), and to use the grammar -`/path/to/rundir/grammar.gz` as the main translation grammar. See -[this page](decoder.html) for a longer list of decoder options. - -The `--pack-tm` option tells the run bundler to -[pack the grammar](packing.html), which can take some time. Since it -is the first of any `--[pack-]tm` arguments, it applies to the first -TM encountered in the config file. +to OOVs (`-mark-oovs false`), and to use the translation model +`/path/to/rundir/grammar.gz` as the main translation model, packing it +before placing it in the bundle. Note that these arguments to +`--copy-config` are the default, so you could leave this off entirely. +See [this page](decoder.html) for a longer list of decoder options. A new directory `language-pack-YYYY-MM-DD` will be created along with a README and a number of support files.
