Hi Folks,
So I've been plodding away again and feel i am very close to generating my
first language pack, however I've arrived at the following fankle!!!
If I run a pipeline from start to finish it fails at the 'test-bundle-1'
phase as below stating " [Errno 2] No such file or directory:
'/usr/local/joshua_resources/russian_experiments/exp3/tune/joshua.config.final'"

lmcgibbn@LMC-056430 /usr/local/joshua_resources/russian_experiments/exp3 $
/usr/local/incubator-joshua/bin/pipeline.pl  --rundir . --type hiero
--corpus
/usr/local/joshua_resources/russian_experiments/data/commoncrawl.ru-en
--tune
/usr/local/joshua_resources/russian_experiments/data/commoncrawl.ru-en.tune
--test
/usr/local/joshua_resources/russian_experiments/data/commoncrawl.ru-en.test
--source en --target ru --readme "Experiment 3 Run 1 of ru --> en model
training" --aligner berkeley --hadoop-mem 10g --tmp
/usr/local/hadoop-2.5.2/hadoop_tmp_dir
[train-copy-and-filter] cached, skipping...
[train-tokenize-en] cached, skipping...
[train-tokenize-ru] cached, skipping...
[train-trim] cached, skipping...
[train-lowercase-en] cached, skipping...
[train-lowercase-ru] cached, skipping...
[train-vocab-en] cached, skipping...
[train-vocab-ru] cached, skipping...
[tune-copy-and-filter] cached, skipping...
[tune-tokenize-en] cached, skipping...
[tune-tokenize-ru] cached, skipping...
[tune-lowercase-en] cached, skipping...
[tune-lowercase-ru] cached, skipping...
[tune-vocab-en] cached, skipping...
[tune-vocab-ru] cached, skipping...
[test-copy-and-filter] cached, skipping...
[test-tokenize-en] cached, skipping...
[test-tokenize-ru] cached, skipping...
[test-lowercase-en] cached, skipping...
[test-lowercase-ru] cached, skipping...
[test-vocab-en] cached, skipping...
[test-vocab-ru] cached, skipping...
[lm-sort-uniq] cached, skipping...
[kenlm] cached, skipping...
[compile-kenlm] cached, skipping...
[glue-tune] cached, skipping...
[tune-bundle] cached, skipping...
[mert-1] rebuilding...

dep=/usr/local/joshua_resources/russian_experiments/exp3/data/tune/corpus.en

dep=/usr/local/joshua_resources/russian_experiments/exp3/tune/joshua.config
[CHANGED]
  dep=tune/model/grammar.gz.packed/slice_00000.source

dep=/usr/local/joshua_resources/russian_experiments/exp3/tune/joshua.config.final
[NOT FOUND]
  cmd=/usr/local/incubator-joshua/scripts/training/run_tuner.py
/usr/local/joshua_resources/russian_experiments/exp3/data/tune/corpus.en
/usr/local/joshua_resources/russian_experiments/exp3/data/tune/corpus.ru
--tunedir /usr/local/joshua_resources/russian_experiments/exp3/tune --tuner
mert --decoder
/usr/local/joshua_resources/russian_experiments/exp3/tune/decoder_command
--decoder-config
/usr/local/joshua_resources/russian_experiments/exp3/tune/joshua.config
--decoder-output-file
/usr/local/joshua_resources/russian_experiments/exp3/tune/output.nbest
--decoder-log-file
/usr/local/joshua_resources/russian_experiments/exp3/tune/joshua.log
--iterations 10 --metric 'BLEU 4 closest'
  took 27 seconds (27s)
[test-bundle-1] rebuilding...

dep=/usr/local/joshua_resources/russian_experiments/exp3/tune/joshua.config.final
[NOT FOUND]
  dep=grammar.gz

dep=/usr/local/joshua_resources/russian_experiments/exp3/test/1/model/joshua.config
  cmd=/usr/local/incubator-joshua/scripts/support/run_bundler.py --force
--symlink --absolute --verbose -T /usr/local/hadoop-2.5.2/hadoop_tmp_dir
/usr/local/joshua_resources/russian_experiments/exp3/tune/joshua.config.final
/usr/local/joshua_resources/russian_experiments/exp3/test/1/model
--copy-config-options '-top-n 300 -pop-limit 5000 -output-format "%i ||| %s
||| %f ||| %c" -mark-oovs false' --pack-tm grammar.gz --tm
/usr/local/joshua_resources/russian_experiments/exp3/data/tune/grammar.glue
  JOB FAILED (return code 2)
ERROR:root:ERROR: argument config: can't open
'/usr/local/joshua_resources/russian_experiments/exp3/tune/joshua.config.final':
[Errno 2] No such file or directory:
'/usr/local/joshua_resources/russian_experiments/exp3/tune/joshua.config.final'

However, if I run the pipeline with the --first-step test flag, then I get
the following where the 'test-bundle-1' phase executes and completes
flawlessly however the pipeline then goes on to die at the 'test-decode-1'
phase!!!

lmcgibbn@LMC-056430 /usr/local/joshua_resources/russian_experiments/exp3 $
/usr/local/incubator-joshua/bin/pipeline.pl  --rundir . --type hiero
--corpus
/usr/local/joshua_resources/russian_experiments/data/commoncrawl.ru-en
--tune
/usr/local/joshua_resources/russian_experiments/data/commoncrawl.ru-en.tune
--test
/usr/local/joshua_resources/russian_experiments/data/commoncrawl.ru-en.test
--source en --target ru --readme "Experiment 3 Run 1 of ru --> en model
training" --aligner berkeley --hadoop-mem 10g --tmp
/usr/local/hadoop-2.5.2/hadoop_tmp_dir --first-step test --grammar
/usr/local/joshua_resources/russian_experiments/exp3/grammar.gz
--joshua-mem 10g
[train-copy-and-filter] cached, skipping...
[train-tokenize-en] cached, skipping...
[train-tokenize-ru] cached, skipping...
[train-trim] cached, skipping...
[train-lowercase-en] cached, skipping...
[train-lowercase-ru] cached, skipping...
[train-vocab-en] cached, skipping...
[train-vocab-ru] cached, skipping...
[tune-copy-and-filter] cached, skipping...
[tune-tokenize-en] cached, skipping...
[tune-tokenize-ru] cached, skipping...
[tune-lowercase-en] cached, skipping...
[tune-lowercase-ru] cached, skipping...
[tune-vocab-en] cached, skipping...
[tune-vocab-ru] cached, skipping...
[test-copy-and-filter] cached, skipping...
[test-tokenize-en] cached, skipping...
[test-tokenize-ru] cached, skipping...
[test-lowercase-en] cached, skipping...
[test-lowercase-ru] cached, skipping...
[test-vocab-en] cached, skipping...
[test-vocab-ru] cached, skipping...
[glue-test] cached, skipping...
[test-bundle-1] rebuilding...

dep=/usr/local/incubator-joshua/scripts/training/templates/tune/joshua.config
  dep=/usr/local/joshua_resources/russian_experiments/exp3/grammar.gz

dep=/usr/local/joshua_resources/russian_experiments/exp3/test/1/model/joshua.config
[CHANGED]
  cmd=/usr/local/incubator-joshua/scripts/support/run_bundler.py --force
--symlink --absolute --verbose -T /usr/local/hadoop-2.5.2/hadoop_tmp_dir
/usr/local/incubator-joshua/scripts/training/templates/tune/joshua.config
/usr/local/joshua_resources/russian_experiments/exp3/test/1/model
--copy-config-options '-top-n 300 -pop-limit 5000 -output-format "%i ||| %s
||| %f ||| %c" -mark-oovs false' --pack-tm
/usr/local/joshua_resources/russian_experiments/exp3/grammar.gz --tm
/usr/local/joshua_resources/russian_experiments/exp3/data/test/grammar.glue
  took 5372 seconds (1h29m32s)
[test-decode-1] rebuilding...

dep=/usr/local/joshua_resources/russian_experiments/exp3/data/test/corpus.en
[CHANGED]

dep=/usr/local/joshua_resources/russian_experiments/exp3/test/decoder_command
[CHANGED]

dep=/usr/local/joshua_resources/russian_experiments/exp3/test/1/model/joshua.config
[CHANGED]

dep=/usr/local/joshua_resources/russian_experiments/exp3/test/1/model/grammar.gz.packed/slice_00000.source
[CHANGED]
  dep=/usr/local/joshua_resources/russian_experiments/exp3/test/output
[CHANGED]

cmd=/usr/local/joshua_resources/russian_experiments/exp3/test/decoder_command
  JOB FAILED (return code 1)

I need to ask the question, using current (todays) master branch with
Python 3, has anyone managed to build a language pack? I seem to have
encountered several small niggling issues and this is the most recent of
them.
Thanks in advance for any guidance here.
Lewis

-- 
http://home.apache.org/~lewismc/
@hectorMcSpector
http://www.linkedin.com/in/lmcgibbney

Reply via email to