hello....I have installed hadoop, MGIZA and chaski succesfully following
http://geek.kyloo.net/software/doku.php/chaski:overview tutorial.
anyways....I am running through a series of errors on training.

1. step 2 -->In preparing for word aligments i get two errors. I do not know
which file in HDFS is looking for:
    10/12/20 22:46:36 INFO waprep.WordAlignPreprocDriver: Step 1. Merging
corpus onto HDFS
    10/12/20 22:46:36 ERROR waprep.WordAlignUtils: File is not on HDFS, try
local
    10/12/20 22:46:36 ERROR waprep.WordAlignUtils: File is not on HDFS, try
local
    10/12/20 22:46:36 INFO waprep.WordAlignPreprocDriver: Step 1. Merging
corpus onto HDFS Completed
    10/12/20 22:46:36 INFO waprep.WordAlignPreprocDriver: Step 2. Building
Dictionary

2. could this be the reason why my tables for moses return empty? I get 9
fatal errors in each running alignment:

   10/12/20 23:05:23 INFO walign.TrainingSequence: Running alignment 2
10/12/20 23:05:24 WARN mapred.JobClient: Use GenericOptionsParser for
parsing the arguments. Applications should implement Tool for the same.
10/12/20 23:05:24 INFO input.FileInputFormat: Total input paths to process :
1
10/12/20 23:06:04 INFO utils.HDFSDirInputStream: Loading next file
hdfs://localhost:54310/user/roberto/Chask-HDFS/training/S2T/step2/result/part-r-00000
10/12/20 23:06:19 WARN mapred.JobClient: Use GenericOptionsParser for
parsing the arguments. Applications should implement Tool for the same.
10/12/20 23:06:19 INFO input.FileInputFormat: Total input paths to process :
1
10/12/20 23:06:49 INFO utils.HDFSDirInputStream: Loading next file
hdfs://localhost:54310/user/roberto/Chask-HDFS/training/S2T/step2/result/part-r-00000
10/12/20 23:06:49 WARN walign.TrainingSequence: Training failed at step 2
trying to recover by reducing memory limit and regenerate ctrl files
10/12/20 23:06:50 INFO utils.HDFSDirInputStream: Loading next file
hdfs://localhost:54310/user/roberto/Chask-HDFS/dict/src/part-r-00000
10/12/20 23:06:50 INFO utils.HDFSDirInputStream: Loading next file
hdfs://localhost:54310/user/roberto/Chask-HDFS/dict/tgt/part-r-00000
10/12/20 23:06:51 INFO waprep.WordAlignUtils: Finished reading, current
split 00000001 Estimated entries 489
10/12/20 23:06:52 WARN mapred.JobClient: Use GenericOptionsParser for
parsing the arguments. Applications should implement Tool for the same.
10/12/20 23:06:52 INFO input.FileInputFormat: Total input paths to process :
1
10/12/20 23:06:52 WARN mapred.JobClient: Use GenericOptionsParser for
parsing the arguments. Applications should implement Tool for the same.
10/12/20 23:06:52 INFO input.FileInputFormat: Total input paths to process :
1
10/12/20 23:07:39 WARN mapred.JobClient: Use GenericOptionsParser for
parsing the arguments. Applications should implement Tool for the same.
10/12/20 23:07:39 INFO input.FileInputFormat: Total input paths to process :
1
10/12/20 23:08:14 INFO utils.HDFSDirInputStream: Loading next file
hdfs://localhost:54310/user/roberto/Chask-HDFS/training/S2T/step2/result/part-r-00000
10/12/20 23:08:14 FATAL walign.TrainingSequence: Training failed after 3
retries, we will not continue

this is the error i get on:

robe...@aaron:~$ hadoop dfs -cat
Chask-HDFS/training/S2T/step0/result/part-r-00000
00000001    SUCCESS

robe...@aaron:~$ hadoop dfs -cat
Chask-HDFS/training/S2T/step1/result/part-r-00000
cat: File does not exist: Chask-HDFS/training/S2T/step1/result/part-r-00000

robe...@aaron:~$ hadoop dfs -cat
Chask-HDFS/training/S2T/step2/result/part-r-00000
00000001    Failed on copying
/user/roberto/Chask-HDFS/training/S2T/step1/model/t.model to tmp/t.model
#NL#

it sems like is not creating or storing the next file to be use for the next
alignment.

please help...i have been in this situation for tooooo long
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to