Re: [Moses-support] kbmira died with SIGABRT when tuning

Barry Haddow Wed, 20 Jan 2016 02:24:08 -0800

Hi Dingyuan

What platform are you running on? I could not reproduce your error on 
Ubuntu 12.04, and valgrind is clean,


cheers - Barry

On 19/01/16 16:31, Barry Haddow wrote:
> Hi Dingyuan
>
> I ran for over 200 iterations and saw no problem. I tried with your LANG
> and LANGUAGE settings (I don't have the right packages for the other
> settings) and still saw no failure.
>
> Maybe it is a random pointer/memory problem like you suggested. I have
> started running your model with valgrind, but nothing so far,
>
> cheers - Barry
>
> On 19/01/16 14:26, Dingyuan Wang wrote:
>> Hi Barry,
>>
>> It usually hits an error in about 1~10 iterations on my laptop. I don't
>> know what triggers that, so it may be a probability problem.
>>
>> Disabling xml-input won't help. I think I should use verbose output.
>>
>> My locale settings is:
>>
>> LANG=zh_CN.UTF-8
>> LANGUAGE=zh_CN.UTF-8:zh_TW.UTF-8:zh_HK.utf8:en_US.utf8
>> LC_CTYPE="zh_CN.UTF-8"
>> LC_NUMERIC="zh_CN.UTF-8"
>> LC_TIME="zh_CN.UTF-8"
>> LC_COLLATE="zh_CN.UTF-8"
>> LC_MONETARY="zh_CN.UTF-8"
>> LC_MESSAGES="zh_CN.UTF-8"
>> LC_PAPER="zh_CN.UTF-8"
>> LC_NAME="zh_CN.UTF-8"
>> LC_ADDRESS="zh_CN.UTF-8"
>> LC_TELEPHONE="zh_CN.UTF-8"
>> LC_MEASUREMENT="zh_CN.UTF-8"
>> LC_IDENTIFICATION="zh_CN.UTF-8"
>> LC_ALL=
>>
>> 在 2016年01月19日 19:20, Barry Haddow 写道:
>>> Hi Dingyuan
>>>
>>> I have your script and model running, but so far it has not reported any
>>> errors. It's at iteration 27, and I'm using the latest Moses from git.
>>>
>>> How long should I expect it to run before it hits an error? Could it be
>>> affected by the locale setting?
>>>
>>> Have you tried running without xml-input to see if you still have the
>>> problem?
>>>
>>> cheers - Barry
>>>
>>> On 19/01/16 05:43, Dingyuan Wang wrote:
>>>> Hi Barry,
>>>>
>>>> I've uploaded the model:
>>>> https://mega.nz/#!UsVSBCBJ!e5IATFvLqrCb5zhmDekLn8NOGw4PSD9RRQLGQeKEvNY
>>>>
>>>> To test the model, I included a script 'repeatnbest.sh' which runs moses
>>>> repeatedly until encoding error occurs.
>>>>
>>>> The file run7.best100.out and run7.out in the archive is the last run
>>>> that produces the error.
>>>>
>>>> It seems that it is WordTranslationFeature that causes the problem.
>>>>
>>>> 在 2016年01月19日 00:03, Barry Haddow 写道:
>>>>> Hi Dingyuan
>>>>>
>>>>> Something is going wrong with the construction or outputting of feature
>>>>> names, and it looks like it's WordTranslationFeature that's the problem.
>>>>> Does the problem go away if you do not use word translation features?
>>>>>
>>>>> If you could make available a model that reproduces the nbest list
>>>>> construction then I would have a chance to debug it,
>>>>>
>>>>> cheers - Barry
>>>>>
>>>>> On 18/01/16 15:32, Dingyuan Wang wrote:
>>>>>> Hi Barry,
>>>>>>
>>>>>> I've checked all the models and corpora with the script, without
>>>>>> finding
>>>>>> any encoding problem.
>>>>>>
>>>>>> I also find that all such errors in nbest list occurs only in the
>>>>>> feature list (3 different samples), without affecting translation
>>>>>> result. Therefore, the phrase table or training corpus may not be the
>>>>>> problem.
>>>>>>
>>>>>> 在 2016年01月18日 23:04, Barry Haddow 写道:
>>>>>>> Hi Dingyuan
>>>>>>>
>>>>>>> Are these encoding errors present in your phrase table? Are they
>>>>>>> present
>>>>>>> in your training corpus? Since they appear in the word translation
>>>>>>> features, and you are using a shortlist, are they in the shortlist
>>>>>>> files
>>>>>>> in the model directory? (These have names with "topn" in them afaik).
>>>>>>>
>>>>>>> File-system errors are unlikely, and for the most part Moses treats
>>>>>>> text
>>>>>>> as byte strings so encoding errors usually trace back to the source
>>>>>>> text.
>>>>>>>
>>>>>>> cheers - Barry
>>>>>>>
>>>>>>> On 18/01/16 14:56, Dingyuan Wang wrote:
>>>>>>>> Hi Barry,
>>>>>>>>
>>>>>>>> "The ones starting with the "@"" are due to corrupted bytes in the
>>>>>>>> nbest
>>>>>>>> list.
>>>>>>>>
>>>>>>>> This kind of corruption occurs from time to time. I wonder if it
>>>>>>>> comes
>>>>>>>> from memory errors or filesystem failure or some kind of
>>>>>>>> pointer/encoding problem in moses.
>>>>>>>>
>>>>>>>> I've written a script to find such corrupted lines:
>>>>>>>>
>>>>>>>> https://gist.github.com/gumblex/0d9d0848b435e4f9818f
>>>>>>>>
>>>>>>>> 在 2016年01月18日 20:42, Barry Haddow 写道:
>>>>>>>>> Hi Dingyuan
>>>>>>>>>
>>>>>>>>> The extractor expects feature names to contain an underscore (not
>>>>>>>>> sure
>>>>>>>>> exactly why) but some of yours don't, and Moses skips them,
>>>>>>>>> interpreting
>>>>>>>>> their values as extra dense features.
>>>>>>>>>
>>>>>>>>> The attached screenshot shows my view of the offending names. The
>>>>>>>>> ones
>>>>>>>>> starting with the "@" are the problem. So it does look like the
>>>>>>>>> nbest
>>>>>>>>> list is corrupted. Can you run the decoder on just that sentence, to
>>>>>>>>> create an uncompressed version of the nbest list?
>>>>>>>>>
>>>>>>>>> cheers - Barry
>>>>>>>>>
>>>>>>>>> On 18/01/16 12:02, Dingyuan Wang wrote:
>>>>>>>>>> Hi Barry,
>>>>>>>>>>
>>>>>>>>>> Attached is the zgrep result.
>>>>>>>>>> I found that in the middle of line 61 a few bytes are corrupted. Is
>>>>>>>>>> that
>>>>>>>>>> a moses problem or my memory has a problem?
>>>>>>>>>>
>>>>>>>>>> I also checked other files using iconv, they are all OK in UTF-8.
>>>>>>>>>>
>>>>>>>>>> 在 2016年01月18日 19:32, Barry Haddow 写道:
>>>>>>>>>>> Hi Dingyuan
>>>>>>>>>>>
>>>>>>>>>>> Yes, that's very possible. The error could be in extracting
>>>>>>>>>>> features.dat
>>>>>>>>>>> from the nbest list. Are you able to post the nbest list? Or at
>>>>>>>>>>> least
>>>>>>>>>>> the entries for sentence 16?
>>>>>>>>>>>
>>>>>>>>>>> Run something like
>>>>>>>>>>>
>>>>>>>>>>> zgrep "^16 " tuning/tmp.1/run7.best100.out.gz
>>>>>>>>>>>
>>>>>>>>>>> cheers - Barry
>>>>>>>>>>>
>>>>>>>>>>> On 18/01/16 11:24, Dingyuan Wang wrote:
>>>>>>>>>>>> Hi Barry,
>>>>>>>>>>>>
>>>>>>>>>>>> I have rerun the ems after the first email, and then posted the
>>>>>>>>>>>> recent
>>>>>>>>>>>> results, so the line changed.
>>>>>>>>>>>>
>>>>>>>>>>>> I just use the latest code, and the EMS script. Pretty much are
>>>>>>>>>>>> default
>>>>>>>>>>>> settings. The EMS setting is:
>>>>>>>>>>>>
>>>>>>>>>>>> sparse-features = "target-word-insertion top 50,
>>>>>>>>>>>> source-word-deletion
>>>>>>>>>>>> top 50, word-translation top 50 50, phrase-length"
>>>>>>>>>>>>
>>>>>>>>>>>> I suspect there is something unexpected in the extractor.
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>> 在 2016年01月18日 19:03, Barry Haddow 写道:
>>>>>>>>>>>>> Hi Dingyuan
>>>>>>>>>>>>>
>>>>>>>>>>>>> In fact it is not the sparse features nor the Asian characters
>>>>>>>>>>>>> that
>>>>>>>>>>>>> are
>>>>>>>>>>>>> the problem. The offending line has 17 dense features, yet your
>>>>>>>>>>>>> model
>>>>>>>>>>>>> has 14 dense features.
>>>>>>>>>>>>>
>>>>>>>>>>>>> The string "1 1 1" appears directly after the language model
>>>>>>>>>>>>> feature in
>>>>>>>>>>>>> line 1694, in your attachment, adding the extra 3 features. Note
>>>>>>>>>>>>> that
>>>>>>>>>>>>> this is not the line you mentioned in your earlier email.
>>>>>>>>>>>>>
>>>>>>>>>>>>> I have no idea why there are extra features. Have you made
>>>>>>>>>>>>> changes to
>>>>>>>>>>>>> any of the core Moses features?
>>>>>>>>>>>>>
>>>>>>>>>>>>> best wishes
>>>>>>>>>>>>> Barry
>>>>>>>>>>>>>
>>>>>>>>>>>>> The offending line:
>>>>>>>>>>>>> what():  Error in line "-5.44027 0 0 -5.34901 0 0 0 -224.872 1 1
>>>>>>>>>>>>> 1 -39
>>>>>>>>>>>>> 18 -26.2331 -40.6736 -44.3698 -82.5072 WT_，~，=3 WT_：~：=1
>>>>>>>>>>>>> WT_“~“=1
>>>>>>>>>>>>> WT_”~”=1 WT_曰~说=1 PL_s3=5 PL_3,2=2 PL_3,3=3 PL_2,3=4 PL_t3=7
>>>>>>>>>>>>> PL_s1=5
>>>>>>>>>>>>> PL_1,2=2 PL_1,1=3 PL_t1=4 PL_2,2=3 PL_t2=7 PL_s2=8 PL_2,1=1 WT_
>>>>>>>>>>>>> 有~有=1
>>>>>>>>>>>>> WT_！~！=1 WT_其~的=1 WT_其~他=1 WT_不~也=1 WT_不~没=1 WT_而~
>>>>>>>>>>>>> 而=1
>>>>>>>>>>>>> WT_而~
>>>>>>>>>>>>> 却=1 WT_祖逖~逖=1 WT_祖逖~祖=1 WT_逖~祖=1 WT_逖~逖=1 WT_大~大
>>>>>>>>>>>>> 江=1
>>>>>>>>>>>>> WT_者~
>>>>>>>>>>>>> 的=1 WT_者~人=1 WT_江~大江=1 WT_渡~渡过=1 WT_复~又=1 WT_余~有=1
>>>>>>>>>>>>> WT_
>>>>>>>>>>>>> 誓~发
>>>>>>>>>>>>> 誓=1 WT_楫~木=1 WT_江~长江=1 WT_击~击=1 WT_将~带领=1 WT_济~成
>>>>>>>>>>>>> 功=1
>>>>>>>>>>>>> WT_中
>>>>>>>>>>>>> 原~中原=1 WT_清~廓清=1 WT_如~像=1 WT_楫~戢=1 WT_能~能=1 WT_中~中
>>>>>>>>>>>>> 流=1 WT_
>>>>>>>>>>>>> 流~中流=1 WT_部曲~部下=1 " of ...
>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> On 18/01/16 10:37, Dingyuan Wang wrote:
>>>>>>>>>>>>>> Hi,
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> I've attached that. The line number is 1694.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> 在 2016年01月18日 16:43, Barry Haddow 写道:
>>>>>>>>>>>>>>> Hi Dingyuan
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Is it possible to attach the features.dat file that is
>>>>>>>>>>>>>>> causing the
>>>>>>>>>>>>>>> error? Almost certainly Moses is failing to parse the line
>>>>>>>>>>>>>>> because of
>>>>>>>>>>>>>>> the Asian characters in the feature names,
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> cheers - Barry
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> On 16/01/16 15:58, Dingyuan Wang wrote:
>>>>>>>>>>>>>>>> I ran
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> ~/software/moses/bin/kbmira -J 75  --dense-init run7.dense
>>>>>>>>>>>>>>>> --sparse-init
>>>>>>>>>>>>>>>> run7.sparse-weights  --ffile run1.features.dat --ffile
>>>>>>>>>>>>>>>> run2.features.dat
>>>>>>>>>>>>>>>> --ffile run3.features.dat --ffile run4.features.dat --ffile
>>>>>>>>>>>>>>>> run5.features.dat --ffile run6.features.dat --ffile
>>>>>>>>>>>>>>>> run7.features.dat
>>>>>>>>>>>>>>>> --scfile run1.scores.dat --scfile run2.scores.dat --scfile
>>>>>>>>>>>>>>>> run3.scores.dat --scfile run4.scores.dat --scfile
>>>>>>>>>>>>>>>> run5.scores.dat
>>>>>>>>>>>>>>>> --scfile run6.scores.dat --scfile run7.scores.dat -o
>>>>>>>>>>>>>>>> /tmp/mert.out
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> in the tuning/tmp.1 directory, which will certainly
>>>>>>>>>>>>>>>> replicate the
>>>>>>>>>>>>>>>> error.
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> 在 2016年01月16日 23:42, Hieu Hoang 写道:
>>>>>>>>>>>>>>>>> The mert script prints out every command it runs. You
>>>>>>>>>>>>>>>>> should be
>>>>>>>>>>>>>>>>> able to
>>>>>>>>>>>>>>>>> replicate the error by running the last command
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> On 16 Jan 2016 14:18, "Dingyuan Wang" <[email protected]
>>>>>>>>>>>>>>>>> <mailto:[email protected]>> wrote:
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>              Sorry, but I can't reliably replicate the same
>>>>>>>>>>>>>>>>> problem
>>>>>>>>>>>>>>>>> when
>>>>>>>>>>>>>>>>> running
>>>>>>>>>>>>>>>>>              TUNING_tune.1 alone. There is no character '_' in
>>>>>>>>>>>>>>>>> the test
>>>>>>>>>>>>>>>>> set
>>>>>>>>>>>>>>>>> or top50
>>>>>>>>>>>>>>>>>              list.
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>              I'm using sparse-features =
>>>>>>>>>>>>>>>>> "target-word-insertion
>>>>>>>>>>>>>>>>> top 50,
>>>>>>>>>>>>>>>>>              source-word-deletion top 50, word-translation
>>>>>>>>>>>>>>>>> top 50
>>>>>>>>>>>>>>>>> 50,
>>>>>>>>>>>>>>>>> phrase-length"
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>              I've attached some related files from EMS and the
>>>>>>>>>>>>>>>>> EMS
>>>>>>>>>>>>>>>>> config.
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>       
>>>>>>>>>>>>>>>>> https://mega.nz/#!xs0SFKxL!M_RTBp1JGX24-b4xlYYLP-bLXKiC_Sl-p96x55avAB4
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>              在 2016年01月16日 02:45, Hieu Hoang 写道:
>>>>>>>>>>>>>>>>>              > could you make your model files available for
>>>>>>>>>>>>>>>>> download so I
>>>>>>>>>>>>>>>>> can
>>>>>>>>>>>>>>>>>              > replicate this problem.
>>>>>>>>>>>>>>>>>              >
>>>>>>>>>>>>>>>>>              > it seems like you're using a feature
>>>>>>>>>>>>>>>>> function with
>>>>>>>>>>>>>>>>> sparse
>>>>>>>>>>>>>>>>> scores. I
>>>>>>>>>>>>>>>>>              > think the character '_' must be escaped.
>>>>>>>>>>>>>>>>>              >
>>>>>>>>>>>>>>>>>              >
>>>>>>>>>>>>>>>>>              > On 12/01/16 04:00, Dingyuan Wang wrote:
>>>>>>>>>>>>>>>>>              >> Hi all,
>>>>>>>>>>>>>>>>>              >>
>>>>>>>>>>>>>>>>>              >> I'm using EMS for doing experiments. Every
>>>>>>>>>>>>>>>>> time the
>>>>>>>>>>>>>>>>> kbmira
>>>>>>>>>>>>>>>>> died with
>>>>>>>>>>>>>>>>>              >> SIGABRT when turning on one direction, while
>>>>>>>>>>>>>>>>> tuning
>>>>>>>>>>>>>>>>> on the
>>>>>>>>>>>>>>>>> opposite
>>>>>>>>>>>>>>>>>              >> direction (same config and test set) was
>>>>>>>>>>>>>>>>> successful.
>>>>>>>>>>>>>>>>>              >>
>>>>>>>>>>>>>>>>>              >> The mert.log (stderr) shows follows:
>>>>>>>>>>>>>>>>>              >>
>>>>>>>>>>>>>>>>>              >>
>>>>>>>>>>>>>>>>>              >> kbmira with c=0.01 decay=0.999 no_shuffle=0
>>>>>>>>>>>>>>>>>              >> Initialising random seed from system clock
>>>>>>>>>>>>>>>>>              >> Found 15323 initial sparse features
>>>>>>>>>>>>>>>>>              >> ....terminate called after throwing an
>>>>>>>>>>>>>>>>> instance of
>>>>>>>>>>>>>>>>>              >> 'MosesTuning::FileFormatException'
>>>>>>>>>>>>>>>>>              >>    what():  Error in line "-4.51933 0 0
>>>>>>>>>>>>>>>>> -6.09733
>>>>>>>>>>>>>>>>> 0 0 0
>>>>>>>>>>>>>>>>> -121.556 2
>>>>>>>>>>>>>>>>>              -20 12
>>>>>>>>>>>>>>>>>              >> -31.6201 -38.5211 -26.5112 -60.6166 WT_，~，=2
>>>>>>>>>>>>>>>>> WT_？~？=1
>>>>>>>>>>>>>>>>> PL_s1=4
>>>>>>>>>>>>>>>>>              >> PL_s3=1 PL_3,3=1 PL_2,2=3 PL_1,2=1 PL_2,1=3
>>>>>>>>>>>>>>>>> PL_t1=6
>>>>>>>>>>>>>>>>> PL_t2=4
>>>>>>>>>>>>>>>>> PL_t3=2
>>>>>>>>>>>>>>>>>              >> PL_2,3=1 PL_s2=7 PL_1,1=3 WT_未~没有=1 WT_何~
>>>>>>>>>>>>>>>>> 怎么=1
>>>>>>>>>>>>>>>>> WT_何~
>>>>>>>>>>>>>>>>> 能=1
>>>>>>>>>>>>>>>>>              WT_方~正
>>>>>>>>>>>>>>>>>              >> 在=1 WT_又~还=1 WT_君~您=2 WT_趣~向=1 WT_
>>>>>>>>>>>>>>>>> 趣~奔=1
>>>>>>>>>>>>>>>>> WT_有~
>>>>>>>>>>>>>>>>> 没有=1
>>>>>>>>>>>>>>>>> WT_
>>>>>>>>>>>>>>>>>              往~去=1
>>>>>>>>>>>>>>>>>              >> WT_官~官员=1 WT_假~借=1 WT_檄~檄文=1 WT_文~
>>>>>>>>>>>>>>>>> 文告=1
>>>>>>>>>>>>>>>>> WT_上~上
>>>>>>>>>>>>>>>>> 级=1 WT_为~
>>>>>>>>>>>>>>>>>              >> 呢=1 WT_在~正在=1 " of run7.features.dat
>>>>>>>>>>>>>>>>>              >> Aborted
>>>>>>>>>>>>>>>>>              >>
>>>>>>>>>>>>>>>>>              >>
>>>>>>>>>>>>>>>>>              >> I think since run7.scores.dat is generated by
>>>>>>>>>>>>>>>>> some
>>>>>>>>>>>>>>>>> scripts, I
>>>>>>>>>>>>>>>>>              wouldn't
>>>>>>>>>>>>>>>>>              >> be responsible for making the bad format. Last
>>>>>>>>>>>>>>>>> time it
>>>>>>>>>>>>>>>>> also
>>>>>>>>>>>>>>>>> died, I
>>>>>>>>>>>>>>>>>              >> removed the likely offending line in the test
>>>>>>>>>>>>>>>>> set, but
>>>>>>>>>>>>>>>>> this time
>>>>>>>>>>>>>>>>>              another
>>>>>>>>>>>>>>>>>              >> line appears.
>>>>>>>>>>>>>>>>>              >>
>>>>>>>>>>>>>>>>>              >> --
>>>>>>>>>>>>>>>>>              >> Dingyuan Wang
>>>>>>>>>>>>>>>>>              >>
>>>>>>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>>>>>>>              >> Moses-support mailing list
>>>>>>>>>>>>>>>>>              >> [email protected]
>>>>>>>>>>>>>>>>> <mailto:[email protected]>
>>>>>>>>>>>>>>>>>              >>
>>>>>>>>>>>>>>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>>>>>>>>>>>>>>              >
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>              --
>>>>>>>>>>>>>>>>>              Dingyuan Wang (gumblex)
>>>>>>>>>>>>>>>>>
>


-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Re: [Moses-support] kbmira died with SIGABRT when tuning

Reply via email to