Hi Hwidong

Ah, went through your model and it segfaults like you said. There is a 
bug when using the word 'X'.

I've checked in the fix
    
http://mosesdecoder.svn.sourceforge.net/viewvc/mosesdecoder?view=rev&revision=2643

Thx for phrase table

Hieu

Hwidong Na wrote:
> Hi Hieu,
>
> Here is the ini file (run1.moses.ini) and a fuller trace of output
> (log). The rule table used in the decoding is not much large so that you
> can download it. (http://home.postech.ac.kr/~leona/rule-table.gz) It
> will help you reproduce the error. 
>
> It is also strange for me since I do not suffer from the X problem with
> the sample data (http://www.statmt.org/moses/download/sample-models.tgz)
>
> Best regards,
> 2009-12-30 (수), 09:18 +0000, Hieu Hoang:
>   
>> the X problem is strange. I can't reproduce the error, eg. for input
>>    X the man bought the house
>> or with an X in the phrase table
>>    [X] [X] ||| X ||| hello ||| ||| 0.17
>>
>> can you send me the ini file and a fuller trace of the output?
>>
>> thx
>> hieu
>>
>> Hwidong Na wrote:
>>     
>>> Hi all.
>>>
>>> I mean the decoder "moses_chart" and the script "mert-moses-new.pl" in
>>> the mt3_chart branch. (I follow "Compiling Chart Decoder" in
>>> http://www.statmt.org/moses/?n=Development.GetStarted)
>>>
>>> Thanks to John, I try the option "--no-filter-phrase-table" and it works
>>> in a small size corpus. It seems that there is no difference between
>>> "mert-moses.pl" and "mert-moses-new.pl" with the option, i.e.
>>> "mert-moses.pl" also supports the option.
>>>
>>> There is an another problem during the tree-based decoding. If a source
>>> sentence has a word "X", then "moses_chart" gives a segmentation fault
>>> as following. 
>>>  
>>>         Translating: <s> X 1 の 好ましい 範囲 は 、 15 〜 30 % まで の 深 さ で ある 。 </s>  
>>> ||| [0,0]=X (1) [0,1]=X (1) [0,2]=X (1) [0,3]=X (1) [0,4]=X (1) 
>>> ...ellipsis...
>>>         [0..0]=1 [1..1]=2 [2..2]=50 [3..3]=200 ...ellipsis... [0..1]=2 
>>> [1..2]=Segmentation fault
>>>         Exit code: 139
>>>         The decoder died. CONFIG WAS -w 0.000000 -lm 0.500000 -d 0.500000  
>>>
>>> It doesn't when I replace "X" with another word such as "-X-".
>>>
>>> 2009-12-29 (화), 21:00 +0000, Hieu Hoang:
>>>   
>>>       
>>>> You mean the decoder in the mt3_chart branch?
>>>>
>>>> I'm not sure the 
>>>>    mert-moses-new.pl 
>>>> works yet. I think only the 
>>>>    mert-moses.pl
>>>> is working.
>>>>
>>>> also, make sure you're using the script in the mt_chart branch,
>>>> there's niggly small differences.
>>>>
>>>> 2009/12/29 John Morgan <[email protected]>
>>>>         There is an option --no-filter-phrase-table for
>>>>         mert-moses-new.pl.
>>>>         
>>>>         
>>>>         
>>>>         On 12/29/09, Hwidong Na <[email protected]> wrote:
>>>>         > I also have a problem with tuning a tree-based system.
>>>>         >
>>>>         > I use a parallel text without syntactic annotation to train
>>>>         the
>>>>         > tree-based system. Then the extracted rule-table has the
>>>>         only
>>>>         > non-terminal on both source and target side. (in
>>>>         Japanese-to-English)
>>>>         >
>>>>         >         $ gzip -dc <path-to-model>/rule-table.gz | sort -k8
>>>>         -t'|' -nr | head
>>>>         > -n5
>>>>         >         [X] [X] ||| 高 開口 数 による 高 解像度 の ||| with
>>>>         high resolution by means of
>>>>         > a high numerical aperture |||  ||| 1 2.07446e-05 1
>>>>         2.2417e-08 2.718 |||
>>>>         > 0.0232558 0.0232558
>>>>         >         [X] [X] ||| 高 開口 数 による 高 [X] ||| with high
>>>>         [X] high numerical
>>>>         > aperture ||| 5-2 ||| 1 0.000225055 0.5 0.000282326 2.718 |||
>>>>         0.0232558
>>>>         > 0.0465116
>>>>         >         [X] [X] ||| 高 開口 数 による 高 [X] ||| with high
>>>>         [X] a high numerical
>>>>         > aperture ||| 5-2 ||| 1 0.000225055 0.5 1.64576e-05 2.718 |||
>>>>         0.0232558
>>>>         > 0.0465116
>>>>         >         [X] [X] ||| 高 開口 数 [X] 解像度 の ||| [X]
>>>>         resolution by means of a high
>>>>         > numerical aperture ||| 3-0 ||| 1 0.00589926 1 5.15592e-07
>>>>         2.718 |||
>>>>         > 0.0232558 0.0232558
>>>>         >         [X] [X] ||| 高 開口 数 [X] の ||| [X] of a high
>>>>         numerical aperture |||
>>>>         > 3-0 ||| 1 0.0503172 1 0.000170465 2.718 ||| 0.0232558
>>>>         0.0232558
>>>>         >
>>>>         >
>>>>         > Then passing "moses_chart" to "mert-moses-new" gives
>>>>         something wrong.
>>>>         > The first step of decoding is filtering of
>>>>         translation-table. The output
>>>>         > message of filtering scripts shows
>>>>         >
>>>>         >         filtering <path-to-model>/rule-table.gz ->
>>>>         > <path-to-tuning>/filtered/phrase-table.0-0.1...
>>>>         >         0 of 550276 phrases pairs used (0.00%) - note: max
>>>>         length 10
>>>>         >         filtering <path-to-model>/glue-grammar ->
>>>>         > <path-to-tuning>/filtered/phrase-table.0-0.2...
>>>>         >         0 of 3 phrases pairs used (0.00%) - note: max length
>>>>         10
>>>>         >
>>>>         > It seems wrong. So how can I filter correctly? (or just do
>>>>         not filter?)
>>>>         >
>>>>         >> From: Philipp Koehn <pko...@...>
>>>>         >> Subject: Re: tuning tree-based models
>>>>         >> Newsgroups: gmane.comp.nlp.moses.user
>>>>         >> Date: 2009-12-28 23:56:11 GMT (12 hours and 2 minutes ago)
>>>>         >>
>>>>         >> Hi,
>>>>         >>
>>>>         >> this should work... Does the moses process generate a
>>>>         proper n-best list
>>>>         >> file?
>>>>         >> There may be something wrong with running the decoder.
>>>>         >>
>>>>         >> Regarding the section "non-terminals" in the moses.ini file
>>>>         - don't worry,
>>>>         >> this
>>>>         >> is just a list of special non-terminals that are used for
>>>>         unknown words
>>>>         >> etc.
>>>>         >>
>>>>         >> -phi
>>>>         >>
>>>>         >> On Mon, Dec 28, 2009 at 2:37 PM, John Morgan
>>>>         >> <johnjosephmor...@...> wrote:
>>>>         >> > Hello All,
>>>>         >> > I want to tune a tree-based trained system.  I pass the
>>>>         moses_chart
>>>>         >> > decoder to the mert-moses-new.perl script.  The script
>>>>         seems to make 1
>>>>         >> > decoding run producing features.dat and scores.dat files,
>>>>         but it
>>>>         >> > writes an empty extract.out file.  Then the script stops.
>>>>         >> > I also noticed that the train-moses.perl script only
>>>>         writes out a "X"
>>>>         >> > under  [non-terminals] in the moses.ini file.  I have
>>>>         several
>>>>         >> > non-terminals on the input side.
>>>>         >> > Thanks,
>>>>         >> >
>>>>         >> >
>>>>         >> > --
>>>>         >> > Regards,
>>>>         >> > John J Morgan
>>>>         >> > _______________________________________________
>>>>         >> > Moses-support mailing list
>>>>         >> > moses-supp...@...
>>>>         >> > http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>         >> >
>>>>         > --
>>>>         > Hwidong Na <[email protected]>
>>>>         > POSTECH
>>>>         >
>>>>         >
>>>>         >
>>>>         >
>>>>         >
>>>>         > _______________________________________________
>>>>         > Moses-support mailing list
>>>>         > [email protected]
>>>>         > http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>         >
>>>>         
>>>>         
>>>>         
>>>>         --
>>>>         Regards,
>>>>         John J Morgan
>>>>         
>>>>         _______________________________________________
>>>>         Moses-support mailing list
>>>>         
>>>>         
>>>>         [email protected]
>>>>         http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>         
>>>>
>>>>     
>>>>         
>>
>>
>>
>>     
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to