Hello Hieu,

Thanks for your reply.

Just to reconfirm your answers:

>Phrase-table and lexical reordering- no problem. LM - only ken LM. There
is no visual studio project for IRSTLM

1. I can build the binarized language model, binarized phrase table and
binarized lexical reordering table, all in *linux* and directly use them
with the moses decoder compiled in *native* *windows*, and it should work
fine right?

2. If I want to build the binarized phrase table and binarized lexical
reordering table in *native windows*, I can still use GIZA. And if I want
to build the binarized language model in *native windows*, KenLM has a
visual studio project for it?

Thank you
Krishna


On Fri, Oct 5, 2012 at 4:31 PM, Hieu Hoang <[email protected]> wrote:

>  hi krishna
>
>
> On 04/10/2012 18:16, krishna nanda wrote:
>
> Hello Hieu,
>
>  I have some questions on getting moses to work on native windows:
>
>  1. I see that under other builds, there is a moses visual studio
> project. Is that project up to date?
>
> i'm not sure. I was told it was working about a month ago. It should be
> reasonably up-to-date
>
>
>  2. From this thread:
> http://comments.gmane.org/gmane.comp.nlp.moses.user/6991
> I understand that moses scripts for training/tuning have not been ported
> to native windows
>
> correct. It requires cygwin
>
>  .
>
>  And from here: http://www.statmt.org/moses/?n=Moses.FAQ#ntoc9
> I understand that moses decoder works fine in native windows.
>
> The visual studio project compiles the decoder natively. See Q1.
>
>
> So, if I build a binarized language model, binarized phrase table and
> binarized reordering table in linux, would it be ok to use these binarized
> models with the decoder in native windows?
>
> Phrase-table and lexical reordering- no problem. LM - only ken LM. There
> is no visual studio project for IRSTLM
>
>
>  3. Lastly, I also saw in the above thread that you are porting KenLM to
> windows. That would be very helpful, as I was looking to build binarized
> language models in native windows.
>
> it's included.
> NB - Both KenLM and IRSTLM uses memory mapping. Therefore, to use binary
> LM bigger than 2GB, a 64-bit OS has to be used.
> NB2. Cygwin is 32-bit, even on 64-bit windows.
>
>
>  Thank you Hieu
> Krishna
>
>
>  On Mon, Sep 24, 2012 at 7:31 AM, krishna nanda 
> <[email protected]>wrote:
>
>> Hello Hieu,
>>
>>  It was more out of curiosity. Since when we build moses, we do specify
>> one of the four language models (Ken/IRST/SRI/Rand), I was wondering how
>> easy it is to modify moses to accept some other language model.
>>
>>  When you say: "The Moses framework is allow anyone to write their own
>> LM wrapper so that it can be used in the decoder"
>>
>>  do you mean if I do have an ARPA file created from some language model,
>> I can use it with moses like how I use IRSTLM with moses, just that the
>> ARPA file is not created using IRSTLM.
>>
>>  I am experimenting creating a small ARPA file from raw n gram data and
>> feeding it to moses. I have n grams (1,2,3) and their probabilities and
>> backoff weights. I formatted them as required by ARPA (
>> http://www.speech.sri.com/projects/srilm/manpages/ngram-format.5.html)
>>
>>  But, when I try to binarize the ARPA file in moses, I get the following
>> error:
>> *
>> *
>> *"The context of every 3-gram should appear as a 2-gram"*
>>
>>  Since the 2 grams and 3 grams are extracted from the same data, I was
>> not sure why the above error message would not be true. I traced the error
>> to the file "lm/search_hashed.cc". The ARPA formatting in itself seems ok
>> to me. I am not sure what I might be missing.
>>
>>  Thanks for your time Hieu
>>  Krishna
>>
>>
>>   On Tue, Sep 18, 2012 at 5:31 PM, Hieu Hoang 
>> <[email protected]>wrote:
>>
>>>  glad to hear that it's working with cygwin. If anyone out there who's
>>> willing to occasionally test Moses on cygwin and report any problems, I
>>> will be very grateful.
>>>
>>> I'm curious, what is the benefit of the MS Web LM and MITLM? The Moses
>>> framework is allow anyone to write their own LM wrapper so that it can be
>>> used in the decoder. If these other LM have advantages, it'll be good to
>>> incorporate them.
>>>
>>>
>>> On 18/09/2012 04:36, krishna nanda wrote:
>>>
>>> Hello Hieu,
>>>
>>>  Thanks for your reply. Yes, I managed to run moses in cygwin without
>>> any problems. No changes were required.
>>>
>>>  I have another question:
>>> I saw that moses has support for Ken/IRST/SRI/Rand Language models.
>>> But is there an easy way to consume other language models like the
>>> Microsoft web language model or MITLM in moses?
>>>
>>>  Thank you
>>> Krishna
>>>
>>>
>>>
>>>  On Fri, Aug 24, 2012 at 10:01 PM, Hieu Hoang 
>>> <[email protected]>wrote:
>>>
>>>>  the last 2 number (4 31) are not scores and are ignored. They are
>>>> counts that was used to calculate the probabilities.
>>>>
>>>> there's no option to calculate joint probabilities. i suppose you need
>>>> to calculate p(s) or p(t) which can be done, but may require a lot of
>>>> memory. try it yourself and add it to moses if it works.
>>>>
>>>> so you managed to run moses in cygwin all the way to getting a bleu
>>>> score? was there anything you need to change?
>>>>
>>>> On 23/08/2012 16:57, krishna nanda wrote:
>>>>
>>>> Hello Hoang and Barry,
>>>>
>>>>  Thanks a lot for your reply. I was able to install moses and run it
>>>> in cygwin. I have some quick questions:
>>>>
>>>>  I found the format (contents) of the phrase table here:
>>>> http://www.statmt.org/moses/?n=FactoredTraining.ScorePhrases
>>>> according to which, there are 5 scores for a phrase pair.
>>>>
>>>>  But, the phrase table I generated from the news corpus has 7 scores
>>>> for a phrase pair like this:
>>>> ,au cours de ||| ,as of ||| .25 2.36556e-06 0.0333581 0.00128335 2.718
>>>> ||| 4 31
>>>> I was not clear on the above format.
>>>>
>>>>  Secondly, is there an option to also generate joint probabilities of
>>>> phrase pairs?
>>>>
>>>>  Thank you
>>>> Krishna
>>>>
>>>>
>>>>  On Sun, Aug 19, 2012 at 11:12 PM, Hieu Hoang <[email protected]
>>>> > wrote:
>>>>
>>>>>  running Moses on cygwin should be the same as running it on linux or
>>>>> mac. If you have any problems, please get back to us.
>>>>>
>>>>> the document you pointed to is old now, i've changed the website to
>>>>> reflect that.
>>>>>
>>>>> On 17/08/2012 08:45, krishna nanda wrote:
>>>>>
>>>>>  Hello,
>>>>>
>>>>>  I am looking to install Moses in Cygwin. However, I found the
>>>>> document on the website under "windows installation" to be not up to date:
>>>>> http://www.statmt.org/moses/?n=Development.GetStarted
>>>>>
>>>>>  It still uses "regenerate-makefiles.sh", which is not there in the
>>>>> latest source from github. Is there an updated version?
>>>>>
>>>>>  Thank you
>>>>> Krishna
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>   _______________________________________________
>>>>> Moses-support mailing 
>>>>> [email protected]http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>>
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> Moses-support mailing list
>>>>> [email protected]
>>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>>
>>>>>
>>>>
>>>>
>>>
>>>
>>
>
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to