Thanks Amittai, I’ll check it out.
> On Jan 23, 2016, at 1:10 PM, amittai axelrod <amit...@umiacs.umd.edu> wrote:
> 
> whoops, forgot link. see "class-based models" section in:
> http://kheafield.com/code/kenlm/estimation/
> 
> ~amittai
> 
> 
> On 1/23/16 13:08, amittai axelrod wrote:
>>> The reason for using Witten-Bell was because
>>> Kneser-Ney wasn’t able to cope up with the counts being generated for
>>> coarse language models.
>> 
>> that is indeed an annoyance with kndiscount. however, you can now try
>> using "--discount_fallback" with kenlm. it works for me, even with tens
>> of classes.
>> 
>> cheers,
>> ~amittai
>> 
>> On 1/23/16 07:11, Jasneet Sabharwal wrote:
>>> Thanks Ken & Hieu,
>>> 
>>> I’ll give KenLM a try. The reason for using Witten-Bell was because
>>> Kneser-Ney wasn’t able to cope up with the counts being generated for
>>> coarse language models. Sp, I’ll train my LM using SRILM with ngram
>>> order 8 and WB smoothing and use KenLM with  order 8 in Moses.
>>> 
>>> Best,
>>> Jasneet
>>>> On Jan 23, 2016, at 3:38 AM, Kenneth Heafield <mo...@kheafield.com
>>>> <mailto:mo...@kheafield.com>> wrote:
>>>> 
>>>> Hi,
>>>> 
>>>> You can compile with --max-kenlm-order=8 or change the setting in the
>>>> Eclipse files.
>>>> 
>>>> The ARPA file format is interchangeable.  You can build an ARPA using
>>>> SRILM and Witten-Bell (though Bob Moore once called me out at a
>>>> conference for suggesting that as an alternative to Kneser-Ney) then
>>>> load with KenLM.
>>>> 
>>>> Kenneth
>>>> 
>>>> On 01/23/2016 05:39 AM, Jasneet Sabharwal wrote:
>>>>> Thanks Hieu.
>>>>> 
>>>>> I’m using the eclipse project for development. I followed your video to
>>>>> set it up and I have linked the srilm and irstlm installations in the
>>>>> root directory of mosesdecoder. I first tried to compile the project,
>>>>> but neither the SRILM nor the IRSTLM LM cpp files get compiled. So, I
>>>>> added LM_IRST and included "${workspace_loc}/../../irstlm/include” path
>>>>> in the C/C++ Build settings of the project. But I still cannot compile
>>>>> IRST.cpp.
>>>>> 
>>>>> The reason I’m not using the included KenLM is because my new feature
>>>>> function requires an 8-gram language model with witten bell smoothing,
>>>>> which is provided by SRILM. As, IRSTLM can use SRILM generated language
>>>>> models, so I decided to call IRSTLM code inside my feature function to
>>>>> get the score for a phrase.
>>>>> 
>>>>> Any pointers on how can I debug the eclipse project with IRSTLM/SRILM?
>>>>> 
>>>>> Best,
>>>>> Jasneet
>>>>> 
>>>>> PS: When I compile the whole project using "./bjam -j4
>>>>> —with-boost=<absolute path to boost> —with-cmph=<absolute path to cmph>
>>>>> —with-irstlm=<absolute path to irstlm>”, it successfully compiles
>>>>> without any errors.
>>>>> 
>>>>> 
>>>>>> On Jan 19, 2016, at 4:39 PM, Hieu Hoang <hieuho...@gmail.com
>>>>>> <mailto:hieuho...@gmail.com>
>>>>>> <mailto:hieuho...@gmail.com>> wrote:
>>>>>> 
>>>>>> I believe Nadir Durrani's OSM uses KenLM inside it. You can look in
>>>>>>  moses/FF/OSM-Feature
>>>>>> for tips
>>>>>> 
>>>>>> On 20/01/16 00:31, Jasneet Sabharwal wrote:
>>>>>>> Thanks Hieu.
>>>>>>> 
>>>>>>> One last question. What do you think is the best way to load the
>>>>>>> SRILM language model inside my custom feature function and to get a
>>>>>>> score for a string that my feature function created?
>>>>>>> 
>>>>>>> Best,beli
>>>>>>> Jasneet
>>>>>>>> On Jan 17, 2016, at 3:45 AM, Hieu Hoang
>>>>>>>> <<mailto:hieuho...@gmail.com>hieuho...@gmail.com
>>>>>>>> <mailto:hieuho...@gmail.com>> wrote:
>>>>>>>> 
>>>>>>>> 
>>>>>>>> 
>>>>>>>> On 17/01/16 04:05, Jasneet Sabharwal wrote:
>>>>>>>>> Thanks Hieu,
>>>>>>>>> 
>>>>>>>>> I had subscribed to the mailing list and I’m getting the digest,
>>>>>>>>> but not sure why my email went for your approval. When I get the
>>>>>>>>> alignments from GetAlignTerm(), the index of the source word is
>>>>>>>>> relative? To get the index in the source sentence, I’m assuming
>>>>>>>>> that I would need to get the starting position of the source words
>>>>>>>>> from CurrSourceWordsRange().GetStartPos() from current hypothesis
>>>>>>>>> and offset the source alignment index with that value?
>>>>>>>> yep. And to get the index in the target sentence, use
>>>>>>>> GetCurrTargetWordsRange().GetStartPos()
>>>>>>>>> 
>>>>>>>>> Regards,
>>>>>>>>> Jasneet
>>>>>>>>>> On Jan 15, 2016, at 3:43 AM, Hieu Hoang <hieuho...@gmail.com
>>>>>>>>>> <mailto:hieuho...@gmail.com>> wrote:
>>>>>>>>>> 
>>>>>>>>>> please subscribe to the Moses mailing list before posting to it.
>>>>>>>>>> You can subscribe here:
>>>>>>>>>> http://mailman.mit.edu/mailman/admin/moses-support
>>>>>>>>>> To answer you question - the target phrase has a method called
>>>>>>>>>>  GetAlignTerm()
>>>>>>>>>> that contains the alignment for terminals. This comes from the
>>>>>>>>>> phrase-table, and ultimately from the word alignment.
>>>>>>>>>> 
>>>>>>>>>> -------- Forwarded Message --------
>>>>>>>>>> Subject:Moses-support post from jasneet.sabhar...@sfu.ca
>>>>>>>>>> <mailto:jasneet.sabhar...@sfu.ca> requires
>>>>>>>>>> approval
>>>>>>>>>> Date:Wed, 13 Jan 2016 23:36:50 -0500
>>>>>>>>>> From:moses-support-ow...@mit.edu
>>>>>>>>>> <mailto:moses-support-ow...@mit.edu>
>>>>>>>>>> To:moses-support-ow...@mit.edu
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> As list administrator, your authorization is requested for the
>>>>>>>>>> following mailing list posting:
>>>>>>>>>> 
>>>>>>>>>>   List:    Moses-support@mit.edu
>>>>>>>>>>   From:    jasneet.sabhar...@sfu.ca
>>>>>>>>>>   Subject: Getting alignments for current hypothesis in phrase
>>>>>>>>>> based model
>>>>>>>>>>   Reason:  Post by non-member to a members-only list
>>>>>>>>>> 
>>>>>>>>>> At your convenience, visit:
>>>>>>>>>> 
>>>>>>>>>>   http://mailman.mit.edu/mailman/admindb/moses-support
>>>>>>>>>> 
>>>>>>>>>> to approve or deny the request.
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> <ForwardedMessage.eml><ForwardedMessage.eml>
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> _______________________________________________
>>>>>>>>> Moses-support mailing list
>>>>>>>>> Moses-support@mit.edu <mailto:Moses-support@mit.edu>
>>>>>>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>>>>> 
>>>>>>>> --
>>>>>>>> Hieu Hoang
>>>>>>>> http://www.hoang.co.uk/hieu
>>>>>>> 
>>>>>> 
>>>>>> --
>>>>>> Hieu Hoang
>>>>>> http://www.hoang.co.uk/hieu
>>>>> 
>>>>> 
>>>>> 
>>>>> _______________________________________________
>>>>> Moses-support mailing list
>>>>> Moses-support@mit.edu <mailto:Moses-support@mit.edu>
>>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>> 
>>>> _______________________________________________
>>>> Moses-support mailing list
>>>> Moses-support@mit.edu <mailto:Moses-support@mit.edu>
>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>> 
>>> 
>>> 
>>> _______________________________________________
>>> Moses-support mailing list
>>> Moses-support@mit.edu
>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>> 
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support


_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to