hi mohammad
On 17 October 2014 21:45, Mohammad Salameh <[email protected]> wrote: > Thanks Hieu, > I wan to exclude the <s> because I want to translate chunks of source > sentences with one model, and then add them and their score as extra > feature to a phrase table of a different model. > So I don't want the sentence boundaries to be involved in the translation. > I understand. Moses doesn't allow you to exclude <s>, however, if you don't want the score for this, then maybe you should write a feature function to subtract it from the score. Or modify an existing language model to not score <s> > > Also, I trained a factored system with --translation-factors 0-0,1. The > training process ended successfully and I do not see any error with the > training.out file. > But the tuning and decoding is ending up with Segmentation Fault error > when loading the phrase table and when it reaches 3% when loading. > I have attached the mert.out. > Would it be possible to know the reason, or which phrases in the phrase > table is causing the interruption in loading? > Can you also send the moses.ini file you used, and the EXACT command you executed. > Thanks, > Salameh > > > > > > > On Fri, Oct 17, 2014 at 12:57 PM, Hieu Hoang <[email protected]> wrote: > >> sorry, must have missed your email. Answers below >> >> On 16/10/14 20:21, Mohammad Salameh wrote: >> >> Hi, >> any answer to the above questions, >> Thanks, >> Salameh >> >> On Fri, Oct 10, 2014 at 10:11 AM, Mohammad Salameh <[email protected]> >> wrote: >> >>> Hi >>> I have few questions on how Moses system works >>> >>> 1) would it be possible to do a factored translation where factors >>> appear in the output but do not be part of the translation process. For >>> example, I have English surface form on source side and Arabic surface and >>> their stems on the target side. I want to translate from English surface >>> form to Arabic surface, but also see the stems accompanying the surface >>> forms in the output. >>> I have tried setting --translation-factors 0-0 , but only ended up >>> with the Arabic surface forms in the output. >>> >> I'm not sure what you mean by 'not be part of the translation >> process'. If you want to see the stem in the output but you don't want it >> in the translation table, then there needs to be some process that generate >> the stem, given the target word. Moses has a crude solution - it is called >> the generation step. >> >> >>> >>> 2) when translating sentences with moses , I assume that moses adds >>> the sentence boundary markers <s> </s> automatically. Would it be possible >>> to exclude these from the translation. I need to get translation scores for >>> chunks of input sentences which does not involve scores generated based on >>> <s> and </s> from LM or phrase table. >>> >> Yes, it include <s> </s>. No, you can't exclude these from the >> translation process. >> >> I'm curious to know why you want to exclude these >> >> >>> 3) I added additional phrases to the phrase table. Should the phrase >>> table be sorted again and is it enough to do "LC_ALL=C sort " on the PT to >>> be used properly ? >>> >> Yes, it needs to be sorted again. You must also make sure that the new >> phrases are not duplicates of existing phrases >> >> >>> Thanks >>> >>> _______________________________________________ >>> Moses-support mailing list >>> [email protected] >>> http://mailman.mit.edu/mailman/listinfo/moses-support >>> >>> >> >> >> _______________________________________________ >> Moses-support mailing >> [email protected]http://mailman.mit.edu/mailman/listinfo/moses-support >> >> >> > -- Hieu Hoang Research Associate University of Edinburgh http://www.hoang.co.uk/hieu
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
