Thanks Hieu,
I wan to exclude the <s> because I want to translate chunks of source
sentences  with one model, and then add them  and their score as extra
feature to a phrase table of a different model.
So I don't want the sentence boundaries to be involved in the translation.

Also, I trained a factored system with  --translation-factors 0-0,1. The
training process ended successfully and I do not see any error with the
training.out file.
But the tuning and decoding is ending up with Segmentation Fault error when
loading the phrase table and when it reaches 3% when loading.
I have attached the mert.out.
Would it be possible to know the reason, or which phrases in the phrase
table is causing the interruption in loading?
Thanks,
Salameh






On Fri, Oct 17, 2014 at 12:57 PM, Hieu Hoang <[email protected]> wrote:

>  sorry, must have missed your email. Answers below
>
> On 16/10/14 20:21, Mohammad Salameh wrote:
>
> Hi,
> any answer to the above questions,
> Thanks,
> Salameh
>
> On Fri, Oct 10, 2014 at 10:11 AM, Mohammad Salameh <[email protected]>
> wrote:
>
>> Hi
>> I have few questions on how Moses system works
>>
>>  1) would it be possible to do a factored translation where factors
>> appear in the output but do not be part of the translation process. For
>> example, I have English surface form on source side and  Arabic surface and
>> their stems on the target side. I want to translate from English surface
>> form to Arabic surface, but also see the stems accompanying the surface
>> forms in the output.
>>  I have tried setting --translation-factors 0-0 , but only ended up with
>> the Arabic surface forms in the output.
>>
>   I'm not sure what you mean by 'not be part of the translation process'.
> If you want to see the stem in the output but you don't want it in the
> translation table, then there needs to be some process that generate the
> stem, given the target word. Moses has a crude solution - it is called the
> generation step.
>
>
>>
>>  2) when translating sentences with moses , I assume that moses adds the
>> sentence boundary markers <s> </s> automatically. Would it be possible to
>> exclude these from the translation. I need to get translation scores for
>> chunks of input sentences which does not involve scores generated based on
>> <s> and </s> from LM or phrase table.
>>
>   Yes, it include <s> </s>. No, you can't exclude these from the
> translation process.
>
> I'm curious to know why you want to exclude these
>
>
>>  3) I added additional phrases to the phrase table. Should the phrase
>> table be sorted again and is it enough to do "LC_ALL=C sort " on the PT to
>> be used properly ?
>>
>   Yes, it needs to be sorted again. You must also make sure that the new
> phrases are not duplicates of existing phrases
>
>
>>  Thanks
>>
>> _______________________________________________
>> Moses-support mailing list
>> [email protected]
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>
>
> _______________________________________________
> Moses-support mailing 
> [email protected]http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
>

Attachment: mert.out
Description: Binary data

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to