Hi,

Are you planning to use binary domain indicator features? I'm not sure
whether a sparse feature function for this is currently implemented. If
you're working with a small set of domains, you can employ dense
indicators instead (domain-features = "indicator" in EMS). You'll have
to re-extract the phrase table, though. Or process it with a script to
add dense indicator values to the scores field.

I believe that there might also be some bug in the extraction pipeline
when both domain-features = "sparse indicator" and score-settings =
"--GoodTuring" are active in EMS. At least it caused me trouble a couple
of weeks ago. However, I must admit that I didn't investigate it further
at that point.

Anyway, the bottom line is that I recommend re-extracting with dense
indicators.

But let me know what you find regarding a sparse implementation.

Cheers,
Matthias


On Thu, 2015-07-16 at 00:48 +0100, jian zhang wrote:
> Hi,
> 
> 
> Is the sparse features at phrase table, like
> 
> 
> 
> das Haus ||| the house ||| 0.8 0.5 0.8 0.5 2.718 ||| 0-0 1-1 ||| 5000
> 5000 2500 ||| dom_europarl 1
> 
> 
> 
> still supported? If yes, what should I set to the ini file based on
> the example above?
> 
> 
> Thank,
> 
> 
> Jian
> 
> 
> -- 
> Jian Zhang
> Centre for Next Generation Localisation (CNGL)
> Dublin City University
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support



-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to