Hi,
It would be interesting to know more about how to auto-trim a
monolingual dictionary to the words present in the bidix. It would be
highly appropriate for my work on Norwegian bokmål (nb) to Swedish (se).
And, of course, for "correcting" the pair Danish (da) to Swedish (se).
I've thought on commenting out offending entries with some clever script
and/or keeping a full dix in parallel. I don't want to loose the full
dictionaries, as I hope the bidix gradually would be increased.

BTW I'm impressed by your work. To take on such a complicated task and
manage to accomplish it. Apparently, Apertium is very useful for
understanding small languages. Statistical approaches would probably be
out of the question.

Yours,
Per Tunedal

On Wed, Sep 19, 2012, at 17:56, Kevin Brubeck Unhammer wrote:
--snip--
> 
> The bilingual dictionary has 21243 entries (plus 43890 proper nouns),
> the sme monolingual _should_ be auto-trimmed to those entries (it seemed
> testvoc clean, but that's currently a bit hard to test with HFST
> analysers). The nob dictionary is copied over from nn-nb and is
> generation-only, so currently untrimmed with 98427 entries.
> 
---snip---
> 
> 
> -- 
> Kevin Brubeck Unhammer
> 
> GPG: 0x766AC60C
> 
> 
> ------------------------------------------------------------------------------
> Live Security Virtual Conference
> Exclusive live event will cover all the ways today's security and 
> threat landscape has changed and how IT managers can respond. Discussions 
> will include endpoint security, mobile security and the latest in malware 
> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
> _______________________________________________
> Apertium-stuff mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://ad.doubleclick.net/clk;258768047;13503038;j?
http://info.appdynamics.com/FreeJavaPerformanceDownload.html
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to