Hi, It would be interesting to know more about how to auto-trim a monolingual dictionary to the words present in the bidix. It would be highly appropriate for my work on Norwegian bokmål (nb) to Swedish (se). And, of course, for "correcting" the pair Danish (da) to Swedish (se). I've thought on commenting out offending entries with some clever script and/or keeping a full dix in parallel. I don't want to loose the full dictionaries, as I hope the bidix gradually would be increased.
BTW I'm impressed by your work. To take on such a complicated task and manage to accomplish it. Apparently, Apertium is very useful for understanding small languages. Statistical approaches would probably be out of the question. Yours, Per Tunedal On Wed, Sep 19, 2012, at 17:56, Kevin Brubeck Unhammer wrote: --snip-- > > The bilingual dictionary has 21243 entries (plus 43890 proper nouns), > the sme monolingual _should_ be auto-trimmed to those entries (it seemed > testvoc clean, but that's currently a bit hard to test with HFST > analysers). The nob dictionary is copied over from nn-nb and is > generation-only, so currently untrimmed with 98427 entries. > ---snip--- > > > -- > Kevin Brubeck Unhammer > > GPG: 0x766AC60C > > > ------------------------------------------------------------------------------ > Live Security Virtual Conference > Exclusive live event will cover all the ways today's security and > threat landscape has changed and how IT managers can respond. Discussions > will include endpoint security, mobile security and the latest in malware > threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ > _______________________________________________ > Apertium-stuff mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/apertium-stuff ------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://ad.doubleclick.net/clk;258768047;13503038;j? http://info.appdynamics.com/FreeJavaPerformanceDownload.html _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
