Per Tunedal <[email protected]>
writes:

> Hi,
>
> On Thu, Aug 9, 2012, at 23:23, Trosterud Trond wrote:
>> 
>> Per Tunedal kirjoitti 9. aug. 2012 kello 20:21:
>> > Tihomir has told before that he plans to start developing a constraint
>> > grammar for Swedish.
>> 
>> Good. Again: 
>> - Are there open resources?
>> - Could something be ported from Norwegian? (perhaps only indirectly).
>> 
>> >> Yes, a production system (say, I want to translate a sv article to nn on
>> >> Wikipedia) (…)
>> 
>> > Yes, that was the scenario I first had in mind. But it would break if
>> > there is a need for a constraint grammar, wouldn't it? And then there
>> > wont be any use left for the Apertium-translation.
>> 
>> Well. Since a handful of rules will remove most ambiguities, what is left
>> will be partly disambiguated. And how bad this is for MT needs to be
>> seen. So it will not break. It will only be more problematic, and the
>> result will be poorer.
>
> Mikel Artetxe has explained that the OmegaT plug-in doesn't work for
> language pairs that depends on programs that aren't a part of
> lttoolbox-java. Six language pairs depend on the Constraint Grammar
> package and are thus excluded, one of them is apertium-nn-nb. But sv-da
> doesn't use any constraint grammar, thus I concluded that sv-nb (Norsk
> bokmål) wouldn't need one either. And would come to real use, by real
> translators, using OmegaT. If the pair cannot be used, I don't see any
> need to develop it.

In any case a CG could be added later, as an option for those who aren't
using OmegaT or Android.


[...]

>> > What kind of resources do I need?
>> 
>> For 1: swetwol :-) But it seems there are resources in Gothenburg:
>> 
>> http://www.cse.chalmers.se/alumni/markus/FM/
>> http://www.cse.chalmers.se/alumni/markus/FM/download/swedish.lexicon
>> 
>> This might even work.
>
> As an input for transfer rules or for a potential constraint grammar?

The .lexicon file might be used to enlarge sv.dix. However, is-sv.sv.dix
should already be big enough to get a pair started.

(What's the license on that FM stuff anyway?)

>> >> As for 2, Lexin might be one resource. I am on Euralex in Oslo right now,
>> >> and will ask around.
>> > Fine! Besides, what's Lexin?
>> 
>> Lexicon för invandrare, http://lexin.nada.kth.se/lexin/
>
> As a native Swede, I don't see any need for this.

Your machine translation system, however, is not a native Swede, and
might have a need to know that e.g. "katt" is a noun.

But it doesn't seem that Lexin is free software:

    Går lexikonen att ladda ner?
    Nej. Däremot kan man ladda ner Folkets lexikon, som ersätter engelska 
Lexin, men enbart i xml-format. 
    http://lexin.nada.kth.se/lexin/#about=1;main=3;


-- 
Kevin Brubeck Unhammer

Sent using my emacs


------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to