Hi Keld, I haven't decided yet on the corpus to work on. My initial idea was to do some web crawling after texts in the two languages involved. I believe there is a tool available. Maybe at Uppsala University? Anyone who has any idea on how to proceed?
I thought there was a pair SV - EN to work on, thus I intended to quickly set up a minor working system (maybe limited to a restraint domain). Now, when there isn't any such pair, nor any DA - EN pair, I will have to think it over once more. My main interest is French, a language completely different from Swedish. If I would work on the pair SV - FR instead, I will have to consider using an other tool than Apertium. I will have a look at Matxin and MOSES (statistical). Is it possible to use dictionaries and other resources from Apertium with Matxin? Yours, Per Tunedal On Wed, Apr 4, 2012, at 16:00, [email protected] wrote: > On Wed, Apr 04, 2012 at 03:35:39PM +0200, Per Tunedal wrote: -- snip -- > > > > 4. The similarity of Swedish and Norwegian leads me to believe that the > > transfer rules could be somewhat similar for the two languages, although > > Swedish and Danish are considered more similar. --snip-- > > > I suppose some kind of cooperation would be beneficial if I start to > > work on the pair SE - EN. Maybe splitting up the list of words needing > > transfer rules between us. Maybe I can reuse your bilingual dictionary > > (translating Norwegian to Swedish) and you can reuse entries I add to > > mine (translating Swedish to Norwegian). > > We do have a sv-da module with quite some grammar encoded. > > What data are you working on for Swedish? I have been looking at some > data from Gothenburg. > SALDO: http://spraakbanken.gu.se/resurs/saldo > I have some plans/hopes for a da-en module. > > I also like the idea of reusing sv and nb/nn resources and grammar. > > > > I might as well cooperate with the developers of the DA - EN pair, by > > the same reasons. > > I am not sure there is a da-en pair. > > > Maybe it would be easier to start with the SE - NO pair, building on the > > SE - DA? There are great similarities between the Norwegian bokm??l > > (written language variant) and Danish. But I haven't any real knowledge > > of neither Danish, nor of Norwegian :-( > > > > 5. English is quite distant from Swedish and Norwegian. Would it be more > > fruitful to use Matxin instead of Apertium? What's the difference? > > > > Yours, > > Per Tunedal > > --snip-- > > best regards > keld > --snip-- ------------------------------------------------------------------------------ Better than sec? Nothing is better than sec when it comes to monitoring Big Data applications. Try Boundary one-second resolution app monitoring today. Free. http://p.sf.net/sfu/Boundary-dev2dev _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
