Hi,
the transliteration seems straight forward.
I suppose just passing over a name unchanged between Swedish and Danish
would be easy? The national characters are written differently, though.
Could they be added to the character sets or would that complicate
things? It seams a bit odd to transliterate them (Ärlig - Ærlig, Östen -
Østen).

If regexps for names would slow down Apertium, I suppose the same
applies to numbers. Something smarter might be useful.

What exactly are the pardefs in the pair is-sv supposed to do? I'm
always lost when working with regexps :-(

   <pardef n="persons">
      <!-- Ásta Árnadóttir, Ásta Á. Árnadóttir, Ásta Eva Árnardóttir -->
      <e><re>[A-ZÞÁÐÉÍÓÚÝÖÅ][a-záðéíóúýöå]+
      ([A-ZÞÁÐÉÍÓÚÝÖÅ][a-záðéíóúýöå]+|[A-ZÞÁÐÉÍÓÚÝÖÅ])?.?
      ?[A-ZÞÁÐÉÍÓÚÝÖÅ][a-záðéíóúýöå]+d</re><p><l></l><r></r></p><par
      n="Ásta_Árnad/óttir__np"/></e>
      <!-- Davíð Oddsson, Davíð D. Oddsson, Davíð Gunnar Oddson -->
      <e><re>[A-ZÞÁÐÉÍÓÚÝÖÅ][a-záðéíóúýöå]+
      ([A-ZÞÁÐÉÍÓÚÝÖÅ][a-záðéíóúýöå]+|[A-ZÞÁÐÉÍÓÚÝÖÅ])?.?
      ?[A-ZÞÁÐÉÍÓÚÝÖÅ][a-záðéíóúýöå]+ss</re><p><l></l><r></r></p><par
      n="Almar_Þórarinss/on__np"/></e>
      <e><re>[A-ZÞÁÐÉÍÓÚÝÖÅ][a-záðéíóúýöå]+
      ([A-ZÞÁÐÉÍÓÚÝÖÅ][a-záðéíóúýöå]+|[A-ZÞÁÐÉÍÓÚÝÖÅ])?.?
      ?[A-ZÞÁÐÉÍÓÚÝÖÅ][a-záðéíóúýöå]+s</re><p><l></l><r></r></p><par
      n="Snorri_Guðjohns/en__np"/></e>
    </pardef>

Yours,
Per Tunedal


On Sat, Jun 1, 2013, at 13:48, Francis Tyers wrote:
> Here, I've documented the feature.
> 
> http://wiki.apertium.org/wiki/Transliteration
> 
> Fran
> 
> El ds 01 de 06 de 2013 a les 13:39 +0200, en/na Per Tunedal va escriure:
> > Hi again,
> > Yes, that's exactly what I am driving at. You have just explained that
> > the regexps slows down the transducer. Apparently rules are better added
> > some other way.
> > Yours,
> > Per Tunedal
> > 
> > On Sat, Jun 1, 2013, at 13:33, Francis Tyers wrote:
> > > I don't understand: you suggest that defining and creating a new module
> > > for proper names would be far easier than using something that already
> > > exists ? 
> > > 
> > > Fran
> > > 
> > > El ds 01 de 06 de 2013 a les 13:30 +0200, en/na Per Tunedal va escriure:
> > > > Hi,
> > > > thanks. I will try this out when I'm less busy.
> > > > What about the possibility to make some kind of add-on to Apertium to
> > > > handle proper names? It should be far easier than the already present
> > > > finite state transducer for transliteration, wouldn't it?
> > > > Yours,
> > > > Per Tunedal
> > > > 
> > > > On Fri, May 31, 2013, at 15:06, Jimmy O'Regan wrote:
> > > > > On 30 May 2013 18:47, Francis Tyers <[email protected]> wrote:
> > > > > > El dj 30 de 05 de 2013 a les 19:42 +0200, en/na Per Tunedal va 
> > > > > > escriure:
> > > > > >> The most difficult part would be to find the names. Perhaps 
> > > > > >> someone has
> > > > > >> any ideas?
> > > > > >
> > > > > > In Icelandic--English, regular expressions are used. See e.g. 
> > > > > > pardefs
> > > > > > for "persons" and "lastnames" in is.dix
> > > > > >
> > > > > > This is not altogether recommended though, as regular expressions 
> > > > > > slow
> > > > > > down your transducer. What you could do is use them on a large 
> > > > > > corpus
> > > > > > and then mass-add the ones after superficial checking.
> > > > > 
> > > > > Census data is easy to find, gazetteers for NER are easy to find,
> > > > > en.wiktionary has categories for names
> > > > > (http://en.wiktionary.org/wiki/Category:Surnames_by_language
> > > > > http://en.wiktionary.org/wiki/Category:Male_given_names_by_language
> > > > > http://en.wiktionary.org/wiki/Category:Female_given_names_by_language),
> > > > > as do en.wikipedia (http://en.wikipedia.org/wiki/Category:Surnames
> > > > > http://en.wikipedia.org/wiki/Category:Given_names), da.wikipedia
> > > > > (http://da.wikipedia.org/wiki/Kategori:Efternavne
> > > > > http://da.wikipedia.org/wiki/Kategori:Fornavne), and sv.wikipedia
> > > > > (http://sv.wikipedia.org/wiki/Kategori:Efternamn
> > > > > http://sv.wikipedia.org/wiki/Kategori:Förnamn), and Europarl has
> > > > > speaker annotation which contains the name of the speaker.
> > > > > 
> > > > > -- 
> > > > > <Sefam> Are any of the mentors around?
> > > > > <jimregan> yes, they're the ones trolling you
> > > > > 
> > > > > ------------------------------------------------------------------------------
> > > > > Get 100% visibility into Java/.NET code with AppDynamics Lite
> > > > > It's a free troubleshooting tool designed for production
> > > > > Get down to code-level detail for bottlenecks, with <2% overhead.
> > > > > Download for free and get started troubleshooting in minutes.
> > > > > http://p.sf.net/sfu/appdyn_d2d_ap2
> > > > > _______________________________________________
> > > > > Apertium-stuff mailing list
> > > > > [email protected]
> > > > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
> > > > 
> > > > ------------------------------------------------------------------------------
> > > > Get 100% visibility into Java/.NET code with AppDynamics Lite
> > > > It's a free troubleshooting tool designed for production
> > > > Get down to code-level detail for bottlenecks, with <2% overhead.
> > > > Download for free and get started troubleshooting in minutes.
> > > > http://p.sf.net/sfu/appdyn_d2d_ap2
> > > > _______________________________________________
> > > > Apertium-stuff mailing list
> > > > [email protected]
> > > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
> > > 
> > > 
> > > 
> > > 
> > > ------------------------------------------------------------------------------
> > > Get 100% visibility into Java/.NET code with AppDynamics Lite
> > > It's a free troubleshooting tool designed for production
> > > Get down to code-level detail for bottlenecks, with <2% overhead.
> > > Download for free and get started troubleshooting in minutes.
> > > http://p.sf.net/sfu/appdyn_d2d_ap2
> > > _______________________________________________
> > > Apertium-stuff mailing list
> > > [email protected]
> > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
> > 
> > ------------------------------------------------------------------------------
> > Get 100% visibility into Java/.NET code with AppDynamics Lite
> > It's a free troubleshooting tool designed for production
> > Get down to code-level detail for bottlenecks, with <2% overhead.
> > Download for free and get started troubleshooting in minutes.
> > http://p.sf.net/sfu/appdyn_d2d_ap2
> > _______________________________________________
> > Apertium-stuff mailing list
> > [email protected]
> > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
> 
> 
> 
> 
> ------------------------------------------------------------------------------
> Get 100% visibility into Java/.NET code with AppDynamics Lite
> It's a free troubleshooting tool designed for production
> Get down to code-level detail for bottlenecks, with <2% overhead.
> Download for free and get started troubleshooting in minutes.
> http://p.sf.net/sfu/appdyn_d2d_ap2
> _______________________________________________
> Apertium-stuff mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff

------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite
It's a free troubleshooting tool designed for production
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://p.sf.net/sfu/appdyn_d2d_ap2
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to