In the transliteration transducer you don't use regular expressions. Fran
El ds 01 de 06 de 2013 a les 13:39 +0200, en/na Per Tunedal va escriure: > Hi again, > Yes, that's exactly what I am driving at. You have just explained that > the regexps slows down the transducer. Apparently rules are better added > some other way. > Yours, > Per Tunedal > > On Sat, Jun 1, 2013, at 13:33, Francis Tyers wrote: > > I don't understand: you suggest that defining and creating a new module > > for proper names would be far easier than using something that already > > exists ? > > > > Fran > > > > El ds 01 de 06 de 2013 a les 13:30 +0200, en/na Per Tunedal va escriure: > > > Hi, > > > thanks. I will try this out when I'm less busy. > > > What about the possibility to make some kind of add-on to Apertium to > > > handle proper names? It should be far easier than the already present > > > finite state transducer for transliteration, wouldn't it? > > > Yours, > > > Per Tunedal > > > > > > On Fri, May 31, 2013, at 15:06, Jimmy O'Regan wrote: > > > > On 30 May 2013 18:47, Francis Tyers <[email protected]> wrote: > > > > > El dj 30 de 05 de 2013 a les 19:42 +0200, en/na Per Tunedal va > > > > > escriure: > > > > >> The most difficult part would be to find the names. Perhaps someone > > > > >> has > > > > >> any ideas? > > > > > > > > > > In Icelandic--English, regular expressions are used. See e.g. pardefs > > > > > for "persons" and "lastnames" in is.dix > > > > > > > > > > This is not altogether recommended though, as regular expressions slow > > > > > down your transducer. What you could do is use them on a large corpus > > > > > and then mass-add the ones after superficial checking. > > > > > > > > Census data is easy to find, gazetteers for NER are easy to find, > > > > en.wiktionary has categories for names > > > > (http://en.wiktionary.org/wiki/Category:Surnames_by_language > > > > http://en.wiktionary.org/wiki/Category:Male_given_names_by_language > > > > http://en.wiktionary.org/wiki/Category:Female_given_names_by_language), > > > > as do en.wikipedia (http://en.wikipedia.org/wiki/Category:Surnames > > > > http://en.wikipedia.org/wiki/Category:Given_names), da.wikipedia > > > > (http://da.wikipedia.org/wiki/Kategori:Efternavne > > > > http://da.wikipedia.org/wiki/Kategori:Fornavne), and sv.wikipedia > > > > (http://sv.wikipedia.org/wiki/Kategori:Efternamn > > > > http://sv.wikipedia.org/wiki/Kategori:Förnamn), and Europarl has > > > > speaker annotation which contains the name of the speaker. > > > > > > > > -- > > > > <Sefam> Are any of the mentors around? > > > > <jimregan> yes, they're the ones trolling you > > > > > > > > ------------------------------------------------------------------------------ > > > > Get 100% visibility into Java/.NET code with AppDynamics Lite > > > > It's a free troubleshooting tool designed for production > > > > Get down to code-level detail for bottlenecks, with <2% overhead. > > > > Download for free and get started troubleshooting in minutes. > > > > http://p.sf.net/sfu/appdyn_d2d_ap2 > > > > _______________________________________________ > > > > Apertium-stuff mailing list > > > > [email protected] > > > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff > > > > > > ------------------------------------------------------------------------------ > > > Get 100% visibility into Java/.NET code with AppDynamics Lite > > > It's a free troubleshooting tool designed for production > > > Get down to code-level detail for bottlenecks, with <2% overhead. > > > Download for free and get started troubleshooting in minutes. > > > http://p.sf.net/sfu/appdyn_d2d_ap2 > > > _______________________________________________ > > > Apertium-stuff mailing list > > > [email protected] > > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff > > > > > > > > > > ------------------------------------------------------------------------------ > > Get 100% visibility into Java/.NET code with AppDynamics Lite > > It's a free troubleshooting tool designed for production > > Get down to code-level detail for bottlenecks, with <2% overhead. > > Download for free and get started troubleshooting in minutes. > > http://p.sf.net/sfu/appdyn_d2d_ap2 > > _______________________________________________ > > Apertium-stuff mailing list > > [email protected] > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff > > ------------------------------------------------------------------------------ > Get 100% visibility into Java/.NET code with AppDynamics Lite > It's a free troubleshooting tool designed for production > Get down to code-level detail for bottlenecks, with <2% overhead. > Download for free and get started troubleshooting in minutes. > http://p.sf.net/sfu/appdyn_d2d_ap2 > _______________________________________________ > Apertium-stuff mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/apertium-stuff ------------------------------------------------------------------------------ Get 100% visibility into Java/.NET code with AppDynamics Lite It's a free troubleshooting tool designed for production Get down to code-level detail for bottlenecks, with <2% overhead. Download for free and get started troubleshooting in minutes. http://p.sf.net/sfu/appdyn_d2d_ap2 _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
