Hi again,
Yes, that's exactly what I am driving at. You have just explained that
the regexps slows down the transducer. Apparently rules are better added
some other way.
Yours,
Per Tunedal

On Sat, Jun 1, 2013, at 13:33, Francis Tyers wrote:
> I don't understand: you suggest that defining and creating a new module
> for proper names would be far easier than using something that already
> exists ? 
> 
> Fran
> 
> El ds 01 de 06 de 2013 a les 13:30 +0200, en/na Per Tunedal va escriure:
> > Hi,
> > thanks. I will try this out when I'm less busy.
> > What about the possibility to make some kind of add-on to Apertium to
> > handle proper names? It should be far easier than the already present
> > finite state transducer for transliteration, wouldn't it?
> > Yours,
> > Per Tunedal
> > 
> > On Fri, May 31, 2013, at 15:06, Jimmy O'Regan wrote:
> > > On 30 May 2013 18:47, Francis Tyers <[email protected]> wrote:
> > > > El dj 30 de 05 de 2013 a les 19:42 +0200, en/na Per Tunedal va escriure:
> > > >> The most difficult part would be to find the names. Perhaps someone has
> > > >> any ideas?
> > > >
> > > > In Icelandic--English, regular expressions are used. See e.g. pardefs
> > > > for "persons" and "lastnames" in is.dix
> > > >
> > > > This is not altogether recommended though, as regular expressions slow
> > > > down your transducer. What you could do is use them on a large corpus
> > > > and then mass-add the ones after superficial checking.
> > > 
> > > Census data is easy to find, gazetteers for NER are easy to find,
> > > en.wiktionary has categories for names
> > > (http://en.wiktionary.org/wiki/Category:Surnames_by_language
> > > http://en.wiktionary.org/wiki/Category:Male_given_names_by_language
> > > http://en.wiktionary.org/wiki/Category:Female_given_names_by_language),
> > > as do en.wikipedia (http://en.wikipedia.org/wiki/Category:Surnames
> > > http://en.wikipedia.org/wiki/Category:Given_names), da.wikipedia
> > > (http://da.wikipedia.org/wiki/Kategori:Efternavne
> > > http://da.wikipedia.org/wiki/Kategori:Fornavne), and sv.wikipedia
> > > (http://sv.wikipedia.org/wiki/Kategori:Efternamn
> > > http://sv.wikipedia.org/wiki/Kategori:Förnamn), and Europarl has
> > > speaker annotation which contains the name of the speaker.
> > > 
> > > -- 
> > > <Sefam> Are any of the mentors around?
> > > <jimregan> yes, they're the ones trolling you
> > > 
> > > ------------------------------------------------------------------------------
> > > Get 100% visibility into Java/.NET code with AppDynamics Lite
> > > It's a free troubleshooting tool designed for production
> > > Get down to code-level detail for bottlenecks, with <2% overhead.
> > > Download for free and get started troubleshooting in minutes.
> > > http://p.sf.net/sfu/appdyn_d2d_ap2
> > > _______________________________________________
> > > Apertium-stuff mailing list
> > > [email protected]
> > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
> > 
> > ------------------------------------------------------------------------------
> > Get 100% visibility into Java/.NET code with AppDynamics Lite
> > It's a free troubleshooting tool designed for production
> > Get down to code-level detail for bottlenecks, with <2% overhead.
> > Download for free and get started troubleshooting in minutes.
> > http://p.sf.net/sfu/appdyn_d2d_ap2
> > _______________________________________________
> > Apertium-stuff mailing list
> > [email protected]
> > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
> 
> 
> 
> 
> ------------------------------------------------------------------------------
> Get 100% visibility into Java/.NET code with AppDynamics Lite
> It's a free troubleshooting tool designed for production
> Get down to code-level detail for bottlenecks, with <2% overhead.
> Download for free and get started troubleshooting in minutes.
> http://p.sf.net/sfu/appdyn_d2d_ap2
> _______________________________________________
> Apertium-stuff mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff

------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite
It's a free troubleshooting tool designed for production
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://p.sf.net/sfu/appdyn_d2d_ap2
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to