El dt 13 de 04 de 2010 a les 15:03 +0530, en/na Vineet Chaitanya va
escriure:
> 
> 
> On Tue, Apr 13, 2010 at 2:25 PM, Francis Tyers <[email protected]>
> wrote:
>         El dt 13 de 04 de 2010 a les 11:47 +0530, en/na Vineet
>         Chaitanya va
>         escriure:
>         >
>         >
>         > On Tue, Apr 13, 2010 at 8:06 AM, Francis Tyers
>         <[email protected]>
>         > wrote:
>         >         Is this the validated output ? I notice some
>         diagnostics
>         >         remaining (e.g.
>         >         # and @).
>         >
>         >
>         >    It is a "validated output":).
>         
>         
>         Ok.
>         
>         >    These cases are considered harmless!
>         
>         
>         Oh, we don't consider a package for release until there are no
>         diagnostics left (this is 'testvoc').
>         
>    We are dealing with unrestricted texts. What do you do for proper
> nouns? They are practically unlimited.

There are a couple of options, either:

1) We use a regular expression to match common proper noun sequences,
e.g. Mr. [A-Z]. [A-Z][a-z]+ etc.
2) We pass them through as unknown words, the '*' is not considered a
diagnostic.

Fran



------------------------------------------------------------------------------
Download Intel&#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to