El dt 13 de 04 de 2010 a les 15:03 +0530, en/na Vineet Chaitanya va escriure: > > > On Tue, Apr 13, 2010 at 2:25 PM, Francis Tyers <[email protected]> > wrote: > El dt 13 de 04 de 2010 a les 11:47 +0530, en/na Vineet > Chaitanya va > escriure: > > > > > > On Tue, Apr 13, 2010 at 8:06 AM, Francis Tyers > <[email protected]> > > wrote: > > Is this the validated output ? I notice some > diagnostics > > remaining (e.g. > > # and @). > > > > > > It is a "validated output":). > > > Ok. > > > These cases are considered harmless! > > > Oh, we don't consider a package for release until there are no > diagnostics left (this is 'testvoc'). > > We are dealing with unrestricted texts. What do you do for proper > nouns? They are practically unlimited.
There are a couple of options, either: 1) We use a regular expression to match common proper noun sequences, e.g. Mr. [A-Z]. [A-Z][a-z]+ etc. 2) We pass them through as unknown words, the '*' is not considered a diagnostic. Fran ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
