No problem! The multiwords might not be ever so frequent, but the transfer notes, really are. At least in dealing with Wikipedia, we could gain a lot by being able to properly translate "El 1994, ..." and fixing the translation of va + haver + de + inf (e.g. 'va haver de alçar-se')
If you need example sentences for any of the notes, please feel free to ask. I don't have them handy, but a couple of seconds looking in the corpus will be able to find good examples. Fran El dg 09 de 10 de 2011 a les 23:59 +0200, en/na Gema Ramírez-Sánchez va escriure: > Fran, > > at Prompsit we usually do some improvements to this pair for UA and > UPV and UOC maintenance processes and your notes are really helpful > > Thanks! > > Gema. > > > On Mon, Oct 3, 2011 at 12:15 PM, Francis Tyers <[email protected]> wrote: > > Hey all, > > > > In preparing my test corpus for experiments with lexical selection, I've > > done a ~30,000 word evaluation of Catalan->English, and have come up > > with the following list of observations: > > > > https://apertium.svn.sourceforge.net/svnroot/apertium/branches/apertium-en-ca/dev/observations.ca-en.txt > > > > They include multiwords, transfer rules, missing morphology, and lexical > > rules. But mainly multiwords. > > > > I don't have time to make the changes, but maybe someone on the list is > > interested. The current error rate is around 45% according to my > > calculations -- but the texts haven't been properly checked yet. > > > > Fran > > > > PS. I have started to write a page for discussion of the lexical > > selection module here: > > > > http://wiki.apertium.org/wiki/Constraint-based_lexical_selection_module > > > > I would appreciate input on the talk page. > > > > > > ------------------------------------------------------------------------------ > > All the data continuously generated in your IT infrastructure contains a > > definitive record of customers, application performance, security > > threats, fraudulent activity and more. Splunk takes this data and makes > > sense of it. Business sense. IT sense. Common sense. > > http://p.sf.net/sfu/splunk-d2dcopy1 > > _______________________________________________ > > Apertium-stuff mailing list > > [email protected] > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff > > ------------------------------------------------------------------------------ All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2dcopy2 _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
