Sí, tens raó. To solve this we really need to take in more context. The
'by' in this case would probably disambiguate. An almost certainly with
the 'is'. I think there are _very_ few examples where

  is ... PAST by  

should be chosen over

  is ... PP by

Incidentally, if anyone is interested, I've added a Google Summer of
Code idea to the ideas page for 'Rule-based finite-state
disambiguation'. 

Fran

El dl 07 de 02 de 2011 a les 12:12 +0100, en/na Mireia va escriure:
> Hello Fran,
> 
> I've struggled a lot with the English tagger and didn't obtain good
> results... 
> Think about sentences like "Wikimedia, among other participants,
> organized a photo contest ..."
> 
> Mireia
> 
> 
> El dl 07 de 02 de 2011 a les 09:28 +0000, en/na Francis Tyers va
> escriure:
> > Some questions:
> > 
> > O: Wiki Loves Monuments is a public photo contest around monuments, 
> >    organized by Wikimedia chapters.
> > 
> > G: Wiki estima Monuments és un concurs de fotografia al voltant de 
> >    monuments públics, organitzats per capítols, l'enciclopèdia lliure.
> > 
> > A: *Wiki Estima els monuments és una competició de foto pública
> >    al voltant de monuments, va organitzar per *Wikimedia capítols.
> > 
> > The tagger is taking the past for 'organised' instead of the participle,
> > leading to 'va organitzar' instead of 'organitzada'. Could we perhaps
> > add a prefer rule for the tagger for a comma followed by PAST|PP to
> > choose PP ?
> > 
> > I think the other errors can't be fixed, e.g. 
> > 
> > 1) "concurs de fotografia pública" sounds better than "competició de
> >     foto pública" to me (non-native speaker) -- kind of a big,
> >     infrequent multiword though.
> > 2) 'around' is difficult, it even sounds weird to me in English here, 
> >    the meaning is 'relating to' or 'about'. But working that out would
> >    probably require semantics or some fancy lexical stuff
> > 3) chapter, as in part of an organisation is translated as 'capítol'. 
> >    Of all the translations of 'capítol' in Catalan, I think this isn't
> >    one of them. Again, difficult/impossible to resolve.
> > 
> > On the other hand, there is another nice example of Google messing up
> > proper nouns:
> > 
> >  Wikimedia chapters. --> l'enciclopèdia lliure.
> > 
> > Regards,
> > 
> > Fran
> > 
> > 
> > ------------------------------------------------------------------------------
> > The modern datacenter depends on network connectivity to access resources
> > and provide services. The best practices for maximizing a physical server's
> > connectivity to a physical network are well understood - see how these
> > rules translate into the virtual world? 
> > http://p.sf.net/sfu/oracle-sfdevnlfb
> > _______________________________________________
> > Apertium-stuff mailing list
> > Apertium-stuff@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/apertium-stuff
> 
> 
> 
> ------------------------------------------------------------------------------
> The modern datacenter depends on network connectivity to access resources
> and provide services. The best practices for maximizing a physical server's
> connectivity to a physical network are well understood - see how these
> rules translate into the virtual world? 
> http://p.sf.net/sfu/oracle-sfdevnlfb
> _______________________________________________
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff



------------------------------------------------------------------------------
The modern datacenter depends on network connectivity to access resources
and provide services. The best practices for maximizing a physical server's
connectivity to a physical network are well understood - see how these
rules translate into the virtual world? 
http://p.sf.net/sfu/oracle-sfdevnlfb
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to