Re: No rules, but by example

2014-08-19 Thread R.J. Baars
Postags are a challenge. There are so many words having that amount of postags, it will be hard to get those really wel determined. I will spend some time in the disambiguator, for assigning postags as deleting ambiguous ones where possible. First we have to establish a better postagging system

Re: No rules, but by example

2014-08-19 Thread Daniel Naber
On 2014-08-19 13:43, R.J. Baars wrote: > hand some; wrong; handsome > I hand some tools to; correct > It is a very compact way of defining very simple rules. I see, but the thing is that these rules probably won't stay simple for long. What if you want to add "you hand some tools", "we hand som

Re: No rules, but by example

2014-08-19 Thread R.J. Baars
What I mean is just making a list of token groups, good and bad. I'll try a different example: hand some; wrong; handsome I hand some tools to; correct Another one: bene; wrong;been nota bene;correct It is a very compact way of defining very simple rules. I encounter rules that work fine, but

Re: No rules, but by example

2014-08-19 Thread Daniel Naber
On 2014-08-19 09:14, R.J. Baars wrote: > bed : ok > bed english : not ok => bad english > For some types of errors, I think it works better then current > rule/exception type of check. I'm not sure I understand: do you suggest a different (more compact) way to write down simple rules, or do you

No rules, but by example

2014-08-19 Thread R.J. Baars
I think a good addition for LT would be to have a general rule, just acting on tokens, a bit like srx does wit letters. bed : ok bed english : not ok => bad english A mechanism, that lets the longer token list overrule the shorter one. This would create the option to add found errors empirically.