Re: [lingu-dev] compound words

2006-07-03 Thread Bjoern JACKE
Hi Simon, On 2006-07-03 at 20:01 +0200 Simon Brouwer sent off: Thanks for this useful explanation! I will take your recommendations to heart when implementing compounding in the Dutch spell checker files. Did you do the checking manually, or did you use some software for this? I did the checki

Re: [lingu-dev] compound words

2006-07-03 Thread ge
BJ wrote: >Ispell and aspell both don't have sufficient support for >agglutinative languages with complex compound word rules, >unfortunately. Just for the sake of fairness and to be precise: Ispell is the first spell checker, that introduced the powerful affix concept especially for a

Re: [lingu-dev] compound words

2006-07-03 Thread Simon Brouwer
Hi Bjoern, Bjoern JACKE schreef: On 2006-06-30 at 21:17 +0200 Daniel Naber sent off: On Freitag 30 Juni 2006 11:42, Simon Brouwer wrote: It might an idea to identify problem cases by running the list of known-good words through the suggestion mechanism, and making a list of all the variations

Re: [lingu-dev] compound words

2006-07-03 Thread Bjoern JACKE
On 2006-07-03 at 15:16 +0200 [EMAIL PROTECTED] sent off: BJ wrote: Cases like this and cases like Daniel mentions have to be put into a blacklist which has to be flagged with hunspell's FORBIDDENWORD flag. I found recently, that aspell can not (yet) handle "FORBIDDENWORD" flag or analogous

[lingu-dev] compound words

2006-07-03 Thread eleonora46
BJ wrote: > Cases like this > and cases like Daniel mentions have to be put into a blacklist which > has to be flagged with hunspell's FORBIDDENWORD flag. I found recently, that aspell can not (yet) handle "FORBIDDENWORD" flag or analogous mechanizm. Of course, ispell cannot either. This also

Re: [lingu-dev] compound words

2006-07-03 Thread Bjoern JACKE
On 2006-06-30 at 21:17 +0200 Daniel Naber sent off: On Freitag 30 Juni 2006 11:42, Simon Brouwer wrote: It might an idea to identify problem cases by running the list of known-good words through the suggestion mechanism, and making a list of all the variations that are accepted (only) using the

Re: [lingu-dev] Compound words

2006-07-01 Thread Jancs
Citēju ge <[EMAIL PROTECTED]>: > (Probably if I were them, I would do the same). Therefore their spell checker > remains rather static and contains generally much less real (not generated) > words, than the language, it checks.. yes, that's true. Additionally, it seems they are going the way of g

[lingu-dev] Compound words

2006-07-01 Thread ge
Hello, Simon, > Also, Dutch spell checking in MS Word, which is generally considered > quite good, appears to use mechanical compounding as it accepts certain > contrived, nonsensical compounds. This at least suggests that it is not > an *obviously* bad idea, for Dutch at least. Yes, this is also

Re: [lingu-dev] compound words

2006-06-30 Thread Simon Brouwer
Hi Daniel, Daniel Naber schreef: On Freitag 30 Juni 2006 11:42, Simon Brouwer wrote: It might an idea to identify problem cases by running the list of known-good words through the suggestion mechanism, and making a list of all the variations that are accepted (only) using the mechanical com

Re: [lingu-dev] Compound words

2006-06-30 Thread Simon Brouwer
Hi Eleonora, ge schreef: Dear Simon, I think *that* would be throwing away the child. In languages like Dutch it is so easy to form a new but perfectly valid word by compounding, it is impossible to include all the possible combinations in a word list. << All is impossible, I agree. But

Re: [lingu-dev] compound words

2006-06-30 Thread Daniel Naber
On Freitag 30 Juni 2006 11:42, Simon Brouwer wrote: > It might an idea to identify problem cases by running the list of > known-good words through the suggestion mechanism, and making a list of > all the variations that are accepted (only) using the mechanical > compound mechanism. This list could

[lingu-dev] Compound words

2006-06-30 Thread ge
Dear Simon, >> I think *that* would be throwing away the child. In languages like Dutch it is so easy to form a new but perfectly valid word by compounding, it is impossible to include all the possible combinations in a word list. << All is impossible, I agree. But it is not impossible to find

Re: [lingu-dev] compound words

2006-06-30 Thread Simon Brouwer
Hi Eleonora ge wrote: You are right, I found that there is very often just one more or less character, that makes the mechanically compounded word senseless and erroneous. However, this would cause the elimination of a lot of potentially good words, therefore this needs to be verified. Maybe th

[lingu-dev] compound words

2006-06-30 Thread ge
On Friday 30 June 2006 09:52 am, Simon Brouwer wrote: > > The shorter the words, the more catastrophic the error rate. > > It might then be a good idea if the spell checker would reject guessed > compounds below a certain minimum length (configurable in the affix file). Yes, I know, that aspell al

Re: [lingu-dev] compound words

2006-06-30 Thread Simon Brouwer
Hi Eleonora, ge wrote: Dear All, The investigation below will be interesting for those, whose language actively uses compound words (Hungarian, German, Dutch, Swedish, ...) I now closed the investigation of Hungarian compound words creation. The results are in http://tkltrans.sourceforge.

[lingu-dev] compound words

2006-06-29 Thread ge
Dear All, The investigation below will be interesting for those, whose language actively uses compound words (Hungarian, German, Dutch, Swedish, ...) I now closed the investigation of Hungarian compound words creation. The results are in http://tkltrans.sourceforge.net/tklspell/compound.htm#c

[lingu-dev] compound words

2005-08-21 Thread ge
Dear Bram, My study about compound words might be interesting for you: http://tkltrans.sourceforge.net/tklspell/compound.htm Regards, Eleonora Hello spell checking colleagues, As you probably know I have added spell checking facilities to Vim.  I'm using the word lists in Myspell compatible for

Re: [lingu-dev] compound words

2005-08-17 Thread Kevin Scannell
Ar Luan 15 Lúnasa 2005 03:03 pm, scríobh Bram Moolenaar: > I'm hoping that we can agree on a way to define compound words better. >... > I found an outline of a format in the online Aspell documentation. But > it's not finished. Let me propose something, then we can discuss it, > try it out and

[lingu-dev] compound words

2005-08-16 Thread Bram Moolenaar
Hello spell checking colleagues, As you probably know I have added spell checking facilities to Vim. I'm using the word lists in Myspell compatible format as input. I hope we can continue sharing the word lists, since that is were most effort is spent. Spell checking in Vim works very well now