Hi,
Thesaurus files (along with dictionaries) require the convention that 1 char = 1 byte.
So you need to choose an 8-bit encoding (not UTF-8 or unicode) that OOo and the Thesaurus code actually understands.
There are a number of these. One can be found by opening the current Greek dictionary and examining the "Set" line of the .aff file.
Hope this helps,
Kevin
On May 10, 2005, at 9:44 AM, Petros V wrote:
Dear friends,
We are trying to create the Greek thesaurus for OOo. At first we thought that the problem comes only from the awk script (about the big endian words etc). Thanks to Daniel Naber we found out that the script of Pavel Janik is working well but only when the two files (wordlist and trimthess) contain only non-greek characters. So the problem is related with awk and how it recognises the greek characters. Does anybody know how we can solve this problem?
For more information I must report that the expiriments are taking place on Win XP, with the two files saved with ANSI. If they are saved with UTF-8,Unicode, Unicode Big Endian then the OOo hangs up.
Yours sincerely, Petros Velonis
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: dev- [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
