El 17/07/13 14:33, Andras Timar escribió: > Hi Ricardo, > > On Wed, Jul 17, 2013 at 1:42 PM, Ricardo Palomares Martínez > <[email protected]> wrote: >> This should be fixed now. In case other teams are using the same >> source of thesaurus than us, the encoding problem was already in the >> file downloaded from: >> >> http://openthes-es.berlios.de/ >> >> (it has equivalent URLs for English and German). As the file came >> wrong, and I'm new with this matters, I thought it was normal. It >> should be fixed now (I have a PHP script that fixes that and other >> problems with the built files in that site). >> >> The new files have been uploaded again to: >> >> http://forja.rediris.es/frs/?group_id=341 >> > > Sorry, I found something again. After letter 'í' there is always a > soft hyphen character (0xAD). I think it is a mistake.
No problem, I'm really sorry to cause you so much trouble and I'm very grateful for your patience and diagnostics. It is a sum of encoding problems: the original file from openthes-es.berlios.de is wrongly encoded and carries on errors in the syntax, as you let us know with version 0.6. I wrote a PHP script to fix the syntax errors, but PHP handles the encoding based only on the encoding of the PHP script itself. Finally, it seems that some characters are displayed differently on Linux than the regular ANSI set I found in this page: http://www.alanwood.net/demos/ansi.html This time I've revised the PHP script, so instead of using something like: $text = str_replace("³", "ó", $text); I've used: $text = str_replace(chr(0xC3) .chr(0xB3), chr(0xF3), $text); for every escape sequence I've been able to find. I must admit that I still can't get synonym suggestions for words that I know for certain that are included in the thesaurus, but I didn't get any with version 0.6, either. Anyway, the files are again uploaded at the usual place: http://forja.rediris.es/frs/?group_id=341 Thanks again. I hope to have got it right this time. :-) -- Ricardo Palomares (RickieES) Diaspora: https://diasp.eu/u/rickiees Skype: rickie0341971 Jabber: [email protected] -- To unsubscribe e-mail to: [email protected] Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/ Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette List archive: http://listarchives.libreoffice.org/global/l10n/ All messages sent to this list will be publicly archived and cannot be deleted
