> X-Mailer: MessagingEngine.com Webmail Interface > Date: Wed, 09 May 2012 20:51:45 +0200 > From: Per Tunedal <[email protected]> > To: [email protected] > Reply-To: [email protected] > Subject: [Apertium-stuff] Install on Debian server > > Hi, > I've just installed Apertium on an old box. First I tried Ubuntu but was > caught in DNS-problems. I couldn't use apt-get. > Now I have tried Debian instead. No networks problems. Apt-get works > fine. > > Apertium is installed from SVN and is working. I have followed the > instructions in the Wiki for Ubuntu, as Ubuntu is based on Debian ( > http://wiki.apertium.org/wiki/Apertium_on_Ubuntu). > > But: > > 1. special characters aren't recognised > > eg. the example "echo "J'ai deux frères" | apertium fr-es gives an error > on "frères". > > 2. I prepared installing apertium-service but was stuck on the > dependencies. > > a) libboost-system-dev was not found when I tried to install with > apt-get > > b) how do I proceed to install the other dependencies: liblttoolbox3, > libapertium3, libtextcat0 and libapertiumcombine1? > The wiki ( http://wiki.apertium.org/wiki/Apertium-service ) isn't very > explicit. > > Yours, > Per Tunedal >
Do you use a UTF-8 charset ? As to me, I am used to work with ISO-8859-15 charset. For several things like dumping files with both hexa code and characters values, it is more simple if there is one byte for one character. My free software sources are written with ISO-8859-15 charset and it is very simple to make a conversion between ISO-8859-1 and UTF-8 to permit them working with the two charsets. For ISO-8859-15 the main difference is the euro character. But for Apertium, I choosed to make a UTF-8 partition. The reason was, as dictionaries are written in UTF-8, not to break anything by using another charset. But in fact, I made 2 versions of Debian 6, and only UTF-8 version works for using Apertium. On Debian 6, the main difference is in : /etc/default/locale : something like LANG="fr_FR.UTF-8" instead of LANG="fr_FR" (that can be LANG="sv_SV.UTF-8" instead of LANG="sv_SV") /etc/default/console-setup CHARMAP="UTF-8" instead of something like CHARMAP="ISO-8859-15" and some equivalent things for graphic mode. With ISO-8859-15 charset, your example gives a serie of lines Warning: unsupported locale, fallback to "C" followed by the result : tengo dos *fr?@?tre With UTF-8 charset, it works tengo dos hermanos Note : If you choosed English language by default, there may be the problem because English language does not need more the 7 bits ASCII characters to be written. So, UTF-8 is useless in that case. -------------------------------- Bernard Chardonneau (France) Phone : [33] 1 64 90 87 04 (from Sept to June except holidays) GSM phone : [33] 6 49 95 13 95 (french scholl holidays, C zone) Multilingual websites for my free softwares : http://libremail.free.fr and http://libremail.tuxfamily.org http://cyloop.tuxfamily.org (mainly translated with Apertium) My general website (in french only) http://bech.free.fr ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
