> X-Mailer: MessagingEngine.com Webmail Interface
> Date: Wed, 09 May 2012 20:51:45 +0200
> From: Per Tunedal <[email protected]>
> To: [email protected]
> Reply-To: [email protected]
> Subject: [Apertium-stuff] Install on Debian server
>
> Hi,
> I've just installed Apertium on an old box. First I tried Ubuntu but was
> caught in DNS-problems. I couldn't use apt-get.
> Now I have tried Debian instead. No networks problems. Apt-get works
> fine.
>
> Apertium is installed from SVN and is working. I have followed the
> instructions in the Wiki for Ubuntu, as Ubuntu is based on Debian (
> http://wiki.apertium.org/wiki/Apertium_on_Ubuntu).
>
> But:
>
> 1. special characters aren't recognised
>
> eg. the example "echo "J'ai deux frères" | apertium fr-es gives an error
> on "frères".
>
> 2. I prepared installing apertium-service but was stuck on the
> dependencies.
>
> a) libboost-system-dev was not found when I tried to install with
> apt-get
>
> b) how do I proceed to install the other dependencies: liblttoolbox3,
> libapertium3, libtextcat0 and libapertiumcombine1?
> The wiki ( http://wiki.apertium.org/wiki/Apertium-service ) isn't very
> explicit.
>
> Yours,
> Per Tunedal
>

Do you use a UTF-8 charset ?

As to me, I am used to work with ISO-8859-15 charset. For several things
like dumping files with both hexa code and characters values, it is more
simple if there is one byte for one character.

My free software sources are written with ISO-8859-15 charset and it is
very simple to make a conversion between ISO-8859-1 and UTF-8 to permit
them working with the two charsets. For ISO-8859-15 the main difference
is the euro character.

But for Apertium, I choosed to make a UTF-8 partition. The reason was,
as dictionaries are written in UTF-8, not to break anything by using
another charset. But in fact, I made 2 versions of Debian 6, and only
UTF-8 version works for using Apertium.

On Debian 6, the main difference is in :
/etc/default/locale : something like LANG="fr_FR.UTF-8" instead of LANG="fr_FR"
(that can be LANG="sv_SV.UTF-8" instead of LANG="sv_SV")

/etc/default/console-setup
CHARMAP="UTF-8" instead of something like CHARMAP="ISO-8859-15"

and some equivalent things for graphic mode.

With ISO-8859-15 charset, your example gives a serie of lines
Warning: unsupported locale, fallback to "C"

followed by the result :
tengo dos *fr?@?tre

With UTF-8 charset, it works
tengo dos hermanos

Note : If you choosed English language by default, there may be the
problem because English language does not need more the 7 bits ASCII
characters to be written. So, UTF-8 is useless in that case.



--------------------------------
Bernard Chardonneau (France)
Phone : [33] 1 64 90 87 04 (from Sept to June except holidays)
GSM phone : [33] 6 49 95 13 95 (french scholl holidays, C zone)

Multilingual websites for my free softwares :
http://libremail.free.fr and http://libremail.tuxfamily.org
http://cyloop.tuxfamily.org (mainly translated with Apertium)

My general website (in french only)
http://bech.free.fr

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to