On Sat, Feb 26, 2011 at 11:07:55PM +0000, Oleksandr Gavenko wrote:
> 
> In that way we can convert .texi to UTF-8 and make replace for chars:
> 
>   © ==> ©
>   “ ==> “
>   ” ==> ”
>   ‘ ==> ‘
>   ’ ==> ’
>   ⇒ ==> →
> 
> I think sources contain more entities that require rewriting.

Ok.  I think those conversions to unicode characters are already right,
then.  Just need to ensure that they are done like that.  You can have a 
look to tp/Texinfo/Convert/Unicode.pm for example, there are tables for
accented characters commands, and also unicode_map for commands.  There
is also code to convert `̀` and '' to the corresponding unicode and then
utf8 characters:

    $text =~ s/---/\x{2014}/g;
    $text =~ s/--/\x{2013}/g;
    $text =~ s/``/\x{201C}/g;
    $text =~ s/''/\x{201D}/g;

-- 
Pat


Reply via email to