Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars

2009-05-19 Thread Kęstutis Biliūnas
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

close 525275
thanks

Justin B Rye rašė:
 Kęstutis Biliūnas wrote:
   Notruf [noËtruËf] (n) , s.(m )
  emergency call
 It seems that the reason that in the translation transcriptions are used
 character MODIFIER LETTER TRIANGULAR COLON 'ː' (UTF-8: 0xCB 0x90),
 instead of a simple symbol COLONY ':' (0x3A).
 
 [ː] is the correct length-mark character for use in a phonetic
 transcription, and it's only one of many IPA symbols that require
 those wordlists to be in UTF-8

Yes, you are right. It appears this is a real truth. Thank you.

 ... The problem is that the bugreporter is
 using a legacy ISO-8859-1 charmap.  Dict might fail more gracefully,
 but it's got to fail.

Not, it does not fails, and outputs the correct information, only the
terminal is unable to show. This is very similar to the old bug of the
recode program (see #321437).

So I closed this bug report. Timo, I think you need to change the locale
and charmap or blame recode program.

Best wishes,
- --
Kęstutis Biliūnas ke...@kaunas.init.lt
http://kebil.ghost.lt  |  GnuPG-Key ID: F6E7A452


-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.9 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAkoSarQACgkQFXHEz/bnpFJ9bgCgnFGECQCAR9GYQt/4jF35BjjS
ljcAnjg0tnGdQFlndIwqb7lxhnEsqtUa
=DbWw
-END PGP SIGNATURE-



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars

2009-05-19 Thread Timo Juhani Lindfors
Kęstutis Biliūnas ke...@kaunas.init.lt writes:
 ... The problem is that the bugreporter is
 using a legacy ISO-8859-1 charmap.  Dict might fail more gracefully,
 but it's got to fail.

 Not, it does not fails, and outputs the correct information, only the
 terminal is unable to show. This is very similar to the old bug of the

Hmm, isn't it a bug to blindly send data to terminal in encoding X
when locale says that encoding Y should be used? For example some
programs forcibly send ANSI escape sequences without checking (through
termcap I believe) if the terminal supports colors or not and this is
considered a bug (#479877, #448059, #503053).

I'd prefer that dict used the locale system to figure out what
encoding can be used and then either 1) printed an error message about
unsupported locale or 2) simply ran the output through isprint() or
something.

I'll consider filing a bug against recode too when I understand the
issue better.



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars

2009-05-19 Thread kebil
2009 Gegužė 19, 1:41 pm, Ant Timo Juhani Lindfors rašė:
 Kęstutis Biliūnas ke...@kaunas.init.lt writes:
 ... The problem is that the bugreporter is
 using a legacy ISO-8859-1 charmap.  Dict might fail more gracefully,
 but it's got to fail.

 Not, it does not fails, and outputs the correct information, only the
 terminal is unable to show. This is very similar to the old bug of the

 Hmm, isn't it a bug to blindly send data to terminal in encoding X
 when locale says that encoding Y should be used? For example some
 programs forcibly send ANSI escape sequences without checking (through
 termcap I believe) if the terminal supports colors or not and this is
 considered a bug (#479877, #448059, #503053).

 I'd prefer that dict used the locale system to figure out what
 encoding can be used and then either 1) printed an error message about
 unsupported locale or 2) simply ran the output through isprint() or
 something.

Yes, you are right. Therefore, try the command 'dictl notruf'.





-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars

2009-05-19 Thread Timo Juhani Lindfors
kebil ke...@kaunas.init.lt writes:
 Yes, you are right. Therefore, try the command 'dictl notruf'.

Ooh, I did not know that this even existed, thanks!

How about mentioning this in the man page of 'dict'? And also falling
back to dictl if terminal does not support utf-8?





-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars

2009-04-25 Thread Justin B Rye
From: Justin B Rye j...@xibalba.demon.co.uk
To: Kęstutis Biliūnas ke...@kaunas.init.lt
Cc: Timo Juhani Lindfors timo.lindf...@iki.fi
Bcc: 
Subject: Re: Bug#525275: dict-freedict-deu-eng: description of notruf
contains non-printable chars
Reply-To: 
In-Reply-To: 49f234dd.2030...@kaunas.init.lt

Kęstutis Biliūnas wrote:
   Notruf [noËtruËf] (n) , s.(m )
  emergency call
 
 It seems that the reason that in the translation transcriptions are used
 character MODIFIER LETTER TRIANGULAR COLON 'ː' (UTF-8: 0xCB 0x90),
 instead of a simple symbol COLONY ':' (0x3A).

[ː] is the correct length-mark character for use in a phonetic
transcription, and it's only one of many IPA symbols that require
those wordlists to be in UTF-8 - compare the entry for Geschenkkorb
[gəʃɛŋkɔrp] (n) , s.(m ).  The problem is that the bugreporter is
using a legacy ISO-8859-1 charmap.  Dict might fail more gracefully,
but it's got to fail.
-- 
JBR with qualifications in linguistics, experience as a Debian
sysadmin, and probably no clue about this particular package



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars

2009-04-24 Thread Kęstutis Biliūnas
Hi Timo,

Timo Juhani Lindfors rašė:
 Package: dict-freedict-deu-eng
 Version: 1.3-4
 Severity: normal
 
 Steps to reproduce:
 1) dict notruf
 2) echo foo
 

 Actual results:
 2) nothing is printed to screen
 

 2) The problematic character seems to be 0x90. If I remove it I can
 see the description properly:
 
 lindi1:~$ dict notruf | sed $'s/\x90//g'   
 1 definition found
 
From German-English Freedict dictionary [fd-deu-eng]:
 
   Notruf [noËtruËf] (n) , s.(m )
  emergency call

It seems that the reason that in the translation transcriptions are used
character MODIFIER LETTER TRIANGULAR COLON 'ː' (UTF-8: 0xCB 0x90),
instead of a simple symbol COLONY ':' (0x3A).

Thanks for the bug report.
-- 
Kęstutis Biliūnas ke...@kaunas.init.lt
http://kebil.ghost.lt  |  GnuPG-Key ID: F6E7A452



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars

2009-04-23 Thread Timo Juhani Lindfors
Package: dict-freedict-deu-eng
Version: 1.3-4
Severity: normal

Steps to reproduce:
1) dict notruf
2) echo foo

Expected results:
2) foo is printed to screen

Actual results:
2) nothing is printed to screen

More info:
1) It seems that the description of the word contains non-printable
characters that confuse my terminal emulator (xterm):

$ dict notruf | hexdump -C
  31 20 64 65 66 69 6e 69  74 69 6f 6e 20 66 6f 75  |1 definition fou|
0010  6e 64 0a 0a 46 72 6f 6d  20 47 65 72 6d 61 6e 2d  |nd..From German-|
0020  45 6e 67 6c 69 73 68 20  46 72 65 65 64 69 63 74  |English Freedict|
0030  20 64 69 63 74 69 6f 6e  61 72 79 20 5b 66 64 2d  | dictionary [fd-|
0040  64 65 75 2d 65 6e 67 5d  3a 0a 0a 20 20 4e 6f 74  |deu-eng]:..  Not|
0050  72 75 66 20 5b 6e 6f cb  90 74 72 75 cb 90 66 5d  |ruf [noË.truË.f]|
0060  20 28 6e 29 20 2c 20 73  2e 28 6d 20 29 0a 20 20  | (n) , s.(m ).  |
0070  20 20 20 65 6d 65 72 67  65 6e 63 79 20 63 61 6c  |   emergency cal|
0080  6c 0a 20 20 0a|l.  .|
0085

2) The problematic character seems to be 0x90. If I remove it I can
see the description properly:

lindi1:~$ dict notruf | sed $'s/\x90//g'   
1 definition found

From German-English Freedict dictionary [fd-deu-eng]:

  Notruf [noËtruËf] (n) , s.(m )
 emergency call
  
lindi1:~$ echo foo
foo

3) The output is not UTF-8 either since

$ dict notruf | recode utf-8..latin1
1 definition found

From German-English Freedict dictionary [fd-deu-eng]:

  Notruf [norecode: Invalid input in step `UTF-8..ISO-8859-1'




-- System Information:
Debian Release: squeeze/sid
  APT prefers unstable
  APT policy: (500, 'unstable')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.26-1-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_US, LC_CTYPE=en_US.ISO-8859-1 (charmap=ISO-8859-1)
Shell: /bin/sh linked to /bin/bash

dict-freedict-deu-eng depends on no packages.

dict-freedict-deu-eng recommends no packages.

Versions of packages dict-freedict-deu-eng suggests:
ii  dict   1.11.1+dfsg-1 Dictionary Client
ii  dictd  1.11.1+dfsg-1 Dictionary Server

-- no debconf information



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org