Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 close 525275 thanks Justin B Rye rašė: Kęstutis Biliūnas wrote: Notruf [noËtruËf] (n) , s.(m ) emergency call It seems that the reason that in the translation transcriptions are used character MODIFIER LETTER TRIANGULAR COLON 'ː' (UTF-8: 0xCB 0x90), instead of a simple symbol COLONY ':' (0x3A). [ː] is the correct length-mark character for use in a phonetic transcription, and it's only one of many IPA symbols that require those wordlists to be in UTF-8 Yes, you are right. It appears this is a real truth. Thank you. ... The problem is that the bugreporter is using a legacy ISO-8859-1 charmap. Dict might fail more gracefully, but it's got to fail. Not, it does not fails, and outputs the correct information, only the terminal is unable to show. This is very similar to the old bug of the recode program (see #321437). So I closed this bug report. Timo, I think you need to change the locale and charmap or blame recode program. Best wishes, - -- Kęstutis Biliūnas ke...@kaunas.init.lt http://kebil.ghost.lt | GnuPG-Key ID: F6E7A452 -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.9 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iEYEARECAAYFAkoSarQACgkQFXHEz/bnpFJ9bgCgnFGECQCAR9GYQt/4jF35BjjS ljcAnjg0tnGdQFlndIwqb7lxhnEsqtUa =DbWw -END PGP SIGNATURE- -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars
Kęstutis Biliūnas ke...@kaunas.init.lt writes: ... The problem is that the bugreporter is using a legacy ISO-8859-1 charmap. Dict might fail more gracefully, but it's got to fail. Not, it does not fails, and outputs the correct information, only the terminal is unable to show. This is very similar to the old bug of the Hmm, isn't it a bug to blindly send data to terminal in encoding X when locale says that encoding Y should be used? For example some programs forcibly send ANSI escape sequences without checking (through termcap I believe) if the terminal supports colors or not and this is considered a bug (#479877, #448059, #503053). I'd prefer that dict used the locale system to figure out what encoding can be used and then either 1) printed an error message about unsupported locale or 2) simply ran the output through isprint() or something. I'll consider filing a bug against recode too when I understand the issue better. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars
2009 Gegužė 19, 1:41 pm, Ant Timo Juhani Lindfors rašė: Kęstutis Biliūnas ke...@kaunas.init.lt writes: ... The problem is that the bugreporter is using a legacy ISO-8859-1 charmap. Dict might fail more gracefully, but it's got to fail. Not, it does not fails, and outputs the correct information, only the terminal is unable to show. This is very similar to the old bug of the Hmm, isn't it a bug to blindly send data to terminal in encoding X when locale says that encoding Y should be used? For example some programs forcibly send ANSI escape sequences without checking (through termcap I believe) if the terminal supports colors or not and this is considered a bug (#479877, #448059, #503053). I'd prefer that dict used the locale system to figure out what encoding can be used and then either 1) printed an error message about unsupported locale or 2) simply ran the output through isprint() or something. Yes, you are right. Therefore, try the command 'dictl notruf'. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars
kebil ke...@kaunas.init.lt writes: Yes, you are right. Therefore, try the command 'dictl notruf'. Ooh, I did not know that this even existed, thanks! How about mentioning this in the man page of 'dict'? And also falling back to dictl if terminal does not support utf-8? -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars
From: Justin B Rye j...@xibalba.demon.co.uk To: Kęstutis Biliūnas ke...@kaunas.init.lt Cc: Timo Juhani Lindfors timo.lindf...@iki.fi Bcc: Subject: Re: Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars Reply-To: In-Reply-To: 49f234dd.2030...@kaunas.init.lt Kęstutis Biliūnas wrote: Notruf [noËtruËf] (n) , s.(m ) emergency call It seems that the reason that in the translation transcriptions are used character MODIFIER LETTER TRIANGULAR COLON 'ː' (UTF-8: 0xCB 0x90), instead of a simple symbol COLONY ':' (0x3A). [ː] is the correct length-mark character for use in a phonetic transcription, and it's only one of many IPA symbols that require those wordlists to be in UTF-8 - compare the entry for Geschenkkorb [gəʃɛŋkɔrp] (n) , s.(m ). The problem is that the bugreporter is using a legacy ISO-8859-1 charmap. Dict might fail more gracefully, but it's got to fail. -- JBR with qualifications in linguistics, experience as a Debian sysadmin, and probably no clue about this particular package -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars
Hi Timo, Timo Juhani Lindfors rašė: Package: dict-freedict-deu-eng Version: 1.3-4 Severity: normal Steps to reproduce: 1) dict notruf 2) echo foo Actual results: 2) nothing is printed to screen 2) The problematic character seems to be 0x90. If I remove it I can see the description properly: lindi1:~$ dict notruf | sed $'s/\x90//g' 1 definition found From German-English Freedict dictionary [fd-deu-eng]: Notruf [noËtruËf] (n) , s.(m ) emergency call It seems that the reason that in the translation transcriptions are used character MODIFIER LETTER TRIANGULAR COLON 'ː' (UTF-8: 0xCB 0x90), instead of a simple symbol COLONY ':' (0x3A). Thanks for the bug report. -- Kęstutis Biliūnas ke...@kaunas.init.lt http://kebil.ghost.lt | GnuPG-Key ID: F6E7A452 -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#525275: dict-freedict-deu-eng: description of notruf contains non-printable chars
Package: dict-freedict-deu-eng Version: 1.3-4 Severity: normal Steps to reproduce: 1) dict notruf 2) echo foo Expected results: 2) foo is printed to screen Actual results: 2) nothing is printed to screen More info: 1) It seems that the description of the word contains non-printable characters that confuse my terminal emulator (xterm): $ dict notruf | hexdump -C 31 20 64 65 66 69 6e 69 74 69 6f 6e 20 66 6f 75 |1 definition fou| 0010 6e 64 0a 0a 46 72 6f 6d 20 47 65 72 6d 61 6e 2d |nd..From German-| 0020 45 6e 67 6c 69 73 68 20 46 72 65 65 64 69 63 74 |English Freedict| 0030 20 64 69 63 74 69 6f 6e 61 72 79 20 5b 66 64 2d | dictionary [fd-| 0040 64 65 75 2d 65 6e 67 5d 3a 0a 0a 20 20 4e 6f 74 |deu-eng]:.. Not| 0050 72 75 66 20 5b 6e 6f cb 90 74 72 75 cb 90 66 5d |ruf [noË.truË.f]| 0060 20 28 6e 29 20 2c 20 73 2e 28 6d 20 29 0a 20 20 | (n) , s.(m ). | 0070 20 20 20 65 6d 65 72 67 65 6e 63 79 20 63 61 6c | emergency cal| 0080 6c 0a 20 20 0a|l. .| 0085 2) The problematic character seems to be 0x90. If I remove it I can see the description properly: lindi1:~$ dict notruf | sed $'s/\x90//g' 1 definition found From German-English Freedict dictionary [fd-deu-eng]: Notruf [noËtruËf] (n) , s.(m ) emergency call lindi1:~$ echo foo foo 3) The output is not UTF-8 either since $ dict notruf | recode utf-8..latin1 1 definition found From German-English Freedict dictionary [fd-deu-eng]: Notruf [norecode: Invalid input in step `UTF-8..ISO-8859-1' -- System Information: Debian Release: squeeze/sid APT prefers unstable APT policy: (500, 'unstable') Architecture: amd64 (x86_64) Kernel: Linux 2.6.26-1-amd64 (SMP w/4 CPU cores) Locale: LANG=en_US, LC_CTYPE=en_US.ISO-8859-1 (charmap=ISO-8859-1) Shell: /bin/sh linked to /bin/bash dict-freedict-deu-eng depends on no packages. dict-freedict-deu-eng recommends no packages. Versions of packages dict-freedict-deu-eng suggests: ii dict 1.11.1+dfsg-1 Dictionary Client ii dictd 1.11.1+dfsg-1 Dictionary Server -- no debconf information -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org