Your message dated Sun, 27 Apr 2008 09:02:42 +0000
with message-id <[EMAIL PROTECTED]>
and subject line Bug#470200: fixed in uni2ascii 4.7-1
has caused the Debian Bug report #470200,
regarding unicode ligatures to ASCII
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [EMAIL PROTECTED]
immediately.)


-- 
470200: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=470200
Debian Bug Tracking System
Contact [EMAIL PROTECTED] with problems
--- Begin Message ---
Package: uni2ascii
Version: 4.4-1
Severity: minor

I would like to discuss today the Unicodes
¯ ’“”− ff fi fl ffi ...
that is
00AF 2019 201C 201D 2212 FB00 FB01 FB02 FB03 ...

You see, I noticed them when I used pdftotext on
http://www.cs.ucr.edu/~anirban/Anir-networking07.pdf
and then tired to read the results on my ASCII PDA.

I wish pdftotext had a flag to make the output ASCII.

Anyway, even uni2ascii -ydpxef wouldn't get all of them into ASCII.
The ligatures remained -- but turned into 0x codes. (P.S., I wish
there was one flag to "give me best ASCII", lest one ponder the man
page too long.) Also apparently there is no way to get uni2ascii to
not turn what it can't deal with to 0x codes, and let sail thru for
some other filter to complete the job.

Now turning to pstotext, whose man page says "pstotext deals better
with punctuation and ligatures." Not in this case.

Now turning to Text::Unidecode: sorry: mangled ligatures.

Anyways, I ended up having to write by hand:

#!/usr/bin/perl
use strict;
use warnings;
while (<>) {
    s/¯/_/g; #just a guess
    s/’/'/g;
    s/“/"/g;
    s/”/"/g;
    s/−/-/g;
    s/ff/ff/g;
    s/fi/fi/g;
    s/fl/fl/g;
    s/ffi/ffi/g;
    s/ffl/ffl/g;
    s/ſt/ft/g;
    s/st/st/g;
    print;
}



--- End Message ---
--- Begin Message ---
Source: uni2ascii
Source-Version: 4.7-1

We believe that the bug you reported is fixed in the latest version of
uni2ascii, which is due to be installed in the Debian FTP archive:

uni2ascii_4.7-1.diff.gz
  to pool/main/u/uni2ascii/uni2ascii_4.7-1.diff.gz
uni2ascii_4.7-1.dsc
  to pool/main/u/uni2ascii/uni2ascii_4.7-1.dsc
uni2ascii_4.7-1_i386.deb
  to pool/main/u/uni2ascii/uni2ascii_4.7-1_i386.deb
uni2ascii_4.7.orig.tar.gz
  to pool/main/u/uni2ascii/uni2ascii_4.7.orig.tar.gz



A summary of the changes between this version and the previous one is
attached.

Thank you for reporting the bug, which will now be closed.  If you
have further comments please address them to [EMAIL PROTECTED],
and the maintainer will reopen the bug report if appropriate.

Debian distribution maintenance software
pp.
Kartik Mistry <[EMAIL PROTECTED]> (supplier of updated uni2ascii package)

(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing [EMAIL PROTECTED])


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Format: 1.8
Date: Sun, 27 Apr 2008 14:10:03 +0530
Source: uni2ascii
Binary: uni2ascii
Architecture: source i386
Version: 4.7-1
Distribution: unstable
Urgency: low
Maintainer: Kartik Mistry <[EMAIL PROTECTED]>
Changed-By: Kartik Mistry <[EMAIL PROTECTED]>
Description: 
 uni2ascii  - UTF-8 to 7-bit ASCII and vice versa converter
Closes: 470200
Changes: 
 uni2ascii (4.7-1) unstable; urgency=low
 .
   * New upstream release
     + Added missing characters to -p and -x options, Fixes Unicode
       ligatures to ASCII issues (Closes: #470200)
   * debian/docs:
     + Removed AUTHORS file from list as there is single author only
       from upstream. It can be added when more authors will be added.
Checksums-Sha1: 
 6281ba74fd582e9ef7e625cee9461489da4a8fe7 1011 uni2ascii_4.7-1.dsc
 d810b2ecd0fee53b637ab2fc6e353633ace499ed 143344 uni2ascii_4.7.orig.tar.gz
 449b8082a870421ad80a74bfc739abea1711cd47 3376 uni2ascii_4.7-1.diff.gz
 310b7c949c9a9f85e14a5948c652a924ed20191f 53304 uni2ascii_4.7-1_i386.deb
Checksums-Sha256: 
 ccaa3ce86932a1df3920e7edc912857d30db2759ad90c2e66c4f436dbfabf52d 1011 
uni2ascii_4.7-1.dsc
 561f03c55fbef7fd5c631637721c5228c5406327a2c94989563c67d07ce8429a 143344 
uni2ascii_4.7.orig.tar.gz
 74af0275e232b4c790f630e0662c07642622d1c75d6558f409ba2e4aa45bf8d3 3376 
uni2ascii_4.7-1.diff.gz
 2e08916a4c19ee15b18ed3c4dc7b9a796ee4d0510d6eb32a016499276c093b0b 53304 
uni2ascii_4.7-1_i386.deb
Files: 
 a906608a7b3732836191aa138555c2b5 1011 text optional uni2ascii_4.7-1.dsc
 6a5a8c43bd02447710024d39937303d3 143344 text optional uni2ascii_4.7.orig.tar.gz
 dddf90f2ef784cb07de8e580380616ef 3376 text optional uni2ascii_4.7-1.diff.gz
 de83fca7d320c1c87f6650d8402ed711 53304 text optional uni2ascii_4.7-1_i386.deb

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)

iD8DBQFIFD63OU3FkQ7XBOoRAiltAKDxl3+soIJ5kWU6dgjL9qW+R3YwbgCfaQk1
YfDlbr2RHJzh31RF3RF/PSI=
=uDYW
-----END PGP SIGNATURE-----



--- End Message ---

Reply via email to