Package: python2.6-minimal
Version: 2.6.6-8+b1
Severity: normal
Tags: upstream

The problem lies in the following definitions in textwrap.py:

wordsep_simple_re = re.compile(r'(\s+)')
wordsep_re = re.compile(
    r'(\s+|'                                  # any whitespace
    r'[^\s\w]*\w+[^0-9\W]-(?=\w+[^0-9\W])|'   # hyphenated words
    r'(?<=[\w\!\"\'\&\.\,\?])-{2,}(?=\w))')   # em-dash

self.wordsep_re_uni = re.compile(self.wordsep_re.pattern, re.U)
self.wordsep_simple_re_uni = re.compile(
    self.wordsep_simple_re.pattern, re.U)


Since in unicode mode \s matches any whitespace - even non-breakable
one - the regexp generates false positives.

As a workaround I have replaced \s by normal space, but of course this
breaks textwrap if the replace_whitespace option is set to false...

-- System Information:
Debian Release: 6.0.1
  APT prefers stable-updates
  APT policy: (500, 'stable-updates'), (500, 'stable')
Architecture: i386 (i686)

Kernel: Linux 2.6.32-5-686 (SMP w/1 CPU core)
Locale: LANG=de_AT.UTF-8, LC_CTYPE=de_AT.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages python2.6-minimal depends on:
ii  libc6                   2.11.2-10        Embedded GNU C Library: Shared lib
ii  libssl0.9.8             0.9.8o-4squeeze1 SSL shared libraries
ii  zlib1g                  1:1.2.3.4.dfsg-3 compression library - runtime

Versions of packages python2.6-minimal recommends:
ii  python2.6                     2.6.6-8+b1 An interactive high-level object-o

Versions of packages python2.6-minimal suggests:
pn  binfmt-support                <none>     (no description available)

-- no debconf information



-- 
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]

Reply via email to