Package: grep
Version: 2.12-2
Severity: normal

Dear Maintainer,
regexp quantifiers \< \> \b \B \w \W give wrong results with unicode
words.

example:
$ echo "я" | grep -q "\<я"; echo $?
1

i wrote small test script (see attachm.).
it's result:
    \<x x\> \bx x\b x\B \Bx \w  \W
b:   0   0   0   0   1   1   0   1
я:   1   1   1   1   0   0   1   0
Σ:   1   1   1   1   0   0   1   0
ä:   1   1   1   1   0   0   1   0

-- System Information:
Debian Release: 7.1
  APT prefers stable
  APT policy: (500, 'stable')
Architecture: i386 (i686)

Kernel: Linux 3.2.0-4-686-pae (SMP w/2 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages grep depends on:
ii  dpkg          1.16.10
ii  install-info  4.13a.dfsg.1-10
ii  libc6         2.13-38

grep recommends no packages.

Versions of packages grep suggests:
ii  libpcre3  1:8.30-5

-- no debconf information

Attachment: s
Description: application/shellscript

Reply via email to