URL:
<http://savannah.gnu.org/bugs/?23814>
Summary: Result wrong in multibyte locale
Project: grep
Submitted by: noritnk
Submitted on: 金曜日 2008年07月11日 at 05:44
Category: None
Severity: 3 - Normal
Item Group: None
Status: None
Privacy: Public
Assigned to: None
Open/Closed: Open
Discussion Lock: Any
_______________________________________________________
Details:
If a part of multibyte character on a line matches with pattern,
and other part of same line, which isn't a part of multibyte
character, matches with pattern again, no its line is returned.
testcase
^^^^^^^^
test.txt : text file written in EUC-JP.
byte sequence a4b3 a4b3 a4b3 b3a4 b3a4 000a (little endian)
pattern.txt : pattern file written in EUC-JP.
byte sequence b3a4 b3a4 000a (little endian)
$ expor LANG=ja_JP.eucJP
$ grep -f pattern.txt test.txt
Last 4 bytes in the text matches with the pattern,
but not returned.
_______________________________________________________
File Attachments:
-------------------------------------------------------
Date: 金曜日 2008年07月11日 at 05:44 Name: test.txt Size: 11B By:
noritnk
<http://savannah.gnu.org/bugs/download.php?file_id=16096>
-------------------------------------------------------
Date: 金曜日 2008年07月11日 at 05:44 Name: pattern.txt Size: 5B By:
noritnk
<http://savannah.gnu.org/bugs/download.php?file_id=16097>
-------------------------------------------------------
Date: 金曜日 2008年07月11日 at 05:44 Name: grep-2.5.3-mbr.diff Size:
1kB By: noritnk
<http://savannah.gnu.org/bugs/download.php?file_id=16098>
_______________________________________________________
Reply to this item at:
<http://savannah.gnu.org/bugs/?23814>
_______________________________________________
Message sent via/by Savannah
http://savannah.gnu.org/