Not very sure whether I can send attach file or not. In this mail I've sent an attached file (file name is sample.big5), in which it contains 4 Traditional Chinese words encoded in BIG5. There is no white space between these 4 words.
The content looks like "中文測試". The env I use is GNU/ Debian Lenny; kernel 2.6.27.8; gcc version 4.3.2 (Debian 4.3.2-1.1); wc (GNU coreutils) 6.10; LANG=en_US.UTF-8 (Other locale settings e.g. LC_CYTPE are all en_US.UTF-8) Please let me know if the file fails to attached or needs to upload to somewhere else. Thanks for your help, --- On Sat, 7/2/09, Pádraig Brady <[email protected]> wrote: > From: Pádraig Brady <[email protected]> > Subject: Re: Enhancement request to wc > To: [email protected] > Cc: [email protected] > Date: Saturday, 7 February, 2009, 8:00 AM > Neo Anderson wrote: > > Hi > > > > I read the page at > http://www.gnu.org/software/coreutils/, saying that the > enhancement request can go through this mailing list. > > > > My request is that I would like wc can also count > multi bytes characters e.g Chinese Big5 correctly. > > > > Please let me know if any additional information > required. > > We're starting work on general multibyte support for > coreutils, > but `wc` should already be pretty good in the regards. > > Could you provide a small example Big5 encoded file > and expected output. Also it would help if you provided > the version of sort, your operating system version > and what locale you;re using. > > thanks, > Pádraig. > > > _______________________________________________ > Bug-coreutils mailing list > [email protected] > http://lists.gnu.org/mailman/listinfo/bug-coreutils
sample.big5
Description: Binary data
_______________________________________________ Bug-coreutils mailing list [email protected] http://lists.gnu.org/mailman/listinfo/bug-coreutils
