I understand that, so just to ask if it is possible to add a new option e.g.. -utf8 so that wc can count word which is wild characters.
--- On Sun, 8/2/09, Jim Meyering <[email protected]> wrote: > From: Jim Meyering <[email protected]> > Subject: Re: Enhancement request to wc > To: "Neo Anderson" <[email protected]> > Cc: "Pádraig Brady" <[email protected]>, [email protected] > Date: Sunday, 8 February, 2009, 3:01 PM > Neo Anderson <[email protected]> wrote: > > Not very sure whether I can send attach file or not. > In this mail I've > > sent an attached file (file name is sample.big5), in > which it contains > > 4 Traditional Chinese words encoded in BIG5. There is > no white space > > between these 4 words. > > > > The content looks like "中文測試". > > Thanks for the details, but POSIX[1] requires that wc count > that as > a single word: > > The wc utility shall consider a word to be a > non-zero-length string of > characters delimited by white space. > > [1] > http://www.opengroup.org/onlinepubs/000095399/utilities/wc.html > > > _______________________________________________ > Bug-coreutils mailing list > [email protected] > http://lists.gnu.org/mailman/listinfo/bug-coreutils _______________________________________________ Bug-coreutils mailing list [email protected] http://lists.gnu.org/mailman/listinfo/bug-coreutils
