CVSROOT: /webcvs/grep Module name: grep Changes by: Jim Meyering <meyering> 18/12/30 01:24:22
Index: html_node/Environment-Variables.html =================================================================== RCS file: /webcvs/grep/grep/manual/html_node/Environment-Variables.html,v retrieving revision 1.28 retrieving revision 1.29 diff -u -b -r1.28 -r1.29 --- html_node/Environment-Variables.html 10 Feb 2017 04:43:38 -0000 1.28 +++ html_node/Environment-Variables.html 30 Dec 2018 06:24:22 -0000 1.29 @@ -2,7 +2,7 @@ <html> <!-- This manual is for grep, a pattern matching engine. -Copyright (C) 1999-2002, 2005, 2008-2017 Free Software Foundation, +Copyright (C) 1999-2002, 2005, 2008-2018 Free Software Foundation, Inc. Permission is granted to copy, distribute and/or modify this document @@ -11,16 +11,16 @@ Invariant Sections, with no Front-Cover Texts, and with no Back-Cover Texts. A copy of the license is included in the section entitled "GNU Free Documentation License". --> -<!-- Created by GNU Texinfo 6.3, http://www.gnu.org/software/texinfo/ --> +<!-- Created by GNU Texinfo 6.5, http://www.gnu.org/software/texinfo/ --> <head> -<title>GNU Grep 3.0: Environment Variables</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8"> +<title>Environment Variables (GNU Grep 3.3)</title> -<meta name="description" content="GNU Grep 3.0: Environment Variables"> -<meta name="keywords" content="GNU Grep 3.0: Environment Variables"> +<meta name="description" content="Environment Variables (GNU Grep 3.3)"> +<meta name="keywords" content="Environment Variables (GNU Grep 3.3)"> <meta name="resource-type" content="document"> <meta name="distribution" content="global"> <meta name="Generator" content="makeinfo"> -<meta http-equiv="Content-Type" content="text/html; charset=utf-8"> <link href="index.html#Top" rel="start" title="Top"> <link href="Index.html#Index" rel="index" title="Index"> <link href="index.html#SEC_Contents" rel="contents" title="Table of Contents"> @@ -306,6 +306,8 @@ <dd><a name="index-LC_005fALL-environment-variable-2"></a> <a name="index-LC_005fCTYPE-environment-variable"></a> <a name="index-LANG-environment-variable-2"></a> +<a name="index-encoding-error"></a> +<a name="index-null-character"></a> <p>These variables specify the locale for the <code>LC_CTYPE</code> category, which determines the type of characters, e.g., which characters are whitespace. @@ -313,6 +315,17 @@ text is encoded in UTF-8, ASCII, or some other encoding. In the ‘<samp>C</samp>’ or ‘<samp>POSIX</samp>’ locale, all characters are encoded as a single byte and every byte is a valid character. +In more-complex encodings such as UTF-8, a sequence of multiple bytes +may be needed to represent a character, and some bytes may be encoding +errors that do not contribute to the representation of any character. +POSIX does not specify the behavior of <code>grep</code> when patterns or +input data contain encoding errors or null characters, so portable +scripts should avoid such usage. As an extension to POSIX, GNU +<code>grep</code> treats null characters like any other character. +However, unless the <samp>-a</samp> (<samp>--binary-files=text</samp>) option +is used, the presence of null characters in input or of encoding +errors in output causes GNU <code>grep</code> to treat the file as binary +and suppress details about matches. See <a href="File-and-Directory-Selection.html#File-and-Directory-Selection">File and Directory Selection</a>. </p> </dd> <dt><code>LANGUAGE</code></dt>
