https://github.com/python/cpython/commit/bb7a6d470702a19fc4385a676c90ea941d46888d commit: bb7a6d470702a19fc4385a676c90ea941d46888d branch: 3.11 author: Miss Islington (bot) <[email protected]> committer: terryjreedy <[email protected]> date: 2024-03-19T11:59:52-04:00 summary:
[3.11] gh-107607: Update comment about utf-8 BOM being ignored (GH-107858) (#117015) (cherry picked from commit 7f64ae30ddc22577ce4101ce0b6601b3548b036f) Co-authored-by: Terry Jan Reedy [email protected] files: M Doc/reference/lexical_analysis.rst diff --git a/Doc/reference/lexical_analysis.rst b/Doc/reference/lexical_analysis.rst index 65d9de5534a627..76acd300134219 100644 --- a/Doc/reference/lexical_analysis.rst +++ b/Doc/reference/lexical_analysis.rst @@ -96,10 +96,9 @@ which is recognized also by GNU Emacs, and :: which is recognized by Bram Moolenaar's VIM. -If no encoding declaration is found, the default encoding is UTF-8. In -addition, if the first bytes of the file are the UTF-8 byte-order mark -(``b'\xef\xbb\xbf'``), the declared file encoding is UTF-8 (this is supported, -among others, by Microsoft's :program:`notepad`). +If no encoding declaration is found, the default encoding is UTF-8. If the +implicit or explicit encoding of a file is UTF-8, an initial UTF-8 byte-order +mark (b'\xef\xbb\xbf') is ignored rather than being a syntax error. If an encoding is declared, the encoding name must be recognized by Python (see :ref:`standard-encodings`). The _______________________________________________ Python-checkins mailing list -- [email protected] To unsubscribe send an email to [email protected] https://mail.python.org/mailman3/lists/python-checkins.python.org/ Member address: [email protected]
