#1744: treat byte order mark as zero-width whitespace
--------------------------------+-------------------------------------------
Reporter: igloo | Owner:
Type: feature request | Status: new
Priority: normal | Milestone:
Component: Compiler (Parser) | Version: 6.8
Severity: normal | Keywords:
Difficulty: Unknown | Os: Unknown
Testcase: | Architecture: Unknown
--------------------------------+-------------------------------------------
The U+FEFF ZERO WIDTH NO-BREAK SPACE Unicode character, better known as
BYTE ORDER MARK (BOM), currently gives a lexical error:
{{{
$ printf '\xEF\xBB\xBF\nz = "str"\n' > z.hs
$ ghci z.hs
GHCi, version 6.8.0.20070927: http://www.haskell.org/ghc/ :? for help
Loading package base ... linking ... done.
z.hs:1:0: lexical error at character '\65279'
Failed, modules loaded: none.
Prelude> Leaving GHCi.
}}}
The character is only in categories Other and Format, not Space, but I
think we should lex it as whitespace anyway (with zero width for the
purposes of the layout rule). Ideally Haskell' would do likewise.
--
Ticket URL: <http://hackage.haskell.org/trac/ghc/ticket/1744>
GHC <http://www.haskell.org/ghc/>
The Glasgow Haskell Compiler_______________________________________________
Glasgow-haskell-bugs mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/glasgow-haskell-bugs