Re: Use of Unicode in Python 2.5 source code literals

2009-05-03 Thread Uncle Bruce
On May 3, 7:37 am, Matt Nordhoff wrote: > Uncle Bruce wrote: > -- I think I've figured it out! What I was trying to do was to enter the literal strings directly into the IDLE interpreter. The IDLE interpreter will not accept high codepoints directly. However, when I put a defined function in

Re: Use of Unicode in Python 2.5 source code literals

2009-05-03 Thread Matt Nordhoff
Uncle Bruce wrote: > I'm working with Python 2.5.4 and the NLTK (Natural Language > Toolkit). I'm an experienced programmer, but new to Python. > > This question arose when I tried to create a literal in my source code > for a Unicode codepoint greater than 255. (I also posted this > question in

Re: Use of Unicode in Python 2.5 source code literals

2009-05-03 Thread Steven D'Aprano
On Sun, 03 May 2009 03:43:27 -0700, Uncle Bruce wrote: > Based on some experimenting I've done, I suspect that the support for > Unicode literals in ANY encoding isn't really accurate. What seems to > happen is that there must be an 8-bit mapping between the set of Unicode > literals and what can