[issue12742] Add support for CESU-8 encoding

2011-08-29 Thread Adal Chiriliuc

Adal Chiriliuc  added the comment:

It's an internal web API at the place I work for.

To be able to use it from Python in some form, I did an workaround in which I 
just stripped everything outside BMP:

# replace characters outside BMP with 'REPLACEMENT CHARACTER' (U+FFFD)
def cesu8_to_utf8(text):
result = ""
index = 0
length = len(text)
while index < length:
if text[index] < "\xf0":
result += text[index]
index  += 1
else:
result += "\xef\xbf\xbd"  # u"\ufffd".encode("utf8")
index  += 4
return result

Now that I look at the workaround again, I'm not even sure it's about CESU-8 
(it strips Unicode chars encoded to 4 bytes, not 2 pairs of 3 bytes surrogates).

However I can see why there would be little interest in adding this encoding.

--

___
Python tracker 
<http://bugs.python.org/issue12742>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue12742] Add support for CESU-8 encoding

2011-08-12 Thread Adal Chiriliuc

New submission from Adal Chiriliuc :

CESU-8 is identical with UTF-8 except that it has a different encoding format 
for surrogate characters.

http://en.wikipedia.org/wiki/CESU-8

It is used by some web APIs.

--
components: Unicode
messages: 141958
nosy: adalx
priority: normal
severity: normal
status: open
title: Add support for CESU-8 encoding
type: feature request
versions: Python 3.4

___
Python tracker 
<http://bugs.python.org/issue12742>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue2857] Add "java modified utf-8" codec

2011-08-12 Thread Adal Chiriliuc

Adal Chiriliuc  added the comment:

Python does have other "weird" encodings like bz2 or rot13.

Beside, batteries included :)

--

___
Python tracker 
<http://bugs.python.org/issue2857>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue2857] add codec for java modified utf-8

2011-05-10 Thread Adal Chiriliuc

Adal Chiriliuc  added the comment:

I use the hachoir Python package to parse Java .class files and extract the 
strings from them and having support for Java modified UTF-8 would have been 
nice.

--
nosy: +adalx

___
Python tracker 
<http://bugs.python.org/issue2857>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue7863] platform module doesn't detect Windows 7

2010-02-05 Thread Adal Chiriliuc

New submission from Adal Chiriliuc :

Running python 32 bit on Windows 7 64 bit:
>>> import platform
>>> platform.platform()
'Windows-post2008Server-6.1.7600'

Should be corrected to display
'Windows-7-6.1.7600'

Fix:
>elif maj == 6:
>if min == 0:
># ..
>release = 'Vista'
>else:
>if productType == VER_NT_WORKSTATION:
>release = 'Vista'
>else:
>release = '2008Server'
>elif min == 1:
>release = '7'
>else:
>release = 'post2008Server'

--
components: Library (Lib)
messages: 98920
nosy: adal
severity: normal
status: open
title: platform module doesn't detect Windows 7
versions: Python 2.6

___
Python tracker 
<http://bugs.python.org/issue7863>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue6900] Sub-optimal "Locate" button behaviour in Windows CHM file

2009-12-21 Thread Adal Chiriliuc

Adal Chiriliuc  added the comment:

This seem to have been broken when the help file style was changed (from
2.5 to 2.6).

I'm also bothered by this issue, since you cannot locate easily now
related modules.

For example, in the old help version I would go in the index to urllib,
display the module, then click Locate and I would then see all the
Internet related modules.

Now I have to search them through the Contents page.

--
nosy: +adalx

___
Python tracker 
<http://bugs.python.org/issue6900>
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue1546] Win32 Platform SDK conflict

2007-12-03 Thread Adal Chiriliuc

Adal Chiriliuc added the comment:

Sorry, I just saw that this was fixed on head :(

__
Tracker <[EMAIL PROTECTED]>
<http://bugs.python.org/issue1546>
__
___
Python-bugs-list mailing list 
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue1546] Win32 Platform SDK conflict

2007-12-03 Thread Adal Chiriliuc

New submission from Adal Chiriliuc:

The WRITE_RESTRICTED macro defined in structmember.h (value 4) conflicts
with the WRITE_RESTRICTED macro defined in winnt.h (value 8) in the
newer Win32 Platform SDK (such as the one included in Visual Studio 2008).

It generates a warning and could potentially be dangerous as the two
values are different.

--
components: Interpreter Core
messages: 58113
nosy: adal
severity: normal
status: open
title: Win32 Platform SDK conflict
type: compile error
versions: Python 2.6

__
Tracker <[EMAIL PROTECTED]>
<http://bugs.python.org/issue1546>
__
___
Python-bugs-list mailing list 
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com