Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread Chris Angelico
On Fri, May 29, 2015 at 6:05 PM, anatoly techtonik techto...@gmail.com wrote: On Wed, May 27, 2015 at 9:52 PM, anatoly techtonik techto...@gmail.com wrote: And the short answer is that we need unicode because we are printing this information to the stdout, and stdout is opened in text mode at

Subscribe to get an answer vs automatic CC Was: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread anatoly techtonik
On Wed, May 27, 2015 at 3:57 PM, Laura Creighton l...@openend.se wrote: Chris Angelico apparantly has a problem with cc'd people who aren't on the list. I thought that CC in this case works automatically? If that's not the case, then I'll be annoyed by this too. So, thanks for CCing. =) Also,

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread anatoly techtonik
On Wed, May 27, 2015 at 3:57 PM, Laura Creighton l...@openend.se wrote: --- Forwarded Message Return-Path: python-list-bounces+lac=openend...@python.org Received: from mail.python.org (mail.python.org [82.94.164.166]) by theraft.openend.se (8.14.4/8.14.4/Debian-4) with ESMTP id

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread Laura Creighton
In a message of Fri, 29 May 2015 11:05:07 +0300, anatoly techtonik writes: Added Mailman to my suxx tracker: https://github.com/techtonik/suxx-tracker#mailman You are damning the wrong piece of software -- this is not a problem with mailman; mailman doesn't care at all what software you use to

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread Laura Creighton
Do you know about the codecs module? reading http://pymotw.com/2/codecs/ may be useful if this is new to you. Have you read https://www.python.org/dev/peps/pep-0293/ ? Will backslashreplace do what you want? Laura -- https://mail.python.org/mailman/listinfo/python-list

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread Ian Kelly
On Fri, May 29, 2015 at 2:05 AM, anatoly techtonik techto...@gmail.com wrote: Added Mailman to my suxx tracker: https://github.com/techtonik/suxx-tracker#mailman What a useless tool. Instead of tiredly complaining that things suck, why not take some initiative to make them better? I'm curious

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread Mark Lawrence
On 29/05/2015 11:02, Ian Kelly wrote: On Fri, May 29, 2015 at 2:05 AM, anatoly techtonik techto...@gmail.com wrote: Added Mailman to my suxx tracker: https://github.com/techtonik/suxx-tracker#mailman What a useless tool. Instead of tiredly complaining that things suck, why not take some

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread Jon Ribbens
On 2015-05-29, Ian Kelly ian.g.ke...@gmail.com wrote: On Fri, May 29, 2015 at 2:05 AM, anatoly techtonik techto...@gmail.com wrote: Added Mailman to my suxx tracker: https://github.com/techtonik/suxx-tracker#mailman What a useless tool. Instead of tiredly complaining that things suck, why

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread anatoly techtonik
On Fri, May 29, 2015 at 11:41 AM, Laura Creighton l...@openend.se wrote: In a message of Fri, 29 May 2015 11:05:07 +0300, anatoly techtonik writes: Added Mailman to my suxx tracker: https://github.com/techtonik/suxx-tracker#mailman You are damning the wrong piece of software -- this is not a

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread anatoly techtonik
On Fri, May 29, 2015 at 2:39 PM, Laura Creighton l...@openend.se wrote: Do you know about the codecs module? reading http://pymotw.com/2/codecs/ may be useful if this is new to you. Does that work for Python 2 and Python 3? Have you read https://www.python.org/dev/peps/pep-0293/ ? No.

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-29 Thread Ian Kelly
On Fri, May 29, 2015 at 4:44 AM, Jon Ribbens jon+use...@unequivocal.co.uk wrote: On 2015-05-29, Ian Kelly ian.g.ke...@gmail.com wrote: On Fri, May 29, 2015 at 2:05 AM, anatoly techtonik techto...@gmail.com wrote: Added Mailman to my suxx tracker:

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing)

2015-05-27 Thread anatoly techtonik
On Wed, May 27, 2015 at 2:35 PM, Laura Creighton l...@openend.se wrote: I am missing something. Why do you need unicode at all? Why can you not just keep your binary data as binary data? Good question. From the SCons code I see that we need unicode, because we switched to io.StringIO which is

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing)

2015-05-27 Thread Laura Creighton
I am missing something. Why do you need unicode at all? Why can you not just keep your binary data as binary data? I feel like I must be missing something obvious here ... Laura -- https://mail.python.org/mailman/listinfo/python-list

Fwd: Lossless bulletproof conversion to unicode (backslashing)

2015-05-27 Thread anatoly techtonik
Hi. This was labelled offtopic in python-ideas, so I edited and forwarded it here. Please CC as I am not subscribed. In short. I need is a bulletproof way to convert from anything to unicode. This requires some kind of escaping to go forward and back. Some helper function like u2b() (unicode to

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing)

2015-05-27 Thread Chris Angelico
On Wed, May 27, 2015 at 9:52 PM, anatoly techtonik techto...@gmail.com wrote: And the short answer is that we need unicode because we are printing this information to the stdout, and stdout is opened in text mode at least on Windows, and without explicit conversion, Python will try to decode

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing)

2015-05-27 Thread anatoly techtonik
On Wed, May 27, 2015 at 2:47 PM, anatoly techtonik techto...@gmail.com wrote: On Wed, May 27, 2015 at 2:35 PM, Laura Creighton l...@openend.se wrote: I am missing something. Why do you need unicode at all? Why can you not just keep your binary data as binary data? Good question. From the

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing)

2015-05-27 Thread Steven D'Aprano
On Wed, 27 May 2015 09:15 pm, anatoly techtonik wrote: Hi. This was labelled offtopic in python-ideas, so I edited and forwarded it here. Please CC as I am not subscribed. In short. I need is a bulletproof way to convert from anything to unicode. This requires some kind of escaping to

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing) (fwd)

2015-05-27 Thread Laura Creighton
Chris Angelico apparantly has a problem with cc'd people who aren't on the list. python-list is very quiet these days, so if you subscribe it won't be drinking from the firehose. And you can always turn off delivery when you are done. Or you can just go read the archives:

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing)

2015-05-27 Thread random832
On Wed, May 27, 2015, at 07:15, anatoly techtonik wrote: The solution is to have filter preprocess the binary string to escape all non-unicode symbols so that the following lossless transformation becomes possible: binary - escaped utf-8 string - unicode - binary I want to know if

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing)

2015-05-27 Thread random832
On Wed, May 27, 2015, at 07:47, anatoly techtonik wrote: because Python 3 doesn't have non-unicode StringIO That's actually not true - the non-unicode equivalent is BytesIO. However, it's probably not actually what you want, if the point is to display the filenames to the user. --

Re: Fwd: Lossless bulletproof conversion to unicode (backslashing)

2015-05-27 Thread Terry Reedy
On 5/27/2015 7:15 AM, anatoly techtonik wrote: Hi. This was labelled offtopic in python-ideas, so I edited and forwarded it here. Please CC as I am not subscribed. I am not subcribed either, but I do not need or want CCs. python-list is mirrored at news.gmane.org where you can access it as