Steven D'Aprano wrote:
On Thu, 25 Dec 2008 11:00:18 +0100, Stef Mientki wrote:
hello,
Is there a function to remove escape characters from a string ?
(preferable all escape characters except "\n").
Can you explain what you mean? I can think of at least four alternatives:
I have the following kind of strings,
the funny "þ" is ASCII character 254, used as a separator character
[FSM]
Counts = "1þ11þ16" ==> 1,11,16
Init1 = "1þ\BCtrl" ==> 1,Ctrl
State5 = "8þ\BJUMP_COMPL\b\n>PCWrite = 1\n>PCSource = 10"
==> 8, JUMP_COMPL\n>PCWrite = 1\n>PCSource = 10
Seeing and testing all your answers, with great solutions that I've
never seen before,
knowing nothing of escape sequences (I'm a windows guy ;-)
I now see that the characters I need to remove, like \B and \b are
not "official" escape sequences.
So in this case the best (easiest to understand) method is a few replace
statements:
s = s.replace ( '\b', '' ).replace( '\B', '' )
Nevertheless, thank you all for the other examples,
cheers,
Stef
(1) Remove literal escape sequences (backslash-char):
"abc\\t\\ad" => "abcd"
r"abc\t\ad" => "abcd"
(2) Replace literal escape sequences with the character they represent:
"abc\\t\\ad" => "abc\t\ad"
(3) Remove characters generated by escape sequences:
"abc\t\ad" => "abcd"
"abc" => "abc" but "a\x62c" => "ac"
This is likely to be impossible without deep magic.
(4) Remove so-called binary characters which are typically inserted using
escape sequences:
"abc\t\ad" => "abcd"
"abc" => "abc" but "a\x62c" => "abc"
This is probably the easiest, assuming you have bytes instead of unicode.
import string
table = string.maketrans('', '')
delchars =''.join(chr(n) for n in range(32))
s = string.translate(s, table, delchars)
--
http://mail.python.org/mailman/listinfo/python-list