Re: converting html escape sequences to unicode characters

2004-12-10 Thread Craig Ringer
On Fri, 2004-12-10 at 08:36, harrelson wrote: I have a list of about 2500 html escape sequences (decimal) that I need to convert to utf-8. Stuff like: I'm pretty sure this somewhat horrifying code does it, but is probably an example of what not to do: escapeseq = '#48708;' uescape = (\\u%x

Re: converting html escape sequences to unicode characters

2004-12-10 Thread Craig Ringer
On Fri, 2004-12-10 at 16:09, Craig Ringer wrote: On Fri, 2004-12-10 at 08:36, harrelson wrote: I have a list of about 2500 html escape sequences (decimal) that I need to convert to utf-8. Stuff like: I'm pretty sure this somewhat horrifying code does it, but is probably an example of

converting html escape sequences to unicode characters

2004-12-09 Thread harrelson
I have a list of about 2500 html escape sequences (decimal) that I need to convert to utf-8. Stuff like: #48708; #54665; #44592; #47196; #48372; #45244; #44144; #50640; #50836; #45236; #47732; #44552; #51060; #50620; #47560; #51648; #51104; Anyone know what the decimal is representing? It

Re: converting html escape sequences to unicode characters

2004-12-09 Thread Kent Johnson
harrelson wrote: I have a list of about 2500 html escape sequences (decimal) that I need to convert to utf-8. Stuff like: #48708; #54665; #44592; #47196; #48372; #45244; #44144; #50640; #50836; #45236; #47732; #44552; #51060; #50620; #47560; #51648; #51104; Anyone know what the decimal is