Xavier Mor-Mur wrote:
I write smtp client to send html email from my application.
Html normally is a doc saved as html format. If there are images then their tag have SRC coded on a way I can't find how it's done. On Example at end you can see html generated using OpenOffice 3.1. There is only one image and doc name is "Sin título 1.html" which is used to declare inserted images.
I don't know if this happen using MS-Word or other word processors.

<IMG SRC="Sin%20t%C3%ADtulo%201_html_m82e68f1.jpg" ....
image on disc is "Sin título 1_html_m82e68f1.jpg"
whitespace are replaced by %20
but í es replaced by %C3%AD but in utf-8 is %ED
No, you are wrong, %C3%AD is the UTF-8 code for that accented i. %ED is its representation in ISO-8859-1


Special characters are coded using 2 bytes and I need how to decode to access image to embed into email.
UrlDecode don't work as it returns "Ã-" (where this - is virtual dash)
Yes, it works, it returns a UTF-8 encoded string. You have to further decode it to get the ISO-8859-1 value you are looking for.
UTF8Decode is there for this.
--
To unsubscribe or change your settings for TWSocket mailing list
please goto http://lists.elists.org/cgi-bin/mailman/listinfo/twsocket
Visit our website at http://www.overbyte.be

Reply via email to