On 24/3/06 4:42 AM, "A. Pagaltzis" <[EMAIL PROTECTED]> wrote:
>> I'm getting the data by scraping an html page, so I'm expecting >> it to be acceptable html code, including html entities. > > Then decode the entities to a Unicode string and emit the feed as > Unicode. Simplest thing that will work reliably. I figured as much. Oh well, now to track down a list of html entities and their corresponding unicodes ... e.