HTML and character entities in RSS

Richard Gaskin Fri, 26 Feb 2010 17:52:06 -0800

Looking for standards among RSS feeds is like looking for standards inWindows GUI designs: everyone knows they're published somewhere, but noone takes the time to read 'em. ;)

I've been parsing a bunch of RSS files, and man, what a wild west ofweirdness it is.

For example, most of the RSS specs I've read suggest that all data isplan text, with HTML allowable only when marked as CDATA.

But I've seen feeds that do that backwards, and some that have somestrings containing character entities flagged as CDATA with othercontaining entities that aren't flagged -- in the same feed!

By what rule should I know when to translate data from characterentities back to plain old ASCII?

Browsers seem to handle the mish-mash rather well; wish I were asgraceful at handling all the inconsistencies I'm finding.


--
 Richard Gaskin
 Fourth World
 Rev training and consulting: http://www.fourthworld.com
 Webzine for Rev developers: http://www.revjournal.com
 revJournal blog: http://revjournal.com/blog.irv
_______________________________________________
use-revolution mailing list
use-revolution@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution

HTML and character entities in RSS

Reply via email to