I just wanted to report that I've tried TagSoup and at first glance it seems to be doing exactly what I want - this is great! Instead of using a SAXParserFactory I'm now using the SAXFactoryImpl class in TagSoup to instantiate a new SAXParser. I will need to look it over a bit more but it just parsed through a copyright symbol without any complaints!!
On Mar 1, 7:32 am, StefanK <[email protected]> wrote: > In my experience, the problem is in many cases in the character > encoding used in the feed. If the feed is encoded using ISO-8859-1 > encoding (which is what CNN top stories appears to use), and you are > trying to read it using the default UTF-8 encoding some symbols will > come as invalid and break the parser. The only viable solution is to > manually detect the encoding before trying to parse and then construct > the input stream given to the parser with the correct encoding. This > is what I end up doing for BeyondPod in both Windows Mobile and > Android platforms and this solved large set of parsing issues. > Welcome to the bizarre world of RSS parsing. > > Stefan > > On Mar 1, 10:03 am, grennis <[email protected]> wrote: > > > OK, thanks all. I didn't realize the problem was as pervasive as it > > is. I'm presenting a limited set of feeds so I'm hoping the scrub > > approach will work. --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Android Developers" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/android-developers?hl=en -~----------~----~----~----~------~----~------~--~---

