The TagSoup HTML parser, which is very forgiving and very customizable, is compiled into the Android runtime and used by the TextView component to parse HTML. Unfortunately it is not exposed by the SDK, but you can access it via introspection, with the caveat that there is a slight chance that in some future version of android it will go away or be replaced with something else.
On Jan 28, 4:31 pm, Frank Weiss <[email protected]> wrote: > I've recommended nekohtml and several people have reported using it > successfully on Android. > > On Jan 28, 2010 3:04 PM, "Marc Petit-Huguenin" <[email protected]> wrote: > > On 01/28/2010 02:31 PM, Allison Inouye wrote: > I am trying to parse an HTML > document that is missin... > I was able to parse badly written HTML (is there another kind?) as XML by > using > JTidy (not on Android so YMMV): > > import org.w3c.tidy.*; > > Tidy tidy = new Tidy(); > tidy.setXmlOut(true); > tidy.setShowWarnings(false); > tidy.setQuiet(true); > tidy.parseDOM(connection.getInputStream(), null); > > -- You received this message because you are subscribed to the Google Groups > "Android Developers" ... -- You received this message because you are subscribed to the Google Groups "Android Developers" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/android-developers?hl=en

