[android-developers] Re: How to parse html with saxparser (or other solution)

2010-01-04 Thread tlegras
Now nekohtml is working very fine for me so i probably won't change :) But thank you for the link, it is a goldmine :) I found the documentation miss such snippets. On 2 jan, 23:03, jwei512 jwei...@gmail.com wrote: Another one you could try is HTML Cleaner (http://

[android-developers] Re: How to parse html with saxparser (or other solution)

2010-01-03 Thread tlegras
ok i got it. it seems the problem is that their xercesMinimal.jar does not work. it tried it in a non android java project and had the same problem. with the full xerces jar i can parse my html page even it has several errors in it. Too bad the full xerces jar is 1.2Mo :( Seems like a bug from

[android-developers] Re: How to parse html with saxparser (or other solution)

2010-01-03 Thread jwei512
Another one you could try is HTML Cleaner (http:// htmlcleaner.sourceforge.net/). I've made a few applications already that references this library and it even supports XPATH to parse the HTML source If you'd like to see some code snippets then let me know and I can show you some. - jwei

[android-developers] Re: How to parse html with saxparser (or other solution)

2010-01-02 Thread Kumar Bibek
I guess you need to use a special HTML parse. Since, HTML pages are not well-formed and are not XML compliant, using an XML parser will not serve your purpose. Search for any third party libraries. Thanks and Regards, Kumar Bibek On Jan 2, 8:05 pm, tlegras tleg...@gmail.com wrote: Happy new

Re: [android-developers] Re: How to parse html with saxparser (or other solution)

2010-01-02 Thread Frank Weiss
Check nekohtml. I haven't tried it on Android yet, but used it several times on desktop for screee On Jan 2, 2010 8:36 AM, Kumar Bibek coomar@gmail.com wrote: I guess you need to use a special HTML parse. Since, HTML pages are not well-formed and are not XML compliant, using an XML parser

[android-developers] Re: How to parse html with saxparser (or other solution)

2010-01-02 Thread tlegras
ok thanks i am trying nekohtml and currently trying to make it run but with the minimal sample code (so using only provided xercesMinimal.jar) i got this exception in my parse() function: E/AndroidRuntime( 765): Uncaught handler: thread Thread-10 exiting due to uncaught exception