Re: [libxml-devel] Request to add HTML parsing

Yann Klis Tue, 18 Jul 2006 09:33:31 -0700

If you'd like to do HTML parsing, you'd better use RubyfoulSoup 
http://www.crummy.com/software/RubyfulSoup/ which is a port of 
BeautifulSoup to Python, and IMHO better adapted to HTML parsing than 
libxml.


++

yk

Mark Thomas a écrit :
> I'm switching to Ruby from Perl, and currently I do all my HTML parsing in
> perl's XML::LibXML. Applying XPath to parse HTML is extremely powerful and
> fast, fast, fast in libxml.
>
> Can you add that feature to the Ruby one? I think it would be easy to do;
> it's just a flag on the parser, which tells libxml to create a DOM from
> HTML instead of XML, and all the XML methods then magically work on the
> HTML!
>
> So it should be really low hanging fruit. Sweet, delicious fruit.
>
> Please consider it!
>
> Thanks,
> - Mark.
>
> _______________________________________________
> libxml-devel mailing list
> [email protected]
> http://rubyforge.org/mailman/listinfo/libxml-devel
>   

_______________________________________________
libxml-devel mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/libxml-devel

Re: [libxml-devel] Request to add HTML parsing

Reply via email to