If you'd like to do HTML parsing, you'd better use RubyfoulSoup http://www.crummy.com/software/RubyfulSoup/ which is a port of BeautifulSoup to Python, and IMHO better adapted to HTML parsing than libxml.
++ yk Mark Thomas a écrit : > I'm switching to Ruby from Perl, and currently I do all my HTML parsing in > perl's XML::LibXML. Applying XPath to parse HTML is extremely powerful and > fast, fast, fast in libxml. > > Can you add that feature to the Ruby one? I think it would be easy to do; > it's just a flag on the parser, which tells libxml to create a DOM from > HTML instead of XML, and all the XML methods then magically work on the > HTML! > > So it should be really low hanging fruit. Sweet, delicious fruit. > > Please consider it! > > Thanks, > - Mark. > > _______________________________________________ > libxml-devel mailing list > [email protected] > http://rubyforge.org/mailman/listinfo/libxml-devel > _______________________________________________ libxml-devel mailing list [email protected] http://rubyforge.org/mailman/listinfo/libxml-devel
