Tangentially, Dubai Airport has just changed its Web site in a manner that makes it far harder to parse, and among other things replaces the names of airlines with little GIF logos. Oh yes, and where once each flight was a tr with class "data-row", now 50% of them and the others are trs with no class attribute and some random colour value instead. Grrr.
This resulted:
output = [[td.string or td.img["src"] for td in tr.findAll(True) if td.img or
td.string] for tr in soup.findAll('tr', bgcolor=lambda(value): if value ==
'White' or value == '#F7F7DE')]
Cthulhu fhtagn!
--
The only thing worse than e-mail disclaimers...is people who send e-mail to
lists complaining about them
signature.asc
Description: This is a digitally signed message part.
_______________________________________________ Mailing list [email protected] Archive, settings, or unsubscribe: https://secure.mysociety.org/admin/lists/mailman/listinfo/developers-public
