Thanks for the hint. The xml-rpc service is great, but I want some general techniques to parse news information in the usual html pages.
Currently I'm looking at a script-based approach found at: http://www.namo.com/products/handstory/manual/hsceditor/ User can write some simple template to extract certain fields from a web page. Unfortunately, it is not open source, so I can not look inside the blackbox.:-( Zhang Le -- http://mail.python.org/mailman/listinfo/python-list