On May 25, 6:51 am, Jia Lu [EMAIL PROTECTED] wrote:
Hi all
I'm trying to parsing html with re module.
html =
TABLE BORDER=1 cellspacing=0 cellpadding=2
TR
TH nowrapDATA1/THTH nowrapDATA2/HTTH nowrapDATA3/
HTTHDATA4/TH
/TR
TRTDDATA5/TDTDDATA6/TDTDDATA7/TDTDDATA8/TD/TR
/TABLE
Hi all
I'm trying to parsing html with re module.
html =
TABLE BORDER=1 cellspacing=0 cellpadding=2
TR
TH nowrapDATA1/THTH nowrapDATA2/HTTH nowrapDATA3/
HTTHDATA4/TH
/TR
TRTDDATA5/TDTDDATA6/TDTDDATA7/TDTDDATA8/TD/TR
/TABLE
I want to get DATA1-8 from that string.(DATA maybe not english
* Jia Lu (25 May 2007 04:51:35 -0700)
I'm trying to parsing html with re module.
[...]
Can anyone tell me how to do it with regular expression in python?
Just don't. Use an HTML parser like BeautifulSoup
--
http://mail.python.org/mailman/listinfo/python-list
Thorsten Kampe ha scritto:
I'm trying to parsing html with re module.
Just don't. Use an HTML parser like BeautifulSoup
Or HTMLParser/htmllib
--
|\/|55: Mattia Gentilini e 55 = log2(che_palle_sta_storia) (by mezzo)
|/_| ETICS project at CNAF, INFN, Bologna, Italy
|\/| www.getfirefox.com
Thorsten Kampe ha scritto:
I'm trying to parsing html with re module.
Just don't. Use an HTML parser like BeautifulSoup
Or HTMLParser/htmllib. of course you can mix those and re, it'll be
easier than re only.
--
|\/|55: Mattia Gentilini e 55 = log2(che_palle_sta_storia) (by mezzo)
|/_| ETICS