"softwindow" <[EMAIL PROTECTED]> wrote in message
news:[EMAIL PROTECTED]
> it is difficult to get all URL's in a page
Is this really so hard?:
-
from pyparsing import Literal,Suppress,CharsNotIn,CaselessLiteral,\
Word,dblQuotedString,alphanums,SkipTo,makeHTMLTags
import
it is difficult to get all URL's in a page
you can use sgmllib module to parse html files
can get the standard href .
--
http://mail.python.org/mailman/listinfo/python-list
"defcon8" <[EMAIL PROTECTED]> wrote in message
news:[EMAIL PROTECTED]
> How do I get all the URL's in a page?
>
pyparsing comes with a simple example that does this, too.
-- Paul
Download pyparsing at http://sourceforge.net/projects/pyparsing
--
http://mail.python.org/mailman/listinfo/python-l
Thanks
--
http://mail.python.org/mailman/listinfo/python-list
use
htmlparser or regular expression
--
http://mail.python.org/mailman/listinfo/python-list
How do I get all the URL's in a page?
--
http://mail.python.org/mailman/listinfo/python-list