dongdong [EMAIL PROTECTED] wrote:
using web browser can get page's content formally, but when use
urllib2.open(http://tech.163.com/2004w11/12732/2004w11_1100059465339.html;).read()
the result is
htmlheadMETA HTTP-EQUIV=REFRESH
CONTENT=0;URL=http://tech.163.com/04/1110/12/14QUR2BR0009159H.html;
dongdong wrote:
using web browser can get page's content formally, but when use
urllib2.open(http://tech.163.com/2004w11/12732/2004w11_1100059465339.html;).read()
the result is
htmlheadMETA HTTP-EQUIV=REFRESH
CONTENT=0;URL=http://tech.163.com/04/1110/12/14QUR2BR0009159H.html;
META
oh~~~! offer my thanks to Tim Roberts and all persons above!
I see now, it's the different url causes!
contents can only be got from the later (real ) url.
I made a mistick not to look at the different urls taking effect.
--
http://mail.python.org/mailman/listinfo/python-list
dongdong [EMAIL PROTECTED] writes:
oh~~~! offer my thanks to Tim Roberts and all persons above!
I see now, it's the different url causes!
contents can only be got from the later (real ) url.
I made a mistick not to look at the different urls taking effect.
If you use
using web browser can get page's content formally, but when use
urllib2.open(http://tech.163.com/2004w11/12732/2004w11_1100059465339.html;).read()
the result is
htmlheadMETA HTTP-EQUIV=REFRESH
CONTENT=0;URL=http://tech.163.com/04/1110/12/14QUR2BR0009159H.html;
META http-equiv=Pragma
dongdong wrote:
using web browser can get page's content formally, but when use
urllib2.open(http://tech.163.com/2004w11/12732/2004w11_1100059465339.html;).read()
the result is
htmlheadMETA HTTP-EQUIV=REFRESH
CONTENT=0;URL=http://tech.163.com/04/1110/12/14QUR2BR0009159H.html;
META
yeah,u r right, the page uses chinese.(I'm a chinese too.^_^,)
using urllib2.urlopen('').read(),I can't get the contents
between 'body' and '/body' ,the reason isn't the chinese encoding
but the 'no-cache' set,I think.
I want to get the contents between
can you find the problem
dongdong wrote:
using web browser can get page's content formally, but when use
urllib2.open(http://tech.163.com/2004w11/12732/2004w11_1100059465339.html;).read()
the result is
htmlheadMETA HTTP-EQUIV=REFRESH
CONTENT=0;URL=http://tech.163.com/04/1110/12/14QUR2BR0009159H.html;
This line