Re: Can't get the real contents form page in internet as the tag no-chche

2006-03-23 Thread Tim Roberts
dongdong [EMAIL PROTECTED] wrote: using web browser can get page's content formally, but when use urllib2.open(http://tech.163.com/2004w11/12732/2004w11_1100059465339.html;).read() the result is htmlheadMETA HTTP-EQUIV=REFRESH CONTENT=0;URL=http://tech.163.com/04/1110/12/14QUR2BR0009159H.html;

Re: Can't get the real contents form page in internet as the tag no-chche

2006-03-23 Thread Diez B. Roggisch
dongdong wrote: using web browser can get page's content formally, but when use urllib2.open(http://tech.163.com/2004w11/12732/2004w11_1100059465339.html;).read() the result is htmlheadMETA HTTP-EQUIV=REFRESH CONTENT=0;URL=http://tech.163.com/04/1110/12/14QUR2BR0009159H.html; META

Re: Can't get the real contents form page in internet as the tag no-chche

2006-03-23 Thread dongdong
oh~~~! offer my thanks to Tim Roberts and all persons above! I see now, it's the different url causes! contents can only be got from the later (real ) url. I made a mistick not to look at the different urls taking effect. -- http://mail.python.org/mailman/listinfo/python-list

Re: Can't get the real contents form page in internet as the tag no-chche

2006-03-23 Thread John J. Lee
dongdong [EMAIL PROTECTED] writes: oh~~~! offer my thanks to Tim Roberts and all persons above! I see now, it's the different url causes! contents can only be got from the later (real ) url. I made a mistick not to look at the different urls taking effect. If you use

Can't get the real contents form page in internet as the tag no-chche

2006-03-22 Thread dongdong
using web browser can get page's content formally, but when use urllib2.open(http://tech.163.com/2004w11/12732/2004w11_1100059465339.html;).read() the result is htmlheadMETA HTTP-EQUIV=REFRESH CONTENT=0;URL=http://tech.163.com/04/1110/12/14QUR2BR0009159H.html; META http-equiv=Pragma

Re: Can't get the real contents form page in internet as the tag no-chche

2006-03-22 Thread Kent Johnson
dongdong wrote: using web browser can get page's content formally, but when use urllib2.open(http://tech.163.com/2004w11/12732/2004w11_1100059465339.html;).read() the result is htmlheadMETA HTTP-EQUIV=REFRESH CONTENT=0;URL=http://tech.163.com/04/1110/12/14QUR2BR0009159H.html; META

Re: Can't get the real contents form page in internet as the tag no-chche

2006-03-22 Thread dongdong
yeah,u r right, the page uses chinese.(I'm a chinese too.^_^,) using urllib2.urlopen('').read(),I can't get the contents between 'body' and '/body' ,the reason isn't the chinese encoding but the 'no-cache' set,I think. I want to get the contents between can you find the problem

Re: Can't get the real contents form page in internet as the tag no-chche

2006-03-22 Thread I V
dongdong wrote: using web browser can get page's content formally, but when use urllib2.open(http://tech.163.com/2004w11/12732/2004w11_1100059465339.html;).read() the result is htmlheadMETA HTTP-EQUIV=REFRESH CONTENT=0;URL=http://tech.163.com/04/1110/12/14QUR2BR0009159H.html; This line