2008/9/8 Tony Godshall <[EMAIL PROTECTED]>:
> I haven't done this but I can speculate that you need to
> have wget identify itself as firefox.

When I read this, I thought it looked promising, but it doesn't work.
I tried sending exactly the user-agent string firefox is sending and
still got a page from yahoo that clearly indicates yahoo thinks I'm
not logged in.

/Don

>
> Quote from man wget...
>
>       -U agent-string
>       --user-agent=agent-string
>           Identify as agent-string to the HTTP server.
>
>           The HTTP protocol allows the clients to identify themselves
> using a "User-Agent" header field.  This enables distinguishing the
> WWW software,
>           usually for statistical purposes or for tracing of protocol
> violations.  Wget normally identifies as Wget/version, version being
> the current ver‐
>           sion number of Wget.
>
>           However, some sites have been known to impose the policy of
> tailoring the output according to the "User-Agent"-supplied
> information.  While this
>           is not such a bad idea in theory, it has been abused by
> servers denying information to clients other than (historically)
> Netscape or, more fre‐
>           quently, Microsoft Internet Explorer.  This option allows
> you to change the "User-Agent" line issued by Wget.  Use of this
> option is discouraged,
>           unless you really know what you are doing.
>
>
> On Mon, Sep 8, 2008 at 12:25 PM, Donald Allen <[EMAIL PROTECTED]> wrote:
>> There was a recent discussion concerning using wget to obtain pages
>> from yahoo logged into yahoo as a particular user. Micah replied to
>> Rick Nakroshis with instructions describing two methods for doing
>> this. This information has also been added by Micah to the wiki.
>>
>> I just tried the simpler of the two methods -- logging into yahoo with
>> my browser (Firefox 2.0.0.16) and then downloading a page with
>>
>> wget --output-document=/tmp/yahoo/yahoo.htm --load-cookies <my home
>> directory>/.mozilla/firefox/id2dmo7r.default/cookies.txt
>> 'http://<yahoo url>'
>>
>> The page I get is what would be obtained if an un-logged-in user went
>> to the specified url. Opening that same url in Firefox *does*
>> correctly indicate that it is logged in as me and reflects my
>> customizations.
>>
>> wget -V:
>> GNU Wget 1.11.1
>>
>> I am running a reasonably up-to-date Gentoo system (updated within the
>> last month) on a Thinkpad X61.
>>
>> Have I missed something here? Any help will be appreciated. Please
>> include my personal address in your replies as I am not (yet) a
>> subscriber to this list.
>>
>> Thanks --
>> /Don Allen
>>
>
>
>
> --
> Best Regards.
> Please keep in touch.
> This is unedited.
> P-)
>

Reply via email to