2010/8/23 "Martin v. Löwis" <[email protected]>:
>> Proposals: let's remove z3c.pypimirror and pep381client from the download 
>> stats.
>
> This isn't really implementable as formulated: for many of the files, I
> just don't know what user agent has downloaded them.

How come ? I though all calls were made through Apache via the same root.

[..]
> Also, what about other automatic downloaders, such as Googlebot, wget,
> or buildout?

I would count buildout and wget calls. For instance, I manually download
files using wget, so its a legitimate hit. But yes, the definition of
what should
be counted as a hit is quite fuzzy.

The only way to know what hits are from mirrors or bots without
relying on the UA
would be to detect a client that acts as a bot and discard its hits.

This can be done by grouping calls issued from the same IP, that are
scanning the whole index in a short time. But that's some work :)

> I plan to display each download counter broken down by UA, so that users
> could form their own opinion on how many downloads the file has really
> seen. Implementing this would take some time, though (as would
> implementing anything else, for that matter).

That would be the best/simplest option.

Regards
Tarek
_______________________________________________
Catalog-SIG mailing list
[email protected]
http://mail.python.org/mailman/listinfo/catalog-sig

Reply via email to