Then that sounds much more viable. I'll run a quick test now to see
how much clustering we'd see at, say, the one-second resolution level,
and throw it out here so we can make more informed decisions about a
data release on this.

On 13 April 2015 at 08:08, Hirav Gandhi <[email protected]> wrote:
> Hi Oliver,
>
> Re: Hirav: would you be looking for temporally /and/ contextually granular
> pageviews, i.e. "a view to X page at Y time", or just temporally granular,
> so "a view to a page on enwiki at X time"? If the latter you've got more of
> a shot, I suspect.
>
> I only want the latter - I am not concerned with the context so much as just
> “a view to a page on enwiki at X time.”
>
> Hirav
>
>
> On Apr 13, 2015, at 5:00 AM, [email protected] wrote:
>
> Send Analytics mailing list submissions to
> [email protected]
>
> To subscribe or unsubscribe via the World Wide Web, visit
> https://lists.wikimedia.org/mailman/listinfo/analytics
> or, via email, send a message with subject or body 'help' to
> [email protected]
>
> You can reach the person managing the list at
> [email protected]
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Analytics digest..."
>
>
> Today's Topics:
>
>   1. Re: Page views on a more frequent than hourly basis (Pine W)
>   2. Re: Page views on a more frequent than hourly basis (Oliver Keyes)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Mon, 13 Apr 2015 00:47:31 -0700
> From: Pine W <[email protected]>
> To: "A mailing list for the Analytics Team at WMF and everybody who
> has an interest in Wikipedia and analytics."
> <[email protected]>
> Cc: Bharath Sitaraman <[email protected]>
> Subject: Re: [Analytics] Page views on a more frequent than hourly
> basis
> Message-ID:
> <CAF=dyjgnut+t6n6mujq16duyiwp7et6ruht3_-tzdnsep+2...@mail.gmail.com>
> Content-Type: text/plain; charset="utf-8"
>
>
> Hi,
>
> This issue of pageview data granularity has been discussed before, and the
> answer has been that hourly is the smallest increment allowed to be
> revealed publicly, for privacy reasons.
>
> I believe that the person you will want to discuss your request with is
> Toby, who I have cc'd here.
>
> Pine
> On Apr 13, 2015 12:11 AM, "Hirav Gandhi" <[email protected]> wrote:
>
> Hi Wikimedia Analytics Team,
>
> My colleague Bharath and I are doing research on dynamic server allocation
> algorithms and we were looking for a suitable datasets to test our
> predictive algorithm on. We noticed that Wikimedia has an amazing data set
> of hourly page views, but we were looking for something a bit more
> granular, such as aggregated page requests to English Wikipedia on a minute
> by minute basis or second by second basis if possible.
>
> We are more than happy to pour through any raw data you might have that
> would help us calculate page requests at this granular level. Please let us
> know if it would be possible to get such data and if so how. Thank you in
> advance for your help.
>
> Best,
>
> Hirav Gandhi
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL:
> <https://lists.wikimedia.org/pipermail/analytics/attachments/20150413/a88287b6/attachment-0001.html>
>
> ------------------------------
>
> Message: 2
> Date: Mon, 13 Apr 2015 06:39:45 -0400
> From: Oliver Keyes <[email protected]>
> To: "A mailing list for the Analytics Team at WMF and everybody who
> has an interest in Wikipedia and analytics."
> <[email protected]>
> Cc: Bharath Sitaraman <[email protected]>
> Subject: Re: [Analytics] Page views on a more frequent than hourly
> basis
> Message-ID:
> <CAAUQgdDsnHd8s+ACL-XBtXBz6OO-T04CcJfnGfqwrYAV-=h...@mail.gmail.com>
> Content-Type: text/plain; charset=UTF-8
>
>
> Preeetty sure that Toby is on the analytics list, Pine. He's the
> director of analytics.
>
> Hirav: would you be looking for temporally /and/ contextually granular
> pageviews, i.e. "a view to X page at Y time", or just temporally
> granular, so "a view to a page on enwiki at X time"? If the latter
> you've got more of a shot, I suspect.
>
> On 13 April 2015 at 03:47, Pine W <[email protected]> wrote:
>
> Hi,
>
> This issue of pageview data granularity has been discussed before, and the
> answer has been that hourly is the smallest increment allowed to be revealed
> publicly, for privacy reasons.
>
> I believe that the person you will want to discuss your request with is
> Toby, who I have cc'd here.
>
> Pine
>
> On Apr 13, 2015 12:11 AM, "Hirav Gandhi" <[email protected]> wrote:
>
>
> Hi Wikimedia Analytics Team,
>
> My colleague Bharath and I are doing research on dynamic server allocation
> algorithms and we were looking for a suitable datasets to test our
> predictive algorithm on. We noticed that Wikimedia has an amazing data set
> of hourly page views, but we were looking for something a bit more granular,
> such as aggregated page requests to English Wikipedia on a minute by minute
> basis or second by second basis if possible.
>
> We are more than happy to pour through any raw data you might have that
> would help us calculate page requests at this granular level. Please let us
> know if it would be possible to get such data and if so how. Thank you in
> advance for your help.
>
> Best,
>
> Hirav Gandhi
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
>
>
> --
> Oliver Keyes
> Research Analyst
> Wikimedia Foundation
>
>
>
> ------------------------------
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
> End of Analytics Digest, Vol 38, Issue 21
> *****************************************
>
>
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>



-- 
Oliver Keyes
Research Analyst
Wikimedia Foundation

_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to