Oh, I also realized these files already has the data you need:

In http://dumps.wikimedia.org/other/pageviews/2015/2015-11/, files
projectviews-20151103-000000 through projectviews-20151103-230000

On Thu, Nov 5, 2015 at 9:47 AM, Dan Andreescu <[email protected]>
wrote:

> Oh, and in general, you can dump the results of queries to a location on
> stat1002 that rsyncs to a public place.  But we need people to be very
> careful with that so we usually want to go through code review for any code
> that does it.  Reportupdater is a tool that you can use to write SQL
> scripts or bash scripts which make the process of "publishing" data a
> little better.
>
> On Thu, Nov 5, 2015 at 9:45 AM, Dan Andreescu <[email protected]>
> wrote:
>
>> Max, there's a pageview API that we're not fully ready to announce
>> because we haven't finished the documentation but it works so I'll tell you
>> offline about it.  It has the data you're looking for in that query.
>>
>> Anyone else who is interested in the API - we're just finishing up docs
>> and synchronizing with a blog post, it won't be long now, the actual code
>> and infrastructure is stable.
>>
>> On Wed, Nov 4, 2015 at 7:50 PM, Max Semenik <[email protected]>
>> wrote:
>>
>>> Hey, I was wondering if it is possible to export the results of Hive
>>> queries to some world-readable place?
>>>
>>> What I'm trying to achieve: for my www portals work, I want the results
>>> of aggregation (SELECT project, sum(view_count) AS num FROM
>>> projectview_hourly WHERE year=2015 AND month=11 AND day=3 GROUP BY project)
>>> published somewhere in a machine-readable format. Ideally, this could be
>>> published externally (for example,
>>> https://stats.wikimedia.org/daily_pageviews.csv or whatever). If that
>>> is hard, making it somehow available on the cluster would suffice. What are
>>> the options for doing that?
>>>
>>> --
>>> Best regards,
>>> Max Semenik ([[User:MaxSem]])
>>>
>>> _______________________________________________
>>> Analytics mailing list
>>> [email protected]
>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>>>
>>
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to