Oh, and in general, you can dump the results of queries to a location on stat1002 that rsyncs to a public place. But we need people to be very careful with that so we usually want to go through code review for any code that does it. Reportupdater is a tool that you can use to write SQL scripts or bash scripts which make the process of "publishing" data a little better.
On Thu, Nov 5, 2015 at 9:45 AM, Dan Andreescu <[email protected]> wrote: > Max, there's a pageview API that we're not fully ready to announce because > we haven't finished the documentation but it works so I'll tell you offline > about it. It has the data you're looking for in that query. > > Anyone else who is interested in the API - we're just finishing up docs > and synchronizing with a blog post, it won't be long now, the actual code > and infrastructure is stable. > > On Wed, Nov 4, 2015 at 7:50 PM, Max Semenik <[email protected]> wrote: > >> Hey, I was wondering if it is possible to export the results of Hive >> queries to some world-readable place? >> >> What I'm trying to achieve: for my www portals work, I want the results >> of aggregation (SELECT project, sum(view_count) AS num FROM >> projectview_hourly WHERE year=2015 AND month=11 AND day=3 GROUP BY project) >> published somewhere in a machine-readable format. Ideally, this could be >> published externally (for example, >> https://stats.wikimedia.org/daily_pageviews.csv or whatever). If that is >> hard, making it somehow available on the cluster would suffice. What are >> the options for doing that? >> >> -- >> Best regards, >> Max Semenik ([[User:MaxSem]]) >> >> _______________________________________________ >> Analytics mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/analytics >> >> >
_______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
