On Wed, Apr 24, 2013 at 9:17 PM, Dario Taraborelli <
[email protected]> wrote:

> Dave,
>
> thanks for sharing this, the referral data is particularly fascinating. I
> mentioned during the quarterly review that I'd love to get a better sense
> of (1) the proportion of requests in the mobile request logs lacking a
> referral, (2) the possible causes of this gap and (3) to what extent these
> missing entries introduce a bias in the referral ranking.
>
> The 3rd most popular query (according to your dumps) is ビッグダディ (japanese
> for "Big Daddy"), which presumably refers to this guy:
> http://metro.co.uk/2013/03/20/giant-japanese-spider-crab-big-daddy-arrives-at-blackpool-sea-life-centre-3550751/
> What's interesting is that there's no such entry on the japanese Wikipedia
> and I am baffled that people may have landed on the website via a search
> engine query for a non-existing article.
> Do you have an explanation for this or am I misinterpreting what you mean
> by search query?
>

There *is* an article on this on ja.wiki :) It may have been renamed since
then, but it's still the 2nd Google hit for ビッグダディ:
http://ja.wikipedia.org/wiki/%E7%97%9B%E5%BF%AB!%E3%83%93%E3%83%83%E3%82%B0%E3%83%80%E3%83%87%E3%82%A3


>
> Dario
>
> On Apr 24, 2013, at 8:40 PM, David Schoonover <[email protected]> wrote:
>
> Hiya all,
>
> As promised earlier today in the Analytics weekly showcase, I've got a few
> interesting bits of data to share from playing with the new Mobile Site
> Sessions dataset.
>
>
> # Visits to Mobile Site, 4/21/2013
>
> - Total Visits:                             51,624,103
> - Unique Visitors:                          37,736,120
> - Total Pageviews:                         104,972,033
> - Avg Pageviews per Session:                    2.0334
> - Max Pageviews in one Session:                141,882
>
> ## Standard Site
> - Visits:                                   51,603,221
> - Unique Visitors:                          37,723,188
> - Pageviews:                               104,910,382
> - Avg Pageviews per Session:                     2.033
>
> ## Alpha Site
> - Visits:                                          986
> - Unique Visitors:                                 822
> - Pageviews:                                     7,087
> - Avg Pageviews per Session:                     7.188
>
> ## Beta Site
> - Visits:                                       19,896
> - Unique Visitors:                              16,235
> - Pageviews:                                    54,564
> - Avg Pageviews per Session:                     2.742
>
>
> ## Notes
> - A session (or "visit") is defined as all activity with less than 30
> minutes between each hit. Intuitively speaking, a session ends when the
> user hasn't done anything in 30m.
> - As we do not set visitor_id cookies for all users, the "unique visitors"
> metric was calculated using hash(ip_address + users_agent) as visitor_id.
> - This job looked at all requests to the mobile site on 4/21/2013, which
> is 75.17 GB of request logs.
> - The job took ~17 minutes to process the day into 15.3 GB of sessions.
> - The summary above took maybe 10 minutes to set up/write in Hive, and the
> job took maybe 7 minutes.
>
>
> In addition to that summary, I ran a few jobs on the entry_referer field
> -- the URL that referred the user to us when the session started. Obvious
> caveats: this is only one day of data, and it's only the mobile site. Draw
> conclusions with care.
>
> First, I pulled out the top referring domains. It's mostly as you'd expect
> -- search engines -- though you'll also note that several Wikipedia mobile
> sites show up. My working hypothesis is that people don't tend to close
> tabs on smartphones; when they later come back, it is often to an open
> Wikipedia tab: clicking a link or perform a search means the referrer is
> still us.
>
> Since -- as expected -- so much of the data pertained to search engines, I
> also calculated the top search queries and top keywords that sent people to
> us. (For keywords, I've filtered out common "stop words": de, of, in, is,
> la, and, el, es, to, en, di, los, le, da, se, las, les, il, du, a, i, o, y,
> e.) In both, you see the predictable: lots of searches for porn, for
> "facebook", for "wiki", etc. But you also see a few things that surprised
> me:
>
> - Tons of Japanese. Japan is the most mobile-enabled country in the world
> so I guess we should have expected to see many searches in Japanese show up
> in the top queries. I've left them URL-encoded in the results -- you'll see
> them as weird lines with % in them.
>
> - Apparently people search for movies and TV so they can spoil their fun
> by reading about them on Wikipedia. Both of "movies" and "film" show up in
> the top keywords; Iron Man 1, 2, AND 3 all show up in the top search
> queries. I didn't expect this was a major use-case, but -- wikigroaning
> aside -- it's an interesting fact.
>
> I'm sure we're only scratching the surface here. This is an exciting
> dataset, and I'm sure there's lots more to learn!
>
> The full results:
> - Top Referring Entry Domains:
> http://stats.wikimedia.org/kraken-public/webrequest/mobile/views/sessions/mobile_sessions-2013-04-21-top_entry_domains.tsv
> - Top Referring Entry Search Queries:
> http://stats.wikimedia.org/kraken-public/webrequest/mobile/views/sessions/mobile_sessions-2013-04-21-top_entry_search_queries.tsv
> - Top Referring Entry Search Keywords:
> http://stats.wikimedia.org/kraken-public/webrequest/mobile/views/sessions/mobile_sessions-2013-04-21-top_entry_keywords.tsv
>
> Questions are welcome!
>
>
> --
> David Schoonover
> [email protected]
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
>


-- 
Maryana Pinchuk
Associate Product Manager, Wikimedia Foundation
wikimediafoundation.org
_______________________________________________
Mobile-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/mobile-l

Reply via email to