Doh wrong file, i meant https://github.com/wikimedia/wikimedia-discovery-analytics/blob/master/oozie/popularity_score/popularity_score.hql
On Tue, Dec 6, 2016 at 11:52 AM, Erik Bernhardson < [email protected]> wrote: > popularity score is calculated once a week for the previous weeks data. > This score is basically (article page views) / (all article page views). > See https://github.com/wikimedia/wikimedia-discovery- > analytics/blob/master/hive/popularity_score/create_ > popularity_score_table.hql > > score is an old version of popularity score, we changed the name to make > it more distinct but it lingers in some places because we update documents > rather than completely replace them. Feel free to ignore. > > > > > On Tue, Dec 6, 2016 at 11:16 AM, Adam Baso <[email protected]> wrote: > >> +discovery list. >> >> On Tue, Dec 6, 2016 at 12:53 PM, Sumit Asthana <[email protected] >> > wrote: >> >>> Hi, >>> >>> I was extracting the Wikipedia cirrus dump of articles using >>> ?action=cirrusDump for feature extraction from articles and noticed two >>> keys "score" and "popularity_score". Can anyone tell what exactly do these >>> keys denote and how're they calculated? >>> >>> I'm curious to know the possible use cases of these scores in Machine >>> Learning as I'm currently processing articles. >>> >>> -- >>> -Thanks, >>> Sumit <http://mediawiki.org/wiki/User:Sumit.iitp> >>> >>> _______________________________________________ >>> AI mailing list >>> [email protected] >>> https://lists.wikimedia.org/mailman/listinfo/ai >>> >>> >> >> _______________________________________________ >> discovery mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/discovery >> >> >
_______________________________________________ discovery mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/discovery
