Hi Gerard,

We're diverging from the initial thread. I'll respond to one point, we
should take the rest of the discussion somewhere else. :)

On Mon, Apr 17, 2017 at 11:36 AM, Gerard Meijssen <gerard.meijs...@gmail.com
> wrote:

>
> So yes, your approach is good but like the translation tool it relies on
> English content.


​No. The approach and implementation is from any language to any language.
You can play with a very simplified version of the recommendations​ at
https://recommend.wmflabs.org/. You can choose any language as source or
destination.

Best,
Leila



> Thanks,
>        GerardM
>
>
>
> On 17 April 2017 at 18:40, Leila Zia <le...@wikimedia.org> wrote:
>
> > Hoi Gerard,
> >
> > On Mon, Apr 17, 2017 at 7:54 AM, Gerard Meijssen <
> > gerard.meijs...@gmail.com>
> > wrote:
> >
> > >
> > > When you analyse articles and find that some things are missing, it
> will
> > > help a lot when you can target these articles to the people who are
> > likely
> > > interested. When people interested in soccer learn that a soccer player
> > > died, they are more likely to edit even write an article.
> > >
> >
> > ​You are absolutely right. This is what we even tested in the article
> > creation recommendation experiment and you could see that providing
> > personalized recommendations (where personalization was on the basis of
> > matching editors interests based on their history of contributions​) does
> > better than random important recommendations. A few pointers for you:
> >
> > * Check out section 2.3 of the paper at https://arxiv.org/pdf/1604.
> > 03235.pdf to
> > see how this was done.
> > * I talk briefly about how we do the editor interest modeling at
> > https://youtu.be/lHbnvRwFC_A?t=20m44s
> >
> > In general, we have at least two ways for recommending to people what
> they
> > like to edit: one would be using the information in their past edit
> history
> > and building topical models that can help us learn what topics an editor
> is
> > interested in. The other is by asking the editor to provide some seeds of
> > interest to us. For example, we ask you to tell us what kind of article
> you
> > would edit, and we give you recommendations similar to the seed you
> > provide. Each have its own advantages and you sometimes have to mix the
> two
> > approaches (and more) to give the editor enough breadth and depth of
> topics
> > to choose from.
> >
> >
> > > The approach for finding a subject that could do with more attention is
> > one
> > > I applaud. When you want to do this across languages think Wikidata to
> > > define the area of interest for users. It will always include all the
> > > articles in all the languages. As you have seen with the Listeria
> lists,
> > > showing red links and Wikidata items is trivial.
> > >
> >
> > ​Yes, finding what is missing in a Wikipedia language by comparing
> language
> > editions is relatively easy, thanks to Wikidata. :) What is hard is
> ranking
> > these millions of missing articles in any language based on some notion
> of
> > importance. We developed a ranking system for the research I mentioned
> > above. You can read about it in Section 2.2 of the paper at
> > https://arxiv.org/pdf/1604.03235.pdf​. I talk about in less details at
> > https://youtu.be/lHbnvRwFC_A?t=16m58s. In a nutshell: we built a
> > prediction
> > model that aims to predict the number of pageviews the article would
> > receive had it existed in the destination language where it's missing
> > today. The higher this predicted number for a missing article in a
> > language, the more important it is to create it.
> >
> > Best,
> > Leila
> >
> >
> >
> > > Thanks,
> > >      Gerard
> > >
> > > On 17 April 2017 at 02:04, Leila Zia <le...@wikimedia.org> wrote:
> > >
> > > > Hi John,
> > > >
> > > > This may be of interest to you:
> > > >
> > > > We are working on building recommendation systems than can help
> editors
> > > > identify how to expand already existing articles in Wikipedia. This
> > > > includes but is not limited to identifying what sections are missing
> > from
> > > > an article, what citations, what images, infobox information, etc.
> This
> > > is
> > > > research in its early days, if you'd like to follow up with it please
> > > visit
> > > > https://meta.wikimedia.org/wiki/Research:Expanding_
> > > Wikipedia_stubs_across_
> > > > languages
> > > >
> > > > Best,
> > > > Leila
> > > >
> > > >
> > > > Leila Zia
> > > > Senior Research Scientist
> > > > Wikimedia Foundation
> > > >
> > > > On Sat, Apr 15, 2017 at 2:50 PM, John Erling Blad <jeb...@gmail.com>
> > > > wrote:
> > > >
> > > > > Are anyone doing any work on automated quality assurance of
> articles?
> > > Not
> > > > > the ORES-stuff, that is about creating hints from measured
> features.
> > > I'm
> > > > > thinking about verifying existence and completeness of citations,
> and
> > > > > structure of logical arguments.
> > > > >
> > > > > John
> > > > > _______________________________________________
> > > > > Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/
> > > > > wiki/Mailing_lists/Guidelines and https://meta.wikimedia.org/
> > > > > wiki/Wikimedia-l
> > > > > New messages to: Wikimedia-l@lists.wikimedia.org
> > > > > Unsubscribe: https://lists.wikimedia.org/
> > mailman/listinfo/wikimedia-l,
> > > > > <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=
> unsubscribe>
> > > > _______________________________________________
> > > > Wikimedia-l mailing list, guidelines at: https://meta.wikimedia.org/
> > > > wiki/Mailing_lists/Guidelines and https://meta.wikimedia.org/
> > > > wiki/Wikimedia-l
> > > > New messages to: Wikimedia-l@lists.wikimedia.org
> > > > Unsubscribe: https://lists.wikimedia.org/
> mailman/listinfo/wikimedia-l,
> > > > <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>
> > > >
> > > _______________________________________________
> > > Wikimedia-l mailing list, guidelines at: https://meta.wikimedia.org/
> > > wiki/Mailing_lists/Guidelines and https://meta.wikimedia.org/
> > > wiki/Wikimedia-l
> > > New messages to: Wikimedia-l@lists.wikimedia.org
> > > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> > > <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>
> > >
> > _______________________________________________
> > Wikimedia-l mailing list, guidelines at: https://meta.wikimedia.org/
> > wiki/Mailing_lists/Guidelines and https://meta.wikimedia.org/
> > wiki/Wikimedia-l
> > New messages to: Wikimedia-l@lists.wikimedia.org
> > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> > <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>
> >
> _______________________________________________
> Wikimedia-l mailing list, guidelines at: https://meta.wikimedia.org/
> wiki/Mailing_lists/Guidelines and https://meta.wikimedia.org/
> wiki/Wikimedia-l
> New messages to: Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>
>
_______________________________________________
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
<mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>

Reply via email to