OK, I have some news: 1- Today I rewrote some parts of Kian and now it automatically chooses regulation parameter (lambda), thus predictions are more accurate. I wanted to push changes to the github but It seems my ssh has issues. It'll be there soon 2- (Important) I wrote a code that can find possible mistakes in Wikidata based on Kian. The code will be in github soon. Check out this link <http://tools.wmflabs.org/dexbot/possible_mistakes_fr.txt>. It's result from comparing French Wikipedia against Wikidata e.g. this line:
Q2994923: 1 (d), 0.257480420229 (w) [0, 0, 1, 2, 0] 1 (d) means Wikidata thinks it's a human 0.25... (w) means French Wikipedia thinks it's not a human (with 74.3% certainty) And if you check the link you can see it's a mistake in Wikidata. Please check other results and fix them. Tell me if you want this test to be ran from another language too. 3- I used Kian to import unconnected pages from French Wikipedia and created about 1900 items. The result is here <http://tools.wmflabs.org/dexbot/kian_res_fr.txt> and please check if anything in this list is not human and tell me and I run some error analysis. Best On Mon, Mar 16, 2015 at 9:50 PM, Amir Ladsgroup <[email protected]> wrote: > Thanks Sjoerddebruin, > > I'm working on this so I can write a system to find possible mistakes and > it will find and report mistakes made by Dexbot or others. It works more > precise as the time goes by. > > > Best > > On Sun, Mar 15, 2015 at 8:51 PM Sjoerd de Bruin <[email protected]> > wrote: > >> Now the gender game is working again, I encountered there were a lot of >> issues with the following category: https://nl. >> wikipedia.org/wiki/Categorie:Danceact >> >> As you can see, it's about musical groups but they all were marked as >> human. >> >> Greetings, >> >> Sjoerd de Bruin >> [email protected] >> >> Op 14 mrt. 2015, om 14:18 heeft Amir Ladsgroup <[email protected]> het >> volgende geschreven: >> >> >> I'm writing a parser so I can feed gender classification to Kian, It'll >> be done soon and you can use it :) >> >> On Sat, Mar 14, 2015 at 12:53 PM Sjoerd de Bruin <[email protected]> >> wrote: >> >>> Hm, the Wikidata Game is really slow. Magnus, if you read this: do you >>> know what's going on? I play the gender game with only nlwiki articles, but >>> it never loads. It was working yesterday with just 50 items, so it should >>> work now imo. >>> >>> Greetings, >>> >>> Sjoerd de Bruin >>> [email protected] >>> >>> Op 14 mrt. 2015, om 09:39 heeft Sjoerd de Bruin <[email protected]> >>> het volgende geschreven: >>> >>> >>> I've corrected two lists (Lijst van voorzitters van de SER and Lijst van >>> voorzitters van de WRR) and a music group (Viper (Belgische danceact)). >>> Will play the gender game the next few days to check them. >>> >>> Greetings, >>> >>> Sjoerd de Bruin >>> [email protected] >>> >>> Op 14 mrt. 2015, om 00:51 heeft Amir Ladsgroup <[email protected]> >>> het volgende geschreven: >>> >>> Sorry for the late answer, got busy in the real world. >>> This is the result for unconnected pages of Dutch Wikipedia. >>> http://tools.wmflabs.org/dexbot/kian_res_nl.txt >>> Please check and tell me when they are not human. I'm producing result >>> for empty items related to Dutch Wikipedia. >>> >>> On Thu, Mar 12, 2015 at 2:58 PM Amir Ladsgroup <[email protected]> >>> wrote: >>> >>>> Sure, tonight it will be done. >>>> >>>> Best >>>> >>>> On Thu, Mar 12, 2015 at 2:08 AM, Sjoerd de Bruin <[email protected]> >>>> wrote: >>>> >>>>> I'm ready for it! All existing humans on nlwiki have a gender now, so >>>>> it's easy to review this batch. Bring it on. >>>>> >>>>> Op 11 mrt. 2015, om 22:14 heeft Maarten Dammers <[email protected]> >>>>> het volgende geschreven: >>>>> >>>>> Hi Amir, >>>>> >>>>> Amir Ladsgroup schreef op 9-3-2015 om 22:40: >>>>> >>>>> Result for English Wikipedia (6366 articles classified as human) >>>>> <https://tools.wmflabs.org/dexbot/kian_res_en.txt> >>>>> >>>>> Sounds like fun! Can you run it on the Dutch Wikipedia too? On >>>>> https://tools.wmflabs.org/multichill/queries/wikidata/ >>>>> noclaims_nlwiki.txt I have a list of items without claims (linking >>>>> them to other items). >>>>> >>>>> Maarten >>>>> _______________________________________________ >>>>> Wikidata-l mailing list >>>>> [email protected] >>>>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l >>>>> >>>>> >>>>> >>>>> _______________________________________________ >>>>> Wikidata-l mailing list >>>>> [email protected] >>>>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l >>>>> >>>>> >>>> >>>> >>>> -- >>>> Amir >>>> >>>> _______________________________________________ >>> Wikidata-l mailing list >>> [email protected] >>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l >>> >>> >>> _______________________________________________ >>> Wikidata-l mailing list >>> [email protected] >>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l >>> >>> _______________________________________________ >>> Wikidata-l mailing list >>> [email protected] >>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l >>> >> _______________________________________________ >> Wikidata-l mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/wikidata-l >> >> _______________________________________________ >> Wikidata-l mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/wikidata-l >> > -- Amir
_______________________________________________ Wikidata-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-l
