OK, I have some news:
1- Today I rewrote some parts of Kian and now it automatically chooses
regulation parameter (lambda), thus predictions are more accurate. I wanted
to push changes to the github but It seems my ssh has issues. It'll be
there soon
2- (Important) I wrote a code that can find possible mistakes in Wikidata
based on Kian. The code will be in github soon. Check out this link
<http://tools.wmflabs.org/dexbot/possible_mistakes_fr.txt>. It's result
from comparing French Wikipedia against Wikidata e.g. this line:

Q2994923: 1 (d), 0.257480420229 (w) [0, 0, 1, 2, 0]

1 (d) means Wikidata thinks it's a human

0.25... (w) means French Wikipedia thinks it's not a human (with 74.3%
certainty)

And if you check the link you can see it's a mistake in Wikidata. Please
check other results and fix them.

Tell me if you want this test to be ran from another language too.

3- I used Kian to import unconnected pages from French Wikipedia and
created about 1900 items. The result is here
<http://tools.wmflabs.org/dexbot/kian_res_fr.txt> and please check if
anything in this list is not human and tell me and I run some error
analysis.


Best



On Mon, Mar 16, 2015 at 9:50 PM, Amir Ladsgroup <[email protected]> wrote:

> Thanks Sjoerddebruin,
>
> I'm working on this so I can write a system to find possible mistakes and
> it will find and report mistakes made by Dexbot or others. It works more
> precise as the time goes by.
>
>
> Best
>
> On Sun, Mar 15, 2015 at 8:51 PM Sjoerd de Bruin <[email protected]>
> wrote:
>
>> Now the gender game is working again, I encountered there were a lot of
>> issues with the following category: https://nl.
>> wikipedia.org/wiki/Categorie:Danceact
>>
>> As you can see, it's about musical groups but they all were marked as
>> human.
>>
>> Greetings,
>>
>> Sjoerd de Bruin
>> [email protected]
>>
>> Op 14 mrt. 2015, om 14:18 heeft Amir Ladsgroup <[email protected]> het
>> volgende geschreven:
>>
>>
>> I'm writing a parser so I can feed gender classification to Kian, It'll
>> be done soon and you can use it :)
>>
>> On Sat, Mar 14, 2015 at 12:53 PM Sjoerd de Bruin <[email protected]>
>> wrote:
>>
>>> Hm, the Wikidata Game is really slow. Magnus, if you read this: do you
>>> know what's going on? I play the gender game with only nlwiki articles, but
>>> it never loads. It was working yesterday with just 50 items, so it should
>>> work now imo.
>>>
>>> Greetings,
>>>
>>> Sjoerd de Bruin
>>> [email protected]
>>>
>>> Op 14 mrt. 2015, om 09:39 heeft Sjoerd de Bruin <[email protected]>
>>> het volgende geschreven:
>>>
>>>
>>> I've corrected two lists (Lijst van voorzitters van de SER and Lijst van
>>> voorzitters van de WRR) and a music group (Viper (Belgische danceact)).
>>> Will play the gender game the next few days to check them.
>>>
>>> Greetings,
>>>
>>> Sjoerd de Bruin
>>> [email protected]
>>>
>>> Op 14 mrt. 2015, om 00:51 heeft Amir Ladsgroup <[email protected]>
>>> het volgende geschreven:
>>>
>>> Sorry for the late answer, got busy in the real world.
>>> This is the result for unconnected pages of Dutch Wikipedia.
>>> http://tools.wmflabs.org/dexbot/kian_res_nl.txt
>>> Please check and tell me when they are not human. I'm producing result
>>> for empty items related to Dutch Wikipedia.
>>>
>>> On Thu, Mar 12, 2015 at 2:58 PM Amir Ladsgroup <[email protected]>
>>> wrote:
>>>
>>>> Sure, tonight it will be done.
>>>>
>>>> Best
>>>>
>>>> On Thu, Mar 12, 2015 at 2:08 AM, Sjoerd de Bruin <[email protected]>
>>>> wrote:
>>>>
>>>>> I'm ready for it! All existing humans on nlwiki have a gender now, so
>>>>> it's easy to review this batch. Bring it on.
>>>>>
>>>>> Op 11 mrt. 2015, om 22:14 heeft Maarten Dammers <[email protected]>
>>>>> het volgende geschreven:
>>>>>
>>>>>  Hi Amir,
>>>>>
>>>>> Amir Ladsgroup schreef op 9-3-2015 om 22:40:
>>>>>
>>>>> Result for English Wikipedia (6366 articles classified as human)
>>>>> <https://tools.wmflabs.org/dexbot/kian_res_en.txt>
>>>>>
>>>>>  Sounds like fun! Can you run it on the Dutch Wikipedia too? On
>>>>> https://tools.wmflabs.org/multichill/queries/wikidata/
>>>>> noclaims_nlwiki.txt I have a list of items without claims (linking
>>>>> them to other items).
>>>>>
>>>>> Maarten
>>>>>  _______________________________________________
>>>>> Wikidata-l mailing list
>>>>> [email protected]
>>>>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>>>>>
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> Wikidata-l mailing list
>>>>> [email protected]
>>>>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Amir
>>>>
>>>>  _______________________________________________
>>> Wikidata-l mailing list
>>> [email protected]
>>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>>>
>>>
>>> _______________________________________________
>>> Wikidata-l mailing list
>>> [email protected]
>>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>>>
>>> _______________________________________________
>>> Wikidata-l mailing list
>>> [email protected]
>>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>>>
>> _______________________________________________
>> Wikidata-l mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>>
>> _______________________________________________
>> Wikidata-l mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>>
>


-- 
Amir
_______________________________________________
Wikidata-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Reply via email to