One mistake <https://www.wikidata.org/wiki/Q2963097> I just found via the
report. Article in French Wikipedia is about a French type of cheese but
connected to an article in Russian Wikipedia about a French playwriter.

Best
On Fri, Mar 20, 2015 at 3:59 AM Amir Ladsgroup <ladsgr...@gmail.com> wrote:

> Try to download it, or change the character encoding to utf-8 or unicode.
>
> And yes it's based on dumps. :)
>
> On Fri, Mar 20, 2015 at 3:51 AM Ricordisamoa <ricordisa...@openmailbox.org>
> wrote:
>
>>  Il 20/03/2015 01:11, Amir Ladsgroup ha scritto:
>>
>>  OK, I have some news:
>> 1- Today I rewrote some parts of Kian and now it automatically chooses
>> regulation parameter (lambda), thus predictions are more accurate. I wanted
>> to push changes to the github but It seems my ssh has issues. It'll be
>> there soon
>> 2- (Important) I wrote a code that can find possible mistakes in Wikidata
>> based on Kian. The code will be in github soon. Check out this link
>> <http://tools.wmflabs.org/dexbot/possible_mistakes_fr.txt>. It's result
>> from comparing French Wikipedia against Wikidata e.g. this line:
>>
>> Q2994923: 1 (d), 0.257480420229 (w) [0, 0, 1, 2, 0]
>>
>> 1 (d) means Wikidata thinks it's a human
>>
>> 0.25... (w) means French Wikipedia thinks it's not a human (with 74.3%
>> certainty)
>>
>> And if you check the link you can see it's a mistake in Wikidata. Please
>> check other results and fix them.
>>
>> Tell me if you want this test to be ran from another language too.
>>
>> 3- I used Kian to import unconnected pages from French Wikipedia and
>> created about 1900 items. The result is here
>> <http://tools.wmflabs.org/dexbot/kian_res_fr.txt> and please check if
>> anything in this list is not human and tell me and I run some error
>> analysis.
>>
>>
>>  Best
>>
>> Great!
>> Unfortunately, some files seem to have been published with the wrong
>> character encoding.
>> E.g. the first name shows up as "Échécrate" in my browsers.
>> _______________________________________________
>> Wikidata-l mailing list
>> Wikidata-l@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>>
>
_______________________________________________
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Reply via email to