On Tue, 31 Oct 2017, at 04:04 PM, Nicolas VIGNERON wrote:
> 2017-10-31 5:21 GMT+01:00 Sam Wilson <[email protected]>:
>> I've just been experimenting with adding a quick-statements generator to
>>  https://tools.wmflabs.org/ws-search — search for something, with 'linked
>>  to wikidata = no', and then each row of the results has a popup with
>>  code you can copy and paste into quickstatements.
>>  
>>  I don't know if that sort of thing is useful. It's not working with many
>>  fields yet, but I'm finding it faster than entering the basics manually.
>>  
>>  —sam.
> 
> Interesting.
> 
> It's a good start but I see a lot of things that need heavy improvements. I 
> tried for br and fr Wikisources.
> 
> Most importantly not everything on Wikisource is an edition (there is 
> disambig page and editions pages, for instance fr:s:Accroupissements a a work 
> not an edition, it should detect that and change the P31 accordingly)
> Then the description is put on « Dfr  "edition of Accroupissements by , 
> Arthur Rimbaud" », it should be Den not Dfr, there shouldn't be a comma here, 
> and ideally we should add description in multiple languages (maybe with an 
> external bot ?).
> Some others data could probably easily be added 
> (P996[https://www.wikidata.org/wiki/Property:P996] and 
> P1957[https://www.wikidata.org/wiki/Property:P1957] at least and maybe 
> P1476[https://www.wikidata.org/wiki/Property:P1476] too)
> 

Yes it's certainly a first draft!! :-) Thanks for trying it out.

With the disambig pages, can you suggest how to detect them?

Ah, there's a couple of other bugs here:

The page https://fr.wikisource.org/wiki/Accroupissements actually
already has a WIkidata ID, but the ws-search database didn't know about
it :-( probably because it was failing for a while on some weird
problems. I've re-run the scraper, and now that work is showing up with
it's proper Q-number:
https://tools.wmflabs.org/ws-search/?title=Accroupissements&author=&lang=fr

The idea with the quickstatements is that it'll only show it for works
that are *not yet* linked to wikidata. This is where the disambig
problem comes in, because there doesn't seem to be a simple way to
determine what's an edition and what's a work without resorting to
Wikidata. We could look at categories? Is it a truth universally
acknowledged that pages in the categories defined as
https://www.wikidata.org/wiki/Q15939659 are all disambiguation pages?
That could work... 

:-) Thanks again! I'll fix up the i18n stuff. The tool isn't doing
anything at all yet, but I could register it on translatewiki if it's
thought to be useful enough?

—sam

_______________________________________________
Wikisource-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Reply via email to