If the page redirect titles exist in Wikipedia, they are valid in Wikidata as data, regardless of what they represent in *your view* of "quality". If cleanup needs to be done, it should be done in the context of the source first. Evaluating the value of a specific "alias" to a Wikidata item is a judgment that should be based entirely on a *referenceable* data source.
Wikidata aliases (as well as descriptions and preferred labels) are completely arbitrary and unreferenced, and in my judgment worthless, without a primary source or clearly defined semantic relationship. The judgmental curation of Wikidata is in fact, not that useful. Wikidata should simply seek to represent data *as it exists* (errors or not) in the primary source. Furthermore, apparently you do not get why skos:hiddenLabel exists. Why you feel that it is not worthwhile is not relevant to its primary function, which is to facilitate searching. (see https://www.w3.org/2012/09/odrl/semantic/draft/doco/skos_hiddenLabel.html) And, it is not difficult to argue that the searching in Wikidata could use improvement. On 16 March 2016 at 13:00, <[email protected]> wrote: > Send Wikidata-tech mailing list submissions to > [email protected] > > To subscribe or unsubscribe via the World Wide Web, visit > https://lists.wikimedia.org/mailman/listinfo/wikidata-tech > or, via email, send a message with subject or body 'help' to > [email protected] > > You can reach the person managing the list at > [email protected] > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of Wikidata-tech digest..." > > > Today's Topics: > > 1. Re: Wikipedia Page Redirect Titles in Wikidata (Lydia Pintscher) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Tue, 15 Mar 2016 16:49:40 +0000 > From: Lydia Pintscher <[email protected]> > To: [email protected] > Subject: Re: [Wikidata-tech] Wikipedia Page Redirect Titles in > Wikidata > Message-ID: > < > cabfqugjj3hadoaa+oi6wkot9zr6hbcnq9w40ztxtxz9-+vh...@mail.gmail.com> > Content-Type: text/plain; charset="utf-8" > > On Sat, Mar 12, 2016 at 2:14 PM Christopher Johnson < > [email protected]> wrote: > > > Hi, > > > > I am developing a scientific terms thesaurus and have discovered that > > existing Wikipedia "page redirect titles" provide a useful way to resolve > > an odd or archaic form to a "canonical" term label as it is represented > by > > the Wikipedia page title (aka Wikidata "sitelink"). For example, > > > > > https://en.wikipedia.org//w/api.php?action=query&format=xml&prop=redirects&titles=universe > > > > In Wikidata, these "page redirect titles" are not represented in the data > > model except very inconsistently and sparsely as skos:altLabel or > > ("alias"). My use case is that I would like to be able to query Wikidata > > for these page redirect titles in order to resolve odd multi-linguistic > > names to an single concept. > > > > My question is that if I were to create a bot that imported all "page > > redirect titles" for a given sitelink and created them with the > > skos:altLabel property en masse, is this a valid semantic relationship? > > Or, should it rather be represented as ?sitelink owl:sameAs <page > redirect > > URI>? Or both? > > > > Furthermore,, in some cases (z.B. mis-spellings), skos:hiddenLabel may be > > more appropriate, but this has no definition in the data model. There > > potentially would be a lot of clutter in the UI without a hiddelLabel > alias > > property. Also, there are no types for page redirects in Wikipedia, > afaik. > > > > Additional value for the searching in the WIkidata UI could probably be > > obtained from indexing these alternate page titles as well. > > > > There are several points to address: > 1) Should redirects from Wikipedia be imported as aliases on Wikidata? No. > This has been done before and created a massive amount of cleanup work > because the redirects contained a lot of not meaningful misspellings and > more. Please do not import them to Wikidata without approval through the > bot approval process and clear quality control. > 2) Should we allow more fine-grained distinction between real aliases and > misspellings in the UI and datamodel? No. I don't believe this is worth the > complexity and resulting discussions/edit wars and more. > > > Cheers > Lydia > -- > Lydia Pintscher - http://about.me/lydia.pintscher > Product Manager for Wikidata > > Wikimedia Deutschland e.V. > Tempelhofer Ufer 23-24 > 10963 Berlin > www.wikimedia.de > > Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. > > Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter > der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für > Körperschaften I Berlin, Steuernummer 27/029/42207. > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < > https://lists.wikimedia.org/pipermail/wikidata-tech/attachments/20160315/6ad8047f/attachment-0001.html > > > > ------------------------------ > > Subject: Digest Footer > > _______________________________________________ > Wikidata-tech mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikidata-tech > > > ------------------------------ > > End of Wikidata-tech Digest, Vol 35, Issue 6 > ******************************************** >
_______________________________________________ Wikidata-tech mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-tech
