Re: [Wikidata-l] Wikidata Transclusions

2012-06-14 Thread Nikola Smolenski

On 14/06/12 00:39, jmccl...@hypergrove.com wrote:

Transclusion is surely fundamental to wiki application design. The
[[wikidata]] proposal by contrast is a client-server API, such things an
artifact of the 20th century. What is the point of it here?

Ultimately the problem you're grappling with is not just just about
infoboxes, it's about *anything* other than article text that has
multilingual requirements. For instance, the same *pie chart* is to be
shared among wikipedias, the only difference being the graph's title,
key and other labels... [[wikidata]] is today doing format=table, later
other formats. That's alot to handle in an API.


I don't think Wikidata will ever do other formats. Wikidata will only 
export pure data.



So, it's highly advised the client-server API approach be scrapped. At a
minimum, it's outdated technology, for good reasons. Instead, wikidata
should *publish* infoboxes that are happily cached on wikidata servers.
That's the best performance that can possibly be had.


Wikidata publishing infoboxes and Wikipedias using them is again the 
client-server model. And if Wikidata publishes infoboxes, pie charts and 
the like, THAT will complicate the API, not the current approach. Not to 
mention that Wikipedias have and want to have different infobox designs.




smime.p7s
Description: S/MIME Cryptographic Signature
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Wikidata Transclusions

2012-06-14 Thread Gregor Hagedorn
While I agree that it is desirable to support simple, preformatted
Infoboxes that can, with minimal effort be re-used in a large number
of language versions of Wikipedia, I strongly disagree with the demand
to make this the only choice.

I think the present Wikidata approach to allow local Wikipedias to
customize their infoboxes by accessing wikidata properties
property-by-property is the right path.

The large Wikipedias with many editors have invested considerable
creative energy into making quite a large number of infoboxes
elaborate information containers. That includes formatting, images and
hand-crafted links in both the field name and the field value
side. Some values are expressed through svg graphics, other values
expressed through background color coding, etc.

Limiting the usability of Wikidata to plain vanilla infox boxes could
cause considerable resistance in these communities. And although small
Wikipedia will profit a lot from Wikidata, without the engagement of
editors from the large Wikipedias into curating Wikidata content, the
increased synergies will not happen.

Another issue is that (I believe that) Wikidata does not have a notion
of ordering properties. Correct? This is no issue for the present
Wikidata approach because infoboxes remain curated in each local
Wikipedia. However, in a centralized one size fits all approach,
replacing existing infoboxes where information is presented in a
logical order with an alphabetical property order would create huge
resistance (and would be a complex issue that Wikidata would have to
deal with, allowing property ordering and filtering).

I believe that Wikidata correctly aims to provide a smooth transition
path, where it is possible to obtain only part of an infobox from
wikidata and inject wikidata content into existing infobox layouts.

That said: I would encourage a third party contributor to try to
create a default Wikidata infobox generator in a way (extension
installable in multiple Wikipedias) that enables a wikipedia to
autocreate a good looking, plain vanilla infobox with minimal effort.

Gregor

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Wikidata Transclusions

2012-06-14 Thread Gerard Meijssen
Hoi,
Technically there is nothing stopping Wikidata from hosting multiple
infoboxes on the same subject. The big thing about such infoboxes is that
their layout is the same for all subjects in the same category. This does
not mean that every one looks the same but it does mean they follow a
consistent pattern.

When people talk about things like colours and stuff, it becomes highly
emotional but in the final analysis at this stage it is just more bike
shedding. It should be obvious that attributes like colour can be
overriden.. Given that info boxes will not be supported in the near future
...

The notion that people should curate the info boxes locally is something
that I do not subscribe to. Not being able to agree on data and sources is
the same as not being able to reach a neutral point of view. This does not
mean that multiple sources may not agree but equally it does not mean that
different sources cannot be maintained from within Wikidata.

Finally, when Wikidata provides data and info boxes, it does not mean that
any project is compelled to use it. As Wikidata matures, it will become
increasingly clear that it is not the best practice.
Thanks,
 GerardM

On 14 June 2012 12:11, Gregor Hagedorn g.m.haged...@gmail.com wrote:

 While I agree that it is desirable to support simple, preformatted
 Infoboxes that can, with minimal effort be re-used in a large number
 of language versions of Wikipedia, I strongly disagree with the demand
 to make this the only choice.

 I think the present Wikidata approach to allow local Wikipedias to
 customize their infoboxes by accessing wikidata properties
 property-by-property is the right path.

 The large Wikipedias with many editors have invested considerable
 creative energy into making quite a large number of infoboxes
 elaborate information containers. That includes formatting, images and
 hand-crafted links in both the field name and the field value
 side. Some values are expressed through svg graphics, other values
 expressed through background color coding, etc.

 Limiting the usability of Wikidata to plain vanilla infox boxes could
 cause considerable resistance in these communities. And although small
 Wikipedia will profit a lot from Wikidata, without the engagement of
 editors from the large Wikipedias into curating Wikidata content, the
 increased synergies will not happen.

 Another issue is that (I believe that) Wikidata does not have a notion
 of ordering properties. Correct? This is no issue for the present
 Wikidata approach because infoboxes remain curated in each local
 Wikipedia. However, in a centralized one size fits all approach,
 replacing existing infoboxes where information is presented in a
 logical order with an alphabetical property order would create huge
 resistance (and would be a complex issue that Wikidata would have to
 deal with, allowing property ordering and filtering).

 I believe that Wikidata correctly aims to provide a smooth transition
 path, where it is possible to obtain only part of an infobox from
 wikidata and inject wikidata content into existing infobox layouts.

 That said: I would encourage a third party contributor to try to
 create a default Wikidata infobox generator in a way (extension
 installable in multiple Wikipedias) that enables a wikipedia to
 autocreate a good looking, plain vanilla infobox with minimal effort.

 Gregor

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Wikidata Transclusions

2012-06-14 Thread Gregor Hagedorn
On 14 June 2012 12:33, Gerard Meijssen gerard.meijs...@gmail.com wrote:
 Finally, when Wikidata provides data and info boxes, it does not mean that
 any project is compelled to use it. As Wikidata matures, it will become
 increasingly clear that it is not the best practice.

that may be, but Wikidata needs a path to get there. I think the
ability to integrate wikidata into existing the infobox consensus of a
Wikipedia community is essential for adoption. Over time, centrally
provided infoboxes with ever increasing customization functionality
(order, selection, arrangement, linking properties to Wikipedia pages
explaining them, etc.) are desirable and at some point the evolution
of wiki data may conclude that this become the preferred method.

Gregor

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Wikidata Transclusions

2012-06-14 Thread jmcclure
 

Wikidata publishing infoboxes and Wikipedias using them is again
the client-server model. 

Not sure where this chestnut is coming from.
Transclusion is as close to client-server as my cooking is to being
gourmet! 

There's NO API. so I don't understand your commenst at all,
sorry. 

On 13.06.2012 23:48, Nikola Smolenski wrote: 

 On 14/06/12
00:39, jmcclure@hypergrove.comwrote:
 
 Transclusion is surely
fundamental to wiki application design. The [[wikidata]] proposal by
contrast is a client-server API, such things an artifact of the 20th
century. What is the point of it here? Ultimately the problem you're
grappling with is not just just about infoboxes, it's about *anything*
other than article text that has multilingual requirements. For
instance, the same *pie chart* is to be shared among wikipedias, the
only difference being the graph's title, key and other labels...
[[wikidata]] is today doing format=table, later other formats. That's
alot to handle in an API.
 
 I don't think Wikidata will ever do other
formats. Wikidata will only 
 export pure data.
 
 So, it's highly
advised the client-server API approach be scrapped. At a minimum, it's
outdated technology, for good reasons. Instead, wikidata should
*publish* infoboxes that are happily cached on wikidata servers. That's
the best performance that can possibly be had.
 
 Wikidata publishing
infoboxes and Wikipedias using them is again the 
 client-server model.
And if Wikidata publishes infoboxes, pie charts and 
 the like, THAT
will complicate the API, not the current approach. Not to 
 mention
that Wikipedias have and want to have different infobox designs.

 ___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Wikidata Transclusions

2012-06-14 Thread jmcclure
 

I strongly disagree with the demand to make this the only choice.
- 

Gregor, I'm a bit confused -- are you talking about the transclusion
design approach in this statement? because, if so, I'd think there'd be
a number of infobox styles that can be selected by an author on the
wikidata platform when 'building' the infobox page. The author can
transclude any number/any specific infobox(es) on their wikipedia page,
eg 

{{wikidata:en:Infobox:Some topic/some custom imfobox}} 

On
14.06.2012 03:11, Gregor Hagedorn wrote: 

 While I agree that it is
desirable to support simple, preformatted
 Infoboxes that can, with
minimal effort be re-used in a large number
 of language versions of
Wikipedia, I strongly disagree with the demand
 to make this the only
choice.
 
 I think the present Wikidata approach to allow local
Wikipedias to
 customize their infoboxes by accessing wikidata
properties
 property-by-property is the right path.
 
 The large
Wikipedias with many editors have invested considerable
 creative
energy into making quite a large number of infoboxes
 elaborate
information containers. That includes formatting, images and

hand-crafted links in both the field name and the field value

side. Some values are expressed through svg graphics, other values

expressed through background color coding, etc.
 
 Limiting the
usability of Wikidata to plain vanilla infox boxes could
 cause
considerable resistance in these communities. And although small

Wikipedia will profit a lot from Wikidata, without the engagement of

editors from the large Wikipedias into curating Wikidata content, the

increased synergies will not happen.
 
 Another issue is that (I
believe that) Wikidata does not have a notion
 of ordering properties.
Correct? This is no issue for the present
 Wikidata approach because
infoboxes remain curated in each local
 Wikipedia. However, in a
centralized one size fits all approach,
 replacing existing infoboxes
where information is presented in a
 logical order with an alphabetical
property order would create huge
 resistance (and would be a complex
issue that Wikidata would have to
 deal with, allowing property
ordering and filtering).
 
 I believe that Wikidata correctly aims to
provide a smooth transition
 path, where it is possible to obtain only
part of an infobox from
 wikidata and inject wikidata content into
existing infobox layouts.
 
 That said: I would encourage a third
party contributor to try to
 create a default Wikidata infobox
generator in a way (extension
 installable in multiple Wikipedias) that
enables a wikipedia to
 autocreate a good looking, plain vanilla
infobox with minimal effort.
 
 Gregor
 

___
 Wikidata-l mailing
list
 Wikidata-l@lists.wikimedia.org

https://lists.wikimedia.org/mailman/listinfo/wikidata-l

 ___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Wikidata Transclusions

2012-06-14 Thread Gregor Hagedorn
 Gregor, I'm a bit confused -- are you talking about the transclusion design
 approach in this statement?

Yes, in the sense that it demands to be the only access to wiki data
content in a Wikipedia.

 because, if so, I'd think there'd be a number of
 infobox styles that can be selected by an author on the wikidata platform
 when 'building' the infobox page. The author can transclude any number/any
 specific infobox(es) on their wikipedia page, eg

 {{wikidata:en:Infobox:Some topic/some custom imfobox}}

As I say, I look forward to see an infobox builder being developed,
but this is a serious challenge.

See, e.g. http://en.wikipedia.org/wiki/Tiger and take a look at the
hierarchical arrangement of properties, formatting of them, linking of
them (Headings link to concept explanations on the same-language
Wikipedia, with the link being different than the display text, Early
Pleistocene – Recent may be a time range, but the value is Early
Pleistocene and the link is Pleistocene; similarly each taxonomic
author - here only ony present, Linnaeus - should link on en.wikipedia
to en.wikipedia and de.wikipedia to de.wikipedia), expressing some
information with graphics, see Endangered (IUCN 3.1)[1], properties or
property values containing footnotes, the fact that a subspecies is
extinct being abbreviated with a symbol (†P. t. virgata) etc. Note
that the latter case is actually a nesting: it is a list of
subspecies, with each subspecies having multiple properties like
Scientific Name, Wikipedia Page name, extinction status - I am not
sure Wikidata plans to model such data in Phase 2 already.

My bottomline: Keep the wikidata project manageable and doable with
the available resources. Offer a method for Wikipedians to pick up
Wikidata content within the existing template infrastrukture.

But, desirable: ask a white paper which additional work would be
required to create centralized, plain vanilla infobox rendering as
well.

Would you be willing to create such a whitepaper? How much of the
above-shown Tiger example can be created centrally with a limited set
of facilities? How feature rich must the customization become?

Or are you proposing to simply use the existing template programming
with the only the difference that wikidata is the only mediawiki where
the properties can be accessed within templates? Much of my argument
assumes that you are looking for a non-template based infobox
renderer, I may be wrong there.

Gregor

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Wikidata Transclusions

2012-06-14 Thread jmcclure
 

Gregor says
Or are you proposing to simply use the existing
template programming
with the only the difference that wikidata is the
only mediawiki where
the properties can be accessed within templates?
Much of my argument
assumes that you are looking for a non-template
based infobox
renderer, I may be wrong there.

I am proposing that, from
the perspective of the Tiger article author, I would log on to wikidata
and create a page called Infobox:Tiger using a Semantic Form named
Taxobox whose fields track to the args for the Taxobox template. On
the form, one would select whether a genus, familia, ordo, ... regnum is
represented by the page... assume genus was selected,
so.
{{Taxobox
| status = EN (DUBLIN CORE PROPERTY: LANGUAGE)
|
fossil_range = Early [[Pleistocene]] - Recent (DUBLIN CORE: COVERAGE)
|
status_system = iucn3.1 (DUBLIN CORE: FORMAT)
| trend = down (DUBLIN
CORE: FORMAT)
| status_ref =ref name=IUCN/ (DUBLIN CORE: IDENTIFIER)
|
image = Tiger in Ranthambhore.jpg (EMBEDDED {{IMAGE}} TEMPLATE CALL)
|
image_caption = A [[Bengal tiger]] (''P. tigris tigris'') in India's
[[Ranthambhore National Park]]. (PART OF {{IMAGE}} TEMPLATE CALL)
|
image_width = (PART OF {{IMAGE}} TEMPLATE CALL)
| regnum = [[Animal]]ia
(UNNECESSARY - LOOK IT UP VIA #ASK)
| phylum = [[Chordate|Chordata]]
(UNNECESSARY - LOOK IT UP VIA #ASK))
| classis = [[Mammal]]ia
(UNNECESSARY - LOOK IT UP VIA #ASK))
| ordo = [[Carnivora]] (UNNECESSARY
- LOOK IT UP VIA #ASK))
| familia = [[Felidae]] (UNNECESSARY - LOOK IT
UP VIA #ASK))
| genus = ''[[Panthera]]'' (DUBLIN CORE PROPERTY:
RELATION)
| species = 'P. tigris' (DUBLIN CORE PROPERTY:
TITLE)
| binomial = ''Panthera tigris'' (EMBEDDED {{BINOMIAL}} TEMPLATE
CALL)
| binomial_authority = ([[Carl Linnaeus|Linnaeus]], 1758) (PART OF
EMBEDDED {{BINOMIAL}} TEMPLATE CALL)
| synonyms = (DUBLIN CORE PROPERTY:
TITLE) 
center'Felis tigris' small[[Carl
Linnaeus|Linnaeus]], 1758/smallref name=Linn1758 / br
/
'Tigris striatus' small[[Nikolai Severtzov|Severtzov]],
1858/smallbr /
'Tigris regalis' small[[John Edward
Gray|Gray]], 1867/center
| range_map = Tiger_map.jpg (EMBEDDED
{{IMAGE}} TEMPLATE CALL)
| range_map_width = (PART OF EMBEDDED {{IMAGE}}
TEMPLATE CALL)
| range_map_caption = Tiger's historic range in ca. 1850
(pale yellow) and range in 2006 (in green).ref name=dinerstein07 /
|
subdivision_ranks = [[Subspecies]]
| subdivision = (UNNECESSARY - LOOK
IT UP VIA #ASK)
''[[Bengal tiger|P. t. tigris]]''br/
''[[Indochinese
tiger|P. t. corbetti]]''br /
''[[Malayan tiger|P. t. jacksoni]]''br
/
''[[Sumatran tiger|P. t. sumatrae]]''br /
''[[Siberian tiger|P. t.
altaica]]''br /
''[[South China tiger|P. t. amoyensis]]''br
/
†''[[Caspian tiger|P. t. virgata]]''br /
†''[[Bali tiger|P. t.
balica]]''br /
†''[[Javan tiger|P. t. sondaica]]''
}} 

One could
create [[en:Infobox:Tiger]] and [[de:Infobox:Tiger]] to handle language
differences. Alternatively there'd be only [[Infobox:Tiger]] that has
language-qualified string properties, e.g, Title^en contains english
title for the page vs Title^de that contains the German title, with the
#ask pulling the correct one given the wiki's language eg
|?Title^{{CONTENTLANG}} 

For links, use the same magic word eg


|genus={{CONTENTLANG}}:Panthera 

To be a bit fancier:


|genus={{#ifexist:{{CONTENTLANG}}:Panthera|{{CONTENTLANG}}:Panthera|en:Panthera}}


Now, the above is a traditional non-topicmap treatment without
provenance. Let's add provenance first: 

|genus={{Dublin
Core|value=Panthera|creator={{USERNAME}}|date={{TODAY}}|language=}}


Now lets add the topicmap orientation


|genus={{Topic|name=Panthera|creator={{USERNAME}}|date={{TODAY}}|language=}}


So there's certainly alternatives to look at. The basic theme though
is NOT to create a specialized factbox editor but rather is to use
Semantic Forms to capture the values of template args - values that can
be links, text, or template calls. And you're right, Gregor, the primary
way to access wikidata's triples for purposes of regular template
programming is to logon to wikidata. imho, that establishes wikidata as
a black-box, with no additional mechanisms/extensions loaded onto any
'transcluding' wikipedia, which I think is the ideal posture for
integrating wikidata resources into the wikipedias. 

re: the
whitepaper. Sure I'd be happy to put together a real demo. But as a
strugglin' contractor doing opensource dev since 1998, it'd be nice
if. 

Gotta run - john 

On 14.06.2012 09:29, Gregor Hagedorn
wrote:

 Gregor, I'm a bit confused -- are you talking about the
transclusion design approach in this statement?

 Yes, in the sense that
it demands to be the only access to wiki data
 content in a Wikipedia.


because, if so, I'd think there'd be a number of infobox styles that
can be selected by an author on the wikidata platform when 'building'
the infobox page. The author can transclude any number/any specific
infobox(es) on their wikipedia page, eg {{wikidata:en:Infobox:Some

Re: [Wikidata-l] Wikidata test wikis are getting spammed

2012-06-14 Thread Lydia Pintscher
On Thu, Jun 14, 2012 at 7:57 PM, Erik Moeller e...@wikimedia.org wrote:
 http://wikidata-test-client.wikimedia.de/wiki/Special:RecentChanges

 It might be best to limit editing, or try to rope stewards into
 running their anti-spambots there ...

Uff of course it could only be so long until spammers find the demo
too. We'll look into some measures against them tomorrow. One
possibility would of course be to just reset the wiki every night to a
pre-defined state.


Cheers
Lydia

-- 
Lydia Pintscher - http://about.me/lydia.pintscher
Community Communications for Wikidata

Wikimedia Deutschland e.V.
Obentrautstr. 72
10963 Berlin
www.wikimedia.de

Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] [[meta:wikitopics]] updated

2012-06-14 Thread Nadja Kutz
John McClure wrote:

Of course, thanks for the pointer. Yes, I'd agree that 19788's ontology be 
closely reviewed for inclusion. 19788:2 standardizes the  Dublin Core 
properties, the same I recommend for [[wikidata]] provenance data, the same 
slated for the [[wikidata]] ontology. But more to your point is that the entire 
ISO corpus would fit really well if it were viewed as a topic map whose topics 
and sub-topics can be referenced from  [[wikidata]] artifacts such as property 
definitions.

Hello John

Frankly speaking I don't see why one would want to use topic maps.
That is RDF triples are after an identification (canonical: elements with the 
same URI are identified)
 a labeled graph, here to be called the RDF graph. (I know that some people 
call the triples themselves
the RDF graph, but why use a second word (namely graph) for triples?
Triples are very trivial highly disconnected graph.).
If I want to connect certain nodes of that graph to a topic
I only need to supply these nodes with an extra triple which says 
(this node belongs to this topic, i.e. something like (node, 
belongsto,thistopic) ) or modify
the canonical identification map and the RDF graph will be a topic map or one 
has the case that the triples are 
already set out in topics that is for example 
if I have a set of triples with the same resource URI then upon canonical 
identification these are
a kind of topic map (with all legs pointing in one direction) or am I 
missing out something crucial? 

However if you start with topics, you have no canonical information about the 
internal structure of
a topic and in some cases you would need to artificially impose this in 
retrospect onto
the datastructure.
Like if your topic is members of a society , you have all the members and you 
would need some internal structure like a hierarchy
 then you would need to supply each member with a hierarchy classification 
(i.e. with extra data, which is usually different for each member). For the RDF 
case the person who gave you
the triples could have made a choice of order which could be given upon 
canonical identification.  I.e.  in principle the
internal structure depends on your identification map but there is a canonical 
one.
You can of course mimick a RDF triple with a topic map by choosing the topic to 
be the ressource and 
one leg of your topic as a property and the topic which is connected with 
this leg as the object, but the 
choice of a leg is not canonical if there is more than one leg. Only if you 
would make all legs of a topic map into triples you would have something like 
a canonical assignment. I find these differences important. But may be I have 
overseen something or misunderstood 
about topic maps (I read about this issue what I found scattered around in the 
internet so this is not so unprobable). 



I had this kind of discussion with people from deepa mehta 
http://www.deepamehta.de/
because they used topic maps, but sofar nobody there could convince me about 
the distinguished advantages of topic maps.
But the discussion was sofar rather brief.
The discussion was because we discussed to what extend it would be possible to 
merge a student project we 
had at HTW Berlin ( a collaboration platform for visualizing RDF data called 
Mimirix
http://www.daytar.de/art/MIMIRIX/) with deepa mehta, like for example
one could use at least the backend, which has already a layout for access 
control 
(the deepa mehta people told me that they haven't yet really attacked the issue
of access control) or one could use at lease the carefully designed client.


May be you have other arguments for topic maps, as said I might have missed out 
something.


I understand that there are other issues like the speed of adressability or 
direct access issues
but these are then rather an issue of the serialization I find.

So I didn't understand why for example the pregiven JSON structure of an 
JSONarray 
http://www.json.org/javadoc/org/json/JSONArray.html
is not used in JSON-LD 
http://json-ld.org/spec/latest/json-ld-syntax/#sets-and-lists
but thats another topic.

In the context of applications of ISO metadata you may want to read:
http://www.azimuthproject.org/azimuth/show/Examples+of+semantic+web+applications+and+environment
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l