Hey, that's a great project. The current situation with ProteinBoxBot is not
ideal. We don't want to create a separate template for every protein. I suppose
they do that now to make the articles slightly shorter, but Wikidata will allow
us to accomplish the same thing without the confusing extra templates and
ultimately to synchronize the infobox data across different language
Wikipedias. With Wikidata the idea is that you will eventually just be able to
write {{GNF Protein box}} at the top of the protein article and you will get
what you want. If you look at our current list of biology properties though,
there are still some that need to be added to fully support protein
box.http://www.wikidata.org/wiki/Wikidata:List_of_properties#Biology_.2F_Biologie_.2F_Biologie_.2F_.D0.91.D0.B8.D0.BE.D0.BB.D0.BE.D0.B3.D0.B8.D1.8F
You can start adding data for the current properties already though. I just
added the Entrez ID to adenosine deaminase. http://www.wikidata.org/wiki/Q356093
A list of human proteins on Wikipedia is here.
http://en.wikipedia.org/wiki/Category:Human_proteins You can access the list
through code by using a MediaWiki API request like this:
http://en.wikipedia.org/w/api.php?action=query&list=categorymembers&cmtitle=Category:Human_proteins&format=json&cmlimit=500The
use cmcontinue to get them in pages of 500 at a time until you have all 10000.
To go from article title to Wikidata item use something like
this:http://wikidata.org/w/api.php?action=wbgetentities&sites=enwiki&titles=Adenosine%20deaminase&languages=en&format=json
Then you'll want to propose a bot to automatically add properties.
http://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Bot
The property inclusion syntax refers to the item on the article that
transcludes the template. So all of the extra templates that ProteinBoxBot is
currently making would all have identical wiki markup if it was fully using
Wikidata.Date: Tue, 16 Apr 2013 19:46:25 +0530
From: [email protected]
To: [email protected]
Subject: [Wikidata-l] Regarding wikidata inclusion syntax
Hi,
I am Chinmay Naik, an undergrad in CS at Bangalore,India. I hope to contribute
to CrowsourcingBiology through GSOC 2013. The project i am interested is
"convert bot to write to wikidata". Currently the bot does this database
---> wikipedia. The proposal is this database ----> wikidata ----->
wikipedia.
I have the basic idea of how to capture gene info onto wikidata. I was
wondering about the inclusion syntax of wikidata. I had a look at the current
inclusion of wikidata v0.3 meta. Some things are unclear to me. How best it is
to map wikidata items on to wiki articles.??
Currently, the bot uses templates. The template structure is
http://en.wikipedia.org/wiki/Template:GNF_Protein_box. The bot fills a various
gene templates http://en.wikipedia.org/wiki/Template:PBB/100. These gene
templates are included on gene wikipedia articles.
I tried to work out on the italian wikipedia. When i try to include a
template(mapped to wikidata item) onto an wiki article, the wikidata properties
were not displayed.
So is it not possible to include wikidata items in template namespace??
So transclusion where data is retreived from wikidata item not supported??
Kindly pardon me if this is not the right mailing list to post to about this.
Thanks in advance,
Chinmay
_______________________________________________
Wikidata-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-l
_______________________________________________
Wikidata-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-l