thalhamm added a subscriber: Lydia_Pintscher.thalhamm added a comment.
@Lydia_Pintscher @Smalyshev Based on the Wikidata PageRank scores, my former colleague Steffen Thoma (KIT) and I developed a Wikidata autocomplete prototype based on Apache Solr. Please see here:
http://km.aifb.kit.edu
thalhamm removed thalhamm as the assignee of this task.
TASK DETAILhttps://phabricator.wikimedia.org/T143424EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: thalhammCc: Lydia_Pintscher, Smalyshev, thalhamm, thiemowmde, Sjoerddebruin, Glorian_Yapinus, Aklapper
thalhamm claimed this task.thalhamm added a comment.
We were recently discussing a Wikipedia PageRank solution (or a combination of that ranking with other features). I could contribute these scores and get ready also to implement some integration (with some help).TASK DETAILhttps
thalhamm added a comment.
Hi,
I don't really think nt adds more value. If you produce valid turtle, there are tools such as Raptor RDF Syntax Library that easily convert between different RDF syntaxes. Everyone that really needs nt can do this fairly easily themselves, i.e.
rapper --input t
thalhamm added a comment.
TASK DETAILhttps://phabricator.wikimedia.org/T143424EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: thalhammCc: Smalyshev, thalhamm, thiemowmde, Sjoerddebruin, Glorian_Yapinus, Aklapper, QZanden, D3r1ck01, Izno, Wikidata-bugs, aude
thalhamm added a comment.
@Smalyshev, I think we might check first if the type of output is of any use for you. You can get most info (e.g. output/input format) at http://people.aifb.kit.edu/ath/#Wikidata_PageRank. It is not run on Hadoop and it takes fairly little resources (actually it can be
thalhamm added a comment.
@Smalyshev
I have developed a full Bash+Python3 framework that enables to compute PageRank on any Wikipedia language edition (even with low-cost hardware). By default, the input is based on the latest version of the Wikidump and the output involves each page's Q-i