thalhamm added a comment.
@Smalyshev
I have developed a full Bash+Python3 framework that enables to compute PageRank on any Wikipedia language edition (even with low-cost hardware). By default, the input is based on the latest version of the Wikidump and the output involves each page's Q-id and
thalhamm added a comment.
@Smalyshev, I think we might check first if the type of output is of any use for you. You can get most info (e.g. output/input format) at http://people.aifb.kit.edu/ath/#Wikidata_PageRank. It is not run on Hadoop and it takes fairly little resources (actually it can be
thalhamm added a comment.
TASK DETAILhttps://phabricator.wikimedia.org/T143424EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: thalhammCc: Smalyshev, thalhamm, thiemowmde, Sjoerddebruin, Glorian_Yapinus, Aklapper, QZanden, D3r1ck01, Izno, Wikidata-bugs, aude,
Smalyshev added a comment.
@thalhamm We'd like to know more about the PageRank solution, especially applied to Wikidata. In order to see if we could integrate this solution, we'd like to know more about:
What is the input for the algorithm?
What is the output produced?
What platform it is run on
Sjoerddebruin added a comment.
The number of incoming links to a item could be a indication of relevancy, but probably the most difficult one to add.TASK DETAILhttps://phabricator.wikimedia.org/T143424EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: