[Wikidata-bugs] [Maniphest] [Changed Subscribers] T143424: [Task] Explore the Entity Relevancy Scoring for Wikidata

2017-10-09 Thread thalhamm
thalhamm added a subscriber: Lydia_Pintscher.thalhamm added a comment. @Lydia_Pintscher @Smalyshev Based on the Wikidata PageRank scores, my former colleague Steffen Thoma (KIT) and I developed a Wikidata autocomplete prototype based on Apache Solr. Please see here: http://km.aifb.kit.edu

[Wikidata-bugs] [Maniphest] [Unassigned] T143424: [Task] Explore the Entity Relevancy Scoring for Wikidata

2017-11-05 Thread thalhamm
thalhamm removed thalhamm as the assignee of this task. TASK DETAILhttps://phabricator.wikimedia.org/T143424EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: thalhammCc: Lydia_Pintscher, Smalyshev, thalhamm, thiemowmde, Sjoerddebruin, Glorian_Yapinus, Aklapper

[Wikidata-bugs] [Maniphest] [Claimed] T143424: [Task] Explore the Entity Relevancy Scoring for Wikidata

2016-09-19 Thread thalhamm
thalhamm claimed this task.thalhamm added a comment. We were recently discussing a Wikipedia PageRank solution (or a combination of that ranking with other features). I could contribute these scores and get ready also to implement some integration (with some help).TASK DETAILhttps

[Wikidata-bugs] [Maniphest] [Commented On] T144103: Create .nt (NTriples) dumps for wikidata data

2016-11-04 Thread thalhamm
thalhamm added a comment. Hi, I don't really think nt adds more value. If you produce valid turtle, there are tools such as Raptor RDF Syntax Library that easily convert between different RDF syntaxes. Everyone that really needs nt can do this fairly easily themselves, i.e. rapper --input t

[Wikidata-bugs] [Maniphest] [Commented On] T143424: [Task] Explore the Entity Relevancy Scoring for Wikidata

2017-03-17 Thread thalhamm
thalhamm added a comment. TASK DETAILhttps://phabricator.wikimedia.org/T143424EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: thalhammCc: Smalyshev, thalhamm, thiemowmde, Sjoerddebruin, Glorian_Yapinus, Aklapper, QZanden, D3r1ck01, Izno, Wikidata-bugs, aude

[Wikidata-bugs] [Maniphest] [Commented On] T143424: [Task] Explore the Entity Relevancy Scoring for Wikidata

2017-03-17 Thread thalhamm
thalhamm added a comment. @Smalyshev, I think we might check first if the type of output is of any use for you. You can get most info (e.g. output/input format) at http://people.aifb.kit.edu/ath/#Wikidata_PageRank. It is not run on Hadoop and it takes fairly little resources (actually it can be

[Wikidata-bugs] [Maniphest] [Commented On] T143424: [Task] Explore the Entity Relevancy Scoring for Wikidata

2017-07-15 Thread thalhamm
thalhamm added a comment. @Smalyshev I have developed a full Bash+Python3 framework that enables to compute PageRank on any Wikipedia language edition (even with low-cost hardware). By default, the input is based on the latest version of the Wikidump and the output involves each page's Q-i