[Wikidata-bugs] [Maniphest] T179681: Add HDT dump of Wikidata
jjkoehorst added a comment. Small update from my side. After downloading the latest ttl file from Wikidata I receive no errors but also no output. I tried the exact command with a small dataset and that worked. time sudo docker run -v `pwd`:/wikidata rdfhdt/hdt-cpp:v1.3.3 rdf2hdt -f turtle -p -i wikidata/latest-all.ttl.gz wikidata/latest-all.hdt sudo docker run -v `pwd`:/wikidata rdfhdt/hdt-cpp:v1.3.3 rdf2hdt -f turtle -p 19.75s user 13.90s system 0% cpu 50:21:55.81 total So I am not exactly sure what is happening. This is the temp first 103 lines of the turtle file. time sudo docker run -v `pwd`:/wikidata rdfhdt/hdt-cpp:v1.3.3 rdf2hdt -f turtle -p -i wikidata/tmp.ttl.gz wikidata/tmp.hdt Predicate Bitmap in 21 usp: 0 % / 5.4 % Count predicates in 17 userences: 0 % / 6.75 % Count Objects in 8 us Max was: 8: 0 % / 27 % Bitmap in 9 usx bitmap: 0 % / 39.6 % Bitmap bits: 56 Ones: 38 Object references in 23 usces: 0 % / 42.75 % Sort lists in 17 uslists: 0 % / 64.8 % Index generated in 119 us sudo docker run -v `pwd`:/wikidata rdfhdt/hdt-cpp:v1.3.3 rdf2hdt -f turtle -p 0.04s user 0.03s system 1% cpu 4.868 total and then I can access the turtle file on the local drive. TASK DETAIL https://phabricator.wikimedia.org/T179681 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: jjkoehorst Cc: jjkoehorst, MPhamWMF, Daniel_Mietchen, hoo, Addshore, Smalyshev, Ladsgroup, Arkanosis, Tarrow, Lucas_Werkmeister_WMDE, Aklapper, Invadibot, maantietaja, Akuckartz, Dinadineke, DannyS712, Nandana, tabish.shaikh91, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, Jayprakash12345, JakeTheDeveloper, QZanden, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, TheDJ, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T179681: Add HDT dump of Wikidata
jjkoehorst added a comment. As I was having some issues with compiling the code I used a docker instance directly for the conversion unfortunately it failed due to rdf syntax reasons while using the latest database. As I didn't time it I cannot give any details yet about the performance. ? wikidata sudo docker run -v `pwd`:/wikidata rdfhdt/hdt-cpp:v1.3.3 rdf2hdt -p -i wikidata/latest-all.nt.gz wikidata/latest-all.hdt error: wikidata/latest-all.nt.gz:604276348:139: bad IRI scheme char `2F' Catch exception load: Error parsing input. ERROR: Error parsing input. TASK DETAIL https://phabricator.wikimedia.org/T179681 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: jjkoehorst Cc: jjkoehorst, MPhamWMF, Daniel_Mietchen, hoo, Addshore, Smalyshev, Ladsgroup, Arkanosis, Tarrow, Lucas_Werkmeister_WMDE, Aklapper, maantietaja, Akuckartz, Dinadineke, DannyS712, Nandana, tabish.shaikh91, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, Jayprakash12345, JakeTheDeveloper, QZanden, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, abian, Wikidata-bugs, aude, TheDJ, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T227031: Update wikibase-docker configuration for wdqs-frontend
jjkoehorst added a comment. The issue identified was because the custom-config.json was not overwriting the settings stored in default-config.json. To overcome this a pull request was made that overwrites the default-config.json file upon startup. See: https://github.com/wmde/wikibase-docker/pull/85 TASK DETAIL https://phabricator.wikimedia.org/T227031 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: jjkoehorst Cc: jjkoehorst, Andrawaag, Addshore, Aklapper, darthmon_wmde, Jelabra, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Salgo60, _jensen, rosalieper, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Lydia_Pintscher, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T226238: Identical items can't be merged due to duplicate articles in ceb wikipedia
jjkoehorst added a comment. So if I understand it correctly, Q33512910 is the historical entity of Wageningen and Q1305 is the current city of Wageningen? What exactly is meant by historical? 100 years, 10 years? ... Also can we not report in the current wageningen historical changes using e.g. the end time property? In the "old" city it should then be clearly mentioned that this is an old city with a rough end estimation or not? TASK DETAIL https://phabricator.wikimedia.org/T226238 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: jjkoehorst Cc: jjkoehorst, Jane023, Andrawaag, Aklapper, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs