VladimirAlexiev created this task.
VladimirAlexiev added a project: Wikidata.
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION

I'm getting some entity RDF from WD and manipulating it in some way.
WD sometimes randomly cuts off the RDF.
Eg this command sometimes shows different last lines, and the very last line is cut short:

curl -sL https://www.wikidata.org/entity/Q61707895.nt | tail -10

I noticed this when formatting as turtle (I filter out non-English prop names/descriptions).
(Grab the prefixes from https://www.wikidata.org/entity/Q61707895.ttl and save to prefixes.ttl)
Eg these commands, run consecutively, reported errors at different lines, because the RDF was cut short.

curl -sL https://www.wikidata.org/entity/Q61707895.nt | cat prefixes.ttl - | riot --formatted ttl -syntax ttl - > artefact1.ttl
11:19:38 ERROR riot                 :: [line: 4509, col: 105] Not a hexadecimal character (end of file)

curl -sL https://www.wikidata.org/entity/Q61707895.nt | cat prefixes.ttl - | riot --formatted ttl -syntax ttl - > artefact1.ttl
11:19:41 ERROR riot                 :: [line: 4221, col: 83] Broken IRI (End of file)

Right now (a few minutes later) WD returns the full RDF, and I'm glad for it.
But it's troubling that such simple/fundamental operation may be cut off.


TASK DETAIL
https://phabricator.wikimedia.org/T216006

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: VladimirAlexiev
Cc: Aklapper, VladimirAlexiev, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to