Motagirl2 added a comment.
I have the same issue.
I have a script to extract entities from Wikidata dumps, that I've been
running successfully for years.
The last time I ran it, on current latest-all.json.bz2 (03-Mar-2021 14:10,
size 63323125695), it complained about a malformed json:
ijson.common.IncompleteJSONError: parse error: after array element, I
expect ',' or ']'
:[]}},"lastrevid":1374358285}{"type":"item","id":"Q27","labe
(right here) ------
The script runs multiple threads in parallel, so it's able to "crash" on some
threads while continuing on others, so I noticed that the error happens not
only at that point, but also in a couple more places through the json.
I'm currently downloading the version that is .gz (rather than .bz2) to try
running on it (not very hopefully, to be honest).
The last succesfully extraction happened at the beginning of January, on a
.bz2 with size 61247031499 (I'm not able to find it in the dumps page)
TASK DETAIL
https://phabricator.wikimedia.org/T276643
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: ArielGlenn, Motagirl2
Cc: Motagirl2, Addshore, Mahir256, ArielGlenn, Ash20001, maantietaja, jannee_e,
Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Lunewa, QZanden,
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, gnosygnu, abian,
Wikidata-bugs, aude, Lydia_Pintscher, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs