[Wikidata-bugs] [Maniphest] T276643: Wikidata JSON dump (bz2) no longer imports due to bad JSON format

2021-03-07 Thread ArielGlenn
ArielGlenn added a comment. I'll leave this open until the run is complete and folks have had time to try to use them, so probably through the coming weekend. TASK DETAIL https://phabricator.wikimedia.org/T276643 EMAIL PREFERENCES

[Wikidata-bugs] [Maniphest] T139912: Take care of disambiguation items at Wikidata

2021-03-07 Thread Daniel-Barrows
Daniel-Barrows added a comment. Relevant discussion has been archived at https://www.wikidata.org/wiki/Wikidata:Bot_requests/Archive/2017/2#Take_care_of_disambiguation_items TASK DETAIL https://phabricator.wikimedia.org/T139912 EMAIL PREFERENCES

[Wikidata-bugs] [Maniphest] T276643: Wikidata JSON dump (bz2) no longer imports due to bad JSON format

2021-03-07 Thread ArielGlenn
ArielGlenn added a comment. In T276643#6890308 , @Ash20001 wrote: > Will this patch be included in the next dump or can be put back in the last two dumps (regenerate dump) This should be in time for the dump that will be produced

[Wikidata-bugs] [Maniphest] T258590: Change incorrect usage of HTTP to HTTPS for concept URIs on Commons

2021-03-07 Thread Multichill
Multichill added a comment. In T258590#6363261 , @CBogen wrote: > Note that the SD team work to change the Concept URIs in Commons is estimated to be a small. That was August 2020, we're now in March 2021. Any update of the status?

[Wikidata-bugs] [Maniphest] T273113: Wikidata pages don't seem to show up on Google and Bing Search results

2021-03-07 Thread DanBri
DanBri added a comment. How about generating sitemap files during munging (rather than as part of mediawiki or wikibase frontend)? TASK DETAIL https://phabricator.wikimedia.org/T273113 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: DanBri Cc:

[Wikidata-bugs] [Maniphest] T276643: Wikidata JSON dump (bz2) no longer imports due to bad JSON format

2021-03-07 Thread Ash20001
Ash20001 added a comment. Will this patch be included in the next dump or can be put back in the last two dumps (regenerate dump) TASK DETAIL https://phabricator.wikimedia.org/T276643 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: ArielGlenn,

[Wikidata-bugs] [Maniphest] T276643: Wikidata JSON dump (bz2) no longer imports due to bad JSON format

2021-03-07 Thread Maintenance_bot
Maintenance_bot removed a project: Patch-For-Review. TASK DETAIL https://phabricator.wikimedia.org/T276643 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: ArielGlenn, Maintenance_bot Cc: LucasWerkmeister, Motagirl2, Addshore, Mahir256, ArielGlenn,

[Wikidata-bugs] [Maniphest] T276643: Wikidata JSON dump (bz2) no longer imports due to bad JSON format

2021-03-07 Thread gerritbot
gerritbot added a comment. Change 669404 **merged** by ArielGlenn: [operations/puppet@production] wikibase entity dumps: add comma at end of intermediate files https://gerrit.wikimedia.org/r/669404 TASK DETAIL https://phabricator.wikimedia.org/T276643 EMAIL PREFERENCES

[Wikidata-bugs] [Maniphest] T273113: Wikidata pages don't seem to show up on Google and Bing Search results

2021-03-07 Thread abian
abian added a comment. See also T200846#4487680 . TASK DETAIL https://phabricator.wikimedia.org/T273113 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: abian Cc: abian, dr0ptp4kt, Mmarx,

[Wikidata-bugs] [Maniphest] T273113: Wikidata pages don't seem to show up on Google and Bing Search results

2021-03-07 Thread Lydia_Pintscher
Lydia_Pintscher added a comment. Ok thanks to @dr0ptp4kt I now have access to the magic search console \o/ Results so far: https://www.wikidata.org/wiki/Q99922367 was indeed not indexed at all. From a quick look of it because nothing links to it. I've pressed the button to get this one

[Wikidata-bugs] [Maniphest] T276643: Wikidata JSON dump (bz2) no longer imports due to bad JSON format

2021-03-07 Thread gerritbot
gerritbot added a project: Patch-For-Review. TASK DETAIL https://phabricator.wikimedia.org/T276643 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: ArielGlenn, gerritbot Cc: LucasWerkmeister, Motagirl2, Addshore, Mahir256, ArielGlenn, Ash20001,

[Wikidata-bugs] [Maniphest] T276643: Wikidata JSON dump (bz2) no longer imports due to bad JSON format

2021-03-07 Thread gerritbot
gerritbot added a comment. Change 669404 had a related patch set uploaded (by ArielGlenn; owner: ArielGlenn): [operations/puppet@production] wikibase entity dumps: add comma at end of intermediate files https://gerrit.wikimedia.org/r/669404 TASK DETAIL

[Wikidata-bugs] [Maniphest] T276643: Wikidata JSON dump (bz2) no longer imports due to bad JSON format

2021-03-07 Thread LucasWerkmeister
LucasWerkmeister added a comment. A workaround //might// be to insert `sed 's/}{/},{/g'` into the pipeline between `bunzip2` and `mongoimport`. (Though that’ll probably at least slow down the import, since sed will run regexes against huge input lines.) TASK DETAIL

[Wikidata-bugs] [Maniphest] T276643: Wikidata JSON dump (bz2) no longer imports due to bad JSON format

2021-03-07 Thread LucasWerkmeister
LucasWerkmeister added a comment. In T276643#6889891 , @Motagirl2 wrote: > The 20210215 bz2 works perfectly  Yup, same here. TASK DETAIL https://phabricator.wikimedia.org/T276643 EMAIL PREFERENCES

[Wikidata-bugs] [Maniphest] T276643: Wikidata JSON dump (bz2) no longer imports due to bad JSON format

2021-03-07 Thread Motagirl2
Motagirl2 added a comment. The 20210215 bz2 works perfectly  TASK DETAIL https://phabricator.wikimedia.org/T276643 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: ArielGlenn, Motagirl2 Cc: LucasWerkmeister, Motagirl2, Addshore, Mahir256,