Marostegui added a comment.

I will definitely try to optimize the table, as it is getting out of hand

Now that we use Elastic for prefix search, we could drop all rows with term_type = "alias" from this table. Needs a little bit of code to make this optional, but should save a lot of space. If you think that would help, ask @hoo and @Lydia_Pintscher about it.

The % of rows with that type isn't huge

root@db2086.codfw.wmnet[wikidatawiki]> select count(*) from wb_terms where term_type="alias";
+----------+
| count(*) |
+----------+
| 16570605 |
+----------+
1 row in set (11 min 24.28 sec)


root@db2086.codfw.wmnet[wikidatawiki]> select count(*) from wb_terms;
+------------+
| count(*)   |
+------------+
| 1268234185 |
+------------+
1 row in set (6 min 4.49 sec)

So that makes it around 1.3%, not huge, but it would still be good to get them cleaned if they are not necessary.

@hoo @Lydia_Pintscher you want me to create a ticket about it or you'd do it?


TASK DETAIL
https://phabricator.wikimedia.org/T179106

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Marostegui
Cc: Lydia_Pintscher, Stashbot, Ladsgroup, daniel, Marostegui, Aklapper, hoo, Lahi, GoranSMilovanovic, QZanden, Minhnv-2809, Luke081515, Wikidata-bugs, aude, Mbch331, Jay8g, Krenair
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to