Smalyshev added a comment.

I tried copying statement_keywords field into all field (for random 1M items) and the results don't seem to be too encouraging - all field e.g. tokenizes tt0041008 as two tokens tt and 0041008. When searching, it does produce Q18636386 which it should be, but also for example Q507445 (which has TT in it's name) with higher score. So I don't think copying it into text fields would work.

@dcausse, if you want to check it it's in stas_wikidata_test index on relforge.


TASK DETAIL
https://phabricator.wikimedia.org/T163642

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Smalyshev
Cc: dcausse, Esc3300, ArthurPSmith, Stashbot, Lea_Lacroix_WMDE, gerritbot, Liuxinyu970226, Smalyshev, debt, aude, Lydia_Pintscher, Aklapper, Multichill, stebsco, Lahi, Gq86, Darkminds3113, GoranSMilovanovic, QZanden, EBjune, LawExplorer, Avner, Gehel, FloNight, Wikidata-bugs, jayvdb, Mbch331, jeremyb
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to