JanZerebecki added a comment. >>! In T76373#799504, @Smalyshev wrote: > # Right now we completely ignore references. Do we want to keep them in the > index too?
Yes, although the other parts of statements are more important. One might query for things that are not sourced and/or only have a source in Wikipedia, or one might want to find things that refer to a particular source. > # Which fields we want to search against - i.e., do we want to search against > descriptions (we'd probably need full-text index, like Elastic, to make it > useful for anything) We probably want to search against everything in statements and site links. We might not need to import labes, descriptions and aliases until we want to have them directly in the output of the search instead of getting them from mysql/memcache afterwards. Full text not necessary at first: Label, description, aliases full text search is I think the least important feature as our current Elastic setup should already cover these (although not as separate fields). Although I'm sure someone could come up with queries they want that would need this. More interesting is probably values of monolingual text, but still not absolutely necessary to have full text index. None of the current example queries we collected need a full text index. For all the other string values full text would probably not be used. Though in the future it might be beneficial to be able to meet our searching needs through one endpoint (like Titan) even for normal full text queries, which would allow us to easily add drill down or faceted search features based on graph queries. TASK DETAIL https://phabricator.wikimedia.org/T76373 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign <username>. To: Smalyshev, JanZerebecki Cc: Smalyshev, Manybubbles, GWicke, JanZerebecki, jkroll, Wikidata-bugs, aude, daniel _______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
