JanZerebecki added a comment.

>>! In T76373#799504, @Smalyshev wrote:
> # Right now we completely ignore references. Do we want to keep them in the 
> index too?

Yes, although the other parts of statements are more important. One might query 
for things that are not sourced and/or only have a source in Wikipedia, or one 
might want to find things that refer to a particular source.

> # Which fields we want to search against - i.e., do we want to search against 
> descriptions (we'd probably need full-text index, like Elastic, to make it 
> useful for anything)

We probably want to search against everything in statements and site links. We 
might not need to import labes, descriptions and aliases until we want to have 
them directly in the output of the search instead of getting them from 
mysql/memcache afterwards.
Full text not necessary at first: Label, description, aliases full text search 
is I think the least important feature as our current Elastic setup should 
already cover these (although not as separate fields). Although I'm sure 
someone could come up with queries they want that would need this. More 
interesting is probably values of monolingual text, but still not absolutely 
necessary to have full text index. None of the current example queries we 
collected need a full text index. For all the other string values full text 
would probably not be used.
Though in the future it might be beneficial to be able to meet our searching 
needs through one endpoint (like Titan) even for normal full text queries, 
which would allow us to easily add drill down or faceted search features based 
on graph queries.

TASK DETAIL
  https://phabricator.wikimedia.org/T76373

REPLY HANDLER ACTIONS
  Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign 
<username>.

To: Smalyshev, JanZerebecki
Cc: Smalyshev, Manybubbles, GWicke, JanZerebecki, jkroll, Wikidata-bugs, aude, 
daniel



_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to