Please, any one let me know about Wikidata project because I dont know
about this.

On Fri, Jul 27, 2018 at 5:29 PM, James Heald <jpm.he...@gmail.com> wrote:

> +1 with Magnus on years of birth and death
> (but perhaps /only/ years of birth and death, or close surrogates eg years
> of baptism and burial, and inception or publication date for things,
> otherwise the search specificity would become useless with too many other
> 'significant event' dates)
>
> I have found in the last few weeks I have been using the External ID value
> search a lot, from its search-box on the talk page of the main page for a
> property.
>
> I'm finding this works very well, so I wonder whether people think that
> the ability to search for one of these strings directly in the general
> search box would actually add anything, or is the custom search eg via the
> talk-page search box already enough?
>
>  -- James.
>
>
>
> On 27/07/2018 12:49, Magnus Manske wrote:
>
>> Hi, and thanks for working on this!
>>
>> My subjective view:
>> * We don't need P2860/P1433 indexed, at least not at the moment
>> * I would really like dates (mainly, born/died), especially if they work
>> for "greater units", that is, I search for a year and get an item back,
>> even though the statament is month- or day-precise
>>
>> Cheers,
>> Magnus
>>
>> On Thu, Jul 26, 2018 at 10:48 PM Stas Malyshev <smalys...@wikimedia.org>
>> wrote:
>>
>> Hi!
>>>
>>> Today we are indexing in ElasticSearch almost all string properties
>>> (except a few) and select item properties (P31 and P279). We've been
>>> asked to extend this set and index more item properties
>>> (https://phabricator.wikimedia.org/T199884). We did not do it from the
>>> start because we did not want to add too much data to the index at once,
>>> and wanted to see how the index behaves. To evaluate what this change
>>> would mean, some statistics:
>>>
>>> All usage of item properties in statements is about 231 million uses
>>> (according to sqid tool database). Of those, about 50M uses are
>>> "instance of" which we are already indexing. Another 98M uses belong to
>>> two properties - published in (P1433) and cites (P2860). Leaving about
>>> 86M for the rest of the properties.
>>>
>>> So, if we index all the item properties except P2860 and P1433, we'll be
>>> a little more than doubling the amount of data we're storing for this
>>> field, which seems OK. But if we index those too, we'll be essentially
>>> quadrupling it - which may be OK too, but is bigger jump and one that
>>> may potentially cause some issues.
>>>
>>> So, we have two questions:
>>> 1. Do we want to enable indexing for all item properties? Note that if
>>> you just want to find items with certain statement values, Wikidata
>>> Query Service matches this use case best. It's only in combination with
>>> actual fulltext search where on-wiki search is better.
>>>
>>> 2. Do we need to index P2860 and P1433 at all, and if so, would it be ok
>>> if we omit indexing for now?
>>>
>>> Would be glad to hear thoughts on the matter.
>>>
>>> Thanks,
>>> --
>>> Stas Malyshev
>>> smalys...@wikimedia.org
>>>
>>> _______________________________________________
>>> Wikidata mailing list
>>> Wikidata@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>>
>>>
>>
>>
>> _______________________________________________
>> Wikidata mailing list
>> Wikidata@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>>
>
> ---
> This email has been checked for viruses by AVG.
> https://www.avg.com
>
>
>
> _______________________________________________
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>



-- 
Regards,
Sushil Dutt
8800911840
_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

Reply via email to