Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2017-01-05 Thread Lydia Pintscher
Hey folks :) Andy and Pasleim just brought this topic to my attention again. Sorry for having dropped the ball a bit. I've created https://phabricator.wikimedia.org/T154660 with a strawman proposal for the still open question of which length it should be. Please add your arguments there. Cheers

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-10-08 Thread Egon Willighagen
Dear Thomas, On Sat, Oct 8, 2016 at 12:07 PM, Thomas Douillard < thomas.douill...@gmail.com> wrote: > Probably a silly question but ... did you all consider creating a datatype > for molecue representation ? This seem to be a very similar usecase than > mathematica formula. Essentially we're not

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-10-08 Thread Daniel Kinzler
That was discussed and declined a while ago, see . Though I think the proposed realization was presentational rather than functional. I'll have to re-read the discussion, though. Am 08.10.2016 um 12:07 schrieb Thomas Douillard: > Probably a silly

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-10-08 Thread Thomas Douillard
Probably a silly question but ... did you all consider creating a datatype for molecue representation ? This seem to be a very similar usecase than mathematica formula. Essentially we're not dealing with a raw string but a representation of molecule formulas, with its own encoding ... Changing

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-10-08 Thread Egon Willighagen
On Sat, Oct 8, 2016 at 11:28 AM, Lydia Pintscher < lydia.pintsc...@wikimedia.de> wrote: > On Sat, Oct 8, 2016 at 11:23 AM, Egon Willighagen > wrote: > > Ah, those numbers are for https://www.wikidata.org/wiki/Property:P234 > ... > > External identifier then. Cool. And

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-10-08 Thread Lydia Pintscher
On Sat, Oct 8, 2016 at 11:23 AM, Egon Willighagen wrote: > Ah, those numbers are for https://www.wikidata.org/wiki/Property:P234 ... External identifier then. Cool. And for string like in https://www.wikidata.org/wiki/Property:P233? Sebastian's initial email says 1500

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-10-08 Thread Egon Willighagen
On Sat, Oct 8, 2016 at 11:19 AM, Lydia Pintscher < lydia.pintsc...@wikimedia.de> wrote: > On Sat, Oct 8, 2016 at 11:14 AM, Egon Willighagen > wrote: > > For small compounds this is answered by Sebastian's analysis... 5K would > > cover all currently known small

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-10-08 Thread Egon Willighagen
On Sat, Oct 8, 2016 at 11:07 AM, Lydia Pintscher < lydia.pintsc...@wikimedia.de> wrote: > > Based on this my proposal is to increase string and URL and > potentially external identifier if you request it. One open question > is still what the new limit should be. > For small compounds this is

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-10-08 Thread Lydia Pintscher
Hi everyone, I've been thinking more about this and we also discussed this within the development team. Here's my thinking at this point: * We do have data that you all want to see in Wikidata that is currently prevented by the limit. That is not good. * I agree that the general understanding of

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-23 Thread Egon Willighagen
On Fri, Sep 23, 2016 at 5:53 PM, Denny Vrandečić wrote: > One stupid question: due to the length of these identifiers, and since > they are not simple intransparent identifiers but rather encode semantics - > if I understand it correctly - could a single such identifier be

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-23 Thread Denny Vrandečić
Thank you! I am sure that this will help the Wikidata team to make the right decision. Also, very interesting numbers. One stupid question: due to the length of these identifiers, and since they are not simple intransparent identifiers but rather encode semantics - if I understand it correctly -

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-23 Thread Egon Willighagen
Sebastian, great you found time for it! I didn't :/ (Stats are worth a tweet, IMHO :) Egon On Fri, Sep 23, 2016 at 12:20 PM, Sebastian Burgstaller < sebastian.burgstal...@gmail.com> wrote: > Hi Denny, > Sorry, I missed this email. just did the calculation for InChI string > lengths on the 92

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-19 Thread Markus Kroetzsch
On 19.09.2016 18:12, Lydia Pintscher wrote: On Mon, Sep 19, 2016 at 6:19 AM, Denny Vrandečić wrote: Can you figure out what a good limit would be for these two use cases? I.e. what would support 99%, 99.9%, and 100%? Yes this would be extremely helpful. In general I

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-19 Thread Sebastian Burgstaller
Thanks, guys! I am glad to hear that the technical hurdles for implementation seem to be relatively low. Is there any realistic timeline by when this could be done? I agree with Lydia, that not all string properties should allow for unlimited (or even very many) chars. It would be nice to

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-19 Thread Lydia Pintscher
On Mon, Sep 19, 2016 at 6:19 AM, Denny Vrandečić wrote: > Can you figure out what a good limit would be for these two use cases? I.e. > what would support 99%, 99.9%, and 100%? Yes this would be extremely helpful. In general I agree that we can now be more relaxed about this

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-18 Thread Egon Willighagen
Hi all, sorry for joining the party late... On Tue, Sep 13, 2016 at 11:39 AM, Sebastian Burgstaller wrote: > I think this topic might have been discussed many months ago. For > certain data types in the chemical compound space (P233, canonical > smiles, P2017

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-17 Thread Hay (Husky)
One other usecase for this would be citation URLs. For example, to get the number of inhabitants of all Dutch municipalities you need a 800-character (1) permalink from the central bureau of statistics. So this change would be very welcome indeed! -- Hay (1):

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-16 Thread Stas Malyshev
Hi! > However, given that we now have such a well informed community with > established practices and good quality checks, it seems unproblematic to > lift the character limit. I don't think there are major technical > reasons for having it. Surely, BlazeGraph (the WMF SPARQL engine) should > not

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-16 Thread Daniel Kinzler
Am 16.09.2016 um 19:38 schrieb Denny Vrandečić: > Markus' description of the decision for the limit corresponds with mine. I > also > think that this decision can be revisited. I would still advice for caution, > due > to technical issues, but I am sure that the development team will make a >

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-16 Thread Denny Vrandečić
(in particular because I expect that character limit to have to change for Wiktionary in Wikidata) On Fri, Sep 16, 2016 at 10:38 AM Denny Vrandečić wrote: > Markus' description of the decision for the limit corresponds with mine. I > also think that this decision can be

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-16 Thread Denny Vrandečić
Markus' description of the decision for the limit corresponds with mine. I also think that this decision can be revisited. I would still advice for caution, due to technical issues, but I am sure that the development team will make a well-informed decision on this. It would be sad if valid

Re: [Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-16 Thread Markus Kroetzsch
On 13.09.2016 11:39, Sebastian Burgstaller wrote: Hi all, I think this topic might have been discussed many months ago. For certain data types in the chemical compound space (P233, canonical smiles, P2017 isomeric smiles and P234 Inchi key) a higher character limit than 400 would be really

[Wikidata] Greater than 400 char limit for Wikidata string data types

2016-09-13 Thread Sebastian Burgstaller
Hi all, I think this topic might have been discussed many months ago. For certain data types in the chemical compound space (P233, canonical smiles, P2017 isomeric smiles and P234 Inchi key) a higher character limit than 400 would be really helpful (1500 to 2000 chars (I sense that this might