Indeed, these collaborations in high-energy physics are not static
quantities, they change essentially every day (people getting hired and had
their contract expired, and most likely every two papers have a slightly
different author list.

Cheers
Yaroslav

On Sun, May 5, 2019 at 5:58 PM Darren Cook <dar...@dcook.org> wrote:

> > We may also want to consider if Wikidata is actually the best store for
> > all kinds of data. Let's consider example:
> >
> > https://www.wikidata.org/w/index.php?title=Q57009452
> >
> > This is an entity that is almost 2M in size, almost 3000 statements ...
>
> A paper with 2884 authors! arxiv.org deals with it by calling them the
> "Atlas Collaboration": https://arxiv.org/abs/1403.0489
> The actual paper does the same (with the full list of names and
> affiliations in the Appendix).
>
> The nice thing about graph databases is we should be able to set author
> to point to an "Atlas Collaboration" node, and then have that node point
> to the 2884 individual author nodes (and each of those nodes point to
> their affiliation).
>
> What are the reasons to not re-organize it that way?
>
> My first thought was that who is in the collaboration changes over time?
> But does it change day to day, or only change each academic year?
>
> Either way, maybe we need to point the author field to something like
> "Atlas Collaboration 2014a", and clone-and-modify that node each time we
> come to a paper that describes a different membership?
>
> Or is it better to do each persons membership of such a group with a
> start and end date?
>
> (BTW, arxiv.org tells me there are 1059 results for ATLAS Collaboration;
> don't know if one "result" corresponds to one "paper", though.)
>
> > While I am not against storing this as such, I do wonder if it's
> > sustainable to keep such kind of data together with other Wikidata data
> > in a single database.
>
> It feels like it belongs in "core" Wikidata. Being able to ask "which
> papers has this researcher written?" seems like a good example of a
> Wikidata query. Similarly,  "which papers have The ATLAS Collaboration"
> worked on?"
>
> But, also, are queries like "Which authors of Physics papers went to a
> high school that had more than 1000 students?" part of the goal of
> Wikidata? If so, Wikidata needs optimizing in such a way that makes such
> queries both possible and tractable.
>
> Darren
>
> _______________________________________________
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

Reply via email to