Again, thank you for your responses. As I stated previously, our research is more on the Jeremy Foote's line: We want to study the "wiki survival" or "wiki mortality" in highly diverse wiki ecosystem as it is Wikia and we want to focus more in *the death of wikis as a community*, so probably not so related to how much useful or outdated is the knowledge posted.
The first step to approach this is to define "Death". Indeed, It can correspond somehow with the activity of the wiki, as the activity represents the aliveness of a community. But definitely bots, changes made globally to all wikis by Wikia and, maybe, small format corrections should be kept apart. Thank you all for pointing those out! There is a major question that maybe is the key to unlock the definition of these concepts: *Is inactivity equal to death community or is inactivity just an indicator of possible death?* So, in the first definition I brought out, they used the term "dormant", leaving some room for the hope that the wiki can be retaken or awaken at some later point. So I believe that, what we are looking actually forward is for a definition closer to what Leila said about dropping-out users editors (Thank you very much for the reference!), but applied to wikis, so something like "dropped-out wikis" could be what we are looking for in order to have a proxy for wiki death or wiki abandonment. @Leila: Do you think that we could do some equivalent study to evaluate when a wiki shifts from "inactive" to "abandon" ? Is this what you meant when saying "inactive may or may not be equal to dead in your definition."? Or what did you mean by saying so? I want to add, and I think it's worth noting, that apart from activity data, we also have data regarding the visits to the wiki. This give us an idea on how much attention or usefulness the wiki is still providing to visitors. But I find this data more related to the interest or, even, "activity" of the wiki rather than the aliveness of the community behind. El mar., 6 nov. 2018 a las 14:03, Leila Zia (<[email protected]>) escribió: > Hi Abel, > > Some years back, Erik Zachte and I came up with a proposal related to > your question. (inactive may or may not be equal to dead in your > definition.) > > A project is considered active if it satisfies the following conditions: > * The wiki should be hosted by the Wikimedia Foundation > * Open (not closed/locked or private) > * The wiki should have less than a very permissive threshold of > activity: At least three active editors with 5+ edits/month/editor > needed. This threshold is perhaps too permissive but it already cuts > away a large number of almost inactive projects. > > We haven't put the above anywhere public (we totally should). > > And since you may run into this other question soon, too: If you want > to see how many months of inactivity for a username means the user has > dropped out of the project, please check section 1B in > https://meta.wikimedia.org/wiki/Research:Understanding_thanks#Discussion. > > Best, > Leila > > On Mon, Nov 5, 2018 at 2:05 PM Kerry Raymond <[email protected]> > wrote: > > > > While perhaps not relevant to the original enquiry ... > > > > For Wikipedia, do we need to know (or care) about the "death" of: > > > > * a WikiProject > > * a Portal > > * a category > > * an article > > > > While there are articles on historical topics which might, once written, > arguably not need further updating, there are many articles, categories, > portals, projects which do as they involve topics that are current in the > real world. As a simple example, for a town, we report on population and > temperatures. For an electorate, there are elections and new people > elected. Sports results etc. If readers visit articles with out-of-date > information, they may see less value in the article and therefore Wikipedia > as a whole. Have we ever thought of having some system of tagging articles > of this nature, either as a whole or in sections or in infobox fields that > indicates when the information could be considered out of date. E.g. we > have the Australian census every 5 years (the last being 2016). It takes > them about a year to release the data to the public (so 2017 we had first > access to the 2016 data), so we might think it desirable that all > Australian places with census data have been updated by 2018 and certainly > we would not think it acceptable if there were any with 2006 (or earlier) > census data (except as historical information). Yet of course there are > many such out-of-date Australian articles, but probably we don't have an > easy way to know which ones. (Before anyone rushes to tell me about > Wikidata solutions, I would point out that the average Australian Wikipedia > editor neither knows nor cares about Wikidata and our attempt to add 2016 > census data from Wikidata more-or-less collapsed from lack of community > support). I'm thinking here about solutions that Wikipedians might > understand, such as templates which have a tracking category that is > activated when the article misses an update deadline based on some template > in the article. > > > > Of course, on Wikipedia, many articles have the illusion of being > actively maintained in the sense of edits occurring, thanks to vandalism > and reverts of vandalism, endless re-categorisation, automated changes of a > trivial nature (e.g. dash length), the Internet Archive Bot and other > bots, copyedits etc. As someone who does her watchlist diligently, I am > seeing increasing activity over articles (my daily watchlist seems to be > growing faster than the number of entries on my watchlist) which suggests > we are more active, but, when I look at the edits, relatively few of them > are updates to the information content. So activity should not be equated > to information currency. Note, as anyone who deals with visible metrics > soon learns, people game them and our edit counts are a classic example. I > sometimes wonder what would happen if we suppressed that information. Or > better still, counted something that we value more than "number of times > clicked Save". What if we only counted the number of citations added (or > counted it in addition to plain old edit count)? Would that drive > behavioural change from less information-productive activities towards more > information-productive activities? > > > > But if we can have some measure of information-activity/inactivity for > an article, then I presume we can aggregate this across any natural > groupings of articles (e.g. Category trees, Portals, WikiProjects) to > discover where we are stagnating and then let humans decide if that topic > space is one that can stagnate (because it is historic) or one that must be > updated periodically to be considered useful and whether the correct > frequency of updates seems to be occurring, either macroscopically or > (ideally) microscopically around particular time-sensitive factoids. > > > > Can we measure "information growth"? > > > > Kerry > > > > > > > > _______________________________________________ > > Wiki-research-l mailing list > > [email protected] > > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > > _______________________________________________ > Wiki-research-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > -- Saludos, Abel. _______________________________________________ Wiki-research-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
