Hi Travis,

I thought that was you when I read your post; yes, we did indeed talk. Actually, it was 
after our talk that I went through extensive searching to find what is considered 
top-tier in computer science. Here are brief comments I should have included earlier 
explaining how I came up with the three sources of computer science "high 
quality" conferences:

* Top Tier and 2nd tier conferences from
http://webdocs.cs.ualberta.ca/~zaiane/htmldocs/ConfRanking.html: In extensive 
searching for computer science conference rankings, this is the absolute best I 
could find, and most other rankings I found have either referred to or copied 
from this list.
* A-ranked conferences in Information and Computing Sciences from
http://lamp.infosys.deakin.edu.au/era/?page=cforsel10: This is the most 
exhaustive journal ranking exercise I have ever found anywhere. Unfortunately, 
I like you have serious questions about the face validity of these rankings; I 
think they heavily overrate many conferences in my own field of information 
systems; I assume the same is true with other fields that I don't know so well. 
(My primary reservation with conference or journal rankings by professors is 
that I strongly suspect that one of the main criteria for their rankings is 
whether or not they have published in that outlet before.) Unfortunately, I 
don't know of anything that approaches this ranking in comprehensiveness.
* We also considered including all WikiSym articles on Wikipedia: This is not 
because of any statement of WikiSym's quality, but simply because WikiSym is 
probably the closest thing that exists to an academic conference specifically 
for Wikipedia-related research.

Is there no widely-accepted listing of computer science conference rankings? You say, 
"Everyone in my field (HCI) pretty much knows what the first tier conferences are 
where wikipedia research is published." The problem is that I could say the same 
thing about my field, but another researcher would have a different list. There is 
generally consensus about the top two or three in any field, but the huge grey zone comes 
when you try to draw a line. Even your idea of getting small groups of experts to 
validate a number of conferences is pretty shaky, since another small group of experts 
would almost definitely give different results.

Citation counts are always a sticky issue; they depend mainly on indexing by 
citation count databases and recency of articles. However, I do consider them 
one of the most objective (not necessarily one of the best, but one of the most 
objective) criteria for paper quality. Based on your suggestion, I just now 
discovered that ACM Digital Library includes citation counts for conference 
papers. By way of brainstorming, I'm thinking of this possible inclusion rule:

* Calculate (a) the average citation count for Wikipedia articles (either only 
journal average, only conference average, or average of both), and (b) average 
citation count for each journal and/or conference that publishes Wikipedia 
research. (b) is basically (a) grouped by journal/conference.
* Rather than doing raw citation counts, we could try to calculate citations 
per year or some other weighting that recognizes that more recent articles 
would have fewer citations than older ones.
* Include all conference papers greater than the average (whichever average we 
choose) and/or include conference papers from all conferences greater than the 
average. Or we could include all conference papers whose average citations per 
year are greater than the average for journal articles. Or just include the top 
100 ranked conference papers, or however many we can handle.

Although still somewhat artificial, this could give possibly give us a somewhat objective 
basis to filter up the "higher quality" conference papers based on citation 
analysis.

I don't know if I'm trying to go far with this citation count possibility, but 
what do you all think?

Thanks again,
Chitu


-------- Message original --------
Sujet: Re: [Wiki-research-l] Wikipedia literature review - include or exclude 
conference articles (was Request to verify articles for Wikipedia literature 
review)
De : Travis Kriplean <[email protected]>
Pour : Research into Wikimedia content and communities
<[email protected]>
Copie à : Chitu Okoli <[email protected]>
Date : 15/03/2011 5:26 PM
Hey there,

I sympathize with your dilemma...and I think we might have actually talked 
about this at Wikimania 2009. Unfortunately, while you may be satisfied that 
600 journal articles + theses is enough (I certainly would be too), you should 
be equipped to recognize that if you keep it that way you are systematically 
excluding large, significant bodies of research deriving from computer science 
and HCI. As you make this choice, read through one or two of these conference 
papers and measure it against the quality of a randomly selected set of journal 
articles in your set:
   - http://dub.washington.edu/djangosite/media/papers/tmpZ77p1r.pdf
   - http://www.aaai.org/ocs/index.php/ICWSM/ICWSM10/paper/download/1485/1841
   - http://www.cs.cornell.edu/~danco/research/papers/suggestbot-iui07.pdf
   - http://users.soe.ucsc.edu/~luca/papers/07/wikiwww2007.pdf
   - http://portal.acm.org/citation.cfm?id18928

I bet that these conference papers are on the balance of higher quality than a 
random journal article in your set.

Unfortunately, there isn't a good answer for the best methods to follow. 
Everyone in my field (HCI) pretty much knows what the first tier conferences 
are where wikipedia research is published: CHI, CSCW, and UIST; and second tier 
at GROUP. These are all under the ACM SIGCHI banner (http://www.sigchi.org/). 
Another way to put this is that there are no objective measures, its a question 
of what the researchers themselves see as high quality. Ultimately, this is the 
same as with journals, although they tend to have impact factors. If I were to 
estimate how many high quality conference papers from the HCI angle there are, 
I would put it at about 20-30.

Of course, this is only for HCI research, not all CS research. Conferences such 
as WWW have published excellent research on Wikipedia, such as the initial 
paper out of the WikiTrust group, which, if you've been around the wiki 
community, know that they have had a big impact. WWW is considered to be a high 
quality CS conference. Likewise, there has been Wiki research published at 
database and AI conferences. For example, the Intelligence in Wikipedia project 
(summarized here http://portal.acm.org/citation.cfm?id20344).

Unfortunately, your two links to top conferences are pretty much inaccurate 
pictures of the CS conference field (for example, the deakin link puts GECCO as 
the top conference in one of the major categories, which is basically 
laughable). And while we might all love wikisym, it from an academic 
standpoint, it is definitely not a tier one venue.

I cringe to suggest this, but one possible methodology you might follow is to 
do citation count filtering, using, e.g. google scholar. Citations give you an 
indicator of whether other researchers have found it useful to draw on. Look at 
the average citation count of the journal papers, then filter your list of 1500 
conference papers down to those papers that have, say, twice the citations as 
the average citation count of a journal article.

Honestly though, your best methodology would be to have a small group of HCI 
researchers, a small group of AI researchers, and a small group of database 
researchers who have worked on wikipedia compile a list of the conference 
papers that they believe are best representative of the research that that 
community has done on wikipedia.

Hope that helps, and sorry to hear you still struggling with this issue.

Best,
Travis




On 3/15/11 11:56 AM, Chitu Okoli wrote:
James and Travis, you bring up a point that we have struggled back and
forth with for several months. We really, really would like to include
conference articles, but we just can't see how we could handle many more
articles than what we've got now. We've been working on and off on this
project for over two years now. (You can find works in progress at the
link at the bottom to my website.) We'd like to get it done eventually,
and we can only handle so many articles.

_______________________________________________
Wiki-research-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

Reply via email to