===============================
DEADLINE EXTENDED to 12th March
===============================

FOURTH CALL FOR PAPERS

LREC 2010 Workshop on

Web Logs and Question Answering (WLQA2010)

Malta, Saturday 22nd May 2010

www.csis.ul.ie/wlqa2010

Submission deadline: 12th March 2010 (EXTENDED)

Motivation & Scope
==================

An Information Retrieval system takes a user query and returns a
ranked list of documents. On the other hand, a Question Answering
system provides an exact answer [1]. There has been quite a long
period of research in factoid QA driven by annual tracks at CLEF [2],
TREC [3] and NTCIR [4]. The result of this work has been that it is
possible to construct systems which can answer simple factoid queries
with high accuracy. This has led to the belief that QA is a "solved
problem" where no more research is required. However, the questions
are not usually from real users, they are devised by the assessors at
CLEF, TREC etc. Secondly, they are restricted to certain well-known
simple types which are only a small subset of the real questions
which people wish to ask. Thirdly, questions are considered in
isolation (or in some tracks a fixed group) and not in a dialogue
context whereas in our interactions with people all questions are
answered in context and with the possibility for clarification. Thus,
there is a need to inject new ideas into QA research.

Recently there has been much interest in Web query logs and in
particular methods for analysing these in order to extract
information which can be used to improve IR systems [5,6]. Logs are
typically extremely large and contain naturally occurring and noisy
data. Automatic techniques (using for example statistical approaches
or machine learning algorithms) are therefore necessary since manual
approaches are not generally feasible.

The purpose of the workshop, therefore, is to investigate how some of
the methods developed for analysing web logs within an implicit IR
context can be applied to QA. For example:

* Can the meaning of IR queries in logs be deduced automatically in
order to extract the corresponding questions from them?

* Can NLP techniques developed within QA, e.g. Named Entity
recognition be applied to the analysis of query logs?

* Can logs be used to deduce useful new forms of question (i.e. not
simple factoids) which could be looked at next by QA researchers?

* Can questions grouped into sessions be comprehended in such a way
as to deduce the underlying implicit natural language dialogue
consisting of a coherent sequence of questions where each follows
logically from both the previous ones and the system's responses to
them?

* Are there logs from real (or experimental) QA systems like
lexxe.com which can be obtained and what can be learned from them
from the perspective of designing evaluation tasks? What about logs
from sites like answers.com (where queries are answered by human
respondents)?

* Are QA query logs different from IR query logs? Do users behave
differently in QA systems? Can QA-style questions be identified
within an IR log?

* Can click-through data - where the aim of a question can be
inferred from the returned documents which are inspected - be used
for the development of QA systems for example for the deduction of
important query types and their links to IR queries?

* Are there logs of transcribed speech made from telephone QA systems
which can be obtained and what analysis could be carried out on
those, using for example techniques developed at related tracks at
CLEF such as Cross-Language Speech Retrieval (CL-SR) and Question
Answering on Script Transcription (QAST)?

Historically, QA was a combination of NLP and IR. Much web log
analysis is a form of IR in which the same problem of retrieval is
being approached from a different direction, namely the queries
themselves. Thus we are proposing here a new combination, namely QA
and log analysis. These fields are complementary and share the goal
of building better systems for users.

1. Prager, J. (2006). Open-Domain Question Answering (2006).
Foundations and
   Trends in Information Retrieval, 1 (2), 1-141.
2. CLEF (2009). http://www.clef-campaign.org. Accessed 2009.
3. TREC (2009). http://trec.nist.gov/. Accessed 2009.
4. NTCIR (2009). http://research.nii.ac.jp/ntcir/. Accessed 2009.
5. Jansen, J., Taksa, I., & Spink, A. (eds.) (2008). Handbook of Web
Log
   Analysis. Hershey, PA: IGI Global.
6. QLA Workshop (2009). http://ir.shef.ac.uk/cloughie/qlaw2009.

Submissions
===========

Authors are invited to submit original research papers addressing
questions on the lines listed above. Papers must be related to QA and
must involve the use of a query log (but not necessarily of a QA
system). Submissions will be reviewed by members of the programme
committee and judged on technical quality, clarity and relevance to
the workshop.

Papers should be no longer than 8 pages, set in accordance with LREC
guidelines and using the LaTeX or Word templates which are available
here: http://www.lrec-conf.org/lrec2010/?Author-s-Kit-and-Templates .

Papers should be submitted in pdf via the WLQA2010 START system:
https://www.softconf.com/lrec2010/WLQA2010/ . When using START,
authors will be asked to provide essential information about
resources (in a broad sense, i.e. also technologies, standards,
evaluation kits, etc.) that have been used for the work described in
the paper or are a new result of your research.  For further
information on this new iniative, please refer to http://www.lrec-
conf.org/lrec2010/?LREC2010-Map-of-Language-Resources .

Proceedings will be produced at the workshop and it is intended that
selected papers will be published in a journal special issue after
LREC has taken place.

Important Dates
===============

First Call for Papers: December 2009
Second Call for Papers: January 2010
Third Call for Papers: February 2010
Fourth Call for Papers: March 2010
Submission deadline: 12th March 2010 - EXTENDED
Notification of acceptance: 19th March 2010
Final versions of papers: 23rd March 2010
Workshop: At LREC, Saturday 22nd May 2010

Organisers
==========

Richard Sutcliffe
University of Limerick
Richard.Sutcliffe at ul dot ie

Udo Kruschwitz
University of Essex
udo at essex dot ac dot uk

Thomas Mandl
University of Hildesheim
mandl at uni-hildesheim dot de

Programme Committee
===================

Bettina Berendt
Katholieke Universiteit Leuven, Belgium

Gosse Bouma
Rijksuniversiteit Groningen, The Netherlands

Paul Clough
University of Sheffield, UK

Giorgio Di Nunzio
University of Padoa, Italy

Jim Jansen
Pennsylvania State University, USA

Johannes Leveling
Dublin City University, Ireland

Fabrizio Silvestri
ISTI-CNR, Italy

Tomek Strzalkowski
SUNY Albany, USA

José Luis Vicedo
University of Alicante, Spain

Kieran White
University of Limerick, Ireland




Thomas Mandl

*********************************************************************
apl. Prof. Dr. Thomas Mandl
Informationswissenschaft
Universität Hildesheim

Tel.: 05121 - 883 - 837
ma...@uni-hildesheim.de
http://www.uni-hildesheim.de/~mandl

*********************************************************************
LogCLEF                 http://www.uni-hildesheim.de/logclef
LogQA Workshop  http://www.csis.ul.ie/wlqa2010/
Studiengang IIM         http://www.uni-hildesheim.de/de/iim.htm

*********************************************************************

Antwort per Email an