TIR 2010

7th International Workshop on Text-based Information Retrieval

in conjunction with DEXA 2010
University of Deusto
Bilbao, Spain
30 August - 3 September 2010

http://tir.webis.de

Workshop Program

Monday, August 30th, 2010

14:30-14:40TIR-10 Opening
Session 1: Clustering and Classification
14:40-15:00Extracting user interests from search query logs: A clustering approach
Lyes Limam, David Coquil, Harald Kosch, Lionel Brunie [paper] [slides]
15:00-15:30A Comparison of Stylometric and Lexical Features for Web Genre Classification and Emotion Classification in Blogs
Elisabeth Lex, Andreas Juffinger, Michael Granitzer [paper] [slides]
15:30-16:00Scalable Recursive Top-Down Hierarchical Clustering Approach with implicit Model Selection for Textual Data Sets
Markus Muhr, Vedran Sabol, Michael Granitzer [paper] [slides]
16:00-16:30Using Progressive Filtering to Deal with Information Overload
Andrea Addis, Giuliano Armano, Eloisa Vargiu [paper] [slides]

Tuesday, August 31st, 2010

Session 2: IR and Web Search
09:00-09:20Towards a search system for the Web exploiting spatial data of a web document
Stefan Dlugolinsky, Michal Laclavik, Ladislav Hluchy [paper] [slides]
09:20-09:40A New Information Filtering Method for WebPages
Sergio López, Josep Silva [paper] [slides]
09:40-10:10A Heuristic Search Strategy to Improve Web Queries
Matthias Hagen, Benno Stein [paper] [slides]
10:10-10:40Direct Optimization of Evaluation Measures in Learning to Rank using Particle Swarm
Juan M. Fernández-Luna, Juan F. Huete, Óscar Alejo, Ramiro Pérez [paper] [slides]
10:40-11:00Coffee break
Session 3: Keyword Extraction and Semantics
11:00-11:20Thesaurus Based Term Ranking for Keyword Extraction
Luit Gazendam, Christian Wartena, Rogier Brussee [paper] [slides]
11:20-11:50Keyword Extraction using Word Co-occurrence
Christian Wartena, Rogier Brussee, Wout Slakhorst [paper] [slides]
11:50-12:10Identifying Sentence-Level Semantic Content Units with Topic Models
Leonhard Hennig, Thomas Strecker, Sascha Narr, Ernesto William De Luca, Sahin Albayrak [paper] [slides]
12:10Closing Remarks

Call for Papers

Intelligent algorithms for mining and retrieval are the key technology to cope with the information need challenges in our media-centered society. Methods for text-based information retrieval receive special attention, which results from the important role of written text, from the high availability of the Internet, and from the enormous importance of Web communities.

Advanced information retrieval and extraction uses methods from different areas: machine learning, computer linguistics and psychology, user interaction and modeling, information visualization, Web engineering, artificial intelligence, or distributed systems. The development of intelligent retrieval tools requires the understanding and combination of the achievements in these areas, and in this sense the workshop provides a common platform for presenting and discussing new solutions.

The following list organizes classic and ongoing topics from the field of text-based IR for which contributions are welcome:

  • Theory. Retrieval models, language models, similarity measures, formal analysis
  • Mining and Classification. Category formation, clustering, entity resolution, document classification
  • Web. Community mining, social network analysis, structured retrieval from XML documents
  • NLP. Text summarization, keyword extraction, topic identification
  • User Interface. Paradigms and algorithms for information visualization, personalization, privacy issues
  • User Context. Context models for IR, context analysis from user behavior and from social networks
  • Multilinguality. Cross-language retrieval, multilingual retrieval, machine translation for IR
  • Evaluation. Corpus construction, experiment design, conception of user studies
  • Semantic Web. Meta data analysis and tagging, knowledge extraction, inference, and maintenance
  • Software Engineering. Frameworks and architectures for retrieval technology, distributed IR

The workshop is held for the eighth time. In the past, it was characterized by a stimulating atmosphere, and it attracted high quality contributions from all over the world. In particular, we encourage participants to present research prototypes and demonstration tools of their research ideas.

Submission Guideline

  • Research papers submitted to TIR-10 must be original contributions.
  • Submissions will be peer-reviewed by at least two experts from the related field.
  • The review process will be single-blind (the identity of authors is known to the reviewers).
  • Accepted papers will be published as IEEE proceedings by IEEE CS Press.
  • At least one author of each accepted paper must register for the conference and present the contribution.
  • The camera ready papers are limited to 5 (five) pages in IEEE format (two columns in A4).
  • Papers must be written in English.

Submit a Contribution

Submissions must generally be in electronic form using the Portable Document Format (PDF) or Postscript. It is the responsibility of authors to ensure that their papers use no unusual format features and are printable on a standard Postscript printer.

Please use our conference management system ConfDriver to submit your paper.

Important Dates

  • Apr 05, 2010, 24:00 (CET): Postponed deadline for paper submission
  • Apr 20, 2010:Notification to authors
  • Jun 11, 2010: Postponed camera-ready copy due
  • Aug 30-31, 2010: TIR-10 Workshop

Organizing Committee

  • Benno Stein (Co-Chair), Bauhaus-Universität Weimar, Germany
  • Michael Granitzer (Co-Chair), Know-Center Graz & Graz University of Technology, Austria
  • Maik Anderka, Bauhaus-Universität Weimar, Germany

Program Committee

  • Mikhail Alexandrov, Autonomous University of Barcelona, Spain
  • Alberto Barrón-Cedeño, Universidad Politécnica de Valencia, Spain
  • Michael Busch, Twitter, USA
  • Richard Chbeir, Bourgogne University, France
  • Mario Döller, University of Passau, Germany
  • Ingo Frommholz, University of Glasgow, Scotland
  • Shlomo Geva, Queensland University of Technology, Australia
  • Thomas Gottron, Johannes Gutenberg University Mainz, Germany
  • Iryna Gurevych, Technical University of Darmstadt, Germany
  • Christian Gütl, Technical University Graz, Austria
  • Thomas Hampp-Bahnmueller, IBM Research, Germany
  • Andreas Hotho, University of Kassel, Germany
  • Roman Kern, Know-Center Graz, Austria
  • Ralf Klamma, RWTH Aachen, Germany
  • Elisabeth Lex, Know-Center Graz, Austria
  • Nedim Lipka, Bauhaus University Weimar, Germany
  • Matthias Lux, Klagenfurt University, Austria
  • Thomas Mandl, University of Hildesheim, Germany
  • Sven Meyer zu Eissen, Bayer Business Services GmbH, Germany
  • Oliver Niggemann, Ostwestfalen-Lippe University, Germany
  • Peter Prettenhofer, Bauhaus University Weimar, Germany
  • Paolo Rosso, Universidad Politécnica de Valencia, Spain
  • Marina Santini, University of Brighton, UK
  • Christin Seifert, Know-Center Graz, Austria
  • Ralf Schenkel, Max-Planck-Institut für Informatik, Germany
  • Marc Spaniol, Max-Planck-Institut für Informatik, Germany
  • Efstathios Stamatatos, University of the Aegean, Greece
  • Markus Strohmaier, Graz University of Technology, Austria
  • Andrew Trotman, University of Otago, New Zealand
  • Rob Warren, University of Zurich, Switzerland
  • Mario Zechner, Know-Center Graz, Austria
  • Guido Zuccon, University of Glasgow, Scotland