TIR 2008

Workshop Program

Monday, September 1st, 2008

09:15-09:30	Opening
	Session 1: Information Retrieval Theory
09:30-10:00	A Formalization of Logical Imaging for Information Retrieval using Quantum Theory Guido Zuccon, Leif Azzopardi, Keith van Rijsbergen [paper] [slides]
10:00-10:30	Language Models and Smoothing Methods for Collections with Large Variation in Document Length Najeeb Abdulmutalib, Norbert Fuhr [paper] [slides]
10:30-11:00	Proximity estimation and hardness of short-text corpora Marcelo Errecalde, Diego Ingaramo, Paolo Rosso [paper] [slides]
11:00-11:30	Coffee break
	Session 2: Information Extraction and Deep Text Analysis
11:30-12:00	Text Extraction from the Web via Text-to-Tag Ratio Tim Weninger, William Hsu [paper] [slides]
12:00-12:30	Content Code Blurring: A New Approach to Content Extraction Thomas Gottron [paper] [slides]
12:30-13:00	Meta Analysis within Authorship Verification Benno Stein, Nedim Lipka, Sven Meyer zu Eissen [paper] [slides]
13:00-14:30	Lunch
	Session 3: Clustering and Mining
14:30-15:00	Semantically rich spaces for document clustering Roberto Basili, Paolo Marocco, Danele Milizia [paper] [slides]
15:00-15:30	Learning Visual Entities and their Visual Attributes from Text Corpora Erik Boiy, Koen Deschacht, Marie-Francine Moens [paper] [slides]
15:30-16:00	Topic Detection by Clustering Keywords Christian Wartena, Rogier Brussee [paper] [slides]
16:00-16:30	Coffee break
	Session 4: Advanced Application
16:30-17:00	Enhanced Query Expansion in English-Arabic CLIR Abdelghani Bellaachia, Ghita Amor-Tijani [paper]
17:00-17:30	Using NLP and Ontologies for Notary Document Management Systems Flora Amato, Antonino Mazzeo, Antonio Penta, Antonio Picariello [paper] [slides]
17:30	Closing Remarks

Call for Papers

Intelligent algorithms for mining and retrieval are the key technology to cope with the information need challenges in our media-centered society. Methods for text-based information retrieval receive special attention, which results from the important role of written text, from the high availability of the Internet, and from the enormous importance of Web communities.

Advanced information retrieval and extraction uses methods from different areas: machine learning, computer linguistics and psychology, user interaction and modeling, information visualization, Web engineering, artificial intelligence, or distributed systems. The development of intelligent retrieval tools requires the understanding and combination of the achievements in these areas, and in this sense the workshop provides a common platform for presenting and discussing new solutions.

The following list organizes classic and ongoing topics from the field of text-based IR for which contributions are welcome:

Theory. Retrieval models, language models, similarity measures, formal analysis
Mining and Classification. Category formation, clustering, entity resolution, document classification
Web. Community mining, social network analysis, structured retrieval from XML documents
NLP. Text summarization, keyword extraction, topic identification
User Interface. Paradigms and algorithms for information visualization, personalization, privacy issues
User Context. Context models for IR, context analysis from user behavior and from social networks
Multilinguality. Cross-language retrieval, multilingual retrieval, machine translation for IR
Evaluation. Corpus construction, experiment design, conception of user studies
Semantic Web. Meta data analysis and tagging, knowledge extraction, inference, and maintenance
Software Engineering. Frameworks and architectures for retrieval technology, distributed IR

The workshop is held for the eighth time. In the past, it was characterized by a stimulating atmosphere, and it attracted high quality contributions from all over the world. In particular, we encourage participants to present research prototypes and demonstration tools of their research ideas.

Submission Guideline

Research papers submitted to TIR-08 must be original contributions.
Submissions will be peer-reviewed by at least two experts from the related field.
The review process will be single-blind (the identity of authors is known to the reviewers).
Accepted papers will be published as IEEE proceedings by IEEE CS Press.
At least one author of each accepted paper must register for the conference and present the contribution.
The camera ready papers are limited to 5 (five) pages in IEEE format (two columns in A4).
Papers must be written in English.

Submit a Contribution

Submissions must generally be in electronic form using the Portable Document Format (PDF) or Postscript. It is the responsibility of authors to ensure that their papers use no unusual format features and are printable on a standard Postscript printer.

Please use our conference management system ConfDriver to submit your paper.

Important Dates

~~Apr 08, 2008, 24:00 (CET):~~ Postponed deadline for paper submission
~~May 05, 2008:~~Notification to authors
~~Jun 06, 2008:~~ Camera-ready copy due
~~Sep 01, 2008:~~ TIR-08 Workshop

Organizing Committee

Benno Stein (Co-Chair), Bauhaus University Weimar
Sven Meyer zu Eissen (Co-Chair), Bauhaus University Weimar

Program Committee

Leif Azzopardi, University of Glasgow
Mikhail Alexandrov, Autonomous University of Barcelona
Michael Busch, IBM
Ingo Frommholz, University of Duisburg-Essen
Thomas Gottron, Johannes Gutenberg University Mainz
Josephine Griffith, National University of Ireland
Michael Granitzer, Know-Center Graz
Christian Gütl, Technical University Graz
Heiko Holzheuer, Lycos Europe
Andreas Hotho, University of Kassel
Álvaro Francisco Huertas-Rosero, University of Glasgow
Maciej Janik, University of Georgia
Vangelis Karkaletsis, National Center for Scientific Research "Demokritos"
Sebastian Marius Kirsch, Google
Matthias Lux, Klagenfurt University
Thomas Mandl, University of Hildesheim
Oliver Niggemann, dSPACE
Nick Pendar, Iowa State University
David Pinto, Benemerita Universidad Autónoma de Puebla
Martin Potthast, Bauhaus University Weimar
Paolo Rosso, Universidad Politécnica de Valencia
Marina Santini, University of Brighton
Serge Sharoff, University of Leeds
Peter Scheir, Know-Center Graz
Efstathios Stamatatos, University of the Aegean
Hans Friedrich Witschel, University of Leipzig
Guido Zuccon, University of Glasgow

TIR-08

5th International Workshop on Text-based Information Retrieval