09:15-09:30 | Opening |
Session 1: Information Retrieval Theory | |
09:30-10:00 | A Formalization of Logical Imaging for Information Retrieval using Quantum Theory Guido Zuccon, Leif Azzopardi, Keith van Rijsbergen [paper] [slides] |
10:00-10:30 | Language Models and Smoothing Methods for Collections with Large Variation in Document Length Najeeb Abdulmutalib, Norbert Fuhr [paper] [slides] |
10:30-11:00 | Proximity estimation and hardness of short-text corpora Marcelo Errecalde, Diego Ingaramo, Paolo Rosso [paper] [slides] |
11:00-11:30 | Coffee break |
Session 2: Information Extraction and Deep Text Analysis | |
11:30-12:00 | Text Extraction from the Web via Text-to-Tag Ratio Tim Weninger, William Hsu [paper] [slides] |
12:00-12:30 | Content Code Blurring: A New Approach to Content Extraction Thomas Gottron [paper] [slides] |
12:30-13:00 | Meta Analysis within Authorship Verification Benno Stein, Nedim Lipka, Sven Meyer zu Eissen [paper] [slides] |
13:00-14:30 | Lunch |
Session 3: Clustering and Mining | |
14:30-15:00 | Semantically rich spaces for document clustering Roberto Basili, Paolo Marocco, Danele Milizia [paper] [slides] |
15:00-15:30 | Learning Visual Entities and their Visual Attributes from Text Corpora Erik Boiy, Koen Deschacht, Marie-Francine Moens [paper] [slides] |
15:30-16:00 | Topic Detection by Clustering Keywords Christian Wartena, Rogier Brussee [paper] [slides] |
16:00-16:30 | Coffee break |
Session 4: Advanced Application | |
16:30-17:00 | Enhanced Query Expansion in English-Arabic CLIR Abdelghani Bellaachia, Ghita Amor-Tijani [paper] |
17:00-17:30 | Using NLP and Ontologies for Notary Document Management Systems Flora Amato, Antonino Mazzeo, Antonio Penta, Antonio Picariello [paper] [slides] |
17:30 | Closing Remarks |
Intelligent algorithms for mining and retrieval are the key technology to cope with the information need challenges in our media-centered society. Methods for text-based information retrieval receive special attention, which results from the important role of written text, from the high availability of the Internet, and from the enormous importance of Web communities.
Advanced information retrieval and extraction uses methods from different areas: machine learning, computer linguistics and psychology, user interaction and modeling, information visualization, Web engineering, artificial intelligence, or distributed systems. The development of intelligent retrieval tools requires the understanding and combination of the achievements in these areas, and in this sense the workshop provides a common platform for presenting and discussing new solutions.
The following list organizes classic and ongoing topics from the field of text-based IR for which contributions are welcome:
The workshop is held for the eighth time. In the past, it was characterized by a stimulating atmosphere, and it attracted high quality contributions from all over the world. In particular, we encourage participants to present research prototypes and demonstration tools of their research ideas.
Submissions must generally be in electronic form using the Portable Document Format (PDF) or Postscript. It is the responsibility of authors to ensure that their papers use no unusual format features and are printable on a standard Postscript printer.
Please use our conference management system ConfDriver to submit your paper.