Research Challenges

Challenge
Summarization and Text Synthesis [2018 - today]
How to develop specialized summarization systems?
[publications] [data] [events]
Challenge
Computational Argumentation [2015 - today]
How to develop the next generation of search engines and personal assistants?
[publications] [data] [events]
Challenge
Authorship Analytics [2011 - today]
How to develop better algorithms for authorship analysis?
[publications] [data] [events]
Challenge
Computational Ethics [2010 - today]
How to develop effective algorithms for media credibility analysis and deception detection?
[publications] [data] [events]

Information Retrieval and Search

Acqua
ACQuA [2018 - today]
Answering Comparative Questions with Arguments.
[service: comparative argument search] [api]
Conversational Search
Conversational Search [2018 - today]
Research on information-seeking conversations with machines.
Args
Args [2017 - today]
The first search engine for arguments on the web.
[service: argument search engine] [api]
ChatNoir
ChatNoir [2010 - today]
Research search engine with ranking explanation indexing the ClueWeb and CommonCrawl.
[service: search engine] [api]
Netspeak
Netspeak [2006 - today]
Technology for text correction and idiomatic writing.
[service: phrase search] [api] [video]
Plagiarism Detection
Picapica [2005 - today]
Technology for automated plagiarism detection.
[service: reuse detection] [demo: writing process] [video]
Retrieval Models
Retrieval Models [2011 - 2013]
Interactive map to overview and compare the characteristics of well-known retrieval models.
Query Segmentation
Query Segmentation [2010 - 2013]
Second-guess the user’s intent from a search query.
[demo] [api]
Wikipedia Fingerprinting
Wikipedia Fingerprinting [2007]
Search engine implementing full text queries against Wikipedia based on fingerprinting.
AIsearch
AIsearch [2003 - 2006]
Meta search engine for Web document categorization and graphical access.
[video]

Natural Language Processing and Computational Linguistics

Science Studies Logo
Science Studies [2019 - today]
Analysis of scientific documents and actors.
Clickbait Logo
CLICKBAIT [2016 - today]
Analysis of clickbait messages in social media.
ArguAna
ArguAna for the Web [2016 - today]
Argumentation analysis for the web.
[service: args.me] [api: args.me]] [demo: essay scoring]
ArguAna
ArguAna [2012 - 2015]
Argumentation analysis in customer opinion mining.
[demo: review analysis]
InfexBA
InfexBA [2009 - 2011]
Information extraction for business applications.
[demo: sentiment detection]
OpinionCloud
OpinionCloud [2008 - 2013]
On the fly comment summarization for YouTube and Flickr.
Person Resolution
Person Resolution [2007 - 2008]
Resolution of named entities in Web pages.
Market Forecast
Market Forecast [2005 - 2006]
Extraction and summarization of market forecast statements for a user-specified market.

Data Mining and Machine Learning

Web Archive
Web Archive [2018 - today]
Analysis of the web using an 8 PB dataset of the Internet Archive's web archive.
Deep Text Analytics
Deep Text Analytics [2015 - today]
Deep analysis of textual content in collaboration with Adobe.
Digital Engineering
Digital Engineering [2010 - today]
Data mining in artificially generated data to support modeling and simulation tasks.
Wikipedia Quality
Wikipedia Quality [2010 - today]
Analysis and prediction of quality flaws in Wikipedia.
Wikipedia Vandalism
Wikipedia Vandalism [2007 - today]
Analysis and detection of vandalism on Wikipedia.
[service: spatio-temporal analysis]
CAIR
CAIR [2010 - 2015]
Semantic cluster analysis in information retrieval.
Web Genres
Web Genres [2006 - 2008]
On the fly genre analysis for Web pages.
[video]

Software Engineering and Tool Development

WAT-SL
WAT-SL [2017 - today]
Web Annotation Tool for Segment Labeling.
[demo: annotation tool] [code]
Webis@Github
Webis@Github [2013 - today]
Webis github account that hosts the source code for reproducing our research.
TIRA
TIRA [2007 - today]
Online experiment configuration and execution.
[service: shared task platform]
AItools
AItools [2003 - today]
Java library for information retrieval and data mining.