Information Retrieval and Search

Args [2017 - today]
The first search engine for arguments on the web.
[service] [api]
ChatNoir [2010 - today]
Research search engine indexing the ClueWeb and CommonCrawl.
[service] [api] (ChatNoir2)
[service] [api] (ChatNoir1)
Netspeak [2006 - today]
Technology for text correction and idiomatic writing.
[project] [service] [api] [video 1 2 3 4]
Plagiarism Detection
Plagiarism Detection [2005 - today]
Technology for automated plagiarism detection.
[project] [service] [demo: writing process] [video]
Retrieval Models
Retrieval Models [2011 - 2013]
Interactive map to overview and compare the characteristics of well-known retrieval models.
Query Segmentation
Query Segmentation [2010 - 2013]
Second-guess the user’s intent from a search query.
[project] [demo] [api]
Wikipedia Fingerprinting
Wikipedia Fingerprinting [2007]
Search engine implementing full text queries against Wikipedia based on fingerprinting.
BAT [2005 - 2006]
Browser extension to facilitate the accessibility of Web pages for the visually impaired.
AIsearch [2003 - 2006]
Meta search engine for Web document categorization and graphical access.
[project] [video]

Natural Language Processing and Computational Linguistics

Clickbait Logo
CLICKBAIT [2016 - today]
Analysis of clickbait messages in social media.
ArguAna for the Web [2016 - today]
Argumentation analysis for the web.
[project] [service: argument search] [api]
ArguAna [2012 - 2015]
Argumentation analysis in customer opinion mining.
InfexBA [2009 - 2011]
Information extraction for business applications.
[project] [demo: sentiment detection]
OpinionCloud [2008 - 2013]
On the fly comment summarization for YouTube and Flickr.
Person Resolution
Person Resolution [2007 - 2008]
Resolution of named entities in Web pages.
Market Forecast
Market Forecast [2005 - 2006]
Extraction and summarization of market forecast statements for a user-specified market.

Data Mining and Machine Learning

Deep Text Analytics
Deep Text Analytics [2015 - today]
Deep analysis of textual content in collaboration with Adobe.
Digital Engineering
Digital Engineering [2010 - today]
Data mining in artificially generated data to support modeling and simulation tasks.
Wikipedia Quality
Wikipedia Quality [2010 - today]
Analysis and prediction of quality flaws in Wikipedia.
Wikipedia Vandalism
Wikipedia Vandalism [2007 - today]
Analysis and detection of vandalism on Wikipedia.
[project] [service: spatio-temporal analysis]
CAIR [2010 - 2015]
Semantic cluster analysis in information retrieval.
Web Genres
Web Genres [2006 - 2008]
On the fly genre analysis for Web pages.
[project] [video]

Software Engineering and Tool Development

WAT-SL [2017 - today]
Web Annotation Tool for Segment Labeling.
[demo] [code]
Webis@Github [2013 - today]
Webis github account that hosts the source code for reproducing our research.
TIRA [2007 - today]
Online experiment configuration and execution.
[project] [service]
AItools [2003 - today]
Java library for information retrieval and data mining.
NoSQL Stores
NoSQL Databases [2014 - 2015]
Key-value storage systems for Big Data applications.