Open Thesis Topics

Students who are eager to develop their skills by doing a research-oriented thesis in our group should mail their interests to webis@listserv.uni-weimar.de. Suitable topic candidates are shown in the following list. Your own suggestions for topics are also welcome, for which you can draw inspiration from our recent publications.

  • Extreme Multi-Label Classification of German Book Titles
  • Facets of complexity in scholarly political language
  • Jointly Learning Decoupled Bi-Encoder Representations for Retrieval
  • Rating the Degree of Search Engine Optimization of Websites
  • Simplifying the language of political argumentation

Open Student Assistant Topics

Students who want to improve their skills and work with us can apply for a position as a student assistant at webis@listserv.uni-weimar.de. We are currently looking for assistants to work on the following topics:

  • We currently have no open topics. You can always let us know that you're interested.

Ongoing Theses

  • Halle
    • Simulation von Suchanfragen durch Anchortext (supervised by Maik Fröbe, Sebastian Günther, and Matthias Hagen)
  • Jena
    • Robust Out-of-Domain Re-ranking with Cross-Encoders by Fine-tuning on Heterogeneous Collections (supervised by Ferdinand Schlatt)
    • Answering Open-Ended Health-Related Questions based on Trusted Sources (supervised by Jan Heinrich Reimer, Alexander Bonarenko, and Matthias Hagen)
    • Re-Creating Twitter-based IR and NLP-Experiments on the Feediverse (supervised by Jan Heinrich Reimer and Matti Wiegmann)
    • Large-scale Query Log Analyses (supervised by Jan Heinrich Reimer)
  • Leipzig
    • Text2SQL. Exploring Relational Databases with Natural Language User Interfaces (supervised by Tim Gollub)
    • Psychological Features of Argumentation (supervised by Maximilian Heinrich)
    • Lightweight Passage Re-ranking Using Embeddings from Pre-trained Language Models (Supervised by Ferdinand Schlatt and Harry Scells)
    • Logical Features of Neural Networks (supervised by Maximilian Heinrich)
    • Classification of Multimodal Social Media Posts (supervised by Tim Gollub)
    • Active Learning for Text Classification (supervised by Christian Kahmann and Christopher Schröder)
    • Incorporating Knowledge Graph Embeddings in Large Language Models (supervised by Ferdinand Schlatt)
    • Improving Compositionality of Images Generated by Stable Diffusion (supervised by Niklas Deckers)
    • Extracting Large-Scale Multimodal Datasets From Web Archives (supervised by Niklas Deckers)
    • Normdaten-Disambiguierung und Reconciliation auf Korpusdaten (supervised by Erik Körner and Felix Helfer)
    • Statistical Bootstrap Tests with Redundant Data (supervised by Maik Fröbe)
    • Collecting Fine-grained, Intesity-aware Annotations of Triggering Content. (supervised by Matti Wiegmann and Magdalena Wolska)
    • Mining Trigger Warnings from the Web and Social Media. (supervised by Matti Wiegmann)
  • Weimar
    • Language Model Evaluation Game (supervised by Johannes Kiesel)
    • Retrieval Augmented Generation for Enhanced Access to Industrial Documentation (supervised by Tim Gollub)
    • Chart Retrieval for Arguments (supervised by Johannes Kiesel)
    • Mimicking Personas of Dialog Participants with Large Language Models (supervised by Marcel Gohsen)
    • Personalizing Harm Reduction in Text Generation. (supervised by Marcel Gohsen and Matti Wiegmann)
    • Character-Driven Story Generation Through Character Networks (supervised by Marcel Gohsen)
    • Health-Related Queries in Large-Scale Query Logs (supervised by Jan Heinrich Reimer)
    • Information Extraction from Academic Mailing Lists (supervised by Tim Gollub)
    • Topic Segmentation with Large Language Models (supervised by Johannes Kiesel, Nailia Mirzakhmedova, and Matti Wiegmann)
    • Retrieval Augmented Generation for the IR-Anthology (supervised by Tim Gollub)
    • Mining Linked Data on Web Scale (supervised by Nikolay Kolyada)
    • Efficient and Effective Neural Translation Language Model for Search (Supervised by Harry Scells)

Resources for Students

Vacancies

Dear prospective PhD student, unsolicited applications to the Webis group (webis.de) are welcome. However, we cannot promise that open positions are available at the time of your application.

The Webis Group is a tightly cooperating research network, formed by computer science chairs at the universities of Groningen, Hannover, Jena, Kassel, Leipzig, and Weimar. Our mission is to tackle challenges of the information society by conducting basic and applied research with the goal of prototyping and evaluating future information systems. We are an experienced research group where team spirit and active collaboration has top priority. We are looking for open-minded graduates and PhDs who want to develop both as a researcher and as a person. The working language of our group is English; fluency in German is not required.

Interested students should have finished either a master or a PhD in computer science, mathematics, or a related field with excellent or very good grades. A solid background in mathematics and statistics is expected—as well as very good programming skills.

Benno Stein
Bauhaus-Universität Weimar
On behalf of the Webis group

Email: webis@listserv.uni-weimar.de
Web: webis.de