Publications
2022
Saber Zerhoudi, Sebastian Günther, Kim Plassmeier, Timo Borst, Christin Seifert, Matthias Hagen, and Michael Granitzer. The SimIIR 2.0 Framework: User Types, Markov Model-Based Interaction Simulation, and Advanced Query Generation. In 31st ACM International Conference on Information and Knowledge Management (CIKM 2022), 2022. ACM. [bib] [copylink]
Sebastian Günther, Paul Göttert, and Matthias Hagen. Exploring LSTMs for Simulating Search Sessions in Digital Libraries. In 26th International Conference on Theory and Practice of Digital Libraries (TPDL 2022), 2022. Springer. [bib] [code] [copylink]
Theresa Elstner, Johannes Kiesel, Lars Meyer, Max Martius, Sebastian Schmidt, Benno Stein, and Martin Potthast. Visual Web Archive Quality Assessment. In 26th International Conference on Theory and Practice of Digital Libraries (TPDL 2022), 2022. Springer. [bib] [code] [copylink] [data]
Maik Fröbe, Christopher Akiki, Martin Potthast, and Matthias Hagen. How Train-Test Leakage Affects Zero-shot Retrieval. In Diego Arroyuelo and Barbara Poblete, editors, 29th International Symposium on String Processing and Information Retrieval (SPIRE 2022), November 2022. [arxiv] [bib] [code] [copylink]
Alexander Bondarenko, Magdalena Wolska, Stefan Heindorf, Lukas Blübaum, Axel-Cyrille Ngonga Ngomo, Benno Stein, Pavel Braslavski, Matthias Hagen, and Martin Potthast. A Benchmark for Causal Question Answering. In 29th International Conference on Computational Linguistics (COLING 2022), October 2022. Association for Computational Linguistics. [bib] [code] [copylink]
Ferdinand Schlatt, Dieter Bettin, Matthias Hagen, Benno Stein, and Martin Potthast. Mining Health-related Cause-Effect Statements with High Precision at Large Scale. In 29th International Conference on Computational Linguistics (COLING 2022), October 2022. Association for Computational Linguistics. [bib] [code] [copylink]
Maik Fröbe, Christopher Akiki, Martin Potthast, and Matthias Hagen. Noise-Reduction for Automatically Transferred Relevance Judgments. In Experimental IR Meets Multilinguality, Multimodality, and Interaction - 13th International Conference of the CLEF Association, CLEF 2022, Bologna - Italy, September 5-8, 2022, Proceedings, Lecture Notes in Computer Science, September 2022. Springer. [bib] [copylink] [slides]
Jan Heinrich Reimer, Johannes Huck, and Alexander Bondarenko. Grimjack at Touché 2022: Axiomatic Re-ranking and Query Reformulation. In Guglielmo Faggioli, Nicola Ferro, Allan Hanbury, and Martin Potthast, editors, Working Notes Papers of the CLEF 2022 Evaluation Labs, volume 3180 of CEUR Workshop Proceedings, September 2022. [bib] [code] [copylink] [event] [publisher]
Alexander Bondarenko, Maik Fröbe, Johannes Kiesel, Shahbaz Syed, Timon Gurcke, Meriem Beloucif, Alexander Panchenko, Chris Biemann, Benno Stein, Henning Wachsmuth, Martin Potthast, and Matthias Hagen. Overview of Touché 2022: Argument Retrieval. In Alberto Barrón-Cedeño et al., editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. 13th International Conference of the CLEF Association (CLEF 2022), Lecture Notes in Computer Science, September 2022. Springer. [bib] [clef working notes] [copylink] [ecir invited paper] [event]
Lukas Gienapp, Maik Fröbe, Matthias Hagen, and Martin Potthast. Sparse Pairwise Re-ranking with Pre-trained Transformers. In 2022 ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR '22), July 2022. ACM. [arxiv] [bib] [code] [copylink] [slides]
Alexander Bondarenko, Maik Fröbe, Jan Heinrich Reimer, Benno Stein, Michael Völske, and Matthias Hagen. Axiomatic Retrieval Experimentation with ir_axioms. In Enrique Amigó et al., editors, 45th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2022), pages 3131-3140, July 2022. ACM. [bib] [code] [copylink] [doi] [poster] [publisher] [slides] [video]
Yamen Ajjour, Pavel Braslavski, Alexander Bondarenko, and Benno Stein. Identifying Argumentative Questions in Web Search Logs. In Enrique Amigó et al., editors, 45th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2022), pages 2393-2399, July 2022. ACM. [bib] [copylink] [doi] [poster] [publisher] [slides]
Johannes Kiesel. Harnessing Web Archives to Tackle Selected Societal Challenges. Dissertation, Bauhaus-Universität Weimar, June 2022. [bib] [copylink] [doi] [slides]
Aarohi Srivastava and others. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. CoRR, abs/2206.04615, June 2022. [bib] [code] [copylink] [publisher]
Svitlana Vakulenko, Johannes Kiesel, and Maik Fröbe. SCAI-QReCC Shared Task on Conversational Question Answering. In Nicoletta Calzolari et al., editors, 14th Language Resources and Evaluation Conference (LREC 2022), pages 4913-4922, June 2022. European Language Resources Association (ELRA). [arxiv] [bib] [code] [copylink] [data] [publisher] [slides] [video]
Matti Wiegmann, Michael Völske, Martin Potthast, and Benno Stein. Language Models as Context-sensitive Word Search Engines. In Huang, Ting-Hao 'Kenneth' et al., editors, Proceedings of the 1st Workshop on Intelligent and Interactive Writing Assistants (In2Writing 2022), pages 39-45, May 2022. Association for Computational Linguistics. [bib] [code] [copylink] [data] [poster] [publisher] [slides]
Christopher Schröder, Andreas Niekler, and Martin Potthast. Revisiting Uncertainty-based Query Strategies for Active Learning with Transformers. In 60th Annual Meeting of the Association for Computational Linguistics: Findings (ACL 2022), May 2022. Association for Computational Linguistics. [arxiv] [bib] [code] [copylink] [poster] [publisher] [slides] [video]
Johannes Kiesel, Milad Alshomary, Nicolas Handke, Xiaoni Cai, Henning Wachsmuth, and Benno Stein. Identifying the Human Values behind Arguments. In Smaranda Muresan, Preslav Nakov, and Aline Villavicencio, editors, 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), pages 4459-4471, May 2022. Association for Computational Linguistics. [bib] [code] [copylink] [data] [doi] [poster] [slides] [video]
Matthias Hagen, Maik Fröbe, Artur Jurk, and Martin Potthast. Clickbait Spoiling via Question Answering and Passage Retrieval. In Smaranda Muresan, Preslav Nakov, and Aline Villavicencio, editors, 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), pages 7025-7036, May 2022. Association for Computational Linguistics. [arxiv] [bib] [code] [copylink] [data] [slides]
Maik Fröbe, Sebastian Günther, Alexander Bondarenko, Johannes Huck, and Matthias Hagen. Using Keyqueries to Reduce Misinformation in Health-Related Search Results. In 2nd Workshop on Reducing Online Misinformation through Credible Information Retrieval (ROMCIR 2022), CEUR Workshop Proceedings, April 2022. CEUR-WS.org. [bib] [code] [copylink] [slides]
Maik Fröbe, Sebastian Günther, Maximilian Probst, Martin Potthast, and Matthias Hagen. The Power of Anchor Text in the Neural Retrieval Era. In Matthias Hagen et al., editors, Advances in Information Retrieval. 44th European Conference on IR Research (ECIR 2022), Lecture Notes in Computer Science, April 2022. Springer. [bib] [code] [copylink] [data] [slides]
Janek Bevendorff, Berta Chulvi, Elisabetta Fersini, Annina Heini, Mike Kestemont, Krzysztof Kredens, Maximilian Mayerl, Reyner Ortega-Bueno, Piotr Pezik, Martin Potthast, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, Benno Stein, Matti Wiegmann, Magdalena Wolska, and Eva Zangerle. Overview of PAN 2022: Authorship Verification, Profiling Irony and Stereotype Spreaders, Style Change Detection, and Trigger Detection. In Matthias Hagen et al., editors, Advances in Information Retrieval. 43rd European Conference on IR Research (ECIR 2022), volume 13186 of Lecture Notes in Computer Science, pages 331-338, April 2021. Springer. [bib] [copylink] [doi] [event] [publisher]
Maik Fröbe, Nicola Lea Libera, and Matthias Hagen. City of Disguise: A Query Obfuscation Game on the ClueWeb. In Matthias Hagen et al., editors, Advances in Information Retrieval. 44th European Conference on IR Research (ECIR 2022), Lecture Notes in Computer Science, April 2022. Springer. [bib] [code] [copylink] [demo] [poster]
Johannes Kiesel, Volker Bernhard, Marcel Gohsen, Josef Roth, and Benno Stein. What is That? Crowdsourcing Questions to a Virtual Exhibition. In David Elsweiler, editors, 2022 Conference on Human Information Interaction & Retrieval (CHIIR 2022), pages 358-362, March 2022. ACM. [bib] [copylink] [data] [doi] [poster] [publisher] [research] [video]
Alexander Bondarenko, Ekaterina Shirshakova, and Matthias Hagen. A User Study on Clarifying Comparative Questions. In David Elsweiler, editors, 2022 Conference on Human Information Interaction & Retrieval (CHIIR 2022), pages 254-258, March 2022. ACM. [bib] [copylink] [doi] [poster] [publisher] [research]
Christopher Akiki, Lukas Gienapp, and Martin Potthast. Tracking Discourse Influence in Darknet Forums. CoRR, abs/2202.02081, February 2022. [bib] [code] [copylink] [publisher]
Vaibhav Kasturia, Marcel Gohsen, and Matthias Hagen. Query Interpretations from Entity-Linked Segmentations. In 15th ACM International Conference on Web Search and Data Mining (WSDM 2022), February 2022. ACM. [arxiv] [bib] [code] [copylink] [data] [doi] [poster] [publisher] [slides] [video]
Alexander Bondarenko, Yamen Ajjour, Valentin Dittmar, Niklas Homann, Pavel Braslavski, and Matthias Hagen. Towards Understanding and Answering Comparative Questions. In K. Selcuk Candan et al., editors, 15th ACM International Conference on Web Search and Data Mining (WSDM 2022), pages 66-74, February 2022. ACM. [bib] [code] [copylink] [data] [doi] [poster] [publisher] [research] [slides] [video]
2021
Xuke Hu, Hussein S. Al-Olimat, Jens Kersten, Matti Wiegmann, Friederike Klan, Yeran Sun, and Hongchao Fan. GazPNE: Annotation-Free Deep Learning for Place Name Extraction from Microblogs Leveraging Gazetteer and Synthetic Data by Rules. International Journal of Geographical Information Science, 0 (0) : 1-28, 2021. [bib] [copylink] [doi] [publisher]
Lukas Gienapp, Wolfgang Kircheis, Bjarne Sievers, Benno Stein, and Martin Potthast. STEREO: Scientific Text Reuse in Open Access Publications. CoRR, abs/2112.11800, December 2021. [bib] [code] [copylink] [data] [publisher]
Maik Fröbe, Eric Oliver Schmidt, and Matthias Hagen. Efficient Query Obfuscation with Keyqueries. In 20th International IEEE/WIC/ACM Conference on Web Intelligence (WI-IAT '21), December 2021. ACM. [bib] [code] [copylink] [doi] [publisher]
Wei-Fan Chen, Khalid Al-Khatib, Benno Stein, and Henning Wachsmuth. Controlled Neural Sentence-Level Reframing of News Articles. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih, editors, 26th Conference on Empirical Methods in Natural Language Processing: Findings (EMNLP 2021), pages 2683-2693, November 2021. Association for Computational Linguistics. [bib] [copylink] [doi] [publisher]
Erik Körner, Ahmad Dawar Hakimi, Gerhard Heyer, and Martin Potthast. Casting the Same Sentiment Classification Problem. In 26th Conference on Empirical Methods in Natural Language Processing: Findings (EMNLP 2021), pages 584-590, November 2021. Association for Computational Linguistics. [bib] [code] [copylink] [data] [doi] [poster] [publisher] [video]
Erik Körner, Gregor Wiedemann, Ahmad Dawar Hakimi, Gerhard Heyer, and Martin Potthast. On Classifying whether Two Texts are on the Same Side of an Argument. In 26th Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), pages 10130-10138, November 2021. Association for Computational Linguistics. [bib] [code] [copylink] [data] [doi] [poster] [publisher] [slides] [video]
Shahbaz Syed, Tariq Yousef, Khalid Al-Khatib, Stefan Jänicke, and Martin Potthast. SUMMARY EXPLORER: Visualizing the State of the Art in Text Summarization. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih, editors, 26th Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), pages 185-194, November 2021. Association for Computational Linguistics. [bib] [code] [copylink] [demo] [doi] [poster] [publisher]
Alexander Bondarenko, Ekaterina Shirshakova, Marina Driker, Matthias Hagen, and Pavel Braslavski. Misbeliefs and Biases in Health-Related Searches. In Gianluca Demartini et al., editors, 30th ACM International Conference on Information and Knowledge Management (CIKM 2021), pages 2894-2899, November 2021. ACM. [bib] [copylink] [data] [doi] [poster] [publisher] [slides] [video]
Alexander Bondarenko, Maik Fröbe, Marcel Gohsen, Sebastian Günther, Johannes Kiesel, Jakob Schwerter, Shahbaz Syed, Michael Völske, Martin Potthast, Benno Stein, and Matthias Hagen. Webis at TREC 2021: Deep Learning, Health Misinformation, and Podcasts Tracks. In Ellen M. Voorhees and Angela Ellis, editors, 30th International Text Retrieval Conference (TREC 2021), NIST Special Publication, November 2021. National Institute of Standards and Technology (NIST). [bib] [copylink] [poster] [research] [slides]
Milad Alshomary, Timon Gurke, Shahbaz Syed, Philipp Heinisch, Maximilian Spliethoever, Philipp Cimiano, Martin Potthast, and Henning Wachsmuth. Key Point Analysis via Contrastive Learning and Extractive Argument Summarization. In Khalid Al-Khatib, Yufang Hou, and Manfred Stede, editors, 8th Workshop on Argument Mining (ArgMining 2021) at EMNLP, pages 184-189, November 2021. Association for Computational Linguistics. [bib] [code] [copylink] [doi] [publisher] [slides]
Johannes Kiesel, Nico Reichenbach, Benno Stein, and Martin Potthast. Image Retrieval for Arguments Using Stance-Aware Query Expansion. In Khalid Al-Khatib, Yufang Hou, and Manfred Stede, editors, 8th Workshop on Argument Mining (ArgMining 2021) at EMNLP, pages 36-45, November 2021. Association for Computational Linguistics. [bib] [copylink] [data] [doi] [publisher] [research] [slides]
Jan Heinrich Reimer, Thi Kim Hanh Luu, Max Henze, and Yamen Ajjour. Modern Talking in Key Point Analysis: Key Point Matching using Pretrained Encoders. In Khalid Al-Khatib, Yufang Hou, and Manfred Stede, editors, 8th Workshop on Argument Mining (ArgMining 2021) at EMNLP, pages 175-183, November 2021. Association for Computational Linguistics. [bib] [code] [copylink] [doi] [publisher] [slides]
Kim Breitwieser, Allison Lahnala, Charles Welch, Lucie Flek, and Martin Potthast. Modeling Proficiency with Implicit User Representations. CoRR, abs/2110.08011, October 2021. [bib] [copylink] [publisher]
Christopher Akiki and Martin Potthast. BERTian Poetics: Constrained Composition with Masked LMs. CoRR, abs/2110.15181, October 2021. [bib] [code] [copylink] [publisher]
Maik Fröbe, Matthias Hagen, Janek Bevendorff, Michael Völske, Benno Stein, Christopher Schröder, Robby Wagner, Lukas Gienapp, and Martin Potthast. The Impact of Main Content Extraction on Near-Duplicate Detection. In Andreas Wagner, Christian Guetl, Michael Granitzer, and Stefan Voigt, editors, 3rd International Symposium on Open Search Technology (OSSYM 2021), October 2021. International Open Search Symposium. [arxiv] [bib] [copylink] [slides]
Janek Bevendorff, Martin Potthast, and Benno Stein. FastWARC: Optimizing Large-Scale Web Archive Analytics. In Andreas Wagner, Christian Guetl, Michael Granitzer, and Stefan Voigt, editors, 3rd International Symposium on Open Search Technology (OSSYM 2021), October 2021. International Open Search Symposium. [arxiv] [bib] [copylink]
Arno Simons, Wolfgang Kircheis, Marion Schmidt, Martin Potthast, and Benno Stein. When a social network writes science history: How Wikipedia frames innovation processes. In Framing Innovation in a Networked World. An Interdisciplinary Workshop, September 2021. [bib] [copylink] [event] [slides]
Eva Zangerle, Maximilian Mayerl, Martin Potthast, and Benno Stein. Overview of the Style Change Detection Task at PAN 2021. In Guglielmo Faggioli et al., editors, Working Notes Papers of the CLEF 2021 Evaluation Labs, volume 2936 of CEUR Workshop Proceedings, September 2021. [bib] [copylink] [event] [publisher]
Mike Kestemont, Enrique Manjavacas, Ilia Markov, Janek Bevendorff, Matti Wiegmann, Efstathios Stamatatos, Benno Stein, and Martin Potthast. Overview of the Cross-Domain Authorship Verification Task at PAN 2021. In Guglielmo Faggioli et al., editors, Working Notes Papers of the CLEF 2021 Evaluation Labs, volume 2936 of CEUR Workshop Proceedings, September 2021. [bib] [copylink] [event] [publisher]
Christopher Akiki, Maik Fröbe, Matthias Hagen, and Martin Potthast. Learning to Rank Arguments with Feature Selection. In Guglielmo Faggioli et al., editors, Working Notes Papers of the CLEF 2021 Evaluation Labs, volume 2936 of CEUR Workshop Proceedings, September 2021. [bib] [copylink] [event] [publisher]
Martin Potthast, Sebastian Günther, Janek Bevendorff, Jan Philipp Bittner, Alexander Bondarenko, Maik Fröbe, Christian Kahmann, Andreas Niekler, Michael Völske, Benno Stein, and Matthias Hagen. The Information Retrieval Anthology. In Lernen. Wissen. Daten. Analysen. - LWDA 2021, September 2021. [bib] [copylink] [demo]
Alexander Bondarenko, Lukas Gienapp, Maik Fröbe, Meriem Beloucif, Yamen Ajjour, Alexander Panchenko, Chris Biemann, Benno Stein, Henning Wachsmuth, Martin Potthast, and Matthias Hagen. Overview of Touché 2021: Argument Retrieval. In K. Selçuk Candan et al., editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. 12th International Conference of the CLEF Association (CLEF 2021), volume 12880 of Lecture Notes in Computer Science, pages 450-467, September 2021. Springer. [bib] [clef working notes] [copylink] [doi] [ecir invited paper] [event] [publisher] [video]
Johannes Kiesel, Xiaoni Cai, Roxanne El Baff, Benno Stein, and Matthias Hagen. Toward Conversational Query Reformulation. In Omar Alonso, Marc Najork, and Gianmaria Silvello, editors, 2nd International Conference on Design of Experimental Search & Information Retrieval Systems (DESIRES 2021), volume 2950 of CEUR Workshop Proceedings, pages 91-101, September 2021. [bib] [copylink] [data] [publisher] [research] [slides] [video]
Khalid Al-Khatib, Lukas Trautner, Henning Wachsmuth, Yufang Hou, and Benno Stein. Employing Argumentation Knowledge Graphs for Neural Argument Generation. In The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), pages 4744-4754, August 2021. ACL-IJCNLP. [bib] [code] [copylink] [data] [doi] [publisher]
Xuke Hu, Zhiyong Zhou, Jens Kersten, Matti Wiegmann, and Friederike Klan. GazPNE2: A general and annotation-free place name extractor for microblogs fusing gazetteers and transformer models. In Piotr Jankowski, editors, 11th International Conference on Geographic Information Science (GIScience 2021), pages 53:1-53:6, August 2021. Leibniz-Zentrum für Informatik, Dagstuhl Publishing. [bib] [code] [copylink] [doi] [publisher]
Maximilian Spliethöver and Henning Wachsmuth. Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models. In Zhi-Hua Zhou, editors, Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21, pages 552-559, August 2021. International Joint Conferences on Artificial Intelligence Organization. [bib] [code] [copylink] [doi] [publisher]
Nikolay Kolyada, Martin Potthast, and Benno Stein. Beyond Metadata: What Paper Authors Say About Corpora They Use. In The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), pages 5085-5090, August 2021. ACL-IJCNLP. [bib] [code] [copylink] [data] [doi] [publisher] [slides]
Milad Alshomary, Shahbaz Syed, Arkajit Dhar, Martin Potthast, and Henning Wachsmuth. Argument Undermining: Counter-Argument Generation by Attacking Weak Premises. In The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), pages 1816-1827, August 2021. ACL-IJCNLP. [bib] [code] [copylink] [doi] [publisher]
Shahbaz Syed, Khalid Al-Khatib, Milad Alshomary, Henning Wachsmuth, and Martin Potthast. Generating Informative Conclusions for Argumentative Texts. In The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), pages 3482-3493, August 2021. ACL-IJCNLP. [bib] [code] [copylink] [data] [doi] [publisher]
Johannes Kiesel, Lars Meyer, Martin Potthast, and Benno Stein. Meta-Information in Conversational Search. ACM Transactions on Information Systems (ACM TOIS), 39 (4), August 2021. [bib] [copylink] [data] [doi] [publisher] [research] [slides] [video]
Christopher Schröder, Kim Bürgl, Yves Annanias, Andreas Niekler, Lydia Müller, Daniel Wiegreffe, Christian Bender, Christoph Mengs, Gerik Scheuermann, and Gerhard Heyer. Supporting Land Reuse of Former Open Pit Mining Sites using Text Classification and Active Learning. In The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), pages 4141-4152, August 2021. Association for Computational Linguistics. [bib] [copylink] [doi] [publisher]
Christopher Schröder, Lydia Müller, Andreas Niekler, and Martin Potthast. Small-Text: Active Learning for Text Classification in Python. CoRR, abs/2107.10314, July 2021. [arxiv] [bib] [code] [copylink] [publisher]
Yamen Ajjour, Khalid Al-Khatib, Philipp Cimiano, Roxanne El Baff, Basil Ell, Benno Stein, and Henning Wachsmuth, editors. Same Side Stance Classification Shared Task 2019, volume 2921 of CEUR Workshop Proceedings, July 2021. [bib] [copylink] [event] [publisher]
Benno Stein, Yamen Ajjour, Roxanne El Baff, Khalid Al-Khatib, Philipp Cimiano, and Henning Wachsmuth. Same Side Stance Classification. In Yamen Ajjour et al., editors, Same Side Stance Classification Shared Task 2019, volume 2921 of CEUR Workshop Proceedings, July 2021. [bib] [copylink] [event] [publisher]
Erik Körner, Gerhard Heyer, and Martin Potthast. Same Side Stance Classification Using Contextualized Sentence Embeddings. In Yamen Ajjour et al., editors, Same Side Stance Classification Shared Task 2019, volume 2921 of CEUR Workshop Proceedings, July 2021. [bib] [code] [copylink] [publisher]
Yamen Ajjour and Khalid Al-Khatib. Analysing the Submissions to the Same Side Stance Classification Task. In Yamen Ajjour et al., editors, Same Side Stance Classification Shared Task 2019, volume 2921 of CEUR Workshop Proceedings, July 2021. [bib] [copylink] [publisher]
Sebastian Günther and Matthias Hagen. Assessing Query Suggestions for Search Session Simulation. In Krisztian Balog et al., editors, Causality in Search and Recommendation (CSR) and Simulation of Information Retrieval Evaluation (Sim4IR) workshops at SIGIR 2021, volume 2911 of CEUR Workshop Proceedings, July 2021. [bib] [copylink] [publisher] [slides]
Martin Potthast, Sebastian Günther, Janek Bevendorff, Jan Philipp Bittner, Alexander Bondarenko, Maik Fröbe, Christian Kahmann, Andreas Niekler, Michael Völske, Benno Stein, and Matthias Hagen. The Information Retrieval Anthology. In Fernando Diaz et al., editors, 44th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2021), pages 2550-2555, July 2021. ACM. [bib] [code] [copylink] [demo] [doi] [publisher]
Maik Fröbe, Janek Bevendorff, Lukas Gienapp, Michael Völske, Benno Stein, Martin Potthast, and Matthias Hagen. CopyCat: Near-Duplicates within and between the ClueWeb and the Common Crawl. In Fernando Diaz et al., editors, 44th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2021), pages 2398-2404, July 2021. ACM. [bib] [code] [copylink] [doi] [poster] [publisher] [slides]
Markus Fischer, Kristof Komlossy, Benno Stein, Martin Potthast, and Matthias Hagen. Identifying Queries in Instant Search Logs. In Fernando Diaz et al., editors, 44th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2021), pages 1692-1696, July 2021. ACM. [bib] [code] [copylink] [data] [doi] [poster] [publisher] [slides]
Johannes Kiesel, Damiano Spina, Henning Wachsmuth, and Benno Stein. The Meant, the Said, and the Understood: Conversational Argument Search and Cognitive Biases. In Stephan Schlögl, Martin Porcheron, and Leigh Clark, editors, 3rd Conference on Conversational User Interfaces (CUI 2021), July 2021. ACM. [bib] [copylink] [doi] [publisher] [research] [slides] [video]
Michael Völske, Alexander Bondarenko, Maik Fröbe, Benno Stein, Jaspreet Singh, Matthias Hagen, and Avishek Anand. Towards Axiomatic Explanations for Neural Ranking Models. In Faegheh Hasibi, Yi Fang, and Akiko Aizawa, editors, 2021 ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR '21), pages 13-22, July 2021. ACM. [arxiv] [bib] [code] [copylink] [doi] [poster] [publisher]
Alexander Bondarenko, Ekaterina Shirshakova, Niklas Homann, and Matthias Hagen. ACQuA at Same Side Stance Classification 2019. In Yamen Ajjour et al., editors, Same Side Stance Classification Shared Task 2019, volume 2921 of CEUR Workshop Proceedings, July 2021. [bib] [code] [copylink] [publisher]
Khalid Al-Khatib, Tirthankar Ghosal, Yufang Hou, Anita de Waard, and Dayne Freitag. Argument Mining for Scholarly Document Processing: Taking Stock and Looking Ahead. In Second Workshop on Scholarly Document Processing at NAACL21, pages 56-65, June 2021. NAACL. [bib] [copylink] [doi] [publisher]
Martin Potthast, Benno Stein, and Matthias Hagen. The Information Retrieval Anthology 2021: Inaugural Status Report and Challenges Ahead. SIGIR Forum, 55 (1), June 2021. [bib] [copylink] [doi] [publisher]
Damiano Spina, Johanne R. Trippas, Paul Thomas, Hideo Joho, Byström Katriina, Leigh Clark, Nick Craswell, Mary Czerwinski, David Elsweiler, Alexander Frummet, Souvick Ghosh, Johannes Kiesel, Irene Lopatovska, Daniel McDuff, Selina Meyer, Ahmed Mourad, Owoicho Paul, Pathiyan Sachin Cherumanal, Daniel Russell, and Laurianne Sitbon. Report on the Future Conversations Workshop at CHIIR 2021. SIGIR Forum, 55 (1), June 2021. [bib] [copylink] [doi] [publisher]
Lidor Ivan, Shira Dvir Gvirsman, Mario Haim, and Martin Potthast. Don't Take the Bait: Users' Engagement with Clickbait and Its Effect on Editorial Considerations. In 71st Annual International Communication Association Conference (ICA 2021), May 2021. [bib] [copylink] [video]
Marion Schmidt, Wolfgang Kircheis, Arno Simons, Martin Potthast, and Benno Stein. Does Wikipedia Cover the Relevant Literature on Major Innovations Timely? An Exploratory Case Study of CRISPR/Cas9. In Wolfgang Glänzel, Sarah Heeffer, Pei-Shan Chi, and Ronald Rousseau, editors, 18th International Conference on Scientometrics & Informetrics (ISSI 2021), pages 1021-1026, May 2021. International Society for Scientometrics and Informetrics (I.S.S.I.). [bib] [copylink] [publisher] [video]
Henning Schmidgen, Benno Stein, Tim Gollub, Michael Braun, and Jan Willmann. Philosophische Körper. Von digitalem Text zu greifbarem Material. Zeitschrift für digitale Geisteswissenschaften (ZfdG), 6 (6), May 2021. [bib] [copylink] [doi] [publisher]
Viktoriia Chekalina, Alexander Bondarenko, Chris Biemann, Meriem Beloucif, Varvara Logacheva, and Alexander Panchenko. Which is Better for Deep Learning: Python or MATLAB? Answering Comparative Questions in Natural Language. In Dimitra Gkatzia and Djamé Seddah, editors, 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021), pages 302-311, April 2021. ACL. [bib] [copylink] [demo] [publisher] [research]
Matti Wiegmann, Jens Kersten, Hansi Senaratne, Martin Potthast, Friederike Klan, and Benno Stein. Opportunities and Risks of Disaster Data from Social Media: A Systematic Review of Incident Information. Natural Hazards and Earth System Sciences, 21 (5) : 1431-1444, April 2021. [bib] [copylink] [doi] [publisher]
Djoerd Hiemstra, Marie-Francine Moens, Josiane Mothe, Raffaele Perego, Martin Potthast, and Fabrizio Sebastiani, editors. 43rd International Conference on IR Research (ECIR 2021), volume 12656 of Lecture Notes in Computer Science, Springer, April 2021. [bib] [copylink] [doi]
Philipp Cimiano, Matthias Hagen, and Benno Stein, editors. Argumentation Technology, volume 63, De Gruyter Oldenbourg, April 2021. [bib] [copylink] [doi]
Johannes Kiesel, Lars Meyer, Florian Kneist, Benno Stein, and Martin Potthast. An Empirical Comparison of Web Page Segmentation Algorithms. In Djoerd Hiemstra et al., editors, Advances in Information Retrieval. 43rd European Conference on IR Research (ECIR 2021), volume 12657 of Lecture Notes in Computer Science, pages 62-74, March 2021. Springer. [bib] [code] [copylink] [data] [doi] [publisher] [slides] [video]
Janek Bevendorff, BERTa Chulvi, Gretel Liz De La Peña Sarracén, Mike Kestemont, Enrique Manjavacas, Ilia Markov, Maximilian Mayerl, Martin Potthast, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, Benno Stein, Matti Wiegmann, Magdalena Wolska, and Eva Zangerle. Overview of PAN 2021: Authorship Verification, Profiling Hate Speech Spreaders on Twitter, and Style Change Detection. In Djoerd Hiemstra et al., editors, Advances in Information Retrieval. 43rd European Conference on IR Research (ECIR 2021), volume 12036 of Lecture Notes in Computer Science, pages 567-573, March 2021. Springer. [bib] [copylink] [doi] [event] [publisher]
Dora Kiesel, Patrick Riehmann, Henning Wachsmuth, Benno Stein, and Bernd Fröhlich. Visual Analysis of Argumentation in Essays. IEEE Transactions on Visualization and Computer Graphics, 27 (2) : 1139-1148, February 2021. [bib] [copylink] [doi] [video]
Michael Völske, Janek Bevendorff, Johannes Kiesel, Benno Stein, Maik Fröbe, Matthias Hagen, and Martin Potthast. Web Archive Analytics. In Ralf H. Reussner, Anne Koziolek, and Robert Heinrich, editors, 50. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2020, volume P-307 of Lecture Notes in Informatics, LNI, pages 61-72, January 2021. Gesellschaft für Informatik, GI. [arxiv] [bib] [copylink] [doi] [publisher] [research]
2020
Henning Wachsmuth and Till Werner. Intrinsic Quality Assessment of Arguments. In 28th International Conference on Computational Linguistics (COLING 2020), pages 6739-6745, December 2020. International Committee on Computational Linguistics. [bib] [copylink] [doi] [publisher]
Shahbaz Syed, Wei-Fan Chen, Matthias Hagen, Benno Stein, Henning Wachsmuth, and Martin Potthast. Task Proposal: Abstractive Snippet Generation for Web Pages. In 13th International Conference on Natural Language Generation, pages 237-241, December 2020. Association for Computational Linguistics. [bib] [copylink] [data] [publisher]
Shahbaz Syed, Roxanne El Baff, Khalid Al-Khatib, Johannes Kiesel, Benno Stein, and Martin Potthast. News Editorials: Towards Summarizing Long Argumentative Texts. In 28th International Conference on Computational Linguistics (COLING 2020), December 2020. Association for Computational Linguistics. [bib] [code] [copylink] [data] [slides]
Khalid Al-Khatib, Viorel Morari, and Benno Stein. Style Analysis of Argumentative Texts by Mining Rhetorical Devices. In the 7th Workshop on Argument Mining, pages 106-116, December 2020. ACL. [bib] [copylink] [publisher]
Maximilian Spliethöver and Henning Wachsmuth. Argument from Old Man's View: Assessing Social Bias in Argumentation. In 7th Workshop on Argument Mining, pages 76-87, December 2020. Association for Computational Linguistics. [bib] [code] [copylink] [publisher]
Roxanne El Baff, Khalid Al-Khatib, Benno Stein, and Henning Wachsmuth. Persuasiveness of News Editorials Depending on Ideology and Personality. In Third Workshop on Computational Modeling of People's Opinions, Personality, and Emotions in Social Media co-located with COLING, December 2020. Association for Computational Linguistics. [bib] [code] [copylink] [data]
Christopher Akiki and Manuel Burghardt. Toward a Musical Sentiment (MuSe) Dataset for Affective Distant Hearing. In Folgert Karsdorp, Barbara McGillivray, Adina Nerghes, and Melvin Wevers, editors, Workshop on Computational Humanities Research (CHR 2020), volume 2723, pages 225-235, November 2020. [bib] [copylink] [event]
Janek Bevendorff, Alexander Bondarenko, Maik Fröbe, Sebastian Günther, Michael Völske, Benno Stein, and Matthias Hagen. Webis at TREC 2020: Health Misinformation Track. In Ellen M. Voorhees and Angela Ellis, editors, 29th International Text Retrieval Conference (TREC 2020), NIST Special Publication, November 2020. National Institute of Standards and Technology (NIST). [bib] [copylink] [research] [slides]
Wei-Fan Chen, Khalid Al-Khatib, Henning Wachsmuth, and Benno Stein. Analyzing Political Bias and Unfairness in News Articles at Different Levels of Granularity. In 4th Workshop on Natural Language Processing and Computational Social Science, pages 149-154, October 2020. [bib] [copylink] [data]
Michael Völske, Janek Bevendorff, Johannes Kiesel, Benno Stein, Maik Fröbe, Matthias Hagen, and Martin Potthast. Web Archive Analytics: Infrastructure & Applications @ Webis (extended abstract). In Andreas Wagner, Christian Guetl, Michael Granitzer, and Stefan Voigt, editors, 2nd International Symposium on Open Search Technology (OSSYM 2020), October 2020. International Open Search Symposium. [bib] [copylink] [doi] [poster] [publisher] [research]
Michael Völske, Janek Bevendorff, Johannes Kiesel, Benno Stein, Maik Fröbe, Matthias Hagen, and Martin Potthast. Towards an Open Web Index: Lessons From the Past. In Andreas Wagner, Christian Guetl, Michael Granitzer, and Stefan Voigt, editors, 2nd International Symposium on Open Search Technology (OSSYM 2020), October 2020. International Open Search Symposium. [bib] [copylink] [doi] [publisher] [research] [slides]
Johannes Kiesel, Florian Kneist, Lars Meyer, Kristof Komlossy, Benno Stein, and Martin Potthast. Web Page Segmentation Revisited: Evaluation Framework and Dataset. In Mathieu d'Aquin et al., editors, 29th ACM International Conference on Information and Knowledge Management (CIKM 2020), pages 3047-3054, October 2020. ACM. [bib] [code] [copylink] [data] [doi] [publisher] [research] [slides] [video]
Stefan Heindorf, Yan Scholten, Henning Wachsmuth, Axel-Cyrille Ngonga Ngomo, and Martin Potthast. CauseNet: Towards a Causality Graph Extracted from the Web. In Mathieu d'Aquin et al., editors, 29th ACM International Conference on Information and Knowledge Management (CIKM 2020), pages 3023-3030, October 2020. ACM. [bib] [code] [copylink] [data] [doi] [video]
Lukas Gienapp, Maik Fröbe, Matthias Hagen, and Martin Potthast. The Impact of Negative Relevance Judgments on NDCG. In Mathieu d'Aquin et al., editors, 29th ACM International Conference on Information and Knowledge Management (CIKM 2020), pages 2037-2040, October 2020. ACM. [bib] [code] [copylink] [doi] [slides] [video]
Lukas Gienapp, Benno Stein, Matthias Hagen, and Martin Potthast. Estimating Topic Difficulty Using Normalized Discounted Cumulated Gain. In Mathieu d'Aquin et al., editors, 29th ACM International Conference on Information and Knowledge Management (CIKM 2020), pages 2033-2036, October 2020. ACM. [bib] [code] [copylink] [doi] [slides] [video]
Wei-Fan Chen, Khalid Al-Khatib, Henning Wachsmuth, and Benno Stein. Detecting Media Bias in News Articles Using Gaussian Bias Distributions. In 25th Conference on Empirical Methods in Natural Language Processing: Findings (EMNLP 2020), pages 4290-4330, October 2020. [bib] [code] [copylink]
Christopher Akiki and Martin Potthast. Exploring Argument Retrieval with Transformers. In Linda Cappellato, Carsten Eickhoff, Nicola Ferro, and Aurélie Névéol, editors, Working Notes Papers of the CLEF 2020 Evaluation Labs, volume 2696, September 2020. [bib] [copylink] [event] [publisher] [slides]
Konstantin Kobs, Martin Potthast, Matti Wiegmann, Albin Zehe, Benno Stein, and Andreas Hotho. Towards Predicting the Subscription Status of Twitch.tv Users – ECML-PKDD ChAT Discovery Challenge 2020. In ECML-PKDD 2020 ChAT Discovery Challenge on Chat Analytics for Twitch, volume 2661 of CEUR Workshop Proceedings, September 2020. [bib] [copylink] [event] [publisher]
Janek Bevendorff, Bilal Ghanem, Anastasia Giachanou, Mike Kestemont, Enrique Manjavacas, Ilia Markov, Maximilian Mayerl, Martin Potthast, Francisco Rangel, Paolo Rosso, Günther Specht, Efstathios Stamatatos, Benno Stein, Matti Wiegmann, and Eva Zangerle. Overview of PAN 2020: Authorship Verification, Celebrity Profiling, Profiling Fake News Spreaders on Twitter, and Style Change Detection. In Avi Arampatzis et al., editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. 10th International Conference of the CLEF Initiative (CLEF 2020), volume 12260 of Lecture Notes in Computer Science, pages 372-383, September 2020. Springer. [bib] [copylink] [doi] [event] [publisher]
Alexander Bondarenko, Maik Fröbe, Meriem Beloucif, Lukas Gienapp, Yamen Ajjour, Alexander Panchenko, Chris Biemann, Benno Stein, Henning Wachsmuth, Martin Potthast, and Matthias Hagen. Overview of Touché 2020: Argument Retrieval. In Avi Arampatzis et al., editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. 11th International Conference of the CLEF Association (CLEF 2020), volume 12260 of Lecture Notes in Computer Science, pages 384-395, September 2020. Springer. [bib] [clef working notes] [copylink] [doi] [event] [publisher] [slides] [video]
Mike Kestemont, Enrique Manjavacas, Ilia Markov, Janek Bevendorff, Matti Wiegmann, Efstathios Stamatatos, Martin Potthast, and Benno Stein. Overview of the Cross-Domain Authorship Verification Task at PAN 2020. In Linda Cappellato, Carsten Eickhoff, Nicola Ferro, and Aurélie Névéol, editors, Working Notes Papers of the CLEF 2020 Evaluation Labs, volume 2696 of CEUR Workshop Proceedings, September 2020. [bib] [copylink] [event] [publisher]
Matti Wiegmann, Benno Stein, and Martin Potthast. Overview of the Celebrity Profiling Task at PAN 2020. In Linda Cappellato, Carsten Eickhoff, Nicola Ferro, and Aurélie Névéol, editors, Working Notes Papers of the CLEF 2020 Evaluation Labs, volume 2696 of CEUR Workshop Proceedings, September 2020. [bib] [copylink] [event] [publisher]
Eva Zangerle, Maximilian Mayerl, Günther Specht, Benno Stein, and Martin Potthast. Overview of the Style Change Detection Task at PAN 2020. In Linda Cappellato, Carsten Eickhoff, Nicola Ferro, and Aurélie Névéol, editors, Working Notes Papers of the CLEF 2020 Evaluation Labs, volume 2696 of CEUR Workshop Proceedings, September 2020. [bib] [copylink] [event] [publisher]
Philipp Cimiano, Gerhard Heyer, Michael Kohlhase, Benno Stein, Jürgen Ziegler, and Theo Härder, editors. Robust Argumentation Machines, volume 20 of Datenbank-Spektrum, Springer, July 2020. [bib] [copylink] [doi] [publisher]
Alexander Bondarenko, Alexander Panchenko, Meriem Beloucif, Chris Biemann, and Matthias Hagen. Answering Comparative Questions with Arguments. Datenbank-Spektrum, 20 (2) : 155-160, July 2020. [bib] [copylink] [demo] [doi] [publisher] [research]
Lukas Gienapp, Benno Stein, Matthias Hagen, and Martin Potthast. Efficient Pairwise Annotation of Argument Quality. In 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), pages 5772-5781, July 2020. Association for Computational Linguistics. [bib] [code] [copylink] [data] [publisher]
Roxanne El Baff, Henning Wachsmuth, Khalid Al-Khatib, and Benno Stein. Analyzing the Persuasive Effect of Style in News Editorial Argumentation. In 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), pages 3154-3160, July 2020. Association for Computational Linguistics. [bib] [code] [copylink] [data] [publisher] [video]
Janek Bevendorff, Khalid Al-Khatib, Martin Potthast, and Benno Stein. Crawling and Preprocessing Mailing Lists at Scale for Dialog Analysis. In 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), pages 1151-1158, July 2020. Association for Computational Linguistics. [bib] [code] [copylink] [data] [publisher] [video]
Milad Alshomary, Shahbaz Syed, Martin Potthast, and Henning Wachsmuth. Target Inference in Argument Conclusion Generation. In 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), July 2020. Association for Computational Linguistics. [bib] [code] [copylink]
Khalid Al-Khatib, Michael Völske, Shahbaz Syed, Nikolay Kolyada, and Benno Stein. Exploiting Personal Characteristics of Debaters for Predicting Persuasiveness. In 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), pages 7067-7072, July 2020. Association for Computational Linguistics. [bib] [code] [copylink] [data] [publisher]
Maik Fröbe, Janek Bevendorff, Jan Heinrich Reimer, Martin Potthast, and Matthias Hagen. Sampling Bias Due to Near-Duplicates in Learning to Rank. In 43rd International ACM Conference on Research and Development in Information Retrieval (SIGIR 2020), pages 1997-2000, July 2020. ACM. [bib] [code] [copylink] [doi] [publisher] [video]
Milad Alshomary, Nick Düsterhus, and Henning Wachsmuth. Extractive Snippet Generation for Arguments. In 43nd International ACM Conference on Research and Development in Information Retrieval (SIGIR 2020), July 2020. ACM. [bib] [copylink]
Avishek Anand, Lawrence Cavedon, Matthias Hagen, Hideo Joho, Mark Sanderson, and Benno Stein. Dagstuhl Seminar 19461 on Conversational Search. SIGIR Forum, 54 (1), June 2020. [bib] [copylink] [publisher]
Martin Potthast, Matthias Hagen, and Benno Stein. The Dilemma of the Direct Answer. SIGIR Forum, 54 (1), June 2020. [bib] [copylink] [publisher]
Matti Wiegmann, Jens Kersten, Friederike Klan, Martin Potthast, and Benno Stein. Analysis of Detection Models for Disaster-Related Tweets. In Amanda Lee Hughes, Fiona McNeill, and Christopher Zobel, editors, 17th ISCRAM Conference, May 2020. ISCRAM. [bib] [copylink]
Sebastian Bischoff, Niklas Deckers, Marcel Schliebs, Ben Thies, Matthias Hagen, Efstathios Stamatatos, Benno Stein, and Martin Potthast. The Importance of Suppressing Domain Style in Authorship Analysis. CoRR, abs/2005.14714, May 2020. [bib] [copylink] [publisher]
Avishek Anand, Lawrence Cavedon, Hideo Joho, Mark Sanderson, and Benno Stein. Conversational Search (Dagstuhl Seminar 19461). Dagstuhl Reports, 9 (11) : 34-83, April 2020. [arxiv] [bib] [copylink] [doi] [event] [publisher]
Wei-Fan Chen, Shahbaz Syed, Benno Stein, Matthias Hagen, and Martin Potthast. Abstractive Snippet Generation. In Yennung Huang, Irwin King, Tie-Yan Liu, and Maarten van Steen, editors, Web Conference (WWW 2020), pages 1309-1319, April 2020. ACM. [arxiv] [bib] [copylink] [doi] [publisher] [research] [video]
Maik Fröbe, Nina Schwanke, Matthias Hagen, and Martin Potthast. A Search Engine for Police Press Releases to Double-check the News. In Joemon M. Jose et al., editors, Advances in Information Retrieval. 42nd European Conference on IR Research (ECIR 2020), volume 12036 of Lecture Notes in Computer Science, pages 454-458, April 2020. Springer. [bib] [copylink] [demo] [doi] [publisher]
Maik Fröbe, Jan Philipp Bittner, Martin Potthast, and Matthias Hagen. The Effect of Content-Equivalent Near-Duplicates on the Evaluation of Search Engines. In Joemon M. Jose et al., editors, Advances in Information Retrieval. 42nd European Conference on IR Research (ECIR 2020), volume 12036 of Lecture Notes in Computer Science, pages 12-19, April 2020. Springer. [bib] [code] [copylink] [doi] [publisher]
Alexander Bondarenko, Matthias Hagen, Martin Potthast, Henning Wachsmuth, Meriem Beloucif, Chris Biemann, Alexander Panchenko, and Benno Stein. Touché: First Shared Task on Argument Retrieval. In Pablo Castells et al., editors, Advances in Information Retrieval. 42nd European Conference on IR Research (ECIR 2020), volume 12036 of Lecture Notes in Computer Science, pages 517-523, April 2020. Springer. [bib] [copylink] [doi] [event] [publisher]
Janek Bevendorff, Mike Kestemont, Enrique Manjavacas, Martin Potthast, Francisco Rangel, Paolo Rosso, Günther Specht, Efstathios Stamatatos, Benno Stein, Matti Wiegmann, and Eva Zangerle. Shared Tasks on Authorship Analysis at PAN 2020. In Pablo Castells et al., editors, Advances in Information Retrieval. 42nd European Conference on IR Research (ECIR 2020), volume 12036 of Lecture Notes in Computer Science, pages 508-516, April 2020. Springer. [bib] [copylink] [doi] [publisher]
Janek Bevendorff, Tobias Wenzel, Martin Potthast, Matthias Hagen, and Benno Stein. On Divergence-based Author Obfuscation: An Attack on the State of the Art in Statistical Authorship Verification. it - Information Technology, 62 (2) : 99-115, March 2020. [bib] [copylink] [doi] [publisher]
Johannes Kiesel, Kevin Lang, Henning Wachsmuth, Eva Hornecker, and Benno Stein. Investigating Expectations for Voice-based and Conversational Argument Search on the Web. In Luanne Freund et al., editors, 2020 Conference on Human Information Interaction & Retrieval (CHIIR 2020), pages 53-62, March 2020. ACM. [bib] [copylink] [data] [doi] [publisher] [research] [slides] [video]
Khalid Al-Khatib, Yufang Hou, Henning Wachsmuth, Charles Jochim, Francesca Bonin, and Benno Stein. End-to-End Argumentation Knowledge Graph Construction. In 34th AAAI Conference on Artificial Intelligence (AAAI 2020), pages 7367-7374, February 2020. AAAI. [bib] [copylink] [data] [doi]
Alexander Bondarenko, Pavel Braslavski, Michael Völske, Rami Aly, Maik Fröbe, Alexander Panchenko, Chris Biemann, Benno Stein, and Matthias Hagen. Comparative Web Search Questions. In James Caverlee, Xia (Ben) Hu, Mounia Lalmas, and Wei Wang, editors, 13th ACM International Conference on Web Search and Data Mining (WSDM 2020), pages 52-60, February 2020. ACM. [bib] [copylink] [data] [publisher] [research] [slides]
Krisztian Balog, Lucie Flekova, Matthias Hagen, Rosie Jones, Martin Potthast, Filip Radlinski, Mark Sanderson, Svitlana Vakulenko, and Hamed Zamani. Common Conversational Community Prototype: Scholarly Conversational Assistant. CoRR, abs/2001.06910, January 2020. [bib] [copylink] [publisher]
2019
Khalid Al-Khatib. Computational Analysis of Argumentation Strategies. Dissertation, Bauhaus-Universität Weimar, December 2019. [bib] [copylink] [data] [doi]
Shahbaz Syed, Michael Völske, Nedim Lipka, Benno Stein, Hinrich Schütze, and Martin Potthast. Towards Summarization for Social Media - Results of the TL;DR Challenge. In Kees van Deemter, Chenghua Lin, and Hiroya Takamura, editors, 12th International Natural Language Generation Conference (INLG 2019), November 2019. [bib] [copylink] [event] [publisher]
Roxanne El Baff, Henning Wachsmuth, Khalid Al-Khatib, Manfred Stede, and Benno Stein. Computational Argumentation Synthesis as a Language Modeling Task. In Kees van Deemter, Chenghua Lin, and Hiroya Takamura, editors, 12th International Natural Language Generation Conference (INLG 2019), pages 54-64, November 2019. ACL. [bib] [copylink] [doi] [publisher]
Yamen Ajjour, Milad Alshomary, Henning Wachsmuth, and Benno Stein. Modeling Frames in Argumentation. In Kentaro Inui, Jing Jiang, Vincent Ng, and Xiaojun Wan, editors, 24th Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP 2019), pages 2922-2932, November 2019. ACL. [bib] [copylink] [data] [publisher] [research]
Alexander Bondarenko, Maik Fröbe, Vaibhav Kasturia, Michael Völske, Benno Stein, and Matthias Hagen. Webis at TREC 2019: Decision Track. In Ellen M. Voorhees and Angela Ellis, editors, 28th International Text Retrieval Conference (TREC 2019), NIST Special Publication, November 2019. National Institute of Standards and Technology (NIST). [bib] [copylink] [research]
Wei-Fan Chen, Khalid Al-Khatib, Matthias Hagen, Henning Wachsmuth, and Benno Stein. Unraveling the Search Space of Abusive Language in Wikipedia with Dynamic Lexicon Acquisition. In Alberto Barrón-Cedeño et al., editors, 2nd Workshop on NLP for Internet Freedom (NLP4IF 2019) at EMNLP, November 2019. ACL. [bib] [copylink] [publisher]
Henning Wachsmuth. Argumentation Mining. Computational Linguistics, 45 (3) : 603-606, October 2019. [bib] [copylink] [doi] [publisher]
Tim Gollub, Leon Hutans, Tanveer Al Jami, and Benno Stein. Exploratory Search Pipes with Scoped Facets. In 2019 ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR 2019), October 2019. ACM. [bib] [copylink] [doi] [poster] [publisher] [research] [slides]
Paolo Rosso, Martin Potthast, Benno Stein, Efstathios Stamatatos, Francisco Rangel, and Walter Daelemans. Evolution of the PAN Lab on Digital Text Forensics. In Nicola Ferro and Carol Peters, editors, Information Retrieval Evaluation in a Changing World, The Information Retrieval Series, Springer. September 2019. [bib] [copylink] [doi]
Eva Zangerle, Michael Tschuggnall, Günther Specht, Benno Stein, and Martin Potthast. Overview of the Style Change Detection Task at PAN 2019. In Linda Cappellato, Nicola Ferro, David E. Losada, and Henning Müller, editors, Working Notes Papers of the CLEF 2019 Evaluation Labs, volume 2380 of CEUR Workshop Proceedings, September 2019. [bib] [copylink] [event] [publisher]
Matti Wiegmann, Benno Stein, and Martin Potthast. Overview of the Celebrity Profiling Task at PAN 2019. In Linda Cappellato, Nicola Ferro, David E. Losada, and Henning Müller, editors, Working Notes Papers of the CLEF 2019 Evaluation Labs, volume 2380 of CEUR Workshop Proceedings, September 2019. [bib] [copylink] [event] [publisher]
Mike Kestemont, Efstathios Stamatatos, Enrique Manjavacas, Walter Daelemans, Martin Potthast, and Benno Stein. Overview of the Cross-domain Authorship Attribution Task at PAN 2019. In Linda Cappellato, Nicola Ferro, David E. Losada, and Henning Müller, editors, Working Notes Papers of the CLEF 2019 Evaluation Labs, volume 2380 of CEUR Workshop Proceedings, September 2019. [bib] [copylink] [event] [publisher]
Martin Potthast, Tim Gollub, Matti Wiegmann, and Benno Stein. TIRA Integrated Research Architecture. In Nicola Ferro and Carol Peters, editors, Information Retrieval Evaluation in a Changing World, The Information Retrieval Series, Springer. September 2019. [bib] [copylink] [doi]
Walter Daelemans, Mike Kestemont, Enrique Manjavacas, Martin Potthast, Francisco Rangel, Paolo Rosso, Günther Specht, Efstathios Stamatatos, Benno Stein, Michael Tschuggnall, Matti Wiegmann, and Eva Zangerle. Overview of PAN 2019: Bots and Gender Profiling, Celebrity Profiling, Cross-domain Authorship Attribution and Style Change Detection. In Fabio Crestani et al., editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. 10th International Conference of the CLEF Initiative (CLEF 2019), volume 11696 of Lecture Notes in Computer Science, pages 402-416, September 2019. Springer. [bib] [copylink] [doi] [event] [publisher]
Yamen Ajjour, Henning Wachsmuth, Johannes Kiesel, Martin Potthast, Matthias Hagen, and Benno Stein. Data Acquisition for Argument Search: The args.me corpus. In Christoph Benzmüller and Heiner Stuckenschmidt, editors, 42nd German Conference on Artificial Intelligence (KI 2019), pages 48-59, September 2019. Springer. [award] [bib] [copylink] [data] [doi] [research]
Benno Stein and Henning Wachsmuth, editors. 6th Workshop on Argument Mining (ArgMining 2019) at ACL, Association for Computational Linguistics, August 2019. [bib] [copylink] [event] [publisher]
Alexander Panchenko, Alexander Bondarenko, Mirco Franzek, Matthias Hagen, and Chris Biemann. Categorizing Comparative Sentences. In Benno Stein and Henning Wachsmuth, editors, 6th Workshop on Argument Mining (ArgMining 2019) at ACL, pages 136-145, August 2019. Association for Computational Linguistics. [bib] [copylink] [data] [demo] [poster] [publisher] [research]
Martin Potthast, Lukas Gienapp, Florian Euchner, Nick Heilenkötter, Nico Weidmann, Henning Wachsmuth, Benno Stein, and Matthias Hagen. Argument Search: Assessing Argument Relevance. In 42nd International ACM Conference on Research and Development in Information Retrieval (SIGIR 2019), July 2019. ACM. [bib] [copylink] [doi] [publisher] [research]
Janek Bevendorff, Benno Stein, Matthias Hagen, and Martin Potthast. Bias Analysis and Mitigation in the Evaluation of Authorship Verification. In Anna Korhonen, Lluís Màrquez, and David Traum, editors, 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), pages 6301-6306, July 2019. Association for Computational Linguistics. [bib] [copylink] [poster] [publisher]
Christina Lohr, Johannes Kiesel, Stephanie Luther, Johannes Hellrich, Benno Stein, and Udo Hahn. Continuous Annotation Quality Control, Support for Hierarchically Structured Label Sets and Long-Segment Annotation with WAT-SL 2.0. In Annemarie Friedrich, Jet Hoek, and Deniz Zeyrek, editors, 13th Linguistic Annotation Workshop (LAW 2019) at ACL, July 2019. Association for Computational Linguistics. [bib] [code] [copylink] [demo] [poster] [publisher] [research]
Michael Völske, Ehsan Fatehifar, Benno Stein, and Matthias Hagen. Query-Task Mapping. In 42nd International ACM Conference on Research and Development in Information Retrieval (SIGIR 2019), July 2019. ACM. [bib] [copylink] [data] [doi] [poster] [publisher]
Janek Bevendorff, Martin Potthast, Matthias Hagen, and Benno Stein. Heuristic Authorship Obfuscation. In Anna Korhonen, Lluís Màrquez, and David Traum, editors, 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), pages 1098-1108, July 2019. Association for Computational Linguistics. [bib] [copylink] [publisher] [video]
Artem Chernodub, Oleksiy Oliynyk, Philipp Heidenreich, Alexander Bondarenko, Matthias Hagen, Chris Biemann, and Alexander Panchenko. TARGER: Neural Argument Mining at Your Fingertips. In Martha R. Costa-jussà and Enrique Alfonseca, editors, 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), pages 195-200, July 2019. Association for Computational Linguistics. [bib] [copylink] [demo] [poster] [publisher] [research]
Matti Wiegmann, Benno Stein, and Martin Potthast. Celebrity Profiling. In Anna Korhonen, Lluís Màrquez, and David Traum, editors, 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), pages 2611-2618, July 2019. Association for Computational Linguistics. [bib] [copylink] [poster] [publisher]
Michael Völske. Retrieval Enhancements for Task-based Web Search. Dissertation, Bauhaus-Universität Weimar, July 2019. [bib] [copylink] [doi] [slides]
Johannes Kiesel, Fabienne Hubricht, Benno Stein, and Martin Potthast. A Dataset for Content Error Detection in Web Archives. In Maria Bonn, Stephen J. Downie, Alain Martaus, and Dan Wu, editors, 18th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2019), pages 349-350, June 2019. ACM. [award] [bib] [copylink] [data] [doi] [poster] [publisher] [research] [slides]
Janek Bevendorff, Benno Stein, Matthias Hagen, and Martin Potthast. Generalizing Unmasking for Short Texts. In Jill Burstein, Christy Doran, and Thamar Solorio, editors, 14th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2019), pages 654-659, June 2019. Association for Computational Linguistics. [bib] [copylink] [publisher]
Johannes Kiesel, Maria Mestre, Rishabh Shukla, Emmanuel Vincent, Payam Adineh, David Corney, Benno Stein, and Martin Potthast. SemEval-2019 Task 4: Hyperpartisan News Detection. In 13th International Workshop on Semantic Evaluation (SemEval 2019), pages 829-839, June 2019. Association for Computational Linguistics. [bib] [copylink] [data] [doi] [event] [poster] [publisher] [research] [slides] [video]
Pertti Vakkari, Michael Völske, Martin Potthast, Matthias Hagen, and Benno Stein. Modeling the Usefulness of Search Results as Measured by Information Use. Information Processing & Management, 56 (3) : 879-894, May 2019. [bib] [copylink] [data] [doi] [publisher]
Stefan Heindorf, Yan Scholten, Gregor Engels, and Martin Potthast. Debiasing Vandalism Detection Models at Wikidata. In Web Conference (WWW 2019), May 2019. ACM. [bib] [copylink] [research]
Jens Kersten, Anna Kruspe, Matti Wiegmann, and Friederike Klan. Robust Filtering of Crisis-related Tweets. In Zeno Franco, José J. González, and José H. Canós, editors, 16th International Conference on Information Systems for Crisis Response And Management (ISCRAM 2019), pages 814-824, May 2019. [bib] [copylink]
Mathias Lux, Pål Halvorsen, Duc-Tien Dang-Nguyen, Håkon Stensland, Manoj Kesavulu, Martin Potthast, and Michael Riegler. Summarizing E-Sports Matches and Tournaments: The Example of Counter-Strike: Global Offensive. In 11th International Workshop on Immersive Mixed and Virtual Environment Systems (MMVE 2019), May 2019. ACM. [bib] [copylink] [doi] [poster] [slides]
Leif Azzopardi, Benno Stein, Norbert Fuhr, Philipp Mayr, Claudia Hauff, and Djoerd Hiemstra, editors. 41th International Conference on IR Research (ECIR 2019), volume 11437 of Lecture Notes in Computer Science, Springer, April 2019. [bib] [copylink] [doi]
Martin Potthast, Paolo Rosso, Efstathios Stamatatos, and Benno Stein. A Decade of Shared Tasks in Digital Text Forensics at PAN. In Leif Azzopardi et al., editors, Advances in Information Retrieval. 41st European Conference on IR Research (ECIR 2019), volume 11438 of Lecture Notes in Computer Science, pages 291-300, April 2019. Springer. [bib] [copylink] [doi] [poster] [research] [slides]
Milad Alshomary, Michael Völske, Tristan Licht, Henning Wachsmuth, Benno Stein, Matthias Hagen, and Martin Potthast. Wikipedia Text Reuse: Within and Without. In Leif Azzopardi et al., editors, Advances in Information Retrieval. 41st European Conference on IR Research (ECIR 2019), volume 11437 of Lecture Notes in Computer Science, pages 747-754, April 2019. Springer. [bib] [code] [copylink] [data] [demo] [doi] [research] [slides] [video] [wikipedia]
Johannes Kiesel, Arefeh Bahrami, Benno Stein, Avishek Anand, and Matthias Hagen. Clarifying False Memories in Voice-based Search. In Martin Halvey et al., editors, 2019 Conference on Human Information Interaction & Retrieval (CHIIR 2019), pages 331-335, March 2019. ACM. [bib] [copylink] [doi] [poster] [publisher] [research] [slides]
Matthias Schildwächter, Alexander Bondarenko, Julian Zenker, Matthias Hagen, Chris Biemann, and Alexander Panchenko. Answering Comparative Questions: Better than Ten-Blue-Links?. In Martin Halvey et al., editors, 2019 Conference on Human Information Interaction and Retrieval (CHIIR 2019), pages 361-365, March 2019. ACM. [bib] [copylink] [demo] [doi] [publisher] [research]
Andreas Bunte, Benno Stein, and Oliver Niggemann. Model-Based Diagnosis for Cyber-Physical Production Systems Based on Machine Learning and Residual-Based Diagnosis Models. In 33rd International Conference on Artificial Intelligence (AAAI 2019), February 2019. AAAI. [bib] [copylink] [poster] [research]
2018
Martin Potthast, Tim Gollub, Matthias Hagen, and Benno Stein. The Clickbait Challenge 2017: Towards a Regression Model for Clickbait Strength. CoRR, abs/1812.10847, December 2018. [bib] [copylink] [event] [publisher] [research]
Alexander Bondarenko, Michael Völske, Alexander Panchenko, Chris Biemann, Benno Stein, and Matthias Hagen. Webis at TREC 2018: Common Core Track. In Ellen M. Voorhees and Angela Ellis, editors, 27th International Text Retrieval Conference (TREC 2018), NIST Special Publication, November 2018. National Institute of Standards and Technology (NIST). [bib] [copylink] [research]
Yamen Ajjour, Henning Wachsmuth, Dora Kiesel, Patrick Riehmann, Fan Fan, Giuliano Castiglia, Rosemary Adejoh, Bernd Fröhlich, and Benno Stein. Visualization of the Topic Space of Argument Search Results in args.me. In Eduardo Blanco and Wei Lu, editors, 23rd Conference on Empirical Methods in Natural Language Processing (EMNLP 2018) – System Demonstrations, pages 60-65, November 2018. Association for Computational Linguistics. [bib] [copylink] [publisher] [research]
Shahbaz Syed, Michael Völske, Martin Potthast, Nedim Lipka, Benno Stein, and Hinrich Schütze. Task Proposal: The TL;DR Challenge. In Albert Gatt, Martijn Goudbeek, and Emiel Krahmer, editors, 11th International Natural Language Generation Conference (INLG 2018), pages 318-321, November 2018. Association for Computational Linguistics. [bib] [copylink] [data] [publisher]
Wei-Fan Chen, Henning Wachsmuth, Khalid Al-Khatib, and Benno Stein. Learning to Flip the Bias of News Headlines. In Albert Gatt, Martijn Goudbeek, and Emiel Krahmer, editors, 11th International Natural Language Generation Conference (INLG 2018), pages 79-88, November 2018. Association for Computational Linguistics. [bib] [copylink] [data] [publisher]
Ivan Habernal, Henning Wachsmuth, Iryna Gurevych, and Benno Stein. SemEval 2018 Task 12: The Argument Reasoning Comprehension Task. In Marianna Apidianaki et al., editors, 12th International Workshop on Semantic Evaluation (SemEval 2018), October 2018. Association for Computational Linguistics. [bib] [copylink] [event] [publisher] [research]
Johannes Kiesel, Florian Kneist, Milad Alshomary, Benno Stein, Matthias Hagen, and Martin Potthast. Reproducible Web Corpora: Interactive Archiving with Automatic Quality Assessment. Journal of Data and Information Quality (JDIQ), 10 (4) : 17:1-17:25, October 2018. [award] [bib] [code] [copylink] [data] [doi] [publisher] [research]
Frank Hopfgartner, Allan Hanbury, Henning Müller, Ivan Eggel, Krisztian Balog, Torben Brodt, Gordon V. Cormack, Jimmy Lin, Jayashree Kalpathy-Cramer, Noriko Kando, Makoto P. Kato, Anastasia Krithara, Tim Gollub, Martin Potthast, Evelyne Viegas, and Simon Mercer. Evaluation-as-a-Service for the Computational Sciences: Overview and Outlook. Journal of Data and Information Quality (JDIQ), 10 (4) : 15:1-15:32, October 2018. [bib] [copylink] [doi]
Mathias Lux, Michael Riegler, Duc-Tien Dang-Nguyen, Marcus Larson, Martin Potthast, and Pål Halvorsen. Team ORG @ GameStory Task 2018. In Martha Larson et al., editors, Working Notes of the MediaEval 2018 Workshop, volume 1866 of CEUR Workshop Proceedings, October 2018. [bib] [copylink] [publisher]
Mathias Lux, Michael Riegler, Duc-Tien Dang-Nguyen, Marcus Larson, Martin Potthast, and Pål Halvorsen. GameStory Task at MediaEval 2018. In Martha Larson et al., editors, Working Notes of the MediaEval 2018 Workshop, volume 1866 of CEUR Workshop Proceedings, October 2018. [bib] [copylink] [event] [publisher]
Dora Kiesel, Patrick Riehmann, Fan Fan, Yamen Ajjour, Henning Wachsmuth, Benno Stein, and Bernd Fröhlich. Improving Barycentric Embeddings of Topics Spaces. In IEEE VIS 2018, October 2018. IEEE. [bib] [copylink] [poster] [video]
Daniel Zeman, Jan Hajič, Martin Popel, Martin Potthast, Milan Straka, Filip Ginter, Joakim Nivre, and Slav Petrov. CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. In CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pages 1-21, October 2018. Association for Computational Linguistics. [bib] [copylink] [event] [publisher]
Roxanne El Baff, Henning Wachsmuth, Khalid Al-Khatib, and Benno Stein. Challenge or Empower: Revisiting Argumentation Quality in a News Editorial Corpus. In Anna Korhonen and Ivan Titov, editors, 22nd Conference on Computational Natural Language Learning (CoNLL 2018), pages 454-464, October 2018. Association for Computational Linguistics. [bib] [copylink] [data] [publisher]
Mathias Lux, Michael Riegler, Pål Halvorsen, Duc-Tien Dang-Nguyen, and Martin Potthast. E-Sports and Audience. In 12th Vienna Games Conference on Future and Reality of Gaming (FROG 2018), October 2018. [bib] [copylink]
Efstathios Stamatatos, Francisco Rangel, Michael Tschuggnall, Benno Stein, Mike Kestemont, Paolo Rosso, and Martin Potthast. Overview of PAN 2018: Author Identification, Author Profiling, and Author Obfuscation. In Patrice Bellot et al., editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. 9th International Conference of the CLEF Initiative (CLEF 2018), volume 11018 of Lecture Notes in Computer Science, pages 267-285, September 2018. Springer. [bib] [copylink] [doi] [event] [publisher]
Pertti Vakkari, Michael Völske, Matthias Hagen, Martin Potthast, and Benno Stein. Predicting Retrieval Success Based on Information Use for Writing Tasks. In Fabio Crestani et al., editors, 22nd International Conference on Theory and Practice of Digital Libraries (TPDL 2018), pages 161-173, September 2018. Springer. [bib] [copylink] [data] [doi] [publisher]
Tim Gollub, Erdan Genc, Nedim Lipka, and Benno Stein. Pseudo Descriptions for Meta-Data Retrieval. In 8th International Conference on the Theory of Information Retrieval (ICTIR 2018), pages 139-146, September 2018. ACM. [bib] [copylink] [doi] [publisher] [research]
Andreas Bunte, Oliver Niggemann, and Benno Stein. Integrating OWL Ontologies for Smart Services into AutomationML and OPC UA. In 23rd International Conference on Emerging Technologies and Factory Automation (ETFA 2018), pages 1383-1390, September 2018. [bib] [copylink] [doi] [research]
Mike Kestemont, Michael Tschuggnall, Efstathios Stamatatos, Walter Daelemans, Günther Specht, Benno Stein, and Martin Potthast. Overview of the Author Identification Task at PAN-2018: Cross-domain Authorship Attribution and Style Change Detection. In Linda Cappellato, Nicola Ferro, Jian-Yun Nie, and Laure Soulier, editors, Working Notes Papers of the CLEF 2018 Evaluation Labs, volume 2125 of CEUR Workshop Proceedings, September 2018. [bib] [copylink] [event] [publisher]
Francisco Rangel, Paolo Rosso, Manuel Montes-y-Gómez, Martin Potthast, and Benno Stein. Overview of the 6th Author Profiling Task at PAN 2018: Multimodal Gender Identification in Twitter. In Linda Cappellato, Nicola Ferro, Jian-Yun Nie, and Laure Soulier, editors, Working Notes Papers of the CLEF 2018 Evaluation Labs, volume 2125 of CEUR Workshop Proceedings, September 2018. [bib] [copylink] [event] [publisher]
Martin Potthast, Felix Schremmer, Matthias Hagen, and Benno Stein. Overview of the Author Obfuscation Task at PAN 2018: A New Approach to Measuring Safety. In Linda Cappellato, Nicola Ferro, Jian-Yun Nie, and Laure Soulier, editors, Working Notes Papers of the CLEF 2018 Evaluation Labs, volume 2125 of CEUR Workshop Proceedings, September 2018. [bib] [copylink] [event] [publisher]
Martin Potthast, Tim Gollub, Kristof Komlossy, Sebastian Schuster, Matti Wiegmann, Erika Patricia Garces Fernandez, Matthias Hagen, and Benno Stein. Crowdsourcing a Large Corpus of Clickbait on Twitter. In Emily M. Bender, Leon Derczynski, and Pierre Isabelle, editors, 27th International Conference on Computational Linguistics (COLING 2018), pages 1498-1507, August 2018. The COLING 2018 Organizing Committee. [bib] [copylink] [data] [poster] [publisher] [research]
Johannes Kiesel, Arjen P. de Vries, Matthias Hagen, Benno Stein, and Martin Potthast. WASP: Web Archiving and Search Personalized. In Omar Alonso and Gianmaria Silvello, editors, 1st International Conference on Design of Experimental Search & Information Retrieval Systems (DESIRES 2018), volume 2167 of CEUR Workshop Proceedings, pages 16-21, August 2018. [bib] [code] [copylink] [publisher] [research] [slides]
Henning Wachsmuth, Manfred Stede, Roxanne El Baff, Khalid Al-Khatib, Maria Skeppstedt, and Benno Stein. Argumentation Synthesis following Rhetorical Strategies. In Emily M. Bender, Leon Derczynski, and Pierre Isabelle, editors, 27th International Conference on Computational Linguistics (COLING 2018), pages 3753-3765, August 2018. Association for Computational Linguistics. [bib] [copylink] [publisher] [research]
Jiani Qu, Anny Marleen Hißbach, Tim Gollub, and Martin Potthast. Towards Crowdsourcing Clickbait Labels for YouTube Videos. In Yiling Chen and Gabrielle Kazai, editors, 6th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2018), July 2018. [bib] [copylink] [data] [research]
Henning Wachsmuth, Shahbaz Syed, and Benno Stein. Retrieval of the Best Counterargument without Prior Topic Knowledge. In Iryna Gurevych and Yusuke Miyao, editors, 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), pages 241-251, July 2018. Association for Computational Linguistics. [bib] [copylink] [publisher] [research] [slides]
Martin Potthast, Johannes Kiesel, Kevin Reinartz, Janek Bevendorff, and Benno Stein. A Stylometric Inquiry into Hyperpartisan and Fake News. In Iryna Gurevych and Yusuke Miyao, editors, 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), pages 231-240, July 2018. Association for Computational Linguistics. [arxiv] [bib] [code] [copylink] [data] [publisher] [research] [slides] [video]
Khalid Al-Khatib, Henning Wachsmuth, Kevin Lang, Jakob Herpel, Matthias Hagen, and Benno Stein. Modeling Deliberative Argumentation Strategies on Wikipedia. In Iryna Gurevych and Yusuke Miyao, editors, 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), pages 2545-2555, July 2018. Association for Computational Linguistics. [bib] [copylink] [publisher] [research]
Johannes Kiesel, Arefeh Bahrami, Benno Stein, Avishek Anand, and Matthias Hagen. Toward Voice Query Clarification. In 41st International ACM Conference on Research and Development in Information Retrieval (SIGIR 2018), pages 1257-1260, July 2018. ACM. [bib] [copylink] [doi] [poster] [publisher] [research]
Wei-Fan Chen, Matthias Hagen, Benno Stein, and Martin Potthast. A User Study on Snippet Generation: Text Reuse vs. Paraphrases. In 41st International ACM Conference on Research and Development in Information Retrieval (SIGIR 2018), pages 1033-1036, July 2018. ACM. [bib] [copylink] [doi] [poster] [publisher]
Ivan Habernal, Henning Wachsmuth, Iryna Gurevych, and Benno Stein. Before Name-calling: Dynamics and Triggers of Ad Hominem Fallacies in Web Argumentation. In Heng Ji, Amanda Stent, and Marilyn Walker, editors, 13th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2018), June 2018. Association for Computational Linguistics. [bib] [copylink] [publisher] [research]
Ivan Habernal, Henning Wachsmuth, Iryna Gurevych, and Benno Stein. The Argument Reasoning Comprehension Task: Identification and Reconstruction of Implicit Warrants. In 13th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2018), June 2018. Association for Computational Linguistics. [arxiv] [bib] [copylink] [research]
Tim Gollub, Martin Potthast, and Benno Stein. Shaping the Information Nutrition Label. In Dyaa Albakour et al., editors, 2nd International Workshop on Recent Trends in News Information Retrieval (NewsIR 2018) at ECIR, volume 2079 of CEUR Workshop Proceedings, pages 9-11, March 2018. [bib] [copylink] [poster] [publisher] [slides]
Janek Bevendorff, Benno Stein, Matthias Hagen, and Martin Potthast. Elastic ChatNoir: Search Engine for the ClueWeb and the Common Crawl. In Leif Azzopardi, Allan Hanbury, Gabriella Pasi, and Benjamin Piwowarski, editors, Advances in Information Retrieval. 40th European Conference on IR Research (ECIR 2018), Lecture Notes in Computer Science, March 2018. Springer. [bib] [copylink] [research]
Shahbaz Syed, Tim Gollub, Marcel Gohsen, Nikolay Kolyada, Benno Stein, and Matthias Hagen. Cross-Reading News. In Dyaa Albakour et al., editors, 2nd International Workshop on Recent Trends in News Information Retrieval (NewsIR 2018) at ECIR, volume 2079 of CEUR Workshop Proceedings, pages 24-26, March 2018. [bib] [copylink] [publisher]
Martin Potthast, Wei-Fan Chen, Matthias Hagen, and Benno Stein. A Plan for Ancillary Copyright: Original Snippets. In Dyaa Albakour et al., editors, 2nd International Workshop on Recent Trends in News Information Retrieval (NewsIR 2018) at ECIR, volume 2079 of CEUR Workshop Proceedings, pages 3-5, March 2018. [bib] [copylink] [publisher]
Matti Wiegmann, Michael Völske, Benno Stein, Matthias Hagen, and Martin Potthast. Heuristic Feature Selection for Clickbait Detection. CoRR, abs/1802.01191, February 2018. [bib] [copylink] [publisher]
Anna Kruspe, Jens Kersten, Matti Wiegmann, Benno Stein, and Friederike Klan. Classification of Incident-related Tweets: Tackling Imbalanced Training Data using Hybrid CNNs and Translation-based Data Augmentation. In Text REtrieval Conference (TREC), January 2018. [bib] [copylink] [publisher]
2017
Norbert Fuhr, Anastasia Giachanou, Gregory Grefenstette, Iryna Gurevych, Andreas Hanselowski, Kalervo Jarvelin, Rosie Jones, Yiqun Liu, Josiane Mothe, Wolfgang Nejdl, Isabella Peters, and Benno Stein. An Information Nutritional Label for Online Documents. SIGIR Forum, 51 (3) : 44-66, December 2017. [bib] [copylink] [doi] [publisher]
Stefan Heindorf, Martin Potthast, Gregor Engels, and Benno Stein. Overview of the Wikidata Vandalism Detection Task at WSDM Cup 2017. In Martin Potthast, Stefan Heindorf, and Hannah Bast, editors, WSDM Cup 2017 Notebook Papers, December 2017. [arxiv] [bib] [copylink] [event] [publisher] [research]
Matthias Hagen, Yamen Ajjour, Johannes Kiesel, Payam Adineh, and Benno Stein. Webis at TREC 2017: Open Search and Core Tracks. In Ellen M. Voorhees and Lori P. Buckland, editors, 26th International Text Retrieval Conference (TREC 2017), NIST Special Publication, November 2017. National Institute of Standards and Technology (NIST). [bib] [copylink] [publisher]
Matthias Hagen, Martin Potthast, Payam Adineh, Ehsan Fatehifar, and Benno Stein. Source Retrieval for Web-Scale Text Reuse Detection. In Ee-Peng Lim et al., editors, 26th ACM International Conference on Information and Knowledge Management (CIKM 2017), pages 2091-2094, November 2017. ACM. [bib] [copylink] [doi] [research]
Matthias Hagen, Johannes Kiesel, Milad Alshomary, and Benno Stein. Webis at the CLEF 2017 Dynamic Search Lab. In Linda Cappellato, Nicola Ferro, Lorraine Goeuriot, and Thomad Mandl, editors, Working Notes Papers of the CLEF 2017 Evaluation Labs, volume 1866 of CEUR Workshop Proceedings, September 2017. [bib] [copylink] [publisher]
Khalid Al-Khatib, Henning Wachsmuth, Matthias Hagen, and Benno Stein. Patterns of Argumentation Strategies across Topics. In Rebecca Hwa, Martha Palmer, and Sebastian Riedel, editors, 22nd Conference on Empirical Methods in Natural Language Processing (EMNLP 2017), pages 1362-1368, September 2017. Association for Computational Linguistics. [bib] [copylink] [publisher] [research]
Michael Völske, Martin Potthast, Shahbaz Syed, and Benno Stein. TL;DR: Mining Reddit to Learn Automatic Summarization. In Giuseppe Carenini, Jackie Chi Kit Cheung, Fei Liu, and Lu Wang, editors, Workshop on New Frontiers in Summarization at EMNLP 2017, pages 59-63, September 2017. Association for Computational Linguistics. [bib] [copylink] [data] [doi] [publisher] [research]
Michael Tschuggnall, Efstathios Stamatatos, Ben Verhoeven, Walter Daelemans, Günther Specht, Benno Stein, and Martin Potthast. Overview of the Author Identification Task at PAN 2017: Style Breach Detection and Author Clustering. In Linda Cappellato, Nicola Ferro, Lorraine Goeuriot, and Thomad Mandl, editors, Working Notes Papers of the CLEF 2017 Evaluation Labs, volume 1866 of CEUR Workshop Proceedings, September 2017. [bib] [copylink] [event] [publisher]
Francisco Rangel, Paolo Rosso, Martin Potthast, and Benno Stein. Overview of the 5th Author Profiling Task at PAN 2017: Gender and Language Variety Identification in Twitter. In Linda Cappellato, Nicola Ferro, Lorraine Goeuriot, and Thomad Mandl, editors, Working Notes Papers of the CLEF 2017 Evaluation Labs, volume 1866 of CEUR Workshop Proceedings, September 2017. [bib] [copylink] [event] [publisher]
Matthias Hagen, Martin Potthast, and Benno Stein. Overview of the Author Obfuscation Task at PAN 2017: Safety Evaluation Revisited. In Linda Cappellato, Nicola Ferro, Lorraine Goeuriot, and Thomad Mandl, editors, Working Notes Papers of the CLEF 2017 Evaluation Labs, volume 1866 of CEUR Workshop Proceedings, September 2017. [bib] [copylink] [event] [publisher]
Martin Potthast, Francisco Rangel, Michael Tschuggnall, Efstathios Stamatatos, Paolo Rosso, and Benno Stein. Overview of PAN 2017: Author Identification, Author Profiling, and Author Obfuscation. In Gareth J. F. Jones et al., editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. 8th International Conference of the CLEF Initiative (CLEF 2017), volume 10456 of Lecture Notes in Computer Science, pages 275-290, September 2017. Springer. [bib] [copylink] [event]
Henning Wachsmuth, Martin Potthast, Khalid Al-Khatib, Yamen Ajjour, Jana Puschmann, Jiani Qu, Jonas Dorsch, Viorel Morari, Janek Bevendorff, and Benno Stein. Building an Argument Search Engine for the Web. In Kevin Ashley et al., editors, 4th Workshop on Argument Mining (ArgMining 2017) at EMNLP, pages 49-59, September 2017. Association for Computational Linguistics. [bib] [copylink] [demo] [publisher] [research] [slides]
Yamen Ajjour, Wei-Fan Chen, Johannes Kiesel, Henning Wachsmuth, and Benno Stein. Unit Segmentation of Argumentative Texts. In Kevin Ashley et al., editors, 4th Workshop on Argument Mining (ArgMining 2017) at EMNLP, pages 118-128, September 2017. Association for Computational Linguistics. [bib] [code] [copylink] [doi] [publisher] [research]
Henning Wachsmuth, Giovanni Da San Martino, Dora Kiesel, and Benno Stein. The Impact of Modeling Overall Argumentation with Tree Kernels. In Rebecca Hwa, Martha Palmer, and Sebastian Riedel, editors, 22nd Conference on Empirical Methods in Natural Language Processing (EMNLP 2017), pages 2369-2379, September 2017. Association for Computational Linguistics. [bib] [code] [copylink] [publisher] [research] [slides] [video]
Daniel Zeman, Martin Popel, Milan Straka, Jan Hajic, Joakim Nivre, Filip Ginter, Juhani Luotolahti, Sampo Pyysalo, Slav Petrov, Martin Potthast, Francis Tyers, Elena Badmaeva, Memduh Gokirmak, Anna Nedoluzhko, Silvie Cinkova, Jan Hajic jr., Jaroslava Hlavacova, Václava Kettnerová, Zdenka Uresova, Jenna Kanerva, Stina Ojala, Anna Missilä, Christopher D. Manning, Sebastian Schuster, Siva Reddy, Dima Taji, Nizar Habash, Herman Leung, Marie-Catherine de Marneffe, Manuela Sanguinetti, Maria Simi, Hiroshi Kanayama, Valeria de Paiva, Kira Droganova, Héctor Martínez Alonso, Çağrı Çöltekin, Umut Sulubacak, Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Georg Rehm, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Michael Mandl, Jesse Kirchner, Hector Fernandez Alcalde, Jana Strnadová, Esha Banerjee, Ruli Manurung, Antonio Stella, Atsuko Shimada, Sookyoung Kwak, Gustavo Mendonca, Tatiana Lando, Rattima Nitisaroj, and Josie Li. CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. In CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pages 1-19, August 2017. Association for Computational Linguistics. [bib] [copylink] [doi] [event] [publisher] [research]
Henning Wachsmuth, Nona Naderi, Ivan Habernal, Yufang Hou, Graeme Hirst, Iryna Gurevych, and Benno Stein. Argumentation Quality Assessment: Theory vs. Practice. In Regina Barzilay and Min-Yen Kan, editors, 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017), pages 250-255, August 2017. Association for Computational Linguistics. [bib] [code] [copylink] [poster] [research]
Matthias Hagen, Martin Potthast, Marcel Gohsen, Anja Rathgeber, and Benno Stein. A Large-Scale Query Spelling Correction Corpus. In Noriko Kando et al., editors, 40th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2017), pages 1261-1264, August 2017. ACM. [bib] [copylink] [data] [doi] [poster]
Benno Stein, Tim Gollub, and Maik Anderka. Retrieval Models. In Reda Alhajj and Jon G. Rokne, editors, Encyclopedia of Social Network Analysis and Mining (ESNAM), pages 1-7, Springer. August 2017. [bib] [copylink] [demo] [doi] [research]
Matthias Hagen and Benno Stein. Weblog Analysis. In Reda Alhajj and Jon G. Rokne, editors, Encyclopedia of Social Network Analysis and Mining (ESNAM), pages 1-9, Springer. August 2017. [bib] [copylink] [doi]
Astrid Frey, Matthias Hagen, and Benno Stein. Anomalieerkennung im Bereich Industrie 4.0. Industrie 4.0 Management, 33 (4) : 53-56, August 2017. [bib] [copylink] [research]
Henning Wachsmuth and Benno Stein. A Universal Model for Discourse-Level Argumentation Analysis. Special Section of the ACM Transactions on Internet Technology: Argumentation in Social Media (ACM TOIT), 17 (3) : 28:1-28:24, June 2017. [bib] [copylink] [doi] [research]
Johannes Kiesel, Martin Potthast, Matthias Hagen, and Benno Stein. Spatio-temporal Analysis of Reverted Wikipedia Edits. In Eleventh International AAAI Conference on Web and Social Media (ICWSM 2017), pages 122-131, May 2017. [award] [bib] [code] [copylink] [demo] [publisher] [research] [slides] [video]
Johannes Kiesel, Henning Wachsmuth, Khalid Al-Khatib, and Benno Stein. WAT-SL: A Customizable Web Annotation Tool for Segment Labeling. In Phil Blunsom, Alexander Koller, and Mirella Lapata, editors, Software Demonstrations at the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), pages 13-16, April 2017. [bib] [code] [copylink] [demo] [publisher] [research]
Henning Wachsmuth, Nona Naderi, Yufang Hou, Yonatan Bilu, Vinodkumar Prabhakaran, Tim Alberdingk Thijm, Graeme Hirst, and Benno Stein. Computational Argumentation Quality Assessment in Natural Language. In Phil Blunsom, Alexander Koller, and Mirella Lapata, editors, 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), pages 176-187, April 2017. [bib] [copylink] [publisher] [research] [slides]
Henning Wachsmuth, Benno Stein, and Yamen Ajjour. "PageRank" for Argument Relevance. In Phil Blunsom, Alexander Koller, and Mirella Lapata, editors, 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), pages 1116-1126, April 2017. Association for Computational Linguistics. [bib] [code] [copylink] [publisher] [research] [slides]
Stefan Heindorf, Martin Potthast, Hannah Bast, Björn Buchhold, and Elmar Haussmann. WSDM Cup 2017: Vandalism Detection and Triple Scoring. In 10th ACM International Conference on Web Search and Data Mining (WSDM 2017), pages 827-828, February 2017. ACM. [bib] [copylink] [doi] [event]
Martin Potthast, Christian Forler, Eik List, and Stefan Lucks. Passphone: Outsourcing Phone-Based Web Authentication While Protecting User Privacy. Cryptology ePrint Archive, Report 2017/158, 2017. [bib] [copylink] [publisher]
2016
Henning Wachsmuth, Khalid Al-Khatib, and Benno Stein. Using Argument Mining to Assess the Argumentation Quality of Essays. In Yuji Matsumoto and Rashmi Prasad, editors, 26th International Conference on Computational Linguistics (COLING 2016), pages 1680-1692, December 2016. Association for Computational Linguistics. [bib] [code] [copylink] [demo] [poster] [publisher] [research]
Khalid Al-Khatib, Henning Wachsmuth, Johannes Kiesel, Matthias Hagen, and Benno Stein. A News Editorial Corpus for Mining Argumentation Strategies. In Yuji Matsumoto and Rashmi Prasad, editors, 26th International Conference on Computational Linguistics (COLING 2016), pages 3433-3443, December 2016. Association for Computational Linguistics. [bib] [code] [copylink] [data] [poster] [publisher] [research]
Matthias Hagen, Johannes Kiesel, Payam Adineh, Masoud Alahyari, Ehsan Fatehifar, Arefeh Bahrami, Pia Fichtl, and Benno Stein. Webis at TREC 2016: Tasks, Total Recall, and Open Search Tracks. In Ellen M. Voorhees and Lori P. Buckland, editors, 25th International Text Retrieval Conference (TREC 2016), NIST Special Publication, November 2016. National Institute of Standards and Technology (NIST). [bib] [copylink] [publisher]
Martin Potthast, Christian Forler, Eik List, and Stefan Lucks. Passphone: Outsourcing Phone-Based Web Authentication While Protecting User Privacy. In Billy Bob Brumley and Juha Röning, editors, 21st Nordic Conference on Secure IT Systems (NordSec 2016), pages 235-255, November 2016. Springer. [bib] [copylink] [doi]
Matthias Hagen, Maximilian Michel, and Benno Stein. Simulating Ideal and Average Users. In Yi Chang et al., editors, 12th Asia Information Retrieval Societies Conference (AIRS 2016), November 2016. Springer. [bib] [copylink] [doi]
Tim Gollub, Matthias Busse, Benno Stein, and Matthias Hagen. Keyqueries for Clustering and Labeling. In Yi Chang et al., editors, 12th Asia Information Retrieval Societies Conference (AIRS 2016), pages 42-55, November 2016. Springer. [bib] [copylink] [data] [doi]
Matthias Hagen, Michael Völske, Steve Göring, and Benno Stein. Axiomatic Result Re-Ranking. In 25th ACM International Conference on Information and Knowledge Management (CIKM 2016), pages 721-730, October 2016. ACM. [bib] [copylink] [slides]
Stefan Heindorf, Martin Potthast, Benno Stein, and Gregor Engels. Vandalism Detection in Wikidata. In Snehasis Mukhopadhyay et al., editors, 25th ACM International Conference on Information and Knowledge Management (CIKM 2016), pages 327-336, October 2016. ACM. [award] [bib] [copylink] [doi] [research] [slides]
Efstathios Stamatatos, Michael Tschuggnall, Ben Verhoeven, Walter Daelemans, Günther Specht, Benno Stein, and Martin Potthast. Clustering by Authorship Within and Across Documents. In Krisztian Balog, Linda Cappellato, Nicola Ferro, and Craig Macdonald, editors, Working Notes Papers of the CLEF 2016 Evaluation Labs, volume 1609 of Lecture Notes in Computer Science, September 2016. [bib] [copylink] [publisher] [slides]
Francisco Rangel, Paolo Rosso, Ben Verhoeven, Walter Daelemans, Martin Potthast, and Benno Stein. Overview of the 4th Author Profiling Task at PAN 2016: Cross-Genre Evaluations. In Krisztian Balog, Linda Cappellato, Nicola Ferro, and Craig Macdonald, editors, Working Notes Papers of the CLEF 2016 Evaluation Labs, volume 1609 of Lecture Notes in Computer Science, September 2016. [bib] [copylink] [event] [publisher] [slides]
Martin Potthast, Matthias Hagen, and Benno Stein. Author Obfuscation: Attacking the State of the Art in Authorship Verification. In Krisztian Balog, Linda Cappellato, Nicola Ferro, and Craig Macdonald, editors, Working Notes Papers of the CLEF 2016 Evaluation Labs, volume 1609 of Lecture Notes in Computer Science, September 2016. [bib] [copylink] [publisher]
Paolo Rosso, Francisco Rangel, Martin Potthast, Efstathios Stamatatos, Michael Tschuggnall, and Benno Stein. Overview of PAN 2016–New Challenges for Authorship Analysis: Cross-genre Profiling, Clustering, Diarization, and Obfuscation. In Norbert Fuhr et al., editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. 7th International Conference of the CLEF Initiative (CLEF 2016), volume 9822 of Lecture Notes in Computer Science, pages 518-538, September 2016. Springer. [bib] [copylink] [doi] [event] [slides]
Tim Gollub, Nedim Lipka, Eunyee Koh, Erdan Genc, and Benno Stein. Topical Sequence Profiling. In A Min Tjoa, Zita Vale, and Roland Wagner, editors, 13th International Workshop on Text-based Information Retrieval (TIR 2016) at DEXA, pages 207-211, September 2016. IEEE. [bib] [copylink] [demo] [doi] [research]
Ingo Frommholz, Haider M. al-Khateeb, Martin Potthast, Zinnar Ghasem, Mitul Shukla, and Emma Short. On Textual Analysis and Machine Learning for Cyberstalking Detection. Datenbank-Spektrum, 16 (2) : 127-135, June 2016. [bib] [copylink] [doi]
Patrick Riehmann, Martin Potthast, Henning Gruendl, Johannes Kiesel, Dean Jürges, Giuliano Castiglia, Bagrat Ter-Akopyan, and Bernd Fröhlich. Visualizing Article Similarities in Wikipedia. In Tobias Isenberg and Filip Sadlo, editors, 18th Eurographics Conference on Visualization (EuroVis 2016), pages 69-71, June 2016. The Eurographics Association. [award] [bib] [copylink] [doi] [poster] [research]
Khalid Al-Khatib, Henning Wachsmuth, Matthias Hagen, Jonas Köhler, and Benno Stein. Cross-Domain Mining of Argumentative Text through Distant Supervision. In Kevin Knight, Ani Nenkova, and Owen Rambow, editors, 12th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2016), pages 1395-1404, June 2016. Association for Computational Linguistics. [bib] [copylink] [doi] [publisher] [research]
Henning Wachsmuth. Pipelines für effiziente und robuste Ad-hoc Textanalyse. In Ausgezeichnete Informatikdissertationen 2015, volume D-16 of Lecture Notes in Informatics, pages 329-338, May 2016. Gesellschaft für Informatik. [bib] [copylink] [publisher]
Iryna Gurevych, Eduard H. Hovy, Noam Slonim, and Benno Stein. Debating Technologies (Dagstuhl Seminar 15512). Dagstuhl Reports, 5 (12) : 18-46, April 2016. [bib] [copylink] [doi] [event] [publisher]
Matthias Hagen, Benno Stein, and Theo Härder, editors. Big Data and IR, volume 16 of Datenbank-Spektrum, Springer, March 2016. [bib] [copylink] [doi]
Martin Potthast, Sarah Braun, Tolga Buz, Fabian Duffhauss, Florian Friedrich, Jörg Marvin Gülzow, Jakob Köhler, Winfried Lötzsch, Fabian Müller, Maike Elisa Müller, Robert Paßmann, Bernhard Reinke, Lucas Rettenmeier, Thomas Rometsch, Timo Sommer, Michael Träger, Sebastian Wilhelm, Benno Stein, Efstathios Stamatatos, and Matthias Hagen. Who Wrote the Web? Revisiting Influential Author Identification Research Applicable to Information Retrieval. In Nicola Ferro et al., editors, Advances in Information Retrieval. 38th European Conference on IR Research (ECIR 2016), volume 9626 of Lecture Notes in Computer Science, pages 393-407, March 2016. Springer. [bib] [copylink] [doi] [slides]
Martin Potthast, Sebastian Köpsel, Benno Stein, and Matthias Hagen. Clickbait Detection. In Nicola Ferro et al., editors, Advances in Information Retrieval. 38th European Conference on IR Research (ECIR 2016), volume 9626 of Lecture Notes in Computer Science, pages 810-817, March 2016. Springer. [award] [bib] [copylink] [data] [doi] [poster] [research] [slides]
Matthias Hagen, Anna Beyer, Tim Gollub, Kristof Komlossy, and Benno Stein. Supporting Scholarly Search with Keyqueries. In Nicola Ferro et al., editors, Advances in Information Retrieval. 38th European Conference on IR Research (ECIR 2016), volume 9626 of Lecture Notes in Computer Science, pages 507-520, March 2016. Springer. [bib] [copylink] [doi] [slides]
Matthias Hagen, Martin Potthast, Michael Völske, Jakob Gomoll, and Benno Stein. How Writers Search: Analyzing the Search and Writing Logs of Non-fictional Essays. In Diane Kelly et al., editors, 1st ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR 2016), pages 193-202, March 2016. ACM. [bib] [copylink] [data] [doi] [slides]
2015
Henning Wachsmuth. Text Analysis Pipelines–Towards Ad-hoc Large-scale Text Mining. Springer, 2015. [bib] [copylink] [doi] [publisher] [research]
Allan Hanbury, Henning Müller, Krisztian Balog, Torben Brodt, Gordon V. Cormack, Ivan Eggel, Tim Gollub, Frank Hopfgartner, Jayashree Kalpathy-Cramer, Noriko Kando, Anastasia Krithara, Jimmy Lin, Simon Mercer, and Martin Potthast. Evaluation-as-a-Service: Overview and Outlook. CoRR, abs/1512.07454, December 2015. [bib] [copylink] [publisher] [research]
Matthias Hagen, Steve Göring, Magdalena Keil, Olaoluwa Anifowose, Amir Othman, and Benno Stein. Webis at TREC 2015: Tasks and Total Recall Tracks. In Ellen M. Voorhees and Lori P. Buckland, editors, 24th International Text Retrieval Conference (TREC 2015), NIST Special Publication, November 2015. National Institute of Standards and Technology (NIST). [bib] [copylink] [publisher]
Michael Völske, Pavel Braslavski, Matthias Hagen, Galina Lezina, and Benno Stein. What Users Ask a Search Engine: Analyzing One Billion Russian Question Queries. In 24th ACM International Conference on Information and Knowledge Management (CIKM 2015), pages 1571-1580, October 2015. ACM. [bib] [copylink] [doi] [publisher] [slides]
Efstathios Stamatatos, Walter Daelemans, Ben Verhoeven, Patrick Juola, Aurelio López López, Martin Potthast, and Benno Stein. Overview of the Author Identification Task at PAN 2015. In Linda Cappellato, Nicola Ferro, Gareth J.F. Jones, and Eric San Juan, editors, Working Notes Papers of the CLEF 2015 Evaluation Labs, volume 1391 of Lecture Notes in Computer Science, September 2015. [bib] [copylink] [event] [publisher]
Francisco Rangel, Fabio Celli, Paolo Rosso, Martin Potthast, Benno Stein, and Walter Daelemans. Overview of the 3rd Author Profiling Task at PAN 2015. In Linda Cappellato, Nicola Ferro, Gareth J.F. Jones, and Eric San Juan, editors, Working Notes Papers of the CLEF 2015 Evaluation Labs, volume 1391 of Lecture Notes in Computer Science, September 2015. [bib] [copylink] [event] [publisher]
Martin Potthast, Steve Göring, Paolo Rosso, and Benno Stein. Towards Data Submissions for Shared Tasks: First Experiences for the Task of Text Alignment. In Linda Cappellato, Nicola Ferro, Gareth J.F. Jones, and Eric San Juan, editors, Working Notes Papers of the CLEF 2015 Evaluation Labs, volume 1391 of Lecture Notes in Computer Science, September 2015. [bib] [copylink] [publisher] [research]
Matthias Hagen, Martin Potthast, and Benno Stein. Source Retrieval for Plagiarism Detection from Large Web Corpora: Recent Approaches. In Linda Cappellato, Nicola Ferro, Gareth J.F. Jones, and Eric San Juan, editors, Working Notes Papers of the CLEF 2015 Evaluation Labs, volume 1391 of Lecture Notes in Computer Science, September 2015. [bib] [copylink] [publisher] [research]
Efstathios Stamatatos, Martin Potthast, Francisco Rangel, Paolo Rosso, and Benno Stein. Overview of the PAN/CLEF 2015 Evaluation Lab. In Josiane Mothe et al., editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. 6th International Conference of the CLEF Initiative (CLEF 2015), volume 9283 of Lecture Notes in Computer Science, pages 518-538, September 2015. Springer. [bib] [copylink] [doi] [event]
Henning Wachsmuth, Johannes Kiesel, and Benno Stein. Sentiment Flow–A General Model of Web Review Argumentation. In Lluís Márquez, Chris Callison-Burch, and Jian Su, editors, 20th Conference on Empirical Methods in Natural Language Processing (EMNLP 2015), pages 601-611, September 2015. Association for Computational Linguistics. [bib] [code] [copylink] [doi] [publisher] [research] [slides] [video]
Stefan Heindorf, Martin Potthast, Benno Stein, and Gregor Engels. Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis. In Ricardo Baeza-Yates, Mounia Lalmas, Alistair Moffat, and Berthier Ribeiro-Neto, editors, 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2015), pages 831-834, August 2015. ACM. [bib] [copylink] [data] [doi] [poster]
Patrick Riehmann, Martin Potthast, Benno Stein, and Bernd Fröhlich. Visual Assessment of Alleged Plagiarism Cases. Computer Graphics Forum, 34 (3) : 1-10, July 2015. [bib] [copylink] [doi] [publisher] [research] [video]
Frank Hopfgartner, Allan Hanbury, Henning Müller, Noriko Kando, Simon Mercer, Jayashree Kalpathy-Cramer, Martin Potthast, Tim Gollub, Anastasia Krithara, Jimmy Lin, Krisztian Balog, and Ivan Eggel. Report on the Evaluation-as-a-Service (EaaS) Expert Workshop. SIGIR Forum, 49 (1) : 57-65, June 2015. [bib] [copylink] [doi] [publisher] [research]
Matthias Hagen, Martin Potthast, Michel Büchner, and Benno Stein. Webis: An Ensemble for Twitter Sentiment Detection. In Daniel Cer, David Jurgens, Preslav Nakov, and Torsten Zesch, editors, 9th International Workshop on Semantic Evaluation (SemEval 2015), pages 582-589, June 2015. Association for Computational Linguistics. [award] [bib] [code] [copylink] [publisher]
Johannes Kiesel, Khalid Al-Khatib, Matthias Hagen, and Benno Stein. A Shared Task on Argumentation Mining in Newspaper Editorials. In Claire Cardie, editors, 2nd Workshop on Argumentation Mining (ArgMining 2015) at NAACL, pages 35-38, June 2015. Association for Computational Linguistics. [bib] [copylink] [doi] [publisher]
Matthias Hagen, Maximilian Michel, and Benno Stein. What Was the Query? Automatically Generating Queries for Document Sets with Applications in Cluster Labeling. In Elisabeth Métais, Mathieu Roche, and Maguelonne Tesseire, editors, 19th International Conference on Applications of Natural Language to Information Systems (NLDB 2015), volume 9103 of Lecture Notes in Computer Science, pages 124-133, June 2015. Springer. [bib] [copylink] [doi]
Matthias Hagen, Martin Potthast, Michel Büchner, and Benno Stein. Twitter Sentiment Detection via Ensemble Classification Using Averaged Confidence Scores. In Norbert Fuhr, Allan Hanbury, Gabriella Kazai, and Andreas Rauber, editors, Advances in Information Retrieval. 37th European Conference on IR Research (ECIR 2015), volume 9022 of Lecture Notes in Computer Science, pages 513-525, March 2015. Springer. [bib] [code] [copylink] [doi] [slides]
Steven Burrows, Iryna Gurevych, and Benno Stein. The Eras and Trends of Automatic Short Answer Grading. Artificial Intelligence in Education, 25 (1) : 60-117, March 2015. [bib] [copylink] [doi] [publisher]
Matthias Hagen, Daniel Wägner, and Benno Stein. A Corpus of Realistic Known-Item Topics with Associated Web Pages in the ClueWeb09. In Norbert Fuhr, Allan Hanbury, Gabriella Kazai, and Andreas Rauber, editors, Advances in Information Retrieval. 37th European Conference on IR Research (ECIR 2015), volume 9022 of Lecture Notes in Computer Science, pages 741-754, March 2015. Springer. [bib] [copylink] [data] [doi] [slides]
2014
Matthias Hagen, Steve Göring, Maximilian Michel, Georg Müller, and Benno Stein. Webis at TREC 2014: Web, Session, and Contextual Suggestion Tracks. In Ellen M. Voorhees and Lori P. Buckland, editors, 23nd International Text Retrieval Conference (TREC 2014), NIST Special Publication, November 2014. National Institute of Standards and Technology (NIST). [bib] [copylink] [publisher]
Benno Stein, Tim Gollub, and Maik Anderka. Retrieval Models. In Reda Alhajj and Jon G. Rokne, editors, Encyclopedia of Social Network Analysis and Mining (ESNAM), pages 1583-1586, Springer. October 2014. [bib] [copylink] [doi] [research]
Matthias Hagen and Benno Stein. Weblog Analysis. In Reda Alhajj and Jon G. Rokne, editors, Encyclopedia of Social Network Analysis and Mining (ESNAM), pages 2355-2362, Springer. October 2014. [bib] [copylink] [doi]
Khaled M. Elbassioni, Matthias Hagen, and Imran Rauf. A Lower Bound for the HBC Transversal Hypergraph Generation. Fundamenta Informaticae, 130 (4) : 409-414, September 2014. [bib] [copylink]
Efstathios Stamatatos, Walter Daelemans, Ben Verhoeven, Martin Potthast, Benno Stein, Patrick Juola, Miguel A. Sanchez-Perez, and Alberto Barrón-Cedeño. Overview of the Author Identification Task at PAN 2014. In Linda Cappellato, Nicola Ferro, Martin Halvey, and Wessel Kraaij, editors, Working Notes Papers of the CLEF 2014 Evaluation Labs, volume 1180 of Lecture Notes in Computer Science, September 2014. [bib] [copylink] [event] [publisher]
Francisco Rangel, Paolo Rosso, Irina Chugur, Martin Potthast, Martin Trenkmann, Benno Stein, Ben Verhoeven, and Walter Daelemans. Overview of the 2nd Author Profiling Task at PAN 2014. In Linda Cappellato, Nicola Ferro, Martin Halvey, and Wessel Kraaij, editors, Working Notes Papers of the CLEF 2014 Evaluation Labs, volume 1180 of Lecture Notes in Computer Science, September 2014. [bib] [copylink] [event] [publisher] [slides]
Martin Potthast, Matthias Hagen, Anna Beyer, Matthias Busse, Martin Tippmann, Paolo Rosso, and Benno Stein. Overview of the 6th International Competition on Plagiarism Detection. In Linda Cappellato, Nicola Ferro, Martin Halvey, and Wessel Kraaij, editors, Working Notes Papers of the CLEF 2014 Evaluation Labs, volume 1180 of Lecture Notes in Computer Science, September 2014. [bib] [copylink] [event] [publisher] [research] [slides]
Martin Potthast, Tim Gollub, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, and Benno Stein. Improving the Reproducibility of PAN's Shared Tasks: Plagiarism Detection, Author Identification, and Author Profiling. In Evangelos Kanoulas et al., editors, Information Access Evaluation meets Multilinguality, Multimodality, and Visualization. 5th International Conference of the CLEF Initiative (CLEF 2014), pages 268-299, September 2014. Springer. [bib] [copylink] [doi] [research] [slides]
Matthias Hagen and Christiane Glimm. Supporting More-Like-This Information Needs: Finding Similar Web Content in Different Scenarios. In Evangelos Kanoulas et al., editors, Information Access Evaluation meets Multilinguality, Multimodality, and Visualization. 5th International Conference of the CLEF Initiative (CLEF 2014), pages 50-61, September 2014. [bib] [copylink] [doi]
Michael Völske, Tim Gollub, Matthias Hagen, and Benno Stein. A Keyquery-Based Classification System for CORE. In Laurence Lannom, editors, 3rd International Workshop on Mining Scientific Publications (WOSP 2014), volume 20, September 2014. Corporation for National Research Initiatives (CNRI). [bib] [copylink] [doi] [publisher] [research]
Oliver Niggemann, Stefan Windmann, Sören Volgmann, Andreas Bunte, and Benno Stein. Using Learned Models for the Root Cause Analysis of Cyber-Physical Production Systems. In 25th International Workshop on Principles of Diagnosis (DX 2014), September 2014. [bib] [copylink] [publisher] [research]
Tim Gollub, Michael Völske, Matthias Hagen, and Benno Stein. Dynamic Taxonomy Composition via Keyqueries. In 14th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL 2014), pages 39-48, September 2014. ACM. [bib] [copylink] [publisher] [research]
Edgardo Ferretti, Marcelo Errecalde, Maik Anderka, and Benno Stein. On the Use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia. In Franck Morvan, Roland Wagner, and A Min Tjoa, editors, 11th International Workshop on Text-based Information Retrieval (TIR 2014) at DEXA, pages 211-215, September 2014. IEEE. [bib] [copylink] [doi] [research] [slides]
Benno Stein, Matthias Hagen, and Christof Bräutigam. Generating Acrostics via Paraphrasing and Heuristic Search. In Junichi Tsujii and Jan Hajic, editors, 25th International Conference on Computational Linguistics (COLING 2014), pages 2018-2029, August 2014. Association for Computational Linguistics. [bib] [copylink]
Martin Potthast, Matthias Hagen, Anna Beyer, and Benno Stein. Improving Cloze Test Performance of Language Learners Using Web N-Grams. In Junichi Tsujii and Jan Hajic, editors, 25th International Conference on Computational Linguistics (COLING 2014), pages 962-973, August 2014. Association for Computational Linguistics. [bib] [copylink] [research]
Henning Wachsmuth, Martin Trenkmann, Benno Stein, and Gregor Engels. Modeling Review Argumentation for Robust Sentiment Analysis. In Junichi Tsujii and Jan Hajic, editors, 25th International Conference on Computational Linguistics (COLING 2014), pages 553-564, August 2014. Association for Computational Linguistics. [bib] [code] [copylink] [poster] [research]
Petra Löffler and Benno Stein. Korrelationen sind überall da, wo sie gesucht werden. Zeitschrift für Medienwissenschaft (ZFM), 10 : 91-96, April 2014. [bib] [copylink] [publisher]
Henning Wachsmuth, Martin Trenkmann, Benno Stein, Gregor Engels, and Tsvetomira Palakarska. A Review Corpus for Argumentation Analysis. In Alexander Gelbukh, editors, 15th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2014), pages 115-127, April 2014. Springer. [award] [bib] [copylink] [data] [doi] [research] [slides]
2013
Nicola Ferro, Pamela Forner, Henning Müller, Roberto Navigli, Roberto Paredes, Paolo Rosso, Benno Stein, and Dan Tufis. 4th International Conference of the CLEF Initiative (CLEF 2013). SIGIR Forum, 47 (2) : 15-20, December 2013. [bib] [copylink] [doi] [publisher] [research]
Matthias Hagen, Michael Völske, Jakob Gomoll, Marie Bornemann, Lene Ganschow, Florian Kneist, Abdul Hamid Sabri, and Benno Stein. Webis at TREC 2013 Sessions and Web Track. In Ellen M. Voorhees and Lori P. Buckland, editors, 22nd International Text Retrieval Conference (TREC 2013), number (SP 500-302) in NIST Special Publication, November 2013. National Institute of Standards and Technology (NIST). [bib] [copylink] [publisher]
Henning Wachsmuth, Benno Stein, and Gregor Engels. Learning Efficient Information Extraction on Heterogeneous Texts. In Ruslan Mitkov and Jong C. Park, editors, 6th International Joint Conference on Natural Language Processing (IJCNLP 2013), pages 534-542, October 2013. Asian Federation of Natural Language Processing. [bib] [code] [copylink] [publisher]
Henning Wachsmuth, Benno Stein, and Gregor Engels. Information Extraction as a Filtering Task. In Qi He and Arun Iyengar, editors, 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), pages 2049-2058, October 2013. ACM. [bib] [code] [copylink] [doi]
Pamela Forner, Henning Müller, Roberto Paredes, Paolo Rosso, and Benno Stein, editors. 4th International Conference of the CLEF Initiative (CLEF 2013), volume 8138 of Lecture Notes in Computer Science, Springer, September 2013. [bib] [copylink] [doi] [event]
Martin Potthast, Tim Gollub, Matthias Hagen, Martin Tippmann, Johannes Kiesel, Paolo Rosso, Efstathios Stamatatos, and Benno Stein. Overview of the 5th International Competition on Plagiarism Detection. In Pamela Forner, Roberto Navigli, and Dan Tufis, editors, Working Notes Papers of the CLEF 2013 Evaluation Labs, volume 1179 of Lecture Notes in Computer Science, September 2013. [bib] [copylink] [event] [publisher] [research] [slides]
Tim Gollub, Martin Potthast, Anna Beyer, Matthias Busse, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, and Benno Stein. Recent Trends in Digital Text Forensics and its Evaluation. In Pamela Forner et al., editors, Information Access Evaluation meets Multilinguality, Multimodality, and Visualization. 4th International Conference of the CLEF Initiative (CLEF 2013), pages 282-302, September 2013. Springer. [bib] [copylink] [doi] [research] [slides]
Martin Potthast, Matthias Hagen, Michael Völske, and Benno Stein. Crowdsourcing Interaction Logs to Understand Text Reuse from the Web. In Pascale Fung and Massimo Poesio, editors, 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), pages 1212-1221, August 2013. Association for Computational Linguistics. [award] [bib] [copylink] [data] [demo] [publisher] [research] [slides]
Martin Potthast, Matthias Hagen, Michael Völske, and Benno Stein. Exploratory Search Missions for TREC Topics. In Max L. Wilson et al., editors, 3rd European Workshop on Human-Computer Interaction and Information Retrieval (EuroHCIR 2013), volume 1033 of Lecture Notes in Computer Science, pages 11-14, August 2013. [bib] [copylink] [data] [publisher] [research]
Tim Gollub, Matthias Hagen, Maximilian Michel, and Benno Stein. From Keywords to Keyqueries: Content Descriptors for the Web. In Cathal Gurrin et al., editors, 36th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2013), pages 981-984, July 2013. ACM. [bib] [copylink] [doi] [publisher]
Maik Anderka. Analyzing and Predicting Quality Flaws in User-generated Content: The Case of Wikipedia. Dissertation, Bauhaus-Universität Weimar, June 2013. [bib] [copylink] [publisher] [research]
Steven Burrows, Martin Potthast, and Benno Stein. Paraphrase Acquisition via Crowdsourcing and Machine Learning. Transactions on Intelligent Systems and Technology (ACM TIST), 4 (3) : 43:1-43:21, June 2013. [bib] [copylink] [data] [doi] [publisher]
Matthias Hagen, Jakob Gomoll, Anna Beyer, and Benno Stein. From Search Session Detection to Search Mission Detection. In João Ferreira, João Magalhães, and Pável Calado, editors, 10th International Conference Open Research Areas in Information Retrieval (OAIR 2013), pages 85-92, May 2013. ACM. [bib] [copylink] [data] [publisher] [slides]
Steven Burrows, Jörg Frochte, Michael Völske, Ana Belén Martínez Torres, and Benno Stein. Learning Overlap Optimization for Domain Decomposition Methods. In Jian Pei et al., editors, 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2013), pages 438-449, April 2013. Springer. [bib] [copylink] [doi] [research] [slides]
Nedim Lipka. Modeling Non-Standard Text Classification Tasks. Dissertation, Bauhaus-Universität Weimar, March 2013. [bib] [copylink] [publisher] [research]
Henning Wachsmuth, Mirko Rose, and Gregor Engels. Automatic Pipeline Construction for Real-Time Annotation. In Alexander Gelbukh, editors, 14th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2013), volume 7816 of Lecture Notes in Computer Science, pages 38-49, March 2013. Springer. [bib] [code] [copylink] [doi]
2012
Martin Potthast. Technologien zur Wiederverwendung von Texten aus dem Web. In Steffen Hölldobler et al., editors, Ausgezeichnete Informatikdissertationen 2011, volume D-12 LNI of Lecture Notes in Informatics, pages 141-150, December 2012. Gesellschaft für Informatik. [bib] [copylink] [publisher] [research] [slides]
Henning Wachsmuth and Benno Stein. Optimal Scheduling of Information Extraction Algorithms. In Martin Kay and Christian Boitet, editors, 24th International Conference on Computational Linguistics: Posters (COLING 2012), pages 1281-1290, December 2012. COLING 2012 Organizing Committee. [bib] [copylink]
Khalid Al-Khatib, Hinrich Schütze, and Cathleen Kantner. Automatic Detection of Point of View Differences in Wikipedia. In Martin Kay and Christian Boitet, editors, 24th International Conference on Computational Linguistics (COLING 2012), pages 33-50, December 2012. Indian Institute of Technology Bombay. [bib] [copylink]
Nedim Lipka, Benno Stein, and James G. Shanahan. Estimating the Expected Effectiveness of Text Classification Solutions under Subclass Distribution Shifts. In Mohammed J. Zaki et al., editors, 12th IEEE International Conference on Data Mining (ICDM 2012), pages 972-977, December 2012. IEEE. [bib] [copylink] [doi] [publisher]
Matthias Hagen, Martin Potthast, Matthias Busse, Jakob Gomoll, Jannis Harder, and Benno Stein. Webis at the TREC 2012 Session Track. In Ellen M. Voorhees and Lori P. Buckland, editors, 21st International Text Retrieval Conference (TREC 2012), number (SP 500-298) in NIST Special Publication, November 2012. National Institute of Standards and Technology (NIST). [bib] [copylink] [publisher] [slides]
Benno Stein, Tim Gollub, and Dennis Hoppe. Search Result Presentation Based on Faceted Clustering. In Xuewen Chen, Guy Lebanon, Haixun Wang, and Mohammed J. Zaki, editors, 21st ACM International Conference on Information and Knowledge Management (CIKM 2012), pages 1940-1944, October 2012. ACM. [bib] [copylink] [doi]
Matthias Hagen, Martin Potthast, Anna Beyer, and Benno Stein. Towards Optimum Query Segmentation: In Doubt Without. In Xuewen Chen, Guy Lebanon, Haixun Wang, and Mohammed J. Zaki, editors, 21st ACM International Conference on Information and Knowledge Management (CIKM 2012), pages 1015-1024, October 2012. ACM. [bib] [copylink] [doi] [research] [slides]
Tim Gollub, Benno Stein, Steven Burrows, and Dennis Hoppe. TIRA: Configuring, Executing, and Disseminating Information Retrieval Experiments. In A Min Tjoa, Stephen Liddle, Klaus-Dieter Schewe, and Xiaofang Zhou, editors, 9th International Workshop on Text-based Information Retrieval (TIR 2012) at DEXA, pages 151-155, September 2012. IEEE. [bib] [copylink] [doi] [research] [slides]
Patrick Riehmann, Henning Gruendl, Martin Potthast, Martin Trenkmann, Benno Stein, and Bernd Fröhlich. WORDGRAPH: Keyword-in-Context Visualization for NETSPEAK's Wildcard Search. IEEE Transactions on Visualization and Computer Graphics, 18 (9) : 1411-1423, September 2012. [bib] [copylink] [doi] [research]
Martin Potthast, Benno Stein, Fabian Loose, and Steffen Becker. Information Retrieval in the Commentsphere. Transactions on Intelligent Systems and Technology (ACM TIST), 3 (4) : 68:1-68:21, September 2012. [bib] [copylink] [doi] [publisher] [research]
Martin Potthast, Tim Gollub, Matthias Hagen, Jan Graßegger, Johannes Kiesel, Maximilian Michel, Arnd Oberländer, Martin Tippmann, Alberto Barrón-Cedeño, Parth Gupta, Paolo Rosso, and Benno Stein. Overview of the 4th International Competition on Plagiarism Detection. In Pamela Forner, Jussi Karlgren, and Christa Womser-Hacker, editors, Working Notes Papers of the CLEF 2012 Evaluation Labs, volume 1178 of Lecture Notes in Computer Science, September 2012. [bib] [copylink] [event] [publisher] [research]
Maik Anderka and Benno Stein. Overview of the 1st International Competition on Quality Flaw Prediction in Wikipedia. In Pamela Forner, Jussi Karlgren, and Christa Womser-Hacker, editors, Working Notes Papers of the CLEF 2012 Evaluation Labs, volume 1178 of Lecture Notes in Computer Science, September 2012. [bib] [copylink] [data] [event] [publisher] [research] [slides]
Nedim Lipka, Benno Stein, and Maik Anderka. Cluster-Based One-Class Ensemble for Classification Problems in Information Retrieval. In Bill Hersh, Jamie Callan, Yoelle Maarek, and Mark Sanderson, editors, 35th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2012), pages 1041-1042, August 2012. ACM. [bib] [copylink] [doi]
Maik Anderka, Benno Stein, and Nedim Lipka. Predicting Quality Flaws in User-generated Content: The Case of Wikipedia. In Bill Hersh, Jamie Callan, Yoelle Maarek, and Mark Sanderson, editors, 35th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2012), pages 981-990, August 2012. ACM. [bib] [copylink] [data] [doi] [research]
Tim Gollub, Benno Stein, and Steven Burrows. Ousting Ivory Tower Research: Towards a Web Framework for Providing Experiments as a Service. In Bill Hersh, Jamie Callan, Yoelle Maarek, and Mark Sanderson, editors, 35th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2012), pages 1125-1126, August 2012. ACM. [bib] [copylink] [doi] [poster] [research]
Martin Potthast, Matthias Hagen, Benno Stein, Jan Graßegger, Maximilian Michel, Martin Tippmann, and Clement Welsch. ChatNoir: A Search Engine for the ClueWeb09 Corpus. In Bill Hersh, Jamie Callan, Yoelle Maarek, and Mark Sanderson, editors, 35th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2012), pages 1004, August 2012. ACM. [bib] [copylink] [doi] [research]
Claudia Hauff, Matthias Hagen, Anna Beyer, and Benno Stein. Towards Realistic Known-Item Topics for the ClueWeb. In Jaap Kamps et al., editors, 4th Information Interaction in Context Symposium (IIiX 2012), pages 274-277, August 2012. ACM. [bib] [copylink] [doi] [publisher]
Tim Gollub, Steven Burrows, and Benno Stein. First Experiences with TIRA for Reproducible Evaluation in Information Retrieval. In Andrew Trotman et al., editors, Workshop on Open Source Information Retrieval (OSIR 2012) at SIGIR, pages 52-55, August 2012. opensearchlab.otago.ac.nz. [bib] [copylink] [poster] [research] [slides]
Maik Anderka, Benno Stein, and Matthias Busse. On the Evolution of Quality Flaws and the Effectiveness of Cleanup Tags in the English Wikipedia. In Wikipedia Academy 2012, July 2012. Wikipedia. [bib] [copylink] [publisher] [research]
Oliver Niggemann, Benno Stein, Asmir Vodencarevic, Alexander Maier, and Hans Kleine Büning. Learning Behavior Models for Hybrid Timed Systems. In Jörg Hoffmann and Bart Selman, editors, 26th International Conference on Artificial Intelligence (AAAI 2012), pages 1083-1090, July 2012. AAAI. [bib] [copylink] [publisher] [research]
Sumeet Dua, Aryya Gangopadhyay, Parimala Thulasiraman, Umberto Straccia, Michael Shepherd, and Benno Stein, editors. 6th International Conference on Information Systems, Technology and Management (ICISTM 2012), volume 285 of Communications in Computer and Information Science, Springer, May 2012. [bib] [copylink] [doi]
Andre Schmidt, Michael Rzanny, Astrid Schmidt, Matthias Hagen, Eileen Schütze, and Erika Kothe. GC content-independent amino acid patterns in Bacteria and Archaea. Journal of Basic Microbiology, 52 (2) : 195-205, April 2012. [bib] [copylink]
Matthias Hagen, Jakob Gomoll, and Benno Stein. Improved Cascade for Search Mission Detection. In Ben Carterette, Evangelos Kanoulas, Paul D. Clough, and Mark Sanderson, editors, Workshop on Information Retrieval over Query Sessions (SIR 2012) at ECIR, April 2012. [bib] [copylink] [publisher] [slides]
Elisabeth Lex, Michael Völske, Marcelo Errecalde, Edgardo Ferretti, Leticia Cagnina, Christopher Horn, Benno Stein, and Michael Granitzer. Measuring the Quality of Web Content using Factual Information. In Carlos Castillo, Zoltan Gyongyi, Adam Jatowt, and Katsumi Tanaka, editors, 2nd Workshop on Web Quality (WebQuality 2012) at WICOW/AIRWeb, pages 7-10, April 2012. ACM. [bib] [copylink] [doi] [publisher] [research]
Maik Anderka and Benno Stein. A Breakdown of Quality Flaws in Wikipedia. In Carlos Castillo, Zoltan Gyongyi, Adam Jatowt, and Katsumi Tanaka, editors, 2nd Workshop on Web Quality (WebQuality 2012) at WICOW/AIRWeb, pages 11-18, April 2012. ACM. [bib] [copylink] [doi] [research]
Benno Stein, Dennis Hoppe, and Tim Gollub. The Impact of Spelling Errors on Patent Search. In Walter Daelemans, editors, 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), pages 570-579, April 2012. Association for Computational Linguistics. [bib] [copylink] [data] [publisher] [research]
Oliver Niggemann, Benno Stein, and Alexander Maier. Modeling Problems with Machine Learning–A Classification Scheme of Model Learning Approaches for Technical Systems. In Holger Giese, Michaela Huhn, Jan Philipps, and Bernhard Schätz, editors, 8th Dagstuhl Workshop Model-Based Development of Embedded Systems (MBEES 2012), pages 21-29, February 2012. fortiss GmbH. [bib] [copylink]
2011
Martin Potthast. Technologies for Reusing Text from the Web. Dissertation, Bauhaus-Universität Weimar, December 2011. [award] [bib] [copylink] [publisher] [research] [slides] [video]
Steven Burrows, Benno Stein, Jörg Frochte, David Wiesner, and Katja Müller. Simulation Data Mining for Supporting Bridge Design. In Peter Christen et al., editors, 9th Australasian Data Mining Conference (AusDM 2011), volume 121 of CRPIT, pages 163-170, December 2011. ACM. [bib] [copylink] [data] [research] [slides]
Matthias Hagen, Jan Graßegger, Maximilian Michel, and Benno Stein. Webis at the TREC 2011 Sessions Track. In Ellen M. Voorhees and Lori P. Buckland, editors, 20th International Text Retrieval Conference (TREC 2011), November 2011. National Institute of Standards and Technology (NIST). [bib] [copylink]
Henning Wachsmuth and Kathrin Bujna. Back to the Roots of Genres: Text Classification by Language Function. In 5th International Joint Conference on Natural Language Processing, pages 632-640, November 2011. Asian Federation of Natural Language Processing. [bib] [copylink] [data]
Peter Prettenhofer and Benno Stein. Cross-Lingual Adaptation using Structural Correspondence Learning. Transactions on Intelligent Systems and Technology (ACM TIST), 3 (1) : 13:1-13:22, October 2011. [bib] [copylink] [doi] [publisher] [research]
Daniel Blank, Norbert Fuhr, Andreas Henrich, Thomas Mandl, Thomas Rölleke, Hinrich Schütze, and Benno Stein. Teaching IR: Curricular Considerations. In Efthimis Efthimiadis, Juan M. Fernández-Luna, Juan F. Huete, Andrew MacFarlane, and W. Bruce Croft, editors, Teaching and Learning in Information Retrieval, volume 31 of The Information Retrieval Series, pages 31-46, Springer. October 2011. [bib] [copylink] [doi]
Henning Wachsmuth, Benno Stein, and Gregor Engels. Constructing Efficient Information Extraction Pipelines. In Bettina Berendt et al., editors, 20th ACM International Conference on Information and Knowledge Management (CIKM 2011), pages 2237-2240, October 2011. ACM. [bib] [copylink] [doi] [research]
Benno Stein, Tim Gollub, and Dennis Hoppe. Beyond [email protected]: Clustering the Long Tail of Web Search Results. In Bettina Berendt et al., editors, 20th ACM International Conference on Information and Knowledge Management (CIKM 2011), pages 2141-2144, October 2011. ACM. [bib] [copylink] [doi] [research]
Matthias Hagen, Benno Stein, and Tino Rüb. Query Session Detection as a Cascade. In Bettina Berendt et al., editors, 20th ACM International Conference on Information and Knowledge Management (CIKM 2011), pages 147-152, October 2011. ACM. [bib] [copylink] [doi] [slides]
Thomas Gottron, Maik Anderka, and Benno Stein. Insights into Explicit Semantic Analysis. In Bettina Berendt et al., editors, 20th ACM International Conference on Information and Knowledge Management (CIKM 2011), pages 1961-1964, October 2011. ACM. [bib] [copylink] [doi] [research] [wikipedia]
Maik Anderka, Benno Stein, and Nedim Lipka. Detection of Text Quality Flaws as a One-class Classification Problem. In Bettina Berendt et al., editors, 20th ACM International Conference on Information and Knowledge Management (CIKM 2011), pages 2313-2316, October 2011. ACM. [bib] [copylink] [doi] [research]
Matthias Hagen and Benno Stein. Candidate Document Retrieval for Web-Scale Text Reuse Detection. In Roberto Grossi, Fabrizio Sebastiani, and Fabrizio Silvestri, editors, 18th International Symposium on String Processing and Information Retrieval (SPIRE 2011), volume 7024 of Lecture Notes in Computer Science, pages 356-367, October 2011. Springer. [bib] [copylink] [doi] [slides]
Martin Potthast, Andreas Eiselt, Alberto Barrón-Cedeño, Benno Stein, and Paolo Rosso. Overview of the 3rd International Competition on Plagiarism Detection. In Vivien Petras, Pamela Forner, and Paul D. Clough, editors, Working Notes Papers of the CLEF 2011 Evaluation Labs, volume 1177 of Lecture Notes in Computer Science, September 2011. [bib] [copylink] [data] [event] [publisher] [research] [slides]
Martin Potthast and Teresa Holfeld. Overview of the 2nd International Competition on Wikipedia Vandalism Detection. In Vivien Petras, Pamela Forner, and Paul D. Clough, editors, Notebook Papers of CLEF 2011 Labs and Workshops, September 2011. [bib] [copylink] [data] [event] [publisher] [research]
Matthias Hagen and Benno Stein. Applying the User-over-Ranking Hypothesis to Query Formulation. In Giambattista Amati and Fabio Crestani, editors, Advances in Information Retrieval Theory. 3rd International Conference on the Theory of Information Retrieval (ICTIR 2011), volume 6931 of Lecture Notes in Computer Science, pages 225-237, September 2011. Springer. [bib] [copylink] [doi] [research] [slides]
Nedim Lipka and Benno Stein. Robust Models in Information Retrieval. In A Min Tjoa and Roland Wagner, editors, 8th International Workshop on Text-Based Information Retrieval (TIR 2011) at DEXA, volume 0, pages 185-189, September 2011. IEEE. [bib] [copylink] [doi] [research] [slides]
Hamish Cunningham, Norbert Fuhr, and Benno Stein. Challenges in Document Mining (Dagstuhl Seminar 11171). Dagstuhl Reports, 1 (4) : 65-99, August 2011. [bib] [copylink] [doi] [event] [publisher] [research]
Norbert Fuhr, Marc Lechtenfeld, Benno Stein, and Tim Gollub. The Optimum Clustering Framework: Implementing the Cluster Hypothesis. Information Retrieval, 15 (2) : 93-115, July 2011. [bib] [copylink] [doi] [research]
Benno Stein, Martin Potthast, Alberto Barrón-Cedeño, Paolo Rosso, Efstathios Stamatatos, and Moshe Koppel. 4th Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse (PAN 2010). SIGIR Forum, 45 (1) : 45-48, June 2011. [bib] [copylink] [data] [doi] [event] [publisher] [research]
Nedim Lipka and Benno Stein. Classifying with Co-Stems: A New Representation for Information Filtering. In Paul Clough et al., editors, Advances in Information Retrieval. 33rd European Conference on IR Research (ECIR 2011), volume 6611 of Lecture Notes in Computer Science, pages 307-313, April 2011. Springer. [bib] [copylink] [doi]
Benno Stein and Matthias Hagen. Introducing the User-over-Ranking Hypothesis. In Paul Clough et al., editors, Advances in Information Retrieval. 33rd European Conference on IR Research (ECIR 2011), volume 6611 of Lecture Notes in Computer Science, pages 503-509, April 2011. Springer. [bib] [copylink] [doi] [research] [slides]
Matthias Hagen, Benno Stein, and Tino Rüb. Query Session Detection as a Cascade. In Ben Carterette, Paul D. Clough, Evangelos Kanoulas, and Mark Sanderson, editors, Workshop on Information Retrieval over Query Sessions (SIR 2011) at ECIR, April 2011. [bib] [copylink] [publisher] [slides]
Benno Stein, Nedim Lipka, and Peter Prettenhofer. Intrinsic Plagiarism Analysis. Language Resources and Evaluation (LRE), 45 (1) : 63-82, March 2011. [bib] [copylink] [doi] [research]
Martin Potthast, Alberto Barrón-Cedeño, Benno Stein, and Paolo Rosso. Cross-Language Plagiarism Detection. Language Resources and Evaluation (LRE), 45 (1) : 45-62, March 2011. [bib] [copylink] [doi] [research]
Patrick Riehmann, Henning Gruendl, Bernd Fröhlich, Martin Potthast, Martin Trenkmann, and Benno Stein. The Netspeak WordGraph: Visualizing Keywords in Context. In Giuseppe Di Battista, Jean-Daniel Fekete, and Huamin Qu, editors, 4th IEEE Pacific Visualization Symposium (PacificVis 2011), pages 123-130, March 2011. IEEE. [award] [bib] [copylink] [doi] [research]
Matthias Hagen, Martin Potthast, Benno Stein, and Christof Bräutigam. Query Segmentation Revisited. In Sadagopan Srinivasan et al., editors, 20th International Conference on World Wide Web (WWW 2011), pages 97-106, March 2011. ACM. [bib] [copylink] [data] [doi] [research] [slides]
Maik Anderka, Benno Stein, and Nedim Lipka. Towards Automatic Quality Assurance in Wikipedia. In Sadagopan Srinivasan et al., editors, 20th International Conference on World Wide Web (WWW 2011), pages 5-6, March 2011. ACM. [bib] [copylink] [doi] [research]
2010
Andre Schmidt, Matthias Hagen, Eileen Schütze, Astrid Schmidt, and Erika Kothe. In silico prediction of potential metallothioneins and metallohistins in actinobacteria. Journal of Basic Microbiology, 50 (6) : 562-569, December 2010. [bib] [copylink]
Matthias Hagen, Benno Stein, and Michael Völske. Webis at the TREC 2010 Sessions Track. In Ellen M. Voorhees and Lori P. Buckland, editors, 19th International Text Retrieval Conference (TREC 2010), November 2010. National Institute of Standards and Technology (NIST). [bib] [copylink]
Maik Anderka, Nedim Lipka, and Benno Stein. Evaluating Cross-Language Explicit Semantic Analysis and Cross Querying at [email protected] 2009. In Carol Peters et al., editors, Multilingual Information Access Evaluation I: Text Retrieval Experiments. Selected papers of the 10th Cross-Language Evaluation Forum (CLEF 2009), volume 6241 of Lecture Notes in Computer Science, pages 50-57, October 2010. Springer. [bib] [copylink] [doi]
Martin Potthast, Benno Stein, and Teresa Holfeld. Overview of the 1st International Competition on Wikipedia Vandalism Detection. In Martin Braschler, Donna Harman, and Emanuele Pianta, editors, Working Notes Papers of the CLEF 2010 Evaluation Labs, volume 1176 of Lecture Notes in Computer Science, September 2010. [bib] [copylink] [data] [event] [publisher] [research] [slides]
Martin Potthast, Alberto Barrón-Cedeño, Andreas Eiselt, Benno Stein, and Paolo Rosso. Overview of the 2nd International Competition on Plagiarism Detection. In Martin Braschler, Donna Harman, and Emanuele Pianta, editors, Working Notes Papers of the CLEF 2010 Evaluation Labs, volume 1176 of Lecture Notes in Computer Science, September 2010. [bib] [copylink] [data] [event] [publisher] [research] [slides]
Matthias Hagen and Benno Stein. Capacity-Constrained Query Formulation. In Mounia Lalmas et al., editors, Research and Advanced Technology for Digital Libraries. 14th European Conference on Digital Libraries (ECDL 2010), volume 6273 of Lecture Notes in Computer Science, pages 384-388, September 2010. Springer. [bib] [copylink] [doi] [research]
Matthias Hagen and Benno Stein. Search Strategies for Keyword-based Queries. In A Min Tjoa and Roland Wagner, editors, 7th International Workshop on Text-Based Information Retrieval (TIR 2010) at DEXA, pages 37-41, September 2010. IEEE. [bib] [copylink] [doi]
Henning Wachsmuth, Peter Prettenhofer, and Benno Stein. Efficient Statement Identification for Automatic Market Forecasting. In Chu-Ren Huang and Dan Jurafsky, editors, 23rd International Conference on Computational Linguistics (COLING 2010), pages 1128-1136, August 2010. Association for Computational Linguistics. [bib] [copylink] [data] [publisher] [research]
Martin Potthast, Benno Stein, Alberto Barrón-Cedeño, and Paolo Rosso. An Evaluation Framework for Plagiarism Detection. In Chu-Ren Huang and Dan Jurafsky, editors, 23rd International Conference on Computational Linguistics (COLING 2010), pages 997-1005, August 2010. Association for Computational Linguistics. [bib] [copylink] [research]
Benno Stein and Matthias Hagen. Making the Most of a Web Search Session. In Xiangji Jimmy Huang, Irwin King, Vijay Raghavan, and Stefan Rueger, editors, IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT 2010), pages 90-97, August 2010. IEEE. [bib] [copylink] [doi]
Benno Stein, Sven Meyer zu Eißen, and Nedim Lipka. Web Genre Analysis: Use Cases, Retrieval Models, and Implementation Issues. In Alexander Mehler, Serge Sharoff, and Marina Santini, editors, Genres on the Web, volume 42 of Text, Speech and Language Technology, pages 167-190, Springer. August 2010. [bib] [copylink] [doi] [research]
Martin Potthast, Martin Trenkmann, and Benno Stein. Using Web N-Grams to Help Second-Language Speakers. In Web N-Gram Workshop at SIGIR 2010, pages 49, July 2010. [bib] [copylink] [publisher] [research]
Peter Prettenhofer and Benno Stein. Cross-Language Text Classification using Structural Correspondence Learning. In Jan Hajič, Sandra Carberry, Stephen Clark, and Joakim Nivre, editors, 48th Annual Meeting of the Association of Computational Linguistics (ACL 2010), pages 1118-1127, July 2010. Association for Computational Linguistics. [bib] [copylink] [data] [publisher] [research]
Martin Potthast. Crowdsourcing a Wikipedia Vandalism Corpus. In Fabio Crestani et al., editors, 33rd International ACM Conference on Research and Development in Information Retrieval (SIGIR 2010), pages 789-790, July 2010. ACM. [bib] [copylink] [data] [doi] [poster] [research]
Matthias Hagen, Martin Potthast, Benno Stein, and Christof Bräutigam. The Power of Naïve Query Segmentation. In Fabio Crestani et al., editors, 33rd International ACM Conference on Research and Development in Information Retrieval (SIGIR 2010), pages 797-798, July 2010. ACM. [bib] [copylink] [data] [doi] [poster] [research]
Ralf Schenkel and Benno Stein, editors. Social Mining and Search, volume 10 of Datenbank-Spektrum, Springer, June 2010. [bib] [copylink] [doi]
Tim Gollub and Benno Stein. Unsupervised Sparsification of Similarity Graphs. In Hermann Locarek-Junge and Claus Weihs, editors, Classification as a Tool for Research. Selected papers from the 33rd Annual Conference of the German Classification Society (GFKL 2009), Studies in Classification, Data Analysis, and Knowledge Organization, pages 71-79, May 2010. Springer. [bib] [copylink] [doi] [research]
Antonio Reyes, Martin Potthast, Paolo Rosso, and Benno Stein. Evaluating Humor Features on Web Comments. In Nicoletta Calzolari et al., editors, 7th Conference on International Language Resources and Evaluation (LREC 2010), May 2010. European Language Resources Association (ELRA). [bib] [copylink] [poster] [research]
Alberto Barrón-Cedeño, Martin Potthast, Paolo Rosso, Benno Stein, and Andreas Eiselt. Corpus and Evaluation Measures for Automatic Plagiarism Detection. In Nicoletta Calzolari et al., editors, 7th Conference on International Language Resources and Evaluation (LREC 2010), May 2010. European Language Resources Association (ELRA). [bib] [copylink] [data] [research] [slides]
Martin Potthast, Benno Stein, and Steffen Becker. Towards Comment-based Cross-Media Retrieval. In Michael Rappa, Paul Jones, Juliana Freire, and Soumen Chakrabarti, editors, 19th International Conference on World Wide Web (WWW 2010), pages 1169-1170, April 2010. ACM. [bib] [copylink] [doi] [poster] [research]
Nedim Lipka and Benno Stein. Identifying Featured Articles in Wikipedia: Writing Style Matters. In Michael Rappa, Paul Jones, Juliana Freire, and Soumen Chakrabarti, editors, 19th International Conference on World Wide Web (WWW 2010), pages 1147-1148, April 2010. ACM. [bib] [copylink] [doi]
Martin Potthast and Steffen Becker. Opinion Summarization of Web Comments. In Cathal Gurrin et al., editors, Advances in Information Retrieval. 32nd European Conference on Information Retrieval (ECIR 2010), volume 5993 of Lecture Notes in Computer Science, pages 668-669, March 2010. Springer. [bib] [copylink] [doi] [poster] [research]
Martin Potthast, Martin Trenkmann, and Benno Stein. Netspeak: Assisting Writers in Choosing Words. In Cathal Gurrin et al., editors, Advances in Information Retrieval. 32nd European Conference on Information Retrieval (ECIR 2010), volume 5993 of Lecture Notes in Computer Science, pages 672, March 2010. Springer. [bib] [copylink] [doi] [research]
Maik Anderka, Benno Stein, and Martin Potthast. Cross-language High Similarity Search: Why no Sub-linear Time Bound can be Expected. In Cathal Gurrin et al., editors, Advances in Information Retrieval. 32nd European Conference on Information Retrieval (ECIR 2010), volume 5993 of Lecture Notes in Computer Science, pages 640-644, March 2010. Springer. [bib] [copylink] [doi] [poster]
Benno Stein, Martin Potthast, and Martin Trenkmann. Retrieving Customary Web Language to Assist Writers. In Cathal Gurrin et al., editors, Advances in Information Retrieval. 32nd European Conference on Information Retrieval (ECIR 2010), volume 5993 of Lecture Notes in Computer Science, pages 631-635, March 2010. Springer. [bib] [copylink] [doi] [poster] [research]
Eric Berberich, Matthias Hagen, Benjamin Hiller, and Hannes Moser. Experiments. In Matthias Müller-Hannemann and Stefan Schirra, editors, Algorithm Engineering: Bridging the Gap between Algorithm Theory and Practice, volume 5971 of Lecture Notes in Computer Science, pages 325-388, March 2010. Springer. [bib] [copylink] [doi]
Thomas Gottron and Nedim Lipka. A Comparison of Language Identification Approaches on Short, Query-Style Texts. In Cathal Gurrin, Yulan He, Gabriella Kazai, Udo Kruschwitz, Suzanne Little, Thomas Roelleke, Stefan M. Rüger, and Keith van Rijsbergen, editors, Advances in Information Retrieval. 32nd European Conference on Information Retrieval (ECIR 2010), volume 5993 of Lecture Notes in Computer Science, pages 611-614, Springer. March 2010. [bib] [copylink] [doi]
2009
Alberto Barrón-Cedeño, Andreas Eiselt, and Paolo Rosso. Monolingual Text Similarity Measures: A Comparison of Models over Wikipedia Articles Revisions. In Dipti Misra Sharma, Vasudeva Varma, and Rajeev Sangal, editors, 7th International Conference on Natural Language Processing (ICON 2009), pages 29-38, December 2009. Macmillan Publishers. [bib] [copylink]
Benno Stein and Maik Anderka. Collection-Relative Representations: A Unifying View to Retrieval Models. In A Min Tjoa and Roland Wagner, editors, 6th International Workshop on Text-Based Information Retrieval (TIR 2009) at DEXA, pages 383-387, September 2009. IEEE. [bib] [copylink] [doi] [research]
Martin Potthast, Benno Stein, Andreas Eiselt, Alberto Barrón-Cedeño, and Paolo Rosso. Overview of the 1st International Competition on Plagiarism Detection. In Benno Stein et al., editors, 3rd Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse (PAN 2009) at SEPLN, volume 502 of CEUR Workshop Proceedings, pages 1-9, September 2009. [bib] [copylink] [data] [event] [publisher] [research] [slides]
Benno Stein, Paolo Rosso, Efstathios Stamatatos, Moshe Koppel, and Eneko Agirre, editors. 3rd Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse (PAN 2009) at SEPLN, volume 502 of CEUR Workshop Proceedings, September 2009. [bib] [copylink] [event] [publisher] [research]
Martin Potthast. Measuring the Descriptiveness of Web Comments. In Mark Sanderson et al., editors, 32nd International ACM Conference on Research and Development in Information Retrieval (SIGIR 2009), pages 724-725, July 2009. ACM. [bib] [copylink] [doi] [poster] [research]
Maik Anderka and Benno Stein. The ESA Retrieval Model Revisited. In Mark Sanderson et al., editors, 32th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2009), pages 670-671, July 2009. ACM. [bib] [copylink] [doi] [wikipedia]
Daniel Blank, Norbert Fuhr, Andreas Henrich, Thomas Mandl, Thomas Rölleke, Hinrich Schütze, and Benno Stein. Information Retrieval: Concepts and Practical Considerations for Teaching a Rising Topic. Datenbank-Spektrum, 9 (29) : 30-41, May 2009. [bib] [copylink]
Matthias Hagen. Lower bounds for three algorithms for transversal hypergraph generation. Discrete Applied Mathematics, 157 (7) : 1460-1469, April 2009. [bib] [copylink]
Oliver Niggemann, Benno Stein, Thomas Spanuth, and Heinrich Balzer. Using Models for Dynamic System Diagnosis: A Case Study in Automotive Engineering. In Holger Giese, Michaela Huhn, Ulrich Nickel, and Bernhard Schätz, editors, 5th Dagstuhl Workshop Model-Based Development of Embedded Systems (MBEES 2009), number (2009-01) in IB, pages 46-56, April 2009. TU Braunschweig. [bib] [copylink]
Matthias Hagen, Peter Horatschek, and Martin Mundhenk. Experimental comparison of the two Fredman-Khachiyan-algorithms. In Irene Finocchi and John Hershberger, editors, Workshop on Algorithm Engineering and Experiments (ALENEX 2009), pages 154-161, January 2009. SIAM. [bib] [copylink]
2008
Matthias Hagen. Algorithmic and Computational Complexity Issues of MONET. Dissertation, Institut für Informatik, Friedrich-Schiller-Universität Jena, December 2008. [bib] [copylink]
Benno Stein and Sven Meyer zu Eißen. Retrieval Models for Genre Classification. Scandinavian Journal of Information Systems (SJIS), 20 (1) : 91-117, October 2008. [bib] [copylink] [publisher] [research]
Fabian Loose, Steffen Becker, Martin Potthast, and Benno Stein. Retrieval-Technologien für die Plagiaterkennung in Programmen. In Joachim Baumeister and Martin Atzmüller, editors, Workshop Special Interest Group Information Retrieval (FGIR 2008), Technical Report 448, pages 5-12, October 2008. University of Würzburg, Germany. [bib] [copylink] [research] [slides]
Benno Stein and Sven Meyer zu Eißen. Weighted Experts: A Solution for the Spock Data Mining Challenge. In Klaus Tochtermann and Hermann Maurer, editors, 8th International Conference on Knowledge Management (I-KNOW 2008), Journal of Universal Computer Science, pages 358-365, September 2008. Springer. [bib] [copylink] [research] [wikipedia]
Benno Stein, Nedim Lipka, and Sven Meyer zu Eißen. Meta Analysis within Authorship Verification. In A Min Tjoa and Roland Wagner, editors, 5th International Workshop on Text-Based Information Retrieval (TIR 2008) at DEXA, pages 34-39, September 2008. IEEE. [bib] [copylink] [doi] [research]
Benno Stein, Efstathios Stamatatos, and Moshe Koppel, editors. 2nd Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse (PAN 2008) at ECAI, volume 377 of CEUR Workshop Proceedings, July 2008. [bib] [copylink] [event] [publisher] [research]
Benno Stein. Coping with Large Design Spaces. International Journal on Software Tools for Technology Transfer (STTT), 10 (3) : 233-245, June 2008. [bib] [copylink] [doi] [research]
Judy Goldsmith, Matthias Hagen, and Martin Mundhenk. Complexity of DNF minimization and isomorphism testing for monotone formulas. Information and Computation, 206 (6) : 760-775, June 2008. [bib] [copylink]
Benno Stein. Model Construction for Knowledge-Intensive Engineering Tasks. In Ying Liu, Aixin Sun, Han Tong Loh, Wen Feng Lu, and Ee-Peng Lima, editors, Advances of Computational Intelligence in Industrial Systems, pages 139-167, Springer. June 2008. [bib] [copylink] [doi] [research]
Martin Potthast and Benno Stein. New Issues in Near-duplicate Detection. In Christine Preisach, Hans Burkhardt, Lars Schmidt-Thieme, and Reinhold Decker, editors, Data Analysis, Machine Learning and Applications. Selected papers from the 31th Annual Conference of the German Classification Society (GFKL 2007), Studies in Classification, Data Analysis, and Knowledge Organization, pages 601-609, May 2008. Springer. [bib] [copylink] [doi] [research] [slides]
Khaled M. Elbassioni, Matthias Hagen, and Imran Rauf. Some Fixed-Parameter Tractable Classes of Hypergraph Duality and Related Problems. In Martin Grohe and Rolf Niedermeier, editors, Third International Workshop on Parameterized and Exact Computation (IWPEC 2008), volume 5018 of Lecture Notes in Computer Science, pages 91-102, May 2008. Springer. [bib] [copylink]
Martin Potthast, Benno Stein, and Maik Anderka. A Wikipedia-Based Multilingual Retrieval Model. In Craig Macdonald et al., editors, Advances in Information Retrieval. 30th European Conference on IR Research (ECIR 2008), volume 4956 of Lecture Notes in Computer Science, pages 522-530, March 2008. Springer. [bib] [copylink] [doi] [poster] [research] [slides] [wikipedia]
Martin Potthast, Benno Stein, and Robert Gerling. Automatic Vandalism Detection in Wikipedia. In Craig Macdonald et al., editors, Advances in Information Retrieval. 30th European Conference on IR Research (ECIR 2008), volume 4956 of Lecture Notes in Computer Science, pages 663-668, March 2008. Springer. [award] [bib] [copylink] [data] [doi] [poster] [research]
2007
Benno Stein, Moshe Koppel, and Efstathios Stamatatos. Plagiarism Analysis, Authorship Identification, and Near-Duplicate Detection (PAN 2007). SIGIR Forum, 41 (2) : 68-71, December 2007. [bib] [copylink] [doi] [event] [publisher]
Benno Stein and Sven Meyer zu Eißen. Fingerprint-based Similarity Search and its Applications. In Kurt Kremer and Volker Macho, editors, Forschung und wissenschaftliches Rechnen 2006, pages 85-98, Gesellschaft für wissenschaftliche Datenverarbeitung. November 2007. [bib] [copylink]
Benno Stein and Martin Potthast. Construction of Compact Retrieval Models. In Sándor Dominich and Ferenc Kiss, editors, Studies in Theory of Information Retrieval. 1st International Conference on the Theory of Information Retrieval (ICTIR 2007), pages 85-93, October 2007. Foundation for Information Society. [bib] [copylink] [research] [slides]
Stephan Arens, Alexander Buss, Helena Deck, Miroslaw Dynia, Matthias Fischer, Holger Hagedorn, Peter Isaak, Jaroslaw Kutylowski, Friedhelm Meyer auf der Heide, Viktor Nesterow, Adrian Ogiermann, Boris Stobbe, Thomas Storm, and Henning Wachsmuth. Smart Teams: Simulating Large Robotic Swarms in Vast Environments. In Ulrich Rückert, Joaquin Sitte, and Ulf Witkowski, editors, 4th International Symposium on Autonomous Minirobots for Research and Edutainment, pages 215-222, October 2007. Heinz Nixdorf Institut, University of Paderborn. [bib] [copylink]
Benno Stein and Frank Benteler. On the Generalized Box-Drawing of Trees: Survey and New Technology. In Klaus Tochtermann and Hermann Maurer, editors, 7th International Conference on Knowledge Management (I-KNOW 2007), Journal of Universal Computer Science, pages 408-415, September 2007. Springer. [bib] [copylink]
Sven Meyer zu Eißen and Benno Stein. An MDA Approach to Implement Personal Information Retrieval Tools. In A Min Tjoa and Roland Wagner, editors, 4th International Workshop on Text-Based Information Retrieval (TIR 2007) at DEXA, pages 259-263, September 2007. IEEE. [bib] [copylink] [doi]
Matthias Hagen. On the fixed-parameter tractability of the equivalence test of monotone normal forms. Information Processing Letters, 103 (4) : 163-167, August 2007. [bib] [copylink]
Benno Stein and Sven Meyer zu Eißen. Intrinsic Plagiarism Analysis with Meta Learning. In Benno Stein, Moshe Koppel, and Efstathios Stamatatos, editors, 1st Workshop on Plagiarism Analysis, Authorship Identification, and Near-Duplicate Detection (PAN 2007) at SIGIR, volume 276 of CEUR Workshop Proceedings, pages 45-50, July 2007. [bib] [copylink] [publisher] [research]
Benno Stein and Sven Meyer zu Eißen. Topic-Identifikation: Formalisierung, Analyse und neue Verfahren. KI - Künstliche Intelligenz, 3 : 16-22, July 2007. [bib] [copylink] [publisher] [research]
Benno Stein, Moshe Koppel, and Efstathios Stamatatos, editors. 1st Workshop on Plagiarism Analysis, Authorship Identification, and Near-Duplicate Detection (PAN 2007) at SIGIR, volume 276 of CEUR Workshop Proceedings, July 2007. [bib] [copylink] [event] [publisher]
Benno Stein. Principles of Hash-based Text Retrieval. In Charles Clarke et al., editors, 30th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2007), pages 527-534, July 2007. ACM. [bib] [copylink] [doi] [research]
Benno Stein, Sven Meyer zu Eißen, and Martin Potthast. Strategies for Retrieving Plagiarized Documents. In Charles Clarke et al., editors, 30th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2007), pages 825-826, July 2007. ACM. [bib] [copylink] [doi] [poster] [research]
Martin Potthast. Wikipedia in the Pocket–Indexing Technology for Near-duplicate Detection and High Similarity Search. In Charles Clarke et al., editors, 30th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2007), pages 909, July 2007. ACM. [bib] [copylink] [research]
Matthias Hagen. Lower Bounds for Three Algorithms for the Transversal Hypergraph Generation. In Andreas Brandstädt, Dieter Kratsch, and Haiko Müller, editors, 33rd International Workshop on Graph-Theoretic Concepts in Computer Science (WG 2007), volume 4769 of Lecture Notes in Computer Science, pages 316-327, June 2007. Springer. [bib] [copylink]
Heinrich Balzer, Benno Stein, and Oliver Niggemann. Diagnose in verteilten automotiven Systemen. In Jürgen Gausemeier, editors, 5. Paderborner Workshop Entwurf mechatronischer Systeme, volume 210 of HNI-Schriftenreihe, pages 243-254, March 2007. Heinz Nixdorf Institut. [bib] [copylink]
Benno Stein and Martin Potthast. Applying Hash-based Indexing in Text-Based Information Retrieval. In Marie-Francine Moens, Tinne Tuytelaars, and Arjen P. de Vries, editors, 7th Dutch-Belgian Information Retrieval Workshop (DIR 2007), pages 29-35, March 2007. Faculty of Engineering, Universiteit Leuven. [bib] [copylink] [research] [slides]
Sven Meyer zu Eißen. On Information Need and Categorizing Search. Dissertation, University of Paderborn, February 2007. [bib] [copylink] [publisher] [research]