Webis-Wikipedia-Text-Reuse-18

Name: Webis-Wikipedia-Text-Reuse-18
Published: 2018
License: https://creativecommons.org/licenses/by/4.0/deed.en

Synopsis
People
Publications

Synopsis

The Wikipedia Text Reuse Corpus 2018 (Webis-Wikipedia-Text-Reuse-18) containing text reuse cases extracted from within Wikipedia and in between Wikipedia and a sample of the Common Crawl

Access

Please refer to this publication for citing the dataset. If you want to link the dataset, please use the dataset permalink [doi].

Download the dataset from Zenodo.
Find the related metadata at Google.

People

Milad Alshomary
Michael Völske
Henning Wachsmuth
Benno Stein
Matthias Hagen
Martin Potthast

Webis-Wikipedia-Text-Reuse-18

Synopsis

Access

People

Publications

Args

ChatNoir

IR Anthology

Netspeak

Picapica

TIRA