Archive Query Log 2022

Synopsis

The AQL-22 is a query log collected at the Internet Archive over the last 25 years. Includes 356 M queries, 137 M search result pages, and 1.4 B billion search results across 550 search providers. The AQL-22 is the first public query log that is on par with commercial logs with respect to size, scope, and diversity. Provided in a privacy-preserving manner, it promotes open research as well as more transparency and accountability in the search industry.

Access

Please refer to the publications for citing the dataset. If you want to link the dataset, please use the dataset permalink [doi].

  • Browse the dataset here.
  • Find the related metadata at Google.

People

Publications