Using the ProT Nordic Web Dataset
2011 (English)Report (Other academic)
In this paper we present a free dataset, usable for testing web search engines. The dataset corresponds to a snapshot of the Nordic part of the Internet back in early 2007 and is highly abstracted, with numbers representing each web page. The released dataset consists of three parts; a graph, 76 sets of pages containing each tested word combination, and some files to use when calculating relevance of the resulting sets of algorithms/search engines. We also present statistics for some search engine algorithms.
Place, publisher, year, edition, pages
2011. , 29 p.
Report / UMINF, ISSN 0348-0542 ; 13
Nordic Web Dataset, Search Engine Evaluation, Relevance Metrics
Other Electrical Engineering, Electronic Engineering, Information Engineering
Research subject Computer and Information Science
IdentifiersURN: urn:nbn:se:umu:diva-49307OAI: oai:DiVA.org:umu-49307DiVA: diva2:454557