The ProT Nordic Web Dataset
2012 (English)In: Proceedings of the International Conference on Internet Computing: ICOMP 2012 / [ed] Hamid R. Arabnia, Victor A. Clincy, Leonidas Deligiannidis, Andy Marsh, Ashu M. G. Solo, Las Vegas, Nevada: CSREA Press, 2012, 125-128 p.Conference paper, Presentation (Refereed)
In this paper we present a free dataset, usable for testing web search engines. The dataset corresponds to a snapshot of the Nordic part of the Internet in early 2007 and is highly abstracted, with numbers representing each web page. The released dataset consists of three parts; a graph, 76 sets of pages containing each tested word combination, and some files to use when calculating relevance of the resulting sets of algorithms/search engines. We also present a new compound statistic as well as statistical results for some search engine and information retrieval algorithms.
Place, publisher, year, edition, pages
Las Vegas, Nevada: CSREA Press, 2012. 125-128 p.
Nordic Web Dataset, Search Engine Evaluation, Relevance Metrics
Research subject Computer Science
IdentifiersURN: urn:nbn:se:umu:diva-64012ISBN: 1-60132-220-8OAI: oai:DiVA.org:umu-64012DiVA: diva2:586455
The 2012 International Conference on Internet Computing (ICOMP'12)