Document distances using the Zipf distribution and a novel metric
2003 (English)Report (Other academic)
A novel metric is proposed in the present report for the evaluation of the goodness-of-fit criterion between the distribution functions of two samples. We extend the usage of the proposed criterion for the case of the generalized Zipf distribution. Detailed mathematical analysis of the proposed metric, which is embodied in a hypothesis testing, is also provided.
Place, publisher, year, edition, pages
Umeå: Tillämpad fysik och elektronik , 2003. , 15 p.
DML Technical Report, ISSN 1652-8441 ; DML-TR-2003:01
Signalbehandling, Zipf distribution, n-gram frequencies, bhattacharyya metric
IdentifiersURN: urn:nbn:se:umu:diva-407OAI: oai:DiVA.org:umu-407DiVA: diva2:143387