umu.sePublications
Change search

Cite
Citation style
• apa
• ieee
• modern-language-association-8th-edition
• vancouver
• Other style
More styles
Language
• de-DE
• en-GB
• en-US
• fi-FI
• nn-NO
• nn-NB
• sv-SE
• Other locale
More languages
Output format
• html
• text
• asciidoc
• rtf
Zipf's law unzipped
Umeå University, Faculty of Science and Technology, Department of Physics.
Umeå University, Faculty of Science and Technology, Department of Physics.
2011 (English)In: New Journal of Physics, ISSN 1367-2630, E-ISSN 1367-2630, Vol. 13, p. 043004-Article in journal (Refereed) Published
##### Abstract [en]

Why does Zipf's law give a good description of data from seemingly completely unrelated phenomena? Here it is argued that the reason is that they can all be described as outcomes of a ubiquitous random group division: the elements can be citizens of a country and the groups family names, or the elements can be all the words making up a novel and the groups the unique words, or the elements could be inhabitants and the groups the cities in a country, and so on. A Random Group Formation (RGF) is presented from which a Bayesian estimate is obtained based on minimal information: it provides the best prediction for the number of groups with $k$ elements, given the total number of elements, groups, and the number of elements in the largest group. For each specification of these three values, the RGF predicts a unique group distribution $N(k)\propto \exp(-bk)/k^{\gamma}$, where the power-law index $\gamma$ is a unique function of the same three values. The universality of the result is made possible by the fact that no system specific assumptions are made about the mechanism responsible for the group division. The direct relation between $\gamma$ and the total number of elements, groups, and the number of elements in the largest group, is calculated. The predictive power of the RGF model is demonstrated by direct comparison with data from a variety of systems. It is shown that $\gamma$ usually takes values in the interval $1\leq\gamma\leq 2$ and that the value for a given phenomena depends in a systematic way on the total size of the data set. The results are put in the context of earlier discussions on Zipf's and Gibrat's laws, $N(k)\propto k^{-2}$ and the connection between growth models and RGF is elucidated.

##### Place, publisher, year, edition, pages
IoP , 2011. Vol. 13, p. 043004-
##### National Category
Other Physics Topics
##### Research subject
Theoretical Physics
##### Identifiers
OAI: oai:DiVA.org:umu-42557DiVA, id: diva2:409650
##### Funder
Swedish Research Council, 2008-4449 Available from: 2011-04-11 Created: 2011-04-10 Last updated: 2017-12-11Bibliographically approved

#### Open Access in DiVA

##### File information
File name FULLTEXT02.pdfFile size 735 kBChecksum SHA-512
07d926f128c6328e8b8c7c0b6d750a1328ecb18325c9c011f92981ea4cf94deea1eec1e94b3bb98cdb0fdb50be098948eb2bb6015b5bfcdb0e367e674fa86cff
Type fulltextMimetype application/pdf

Publisher's full text

#### Search in DiVA

##### By author/editor
Baek, Seung KiMinnhagen, Petter
##### By organisation
Department of Physics
##### In the same journal
New Journal of Physics
##### On the subject
Other Physics Topics

#### Search outside of DiVA

The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available
doi
urn-nbn

#### Altmetric score

doi
urn-nbn
Total: 219 hits

Cite
Citation style
• apa
• ieee
• modern-language-association-8th-edition
• vancouver
• Other style
More styles
Language
• de-DE
• en-GB
• en-US
• fi-FI
• nn-NO
• nn-NB
• sv-SE
• Other locale
More languages
Output format
• html
• text
• asciidoc
• rtf