umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Benford's Law and the First Letter of Words
Umeå University, Faculty of Science and Technology, Department of Physics.
2018 (English)In: Physica A: Statistical Mechanics and its Applications, ISSN 0378-4371, E-ISSN 1873-2119, Vol. 512, p. 305-315Article in journal (Other academic) Published
Abstract [en]

A universal First-Letter Law (FLL) is derived and described. It predicts the percentages of first letters for words in novels. The FLL is akin to Benford’s law (BL) of first digits, which predicts the percentages of first digits in a data collection of numbers. Both are universal in the sense that FLL only depends on the numbers of letters in the alphabet, whereas BL only depends on the number of digits in the base of the number system. The existence of these types of universal laws appears counter-intuitive. Nonetheless both describe data very well. Relations to some earlier works are given. FLL predicts that an English author on the average starts about 16 out of 100 words with the English letter ‘t’. This is corroborated by data, yet an author can freely write anything. Fuller implications and the applicability of FLL remain for the future.

Place, publisher, year, edition, pages
Elsevier, 2018. Vol. 512, p. 305-315
Keywords [en]
First-Letter Law, Benford’s law, universal frequency ladder, Random Group Formation, maximum entropy
National Category
Language Technology (Computational Linguistics)
Research subject
Linguistics; Theoretical Physics
Identifiers
URN: urn:nbn:se:umu:diva-143257DOI: 10.1016/j.physa.2018.08.133ISI: 000446151000026OAI: oai:DiVA.org:umu-143257DiVA, id: diva2:1167977
Available from: 2017-12-19 Created: 2017-12-19 Last updated: 2018-11-06Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textarXiv

Authority records BETA

Minnhagen, Petter

Search in DiVA

By author/editor
Minnhagen, Petter
By organisation
Department of Physics
In the same journal
Physica A: Statistical Mechanics and its Applications
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 188 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf