Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Crime and Relationship: Exploring Gender Bias in NLP Corpora
Umeå University, Faculty of Science and Technology, Department of Computing Science. Umeå University, Faculty of Social Sciences, Umeå Centre for Gender Studies (UCGS). (Foundations of Language Processing)
Uppsala University.ORCID iD: 0000-0002-4954-4397
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0000-0002-4696-9787
2020 (English)Conference paper, Published paper (Refereed)
Abstract [en]

Gender bias in natural language processing (NLP) tools, deriving from implicit human bias embedded in language data, is an important and complicated problem on the road to fair algorithms. We leverage topic modeling to retrieve documents associated with particular gendered categories, and discuss how exploring these documents can inform our understanding of the corpora we may use to train NLP tools. This is a starting point for challenging the systemic power structures and producing a justice-focused approach to NLP.

Place, publisher, year, edition, pages
2020.
Keywords [en]
gender bias, topic modeling
National Category
Language Technology (Computational Linguistics) Gender Studies
Research subject
Computer Science; gender studies
Identifiers
URN: urn:nbn:se:umu:diva-177583OAI: oai:DiVA.org:umu-177583DiVA, id: diva2:1509712
Conference
SLTC 2020 – The Eighth Swedish Language Technology Conference, 25–27 November 2020, Online
Projects
EQUITBLAvailable from: 2020-12-14 Created: 2020-12-14 Last updated: 2021-01-14Bibliographically approved

Open Access in DiVA

fulltext(106 kB)252 downloads
File information
File name FULLTEXT01.pdfFile size 106 kBChecksum SHA-512
225c1115d9fda60ee0a8c034ddfca9b8c8488631bc3e3c5fb857b7846c54e6b39c56e3cedaa856e82d3de5ee284376c17b694a1e212aecfadb75daea445d379b
Type fulltextMimetype application/pdf

Other links

URL

Authority records

Devinney, HannahBjörklund, JennyBjörklund, Henrik

Search in DiVA

By author/editor
Devinney, HannahBjörklund, JennyBjörklund, Henrik
By organisation
Department of Computing ScienceUmeå Centre for Gender Studies (UCGS)
Language Technology (Computational Linguistics)Gender Studies

Search outside of DiVA

GoogleGoogle Scholar
Total: 252 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 821 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf