Change search
ReferencesLink to record
Permanent link

Direct link
Following Conversational Traces: Part 1: Creating a corpus with the ICWSM dataset.
Umeå University, Faculty of Arts, Humlab. Umeå University, Faculty of Arts, Modern Languages. Engelska.
2007 (English)Conference paper (Refereed)
Abstract [en]

This poster will present the methodology behind the creation of a linguistic corpus based on a subset of the 2007 International Conference on Weblogs and Social Media dataset. Posts from a small group of political bloggers were tagged for parts of speech and indexed into a corpus using the program Xairia. From this corpus, the political blogger subset will be investigated for register and referential information. Referential information,especially with regards to new and given information, will be compared against network placement both to identify network innovators as well as to compare network placement as a catalyst for innovation. The final section, Further Research, will outline the modifications necessary for the creation of a full-scale corpus based on the entire ICWSM 2006 dataset.

Place, publisher, year, edition, pages
2007. 2- p.
Keyword [en]
Weblogs, blog, network, community of practice, corpus, sociolinguistics, ICWSM
National Category
Language Technology (Computational Linguistics) Social Anthropology
URN: urn:nbn:se:umu:diva-6590OAI: diva2:146259
International Conference on Weblogs and Social Media, Boulder, Colorado, U.S.A., March 26-28, 2007.
Available from: 2007-12-14 Created: 2007-12-14 Last updated: 2013-07-08Bibliographically approved

Open Access in DiVA

No full text

Other links

Search in DiVA

By author/editor
Hendrick, Stephanie
By organisation
HumlabModern Languages
Language Technology (Computational Linguistics)Social Anthropology

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 36 hits
ReferencesLink to record
Permanent link

Direct link