Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Implementing a speech-to-text pipeline on the MICO platform
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0000-0002-4696-9787
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0000-0001-8503-0118
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0000-0002-1112-2981
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0000-0001-5496-5041
2016 (English)Report (Other academic)
Abstract [en]

MICO is an open-source platform for cross-media analysis, querying, and recommendation. It is the major outcome of the European research project Media in Context, and has been contributed to by academic and industrial partners from Germany, Austria, Sweden, Italy, and the UK. A central idea is to group sets of related media objects into multimodal content items, and to process and store these as logical units. The platform is designed to be easy to extend and adapt, and this makes it a useful building block for a diverse set of multimedia applications. To promote the platform and demonstrate its potential, we describe our work on a Kaldi-based speech-recognition pipeline.

Place, publisher, year, edition, pages
Umeå University, 2016.
Series
Report / UMINF, ISSN 0348-0542 ; 16.07
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-220303OAI: oai:DiVA.org:umu-220303DiVA, id: diva2:1833467
Available from: 2024-02-01 Created: 2024-02-01 Last updated: 2024-02-01Bibliographically approved

Open Access in DiVA

fulltext(94 kB)104 downloads
File information
File name FULLTEXT01.pdfFile size 94 kBChecksum SHA-512
784450988ced071297681869779be34e2b8c96fbc0f7e4c99ae54060d0eb5539f7931523d4ca60be2282a2514c24c7563296e20774cbf406165fe9175afa660f
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Björklund, HenrikBjörklund, JohannaDahlgren, AdamDemeke, Yonas

Search in DiVA

By author/editor
Björklund, HenrikBjörklund, JohannaDahlgren, AdamDemeke, Yonas
By organisation
Department of Computing Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 104 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 384 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf