Umeå University's logo

umu.sePublications
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
VOICE: Visual Oracle for Interaction, Conversation, and Explanation
King Abdullah University of Science and Technology (KAUST), Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division, Saudi Arabia.
King Abdullah University of Science and Technology (KAUST), Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division, Saudi Arabia.
Linköping University, Department of Science and Technology, Sweden.
King Abdullah University of Science and Technology (KAUST), Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division, Saudi Arabia.
Show others and affiliations
2025 (English)In: IEEE Transactions on Visualization and Computer Graphics, ISSN 1077-2626, E-ISSN 1941-0506Article in journal (Refereed) Epub ahead of print
Abstract [en]

We present VOICE, a novel approach to science communication that connects large language models' conversational capabilities with interactive exploratory visualization. VOICE introduces several innovative technical contributions that drive our conversational visualization framework. Based on the collected design requirements, we introduce a two-layer agent architecture that can perform task assignment, instruction extraction, and coherent content generation. We employ fine-tuning and prompt engineering techniques to tailor agents' performance to their specific roles and accurately respond to user queries. Our interactive text-to-visualization method generates a flythrough sequence matching the content explanation. In addition, natural language interaction provides capabilities to navigate and manipulate 3D models in real-time. The VOICE framework can receive arbitrary voice commands from the user and respond verbally, tightly coupled with a corresponding visual representation, with low latency and high accuracy. We demonstrate the effectiveness of our approach by implementing a proof-of-concept prototype and applying it to the molecular visualization domain: analyzing three 3D molecular models with multiscale and multi-instance attributes. Finally, we conduct a comprehensive evaluation of the system, including quantitative and qualitative analyses on our collected dataset, along with a detailed public user study and expert interviews. The results confirm that our framework and prototype effectively meet the design requirements and cater to the needs of diverse target users.

Place, publisher, year, edition, pages
IEEE, 2025.
Keywords [en]
Conversational visualization, explanatory visualization, multiscale data
National Category
Computer Sciences Human Computer Interaction
Identifiers
URN: urn:nbn:se:umu:diva-241710DOI: 10.1109/TVCG.2025.3579956PubMedID: 40522810Scopus ID: 2-s2.0-105008684015OAI: oai:DiVA.org:umu-241710DiVA, id: diva2:1981237
Funder
Knut and Alice Wallenberg Foundation, KAW 2019.0024Available from: 2025-07-03 Created: 2025-07-03 Last updated: 2025-07-03

Open Access in DiVA

fulltext(53861 kB)160 downloads
File information
File name FULLTEXT01.pdfFile size 53861 kBChecksum SHA-512
883260d671c1f8e3672d6da67223a4b62e94e04c78e64ce2421813a1758d1ba18aa2a310c1f859530a6a0f458f567c4bca44487b37fd6eb9aa67e34dfd9642ac
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Björklund, Johanna

Search in DiVA

By author/editor
Björklund, Johanna
By organisation
Department of Computing Science
In the same journal
IEEE Transactions on Visualization and Computer Graphics
Computer SciencesHuman Computer Interaction

Search outside of DiVA

GoogleGoogle Scholar
Total: 160 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 202 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf