VOICE: Visual Oracle for Interaction, Conversation, and ExplanationShow others and affiliations
2025 (English)In: IEEE Transactions on Visualization and Computer Graphics, ISSN 1077-2626, E-ISSN 1941-0506Article in journal (Refereed) Epub ahead of print
Abstract [en]
We present VOICE, a novel approach to science communication that connects large language models' conversational capabilities with interactive exploratory visualization. VOICE introduces several innovative technical contributions that drive our conversational visualization framework. Based on the collected design requirements, we introduce a two-layer agent architecture that can perform task assignment, instruction extraction, and coherent content generation. We employ fine-tuning and prompt engineering techniques to tailor agents' performance to their specific roles and accurately respond to user queries. Our interactive text-to-visualization method generates a flythrough sequence matching the content explanation. In addition, natural language interaction provides capabilities to navigate and manipulate 3D models in real-time. The VOICE framework can receive arbitrary voice commands from the user and respond verbally, tightly coupled with a corresponding visual representation, with low latency and high accuracy. We demonstrate the effectiveness of our approach by implementing a proof-of-concept prototype and applying it to the molecular visualization domain: analyzing three 3D molecular models with multiscale and multi-instance attributes. Finally, we conduct a comprehensive evaluation of the system, including quantitative and qualitative analyses on our collected dataset, along with a detailed public user study and expert interviews. The results confirm that our framework and prototype effectively meet the design requirements and cater to the needs of diverse target users.
Place, publisher, year, edition, pages
IEEE, 2025.
Keywords [en]
Conversational visualization, explanatory visualization, multiscale data
National Category
Computer Sciences Human Computer Interaction
Identifiers
URN: urn:nbn:se:umu:diva-241710DOI: 10.1109/TVCG.2025.3579956PubMedID: 40522810Scopus ID: 2-s2.0-105008684015OAI: oai:DiVA.org:umu-241710DiVA, id: diva2:1981237
Funder
Knut and Alice Wallenberg Foundation, KAW 2019.00242025-07-032025-07-032025-07-03