Vol. 20, No. 2, February 28, 2026
10.3837/tiis.2026.02.009,
Download Paper (Free):
Abstract
This study investigates how interface modality in multimodal generative AI systems shapes user experience and satisfaction. Although systems such as Open AI's GPT-4o increasingly support multimodal interaction, research has emphasized technical performance more than user-centered interface design. Addressing this gap, we examine how modality affects social presence, perceived enjoyment, concentration, preference, and satisfaction. A between-subjects experimental survey used imagined-interaction stimuli for three conditions: (1) GPT-4o's default text interface, (2) GPT-4o's native audio interface, and (3) a visually enhanced interface in which participants selected one of the five MetaHuman characters, providing simplified, non-embodied visual cues to manipulate visual presence. A total 922 participants were randomly assigned to conditions. Established scales mesasured all constructs. Data were analyzed using ANOVA and structural equation modeling (SEM). ANOVA revealed significant modality-based differences in social presence and perceived enjoyment, but not in concentration. SEM clarified this pattern: higher social presence significantly increased both concentration and enjoyment. Concentration strongly predicted enjoyment, which increased preference and, in turn, satisfaction. Notably, the text-based interface produced higher enjoyment than the MetaHuman-based visual condition, suggesting that simplified or non-realisitc visual cues do not necessarily improve user experience. This is among the first empirical studies to test how visual interface elements in multimodal generative AI shape user experience via presence.
Statistics
Show / Hide Statistics
Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.
Cite this article
[IEEE Style]
D. Lee, "Effects of Visual Interface Elements within Generative AI Multimodal Systems on User Experience and Satisfaction," KSII Transactions on Internet and Information Systems, vol. 20, no. 2, pp. 813-829, 2026. DOI: 10.3837/tiis.2026.02.009.
[ACM Style]
Doyeon Lee. 2026. Effects of Visual Interface Elements within Generative AI Multimodal Systems on User Experience and Satisfaction. KSII Transactions on Internet and Information Systems, 20, 2, (2026), 813-829. DOI: 10.3837/tiis.2026.02.009.
[BibTeX Style]
@article{tiis:105896, title="Effects of Visual Interface Elements within Generative AI Multimodal Systems on User Experience and Satisfaction", author="Doyeon Lee and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2026.02.009}, volume={20}, number={2}, year="2026", month={February}, pages={813-829}}