Advances in consumer virtual reality (VR) and artificial intelligence (AI) have accelerated the use of immersive virtual learning environments (iVLEs) for skills training. Learner engagement is a critical determinant of training effectiveness, which can be shaped by VR system features (e.g., visual, auditory, and tactile immersion) coupled with interaction mechanics and instructional design integrated with the instructional behaviors of virtual human assistants (VHAs). Although visual and behavioral fidelity in VHAs have been extensively studied, functional fidelity (i.e., the extent to which the iVLE and/or VHAs support cognitive, perceptual, and motor processes required to perform a task regardless of visual realism), and particularly the temporal alignment of instructional guidance with learners’ cognitive and motor demands, remains underexamined. This article highlights research on VHAs in iVLEs with a special emphasis on temporal functional fidelity as an emerging requirement for synchronizing instructional support with user workload and task phases. By consolidating existing findings and highlighting gaps in current empirical work, this article outlines key implications for the design and evaluation of VHAs and identifies directions for future research aimed at optimizing instructional timing in iVLEs. The goal is to inform principled VHA design and clarify how fidelity dimensions should be integrated to support effective, pedagogically grounded immersive learning experiences.
Advances in computational processing power and the miniaturization of consumer electronics have substantially reduced the cost and increased the accessibility of immersive technologies, contributing to the rapid expansion of virtual reality (VR) as a widely available platform for simulation and training
[1]. VR enables users to interact within computer-generated environments through head-mounted displays (HMDs), motion-tracked controllers, and hand tracking that has been shown to reshape human–computer interaction and support immersive learning experiences
[2]. When combined with artificial intelligence (AI), these capabilities have accelerated the development of immersive virtual learning environments (iVLEs), including virtual simulations and serious games that provide adaptive instruction, enhance conceptual understanding, and improve knowledge retention
[3].
A central determinant of iVLE effectiveness is learner engagement, a multidimensional construct associated with persistence, achievement, and enjoyment
[4][5][6]. Engagement is particularly consequential in VR, where immersive experiences can increase motivation and support authentic learning and performance
[7]. Within iVLEs, virtual humans (VHs) used as instructional or representational agents, also referred to as virtual human assistants (VHAs), play a critical role in shaping engagement by mediating how learners attend to, interact with, and interpret the virtual learning experience. Research indicates that VHAs with appropriate representational qualities can strengthen embodiment and user acceptance compared to abstract representations, thereby supporting sustained involvement and task engagement
[8].
VHAs play an increasingly prominent role in iVLEs by providing instruction, feedback, and adaptive support
[9][10]. Their effectiveness is shaped by diverse fidelity dimensions, including visual, behavioral, auditory, cognitive, and functional, all of which influence learner perception and interaction
[9][11]. Although fidelity is often discussed broadly as the realism of a simulation
[12][13], research demonstrates considerable inconsistency in its definition and application
[14][15]. For example, structural fidelity refers to perceptual realism, whereas functional fidelity concerns how well task interactions align with learning objectives. In contrast, realism reflects subjective user perception and varies based on expectations and context
[16][17][18].
Efforts to increase fidelity have revealed challenges. Visual or behavioral hyper-realism may elicit uncanny valley responses
[19], and mismatches in audio fidelity or vocal characteristics can disrupt presence and engagement
[20]. Empirical findings remain mixed. More specifically, some studies report that higher fidelity enhances social presence and perceived attractiveness, whereas others show that medium fidelity animations can increase eeriness or bias user judgments
[21]. As VR research increasingly prioritizes photorealistic rendering
[22][23] and AI-driven interactivity
[24], the need to clarify how fidelity affects learning has become more pressing.
A key unresolved question concerns the optimal level and type of fidelity needed to support engagement and performance in iVLEs. Although fidelity is frequently assumed to improve learning, current evidence remains inconclusive
[25][26]. These findings suggest that learning effectiveness depends not only on how realistic a virtual environment or instructor (VHA) appears, but on how it is functionally applied
[14]. In this context, temporal functional fidelity (i.e., the alignment of instructional timing with learners’ cognitive and motor demands) has received comparatively limited attention despite its strong theoretical relevance for managing cognitive load and supporting procedural skill acquisition.
This article highlights key implications for VHA design and evaluation and identifies future research directions for optimizing instructional timing in iVLEs. The aim is to inform principled VHA design and clarify how fidelity dimensions should be integrated to support effective, pedagogically grounded immersive virtual learning.
This entry is adapted from the peer-reviewed paper 10.3390/encyclopedia6040077