Eye Tracking and Visual Attention to Live Streaming: Comparison
Please note this is a comparison between Version 1 by Hsuan Chu CHEN and Version 2 by Sirius Huang.

The COVID-19 pandemic has led to the development of a new business model, “Live Streaming + Ecommerce”, which is a new method for commercial sales that shares the goal of sustainable economic growth (SDG 8). As information technology finds its way into the digital lives of internet users, the real-time and interactive nature of live streaming has overturned the traditional entertainment experience of audio and video content, moving towards a more nuanced division of labor with multiple applications. Researchers have used eye tracking technology in order to understand consumers’visual search methods and decision-making processes.

  • eye tracking
  • ecommerce live broadcast
  • visual attention

1. Introduction

The United Nations launched 17 Sustainable Development Goals (SDGs) in 2015, including 169 targets that can be grouped into three pillars of sustainable development: economy, society, and the environment [1][2][1,2]. Interdisciplinary cooperation with other parties is required to achieve the SDGs, as the goals relate to all aspects of human endeavors, and in this context, business operations play an important role in supporting the achievement of the goals, as they involve many business activities that contribute to the improvement of living standards [3][4][5][6][3,4,5,6].
The impacts of the COVID-19 pandemic on governments, industries, and all business activities worldwide have seriously challenged the achievement of the SDGs and increased the complexity of the intersecting SDGs [7][8][7,8]. Due to this impact, the need for digital technology applications has increased, and the study published in The World in 2050 (TWI205), Transformations to Achieve the Sustainable Development Goals, recommends that the world adopt the United Nations SDGs as the basis for immediate key transformations [9]. Under the influence of COVID-19, many brick-and-mortar shops and businesses closed down; however, online businesses benefited from the growth of the “Stay-at-Home Economy” [10][11][12][10,11,12]. As a result, existing online business models flourished around the world during the pandemic, giving rise to a new business model: live ecommerce platforms, which share the goal of sustainable economic growth with SDG 8. This reinforces the fact that during the pandemic, going out has been reduced but not spending, and online and offline commerce can play complementary roles [13][14][15][13,14,15].
The main consumption scenario of consumers has changed from brick-and-mortar shops to business activities in an electronic form combining the internet and commerce (ecommerce), meaning the T2O (TV to online) business model, which links television programs with ecommerce to create an innovative model of combining live streaming with ecommerce. This business model has brought a new shopping experience to consumers and created the so-called “internet celebrity economy”, which has created opportunities for the development of live ecommerce [16]. Live ecommerce is a new business model driven by real-time interactions, which effectively stimulates consumers and develops consumers into fans to follow the live broadcast through entertaining live content that meets their needs [17][18][19][17,18,19], thus bringing money-catching (eye-catching) traffic to the live shows and, in turn, providing a new channel of interaction and communication between consumers and merchants [20]. Consequently, ecommerce platforms have been actively joining live ecommerce in recent years [21]; for example, Taobao, JD.com, Jumei Youpin, and Facebook fan pages have all launched live ecommerce features [22][23][22,23].
According to MIC’s survey data on live streaming in Taiwan, 31.3% of Taiwanese netizens follow a specific live streamer (a user who plays the role of a live stream host), and 78.4% of those who watch video and audios online have watched a live broadcast [24]. However, live streaming is not just a one-sided personal showcase for the live streamer, it also includes the interactions and reactions of the live viewers to the content of the live stream, which together form part of the live ecommerce platform. Thus, how to make consumers willing to learn about brands and products, share their ideas and needs, and receive immediate responses from live streamers have become the most important issues for live ecommerce to create a new commerce model different from the traditionally static way of displaying ecommerce products [25]. Most existing study methods of ecommerce live streaming are based on questionnaires or interviews, which are used to infer the participants’ mental activities. However, these study methods may be biased by the participants’ memory errors, introspective correctness errors, or subconscious attempts to conform to social expectations, which may result in the research data not being detailed enough or not being able to observe the overall cognitive process, causing invalid data and experimental bias [26]
Due to the ease of use and maturity of eye tracking technology [27][28][27,28], eye tracking research has been used in a wide range of research fields since its inception for reading behavior analyses, such as cognitive science, psychology, human factors engineering, human–computer interaction interfaces, virtual reality, digital learning, artificial intelligence, machine learning and so on [29][30][31][29,30,31]. Eye tracking technology uses the principle of image processing to capture the infrared rays reflected from the pupil and records eye movements using a special camera that locks onto the eyes to analyze the eye tracking process. The observation and analysis of eye movements also facilitate the study of users’ perceptual recognition or attention distribution, thus, inferring their cognitive processes [32][33][32,33]. Fixation and Saccade are two of the most commonly used methods to process data in eye tracking studies [34][35][36][34,35,36]. “Fixation” is defined as the state of visual attention to a specific area between one eye movement and the next, and is the primary means of receiving visual information. “Saccade” is defined as a continuous and rapid movement between eye fixation points, meaning a rapid eye movement that leads to a specific visual target [37][38][39][37,38,39]. The sequence of eye movements, which includes saccade and fixation, is known as the scan path. In addition, in the process of studying eye tracking technology, in order to record and analyze the eye movements of users’ browsing behavior in a physiologically and psychologically unaffected state, researchers must set regions of interest (ROIs) to observe and collect eye fixation and visual behavior at certain locations [40][41][42][43][44][40,41,42,43,44]. Based on the interactions of the ROIs and visual information, the latency of first fixation (LFF), duration of first fixation (DFF), total fixation durations (TFD), and number of fixations (NOF) are commonly used as evaluation criteria for processing eye movement data [45][46][45,46]. In recent years, the rapid development of the internet has led to the explosive growth of ecommerce, resulting in innovative models for live ecommerce. By recognizing the importance of the behavioral process, some researchers have used eye trackers to record the order in which consumers browse products, as well as the length of time they spend on messages in a particular browsing task or behavior, in order to understand their visual search methods and decision-making processes.

2. Business Model and Studies of Live Ecommerce

With the popularity of mobile vehicles, the rapid development of 5G, and the rise of the OTT (over-the-top) audio and video industry, live streaming applications have formed a whirlwind, and the in-depth combination of online live streaming and ecommerce has attracted a large amount of traffic and business opportunities for live ecommerce [47]. In view of this, many ecommerce companies are hoping to use the appeal, performance, and influence of internet celebrities to promote their products, thus, creating the so-called internet celebrity economy and increasing the likelihood of potential consumers [48]. The first year of the “live streaming + ecommerce” model was 2016, in which internet celebrities used their image, appeal, or influence to deepen consumers’ impressions of their products and brands, thus, turning their popularity into online profits, which creates a win–win economic model for ecommerce, sellers, and internet celebrities [49]. In recent years, many scholars have used questionnaires or interviews to explore how live ecommerce, meaning the new form of online media with interactive and real-time characteristics, increases consumers’ urges to buy and consume, as well as their desire to experience and be entertained, based on the premise that live ecommerce can attract consumers’ attention and satisfy their shopping needs through the TV to Online mode and TV Shopping [49][50][49,50]. Such studies are both effective and instructive; for example, one study showed that three types of motivations, namely “information motivation”, “social interaction motivation”, and “entertainment motivation”, have a positive impact on the frequency of use of live ecommerce. Moreover, the extroversion and openness of the users have a positive predictive effect on the motivation for, behavior in, and satisfaction level with live ecommerce. Using a structural equation model, a post-acceptance and continuous adoption model of information systems was used to explore user behavior on Twitch, which is a live webcasting platform, and the results showed a significant effect of cognitive interactivity on perceived usefulness, a significant effect of perceived usefulness on satisfaction and intention to continue using, and a significant effect of satisfaction on intention to continue using. Previous studies have indicated that highly motivated community participants are more likely to generate brand trust, which in turn increases the psychological attachment of fans to brands and leads to their long lasting relationship with the fan page [8]. It was also found that a live presence had a significant positive effect on online interactivity and message credibility, online interactivity and message credibility had a significant positive effect on identity, and identity and affordability had a significant positive effect on consumers’ purchase intention, where affordability mediated the effect of identity and purchase intention. A social motivation model with eight factors was constructed based on the use and satisfaction model to explore the level of engagement of four-sided live viewers, and the findings showed that less-viewed live platforms had a higher social engagement motivation than more-viewed live platforms [51][52][51,52].

3. Eye Tracking Technology and Related Studies

For most human beings, 80% of information processing relies on vision; thus, studying eye movements is considered to be the most effective tool in visual information processing and the most important source of sensory information in cognitive processing [36]. The main reason is that eye movements can be effectively used to capture the complex cognitive processes of human visual information, meaning they can be used as a technique to locate where people are looking [53][54][55][53,54,55]. Unlike previous cognitive processes for individuals, eye movements can also effectively characterize the information processed during the reading process and provide external behavioral indicators [56][57][58][56,57,58]. Therefore, eye tracking is considered to be a measurement technique that reflects the visual information process, helps researchers to record the fixation positions of the participants’ eyes at a given time, and displays the movement sequence and trajectories of the information. Eye tracking technology provides an important tool for the natural and online exploration of cognitive thinking, which reflects the correlation between eye movements and psychological changes in readers when receiving information [37]. This technique has been widely applied to understand the reading process, as well as other related topics and research experiments, such as eye movement characteristics, perceptual span, and information integration. Eye movement data are also used to examine the cognitive processes of different cognitive tasks, and in many studies related to eye tracking, total fixation durations, the number of fixations, and the sequence of fixations are the most frequently investigated variables [56][57][56,57]. When the eye is in fixation or saccade, it indicates visual attention. Eye movement processes, such as fixation duration, fixation position, and visual trajectory, can be used as bases for assessing whether visual behavior is being attended to [59]; for example, researchers have shown that the fixation durations and the number of fixations are directly related to preference, where shorter fixation durations or fewer fixations indicate a lower preference, while longer fixation durations or more frequent fixations indicate a higher preference [32][60][32,60]. An experiment was conducted with 33 participants using an eye-monitoring device to investigate the relationship between eye tracking and landscape preference, attention, image characteristics, and the number of fixations, and the results showed that the total number of fixations was influenced by preference, and that preference factors varied with personal factors, such as professional background or sex [32][34][61][32,34,61].

4. Eye Tracking Technology and Visual Attention

Recent advances in computer processing technology have led psychologists to believe that the human psychological function is similar to that of computers [62][63][62,63], i.e., the structure and process of human psychological functioning can be understood through the way that computers process information [64][65][64,65]. At the same time, the relationship between the perceptual information processing theory and cognitive processes has led many researchers to conclude that eye tracking technology is the most direct and effective way to study visual information processing, observe the reading patterns and visual attention of different learners, and even as a tool for teachers to diagnose learning disabilities [66]. The eye movements commonly observed and recorded in eye tracking are Fixation and Saccade. The Scan path refers to a series of fixation points and sweeps, which are the conscious eye movements associated with attention shifting, higher-level memory, and comprehension of cognitive processes. Another study investigated the relationship between students’ reading comprehension and visual attention when reading Chinese sentences with misplaced words by combining eye tracking technology and an electroencephalogram, and the results showed that misplaced words did not affect reading comprehension and that increasing the number of misplaced words in a sentence did not affect the fixation duration. An empirical analysis that used eye tracking technology was conducted to investigate how graphic design in e-books with a high and low correlation affected learners’ visual behavior and learning outcomes when learning single Spanish words [67][68][69][70][71][67,68,69,70,71]. Eye tracking technology was used to investigate the differences in the order of seeing and reading, as well as the cognitive span of native Japanese learners, when they viewed words in their first language (L1) and second language (L2). The empirical results of the study showed that for L2 readers, reading was more important than seeing, and there was a difference in parafoveal processing between L1 readers and L2 readers when reading target words in sentences [72][73][72,73]. In addition, in order to understand the effectiveness and concentration of learners in game-based learning, a study on the design and development of digital games using eye tracking technology found that visual trajectories were used to analyze learners’ learning processes, and that incorporating game features with rules and objectives in the games was effective in capturing learners’ attention [74][75][76][74,75,76]. Visual behavior, processes, and achievements in game-based learning (GBL) have been used to explore the differences between players with medium–high and low conceptual comprehension in visual behavior and game process in GBL using eye tracking technology [77]. The results suggested that players in the high comprehension group showed effective text reading strategies and better metacognitive control of visual attention during gameplay. Another study explored the impact of using a simulation environment with animated agents on the visual attention, emotion, performance, and perception, in order to assess how animated agents of emotion in simulation-based training affected the performance outcomes and perceptions of individuals interacting with the training application in real-time. The results of the study showed that both experienced and novice participants focused more visual attention on the animated agents than on other defined regions of interest in the simulation environment [78]. While multiple choice (MC) based on visual attention is an important form of testing to assess students’ academic performance in eLearning, MC question evaluation indices (e.g., correctness rate) only consider the correctness of the final choice but ignore the process that led the participants to select the answer. The experimental results showed that including this measure could reflect differences in fixation movements and help teachers infer the true academic level of students [79]. Eye tracking has been used to study students’ visual attention to solve upper-level physics questions, and to identify the differences in understanding and cognitive processing when solving questions in a graphical and abstract mathematical manner. The fixation patterns and associated eye-tracking measures suggested that the two visual strategies had different cognitive processes, and the different strategies led to different fixation patterns and learning outcomes [80]. A comparative analysis of Facebook online advertising attention using eye tracking technology was conducted to explore the eye tracking indices and Facebook advertising attention behavior [81]. VR eye tracking was used to explore how prior knowledge affected the visual attention and learning outcomes for Japanese mimicry and onomatopoeia, and the results of the study showed that learning in VR improved attention to learning [82][83][84][85][82,83,84,85]. A study investigating the educational effects of anthropology through a GBL approach was carried out to learn about the association between game immersion and visual attention distribution. The eye tracking technology showed that students who played anthropology in an immersive manner were more focused on the played character [86]. In a study of viewers’ visual attention to subtitles in home shopping broadcasts, eye tracking technology was used to obtain objective data on visual attention to home shopping subtitles and to determine the factors that draw the visual attention to subtitles on television home shopping screens. In addition, an eye tracking device was used to propose an effective method of producing and directing home shopping subtitles [87].