• DOI: 10.1016/j.compedu.2018.06.023
  • Corpus ID: 51881376

A systematic review of eye tracking research on multimedia learning

  • Ecenaz Alemdag , K. Cagiltay
  • Published in Comput. Educ. 1 October 2018
  • Education, Computer Science
  • Comput. Educ.

263 Citations

A review of eye tracking research on video-based learning, a narrative review of eye-tracking research on self-regulated learning from multimedia learning materials in higher education.

  • Highly Influenced

A systematic literature review for the use of eye-tracking in special education

Eye-tracking and metacognitive skills: a review on the use of eye-tracking for measuring students’ metacognitive skills in chemistry learning, using eye tracking technology to analyse cognitive load in multichannel activities in university students, a scoping review of webcam eye tracking in learning and education, a systematic review of eye-tracking technology in second language research, looking through the model’s eye: a systematic review of eye movement modeling example studies, using eye tracking to understand learners' attention in online live teaching, eye tracking as feedback tool in physics teacher education, 88 references, a review of using eye-tracking technology in exploring learning from 2000 to 2012, the use of eye tracking as a research and instructional tool in multimedia learning, the use of eye movements in the study of multimedia learning., eye-tracking as a research methodology in educational context: a spanning framework, using eye tracking to study on attention and recall in multimedia learning environments: the effects of design in learning, an eye-tracking study of cueing effects in multimedia learning, eye tracking as a tool to study and enhance cognitive and metacognitive processes in computer-based learning environments, the use of eye-gaze to understand multimedia learning, tracking learners' visual attention during a multimedia presentation in a real classroom, an examination of cognitive processing of multimedia information based on viewers' eye movements, related papers.

Showing 1 through 3 of 0 Related Papers

A systematic review of eye tracking research on multimedia learning

New citation alert added.

This alert has been successfully added and will be sent to:

You will be notified whenever a record that you have chosen has been cited.

To manage your alert preferences, click on the button below.

New Citation Alert!

Please log in to your account

Information & Contributors

Bibliometrics & citations, view options.

  • Kennel K Ruzika S (2024) Real-time Prediction of Students' Math Difficulties using Raw Data from Eye Tracking and Neural Networks Proceedings of the ACM on Human-Computer Interaction 10.1145/3655607 8 :ETRA (1-19) Online publication date: 28-May-2024 https://dl.acm.org/doi/10.1145/3655607
  • Zheng X Jiang Y Cheng H Nie A (2024) How to Arrange Texts and Pictures for Online Visitors — Comparing Basic Ceramic Display Forms with Eye Tracking Journal on Computing and Cultural Heritage 10.1145/3639938 17 :2 (1-23) Online publication date: 26-Mar-2024 https://dl.acm.org/doi/10.1145/3639938
  • Luo F Liu R Awoyemi I Crawford C Nasrin F Stephenson B Stone J Battestilli L Rebelsky S Shoop L (2024) Novel Insights into Elementary Girls' Experiences in Physiological Computing Proceedings of the 55th ACM Technical Symposium on Computer Science Education V. 1 10.1145/3626252.3630900 (764-770) Online publication date: 7-Mar-2024 https://dl.acm.org/doi/10.1145/3626252.3630900
  • Show More Cited By

Index Terms

Applied computing

Enterprise computing

Computing methodologies

Artificial intelligence

Philosophical/theoretical foundations of artificial intelligence

Human-centered computing

Human computer interaction (HCI)

Information systems

Information systems applications

Multimedia information systems

Social and professional topics

Professional topics

Computing education

Recommendations

Eye movements predict students' computer-based assessment performance of physics concepts in different presentation modalities.

Despite decades of studies on the link between eye movements and human cognitive processes, the exact nature of the link between eye movements and computer-based assessment performance still remains unknown. To bridge this gap, the present study ...

A review of eye tracking research on video-based learning

Eye tracking technology is increasingly used to understand individuals’ non-conscious, moment-to-moment processes during video-based learning. This review evaluated 44 eye tracking studies on video-based learning conducted between 2010 and 2021. ...

A head-mounted sensor-based eye tracking device: eye touch system

In this study, a new eye tracking system, namely Eye Touch, is introduced. Eye Touch is based on an eyeglasses-like apparatus on which IrDA sensitive sensors and IrDA light sources are mounted. Using inexpensive sensors and light sources instead of a ...

Information

Published in.

Elsevier Science Ltd.

United Kingdom

Publication History

Author tags.

  • Multimedia/hypermedia systems
  • Human-computer interface
  • Evaluation methodologies
  • Teaching/learning strategies
  • Research-article

Contributors

Other metrics, bibliometrics, article metrics.

  • 32 Total Citations View Citations
  • 0 Total Downloads
  • Downloads (Last 12 months) 0
  • Downloads (Last 6 weeks) 0
  • Hwang E Lee J (2024) Attention-based automatic editing of virtual lectures for reduced production labor and effective learning experience International Journal of Human-Computer Studies 10.1016/j.ijhcs.2023.103161 181 :C Online publication date: 1-Jan-2024 https://dl.acm.org/doi/10.1016/j.ijhcs.2023.103161
  • Villalobos M Videla R Grasso-Cladera A Costa-Cordella S Rossi A Fuchs N Parada F (2023) Mobile Brain/Body Imaging Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems 10.1177/10597123211072613 31 :5 (423-448) Online publication date: 1-Oct-2023 https://dl.acm.org/doi/10.1177/10597123211072613
  • Sparmann S Hüsing S Schulte C (2023) JuGaze: A Cell-based Eye Tracking and Logging Tool for Jupyter Notebooks Proceedings of the 23rd Koli Calling International Conference on Computing Education Research 10.1145/3631802.3631824 (1-11) Online publication date: 13-Nov-2023 https://dl.acm.org/doi/10.1145/3631802.3631824
  • Sharma K Giannakos M (2023) Carry-Forward Effect: Early scaffolding learning processes Proceedings of the 2023 Symposium on Learning, Design and Technology 10.1145/3594781.3594786 (43-52) Online publication date: 23-Jun-2023 https://dl.acm.org/doi/10.1145/3594781.3594786
  • Knippertz L Münz A Ruzika S (2023) Automated Detection of Geometric Structures in Gaze Data Proceedings of the 2023 Symposium on Eye Tracking Research and Applications 10.1145/3588015.3590108 (1-2) Online publication date: 30-May-2023 https://dl.acm.org/doi/10.1145/3588015.3590108
  • Yang F Wang H (2023) Tracking visual attention during learning of complex science concepts with augmented 3D visualizations Computers & Education 10.1016/j.compedu.2022.104659 193 :C Online publication date: 1-Feb-2023 https://dl.acm.org/doi/10.1016/j.compedu.2022.104659
  • Zhang S de Koning B Paas F (2023) Effects of mouse pointing on learning from labeled and unlabeled split-attention materials Computers in Human Behavior 10.1016/j.chb.2023.107673 143 :C Online publication date: 1-Jun-2023 https://dl.acm.org/doi/10.1016/j.chb.2023.107673

View options

Login options.

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Share this publication link.

Copying failed.

Share on social media

Affiliations, export citations.

  • Please download or close your previous search result export first before starting a new bulk export. Preview is not available. By clicking download, a status dialog will open to start the export process. The process may take a few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress. Download
  • Download citation
  • Copy citation

We are preparing your search results for download ...

We will inform you here when the file is ready.

Your file of search results citations is now ready.

Your search export query has expired. Please try again.

A systematic review of eye tracking research on multimedia learning

  • Ecenaz Alemdag Ecenaz Alemdag
  • Kursat Cagiltay Kursat Cagiltay
  • 1 October 2018
  • journal article
  • review article
  • Published by  Elsevier  in  Computers & Education
  • Vol. 125 , 413-428
  • https://doi.org/10.1016/j.compedu.2018.06.023
  • MULTIMEDIA/HYPERMEDIA SYSTEMS
  • HUMAN-COMPUTER INTERFACE
  • EVALUATION METHODOLOGIES
  • TEACHING/LEARNING STRATEGIES

This publication has 63 references indexed in Scilit:

  • Tracking learners' visual attention during a multimedia presentation in a real classroom Computers & Education, 2013
  • Comprehension of multimedia health information among older adults with chronic illness Visual Communication, 2012
  • An examination of cognitive processing of multimedia information based on viewers' eye movements Interactive Learning Environments, 2011
  • Using eye-tracking technology to investigate the redundant effect of multimedia web pages on viewers’ cognitive processes Computers in Human Behavior, 2011
  • Research trends in mobile and ubiquitous learning: a review of publications in selected journals from 2001 to 2010 British Journal of Educational Technology, 2011
  • Explaining the modality and contiguity effects: New insights from investigating students' viewing behaviour Applied Cognitive Psychology, 2009
  • Visual representations in science education: The influence of prior knowledge and cognitive load theory on instructional design principles Science Education, 2006
  • Does the modality principle hold for different media? A test of the method‐affects‐learning hypothesis Journal of Computer Assisted Learning, 2006
  • Eye movements in reading and information processing: 20 years of research. Psychological Bulletin, 1998
  • Constructing Mental Models of Machines from Text and Diagrams Journal of Memory and Language, 1993

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • v.6(11); 2020 Nov

Logo of heliyon

Multimedia tools in the teaching and learning processes: A systematic review

M.d. abdulrahaman.

a Department of Information and Communication Science, University of Ilorin, Ilorin, Nigeria

b Department of Telecommunication Science, University of Ilorin, Ilorin, Nigeria

A.A. Oloyede

N.t. surajudeen-bakinde.

c Department of Electrical and Electronics Engineering, University of Ilorin, Ilorin, Nigeria

L.A. Olawoyin

O.v. mejabi, y.o. imam-fulani.

d Department of Religions, Faculty of Arts, University of Ilorin, Ilorin, Nigeria

e Department of Mass Communication, University of Ilorin, Ilorin, Nigeria

Access to quality education is still a major bottleneck in developing countries. Efforts at opening the access to a large majority of citizens in developing nations have explored different strategies including the use of multimedia technology. This paper provides a systematic review of different multimedia tools in the teaching and learning processes with a view to examining how multimedia technologies have proven to be a veritable strategy for bridging the gap in the provision of unrestricted access to quality education and improved learners' performance. The review process includes conducting an extensive search of relevant scientific literature, selection of relevant studies using a pre-determined inclusion criteria, literature analysis, and synthesis of the findings of the various studies that have investigated how multimedia have been used for learning and teaching processes. The review examines various case study reports of multimedia tools, their success and limiting factors, application areas, evaluation methodologies, technology components, and age groups targeted by the tools. Future research directions are also provided. Apart from text and images, existing tools were found to have multimedia components such as audio, video, animation and 3-D. The study concluded that the majority of the multimedia solutions deployed for teaching and learning target the solution to the pedagogical content of the subject of interest and the user audience of the solution while the success of the different multimedia tools that have been used on the various target groups and subjects can be attributed to the technologies and components embedded in their development.

Education, Media in education, Teaching/learning strategies, Pedagogical issues, Systematic review

1. Introduction

Multimedia is a combination of more than one media type such as text (alphabetic or numeric), symbols, images, pictures, audio, video, and animations usually with the aid of technology for the purpose of enhancing understanding or memorization ( Guan et al., 2018 ). It supports verbal instruction with the use of static and dynamic images in form of visualization technology for better expression and comprehension ( Alemdag and Cagiltay, 2018 ; Chen and Liu, 2008 ). The hardware and software used for creating and running of multimedia applications is known as multimedia technology ( Kapi et al., 2017 ). Multimedia technology has some characteristics like integration, diversity, and interaction that enable people to communicate information or ideas with digital and print elements. The digital and print elements in this context refer to multimedia-based applications or tools used for the purpose of delivering information to people for better understanding of concepts.

Indeed, various aspects of human endeavours, especially the educational sector, are being transformed by the advent of Information and Communication Technology (ICT). ICT involves the use of hardware and software for the purpose of collecting, processing, storing, presenting, and sharing of information mostly in digital forms. Multimedia technology is an important aspect of ICT that deals with how information can be represented and presented digitally, using different media such as text, audio, video, among others ( Guan et al., 2018 ). It involves the combination of several technologies provide information in the best possible formats, packages, and sizes.

However, when used in the classroom or for educational purposes, the design quality and sophistication of multimedia application must be high enough to combine the different elements of the cognitive processes so as to achieve the best mimicking of the teacher. There are different types of multimedia applications available in the market today. These applications have been deployed for different educational purposes such as the works deployed for Mathematics classes, Social Sciences, Sciences, Physiology, Physics and Physical Education Studies ( Al-Hariri and Al-Hattami 2017 ; Anderson, 1993 ; Chen and Liu, 2008 ; Chen and Xia, 2012 ; Ilhan and Oruc, 2016 ; Jian-hua & Hong, 2012 ; Milovanovi et al., 2013 ; Shah and Khan, 2015 ).

The central problem, however, remains the same. Which is, the problem of how to use the applications to provide students with stimulating experience by delivering information for better understanding of concepts. While it is important to develop various applications for effective teaching delivery, each of these applications has its own focus area, peculiarities, target age, merits and demerits. Thus, the taxonomy and component synthesis for the development of the multimedia application need to be extensively investigated as these would affect the teaching delivery, learning and wider applicability. Some of the multimedia solutions have been deployed, tested and recorded significant success, while some did not record marginal success.

The success stories also vary with location, target age and deployment purposes. Therefore, the aim of this paper is to provide a systematic review of the scientific published studies that examined different multimedia tools in the teaching and learning process with a view to identifying the existing multimedia-based tools, understanding their usage, application areas and impacts on education system. In order words, the study, through a systematic review of literature, aims at identifying the existing multimedia-based tools for teaching and learning; understanding their usage and limiting factors, application areas, evaluation methodologies, technology components synthesis and impacts on education system.

To this end, the study is guided by the following research questions:

  • (1) What are the existing multimedia tools in teaching and learning?
  • (2) What type of multimedia component fits an audience?
  • (3) What types of multimedia components are adopted in the existing tools?
  • (4) What evaluation methodologies are useful for successful outcome?
  • (5) What factors aid success or failure in the use of multimedia tools for teaching and learning?

The outcome of this study is aimed at serving as a guide for teachers and education administrators while selecting multimedia tools and applications for teaching in schools. So, in this study, the taxonomy and component synthesis of some widely cited multimedia applications are provided. Various case studies and results are examined. Furthermore, barriers limiting the usage of ICT and multimedia in teaching and learning are identified; and some unresolved cases and future research decisions are outlined.

The subsequent parts of this paper include Section 2 , which is the literature review that examines multimedia technology and its place in teaching and learning; Section 3 , the research methodology; Section 4 , presentation of results; Section 5 , discussion of the findings; and Section 6 , the conclusion, recommendations and suggestions for future work.

2. Literature review

2.1. multimedia learning and teaching: concepts and resources.

Multimedia or digital learning resources assist learners to get on well with mental representations with the use of different media elements, which support information processing. Information, which is made up of content and sometimes learning activities, are presented with the use of the combination of text, image, video and audio by digital learning resources. It has been demonstrated, by research on using multimedia for learning, that there are more positive results observed in learners who combine picture and words than those who use words only ( Chen and Liu, 2008 ; Mayer, 2008 ). As stated in Eady and Lockyer (2013) , different pedagogy methods were implemented by the use of digital resources. Their paper presented how the authors were able to introduce topics to students, demonstrate to them, stimulate a group, make different text types available and engage students in an interactive manner.

Generally speaking, multimedia technology for educational purposes can be categorised according to whether they are used for teaching or for learning. Some of the different multimedia or digital learning resources are listed in Eady and Lockyer (2013) . Furthermore, according to Guan et al. (2018) , several studies have established the importance of multimedia technologies to education and the widespread adoption of multimedia tools. Multimedia generally involves the use of technology and the widespread adoption of multimedia applications in education is as a result of its many benefits ( Almara'beh et al., 2015 ). Some of the benefits of the multimedia application tools for teaching and learning are summarized as follows:

  • (1) Ability to turn abstract concepts into concrete contents
  • (2) Ability to presents large volumes of information within a limited time with less effort
  • (3) Ability to stimulates students' interest in learning
  • (4) Provides teacher with the ability to know students position in learning.

Multimedia designed for learning refers to the process of building mental representation from words and pictures in different contexts. They are designed to assist learning with tools which can be used in presentations, class room or laboratory learning, simulations, e-learning, computer games, and virtual reality, thereby allowing learners to process information both in verbal and pictorial forms ( Alemdag and Cagiltay, 2018 ). Multimedia designed for learning requires understanding of some theories such as cognitive theory of multimedia learning, which postulates three assumptions that describe how people learn from instructional multimedia materials. These assumptions can be phrased as dual-channel, limited capacity, and active processing ( Alemdag and Cagiltay, 2018 ). Dual-channel assumes that learners have many channels to separate visual and auditory information. The restricted/limited capacity assumes that there is a limit to the load of data that can be processed in each channel. Understanding these will allow teachers not overwhelming learners with much information. On the other hand, learners will be aware of their information processing limitations or capabilities. Active processing proposes that when it comes to information selection, organization, and integration, human beings are active agents and are capable of managing the forms of information they are interacting with.

The appropriate use of ICT in teaching transforms the learning environment from teacher-centred to learner-centred ( Coleman et al., 2016 ) just as it is transforming all aspects of human life ( Guan et al., 2018 ). Coleman et al., (2016) emphasised that the shifting from teaching to learning creates a student-centred learning where teachers are there as facilitators and not sages on the stages, thus changing the role of the teacher from knowledge transmitter to that of a facilitator, knowledge navigator and a co-learner. Keengwe et al., (2008a) concluded that the application of multi-media technologies ensures a very productive, interesting, motivating, interactive and quality delivery of classroom instruction while addressing diverse learners' needs.

2.2. Role of multimedia technology in teaching and learning

Technology is evolving and scholars in the areas of Information Technology (IT) and education technology are continuing to study how multimedia technologies can be harnessed for the enhancement of teaching and learning. A software tool can be used to expand teaching and learning in various fields. It is important to provide students with practical experience in most fields of learning.

The importance of multimedia technologies and applications in education as a teaching or learning tool cannot be over emphasized. This has been confirmed in several studies that have investigated the impact of multimedia technology to the education system. Milovanovi et al. (2013) demonstrated the importance of using multimedia tools in Mathematics classes and found that the multimedia tool greatly enhances students' learning. Several works exist that show that multimedia enhances students' learning ( Aloraini, 2012 ; Al-Hariri and Al-Hattami, 2017 ; Barzegar et al., 2012 ; Chen and Xia 2012 ; Dalacosta et al., 2009 ; Jian-hua & Hong, 2012 ; Janda, 1992 ; Keengwe et al., 2008b ; Kingsley and Boone, 2008 ; Shah and Khan, 2015 ; Taradi et al., 2005 ; Zin et al., 2013 ).

Multimedia communication has close similarities to face-to-face communications. It is less restricted than text and ensures better understanding ( Pea, 1991 ). Multimedia technology helps simplify abstract content, allows for differences from individuals and allows for coordination of diverse representation with a different perspective. The use of the computer-based technique as an interface between students and what they are learning with suitable fonts and design can be very valuable.

Certainly, multimedia technology brings about improvement in teaching and learning, however, there are a number of limitations in this technology for educational purposes. Some of these limitations include unfriendly programming or user interface, limited resources, lack of required knowledge and skill, limited time and high cost of maintenance among others ( Al-Ajmi and Aljazzaf, 2020 ; Putra, 2018 ).

2.3. Multimedia evaluation techniques

Evaluation entails assessing whether a multimedia programme fulfils the purposes set including being useful for its target audience. Kennedy and Judd (2007) make the point that developers of multimedia tools have expectations about the way they will be used which could be functional (focused on the interface) or educational (involving the learning designs, processes and outcomes). It is important to note that there are different methods used in the evaluation of multimedia and most evaluations entail experiments, comparisons and surveys. The primary goal is to balance assessment validity with efficiency of the evaluation process ( Mayer, 2005 ).

Survey research has two common key features – questionnaires (or interviews) and sampling, and is ideally suited for collecting data from a population that is too large to observe directly and is economical in terms of researcher time, cost and effort when compared to experimental research. However, survey research is subject to biases from the questionnaire design and sampling including non-response, social desirability and recall and may not allow researchers to have an in-depth understanding of the underlying reasons for respondent behaviour ( West, 2019 ; Kelley et al., 2003 ).

Generally, comparison studies follow the format of comparing outcome from an experimental group using the multimedia being evaluated against a control group. This method has been criticised for having inadequate treatment definition, not specifying all treatment dimensions and failure to measure treatment implementation, among others ( Yildiz and Atkins, 1992 ).

Faced with the subjective nature of surveys and the limitations from comparison studies, eye tracking and other student behaviour such as emotional response, provides information not consciously controlled by the student or researcher and is used as an objective data gathering technique. Eye tracking research is a multi-disciplinary field that tracks eye movements in response to visual stimuli ( Horsley et al., 2014 ). Data from eye-tracking allows researchers to validate empirically and objectively, how learners comprehend the multimedia content, the attention of the learner while analysing the multimedia content, and the cognitive demand of the content ( Molina et al., 2018 ). Eye tracking is quite interesting as it provides a useful source of information in the case of children. This is because gathering information using the traditional techniques is more difficult especially when it involves children's interests and preferences ( Molina et al., 2018 ).

Earlier attempts at analysing student behaviour while engaging with online material included analysing student access computer logs, and the frequency of participation and duration of participation ( Morris et al., 2005 ). Nie and Zhe (2020) demonstrated that the conventional method of manually analysing student behaviour is gradually becoming less effective compared to online classroom visual tracking. They found that the online classroom visual tracking behaviour can be divided into several components: selection, presentation, mapping, analysis and collection, as well as the analysis from students' facial expression.

Several works exist that use student behaviour tracking to examine how students interact with multimedia learning tools. For instance, Agulla et al. (2009) , incorporated in a learning management system (LMS), student behaviour tracking that provided information on how much time the student spent in front of the computer examining the contents. They did so through the use of face tracking, fingerprint and speaker verification. Alemdag and Cagiltay (2018) conducted a systematic review of eye-tracking research on multimedia learning and found that while this research method was on the rise it was mainly used to understand the effects of multimedia use among higher education students. They also identified that although eye movements were linked to how students select, organise and integrate information presented through multimedia technologies, metacognition and emotions were rarely investigated with eye movements.

Molina et al. (2018) used eye-tracking in evaluating multimedia use by primary school children. Some studies have used a combination of eye tracking data and verbal data in order to gain insight into the learners' cognitions during learning and how they perceived the learning material ( Stark et al., 2018 ).

As much as eye-tracking and other behavioural research present opportunity for objective evaluation, difficulty of interpretation is one of the limitations of eye-movement data ( Miller, 2015 ), and it is not surprising that the traditional methods of evaluation through questionnaire administration and surveys are still commonly used.

3. Research methodology

This study adopted a research design that involves a searching method for identifying the articles to be reviewed for solving a specific research problem. It includes a systematic review of the article contents for analysis and synthesis. The systematic review follows the procedure outlined in the Preferred Reporting Items for Systematic Reviews and Meta-analysis for Protocol (PRISMA-P) 2015 guideline as provided in the work of Moher et al. (2015) , an extension of Liberati et al. (2009) . The guideline is to facilitate a carefully planned and documented systematic review in a manner that promotes consistency, transparency, accountability and integrity of review articles. Although it was originally developed for the analysis of health related studies, it is now widely adopted in other fields of study. Furthermore, the study involves protocol that includes identifying the data sources for the search, the keywords for the search and the inclusion criteria. To aid in synthesis of the identified articles, key points from the articles are summarised in tables and quantifiable components are analysed.

3.1. Data sources

The quality of a systematic review starts with the data sources used for identifying the articles to be selected for the review. This requires a thorough search and scrutiny of existing literatures from variety of academic databases and journals. The academic databases and journals considered for this review include Science Direct, IEEE Explore, ACM Digital library, Google Scholar, Springer, Wiley Online Library, Taylor & Francis, EBSCOHOST, Web of Science, and Scopus. These databases are reputable bibliographic sources and journals or conference papers indexed in them are deemed reputable and of good quality.

3.2. Search keywords

In order to ensure appropriate primary search terms are used and relevant papers are carefully selected for the review purpose, the literature search method of Kitchenham et al. (2009) was adopted. While it is expected that searching on a main string should be sufficient for the query output to collect all related papers, this is not the case always; hence the inclusion of substrings. Some problems associated with the databases used for the study are:

  • • Some do not have automatic root recognition
  • • Some have limitation of how many words to use e.g. IEEE, 15 words
  • • Some databases offer advanced or expert search
  • • ACM, IEEE and others do not have anything, not even a precedence rule.

The search terms for relevant literatures in the academic databases and journals specified in section 3.1 , are: “multimedia”, “multimedia technology”, “multimedia technology + Education”, “ICT impact + Education”, “multimedia tools + Education”, “multimedia + Teaching”, “multimedia + Learning”, “Application Software + Education”, and “Digital + Education”.

3.3. Inclusion and exclusion criteria

For the purpose given, each of the articles from the consulted academic databases and libraries had an equal chance of being selected. In order to avoid bias in the selection, a clear principle was set and adopted to form the criteria for inclusion of papers. These criteria are presented in Table 1 .

Table 1

Inclusion criteria of articles.

S/NInclusion Criteria
1.The study focuses on multimedia technology, tools, software, application, impact, and deployment in education system
2.The article was peer reviewed and published in reputable journal or conference
3.The study is written in English language
4.The article is either a survey or research paper
5.The article is available online

Thus, the queries using the stated search strings led to a pool of 10,972 articles in the subjects of interest that were online and written in English. All publications found as at the time of the search, which was in May 2019, were included. Publication date constraint for including a paper in the study was not applied. The process of screening this pool of 10,972 articles to meet the purpose of the study is outlined in the next section.

3.4. Exclusion from pooled articles

The number of articles from the database keywords search were reduced in line with the elimination procedure outlined as follows:

  • i. elimination of paper based on unrelated title and elimination of duplications from various sources, leading to a reduction from 10,972 to 1,403;
  • ii. examination of the abstracts of the 1,403 articles and reduction from 1,403 to 505;
  • iii. elimination based on the direction of the article after reading through, leading to reduction from 505 to 78.

The elimination procedure is represented in Figure 1 which shows the flow of the procedure for screening the articles for the study.

Figure 1

Literature elimination process.

Table 2 provides a summary of the databases visited and the respective number of articles (from the final 78) that were obtained from that source.

Table 2

Search databases and number for articles.

S/NArticle SourcesURLNo. of ArticlesPercentage (%)
1.Science Direct 2532.1
2.Google Scholar 2025.6
3.IEEE Explore 1215.4
4.Springer 810.3
5.ACM Digital library 45.1
6.Web of Science 45.1
7.Taylor & Francis 22.6
8.EBSCOHOST 22.6
9.Wiley Online Library 11.3
Total =78100.0

Table 2 shows the percentage of the articles sourced from each academic database and reveals that Science Direct accounts for the highest number of the related articles with 25 (32%) papers, closely followed by Google Scholar 20 (26%) and IEEE Explore with 12 (15%) articles. Springer accounts for 8 articles, which represents 10% of the entire reviewed papers, while ACM Digital Library, Taylor and Francis, Web of science and EBSCOHOST contribute 4 (5%), 2 (3%), 4 (5%) and 2 (3%) respectively. The least paper is contributed by Wiley Online Library with one paper, which represents 1% of the entire papers reviewed for this study.

3.5. Data collection and synthesis of results

Based on the selection mechanism, 78 articles were shortlisted for analysis. Each article was reviewed and information extracted from it for tabulation. The information sought included the following: the type of multimedia tool used, the focus area of the tool, the technology that was deployed, the multimedia components used within the tool, how the tool was applied – whether for teaching or learning or both, the location where the tool was tested, and the target age on which the tool was tested. The researchers also tabulated impressions gleaned from the review in a “comments” column. If the tool was evaluated, then the evaluation methodology, target group, sample size, outcome, limitations of the methodology and whether or not the outcome could be generalized, were also presented.

In the next section, the insights from the articles reviewed are presented and some of the findings presented in tables for ease of analyses and synthesis.

After careful application of the procedures for selection as outlined in section 3 , each of the 78 shortlisted articles were subjected to a systematic review which involved extracting information as itemised in section 3.5 . Such information were tabulated for further analysis. Not all the articles were empirical based or contained the desired data items. Nineteen articles which were based on experimental work reported the details of the multimedia tool developed or deployed. Furthermore, 13 articles with details of the evaluation of the use of multimedia tools in teaching and learning were identified. Also revealed, were barriers to the use of multimedia. The findings from the systematic review are presented in this section.

The set of articles reviewed clearly emphasized the importance of multimedia technology to the improvement of teaching and learning environment. Several studies that have investigated the impact of ICT to education stated that multimedia technology has positive impact on the way teachers impart knowledge and the manner in which learners comprehend subject matters. The review also revealed that several multimedia-based tools exist, most of which are usually based on subject, field, age or level at various institutions of learning. In addition, some of the reviewed papers investigated the impact of teaching and/or learning with multimedia based instructional materials using descriptive, qualitative and quantitative research methods with different focus groups for both the pre-test and post-test conditions.

Nevertheless, despite the impact of multimedia tools on the improvement of teaching and learning activities, it could be counterproductive if the computer-based tools are not properly designed or the instructional materials are not well composed. The reviews showed that multimedia adoption in education requires adequate understanding of technology and multimedia types or components required to properly represent concepts or ideas. This implies that a teacher must understand the learners and know what technology or tool needs to be adopted at a given time for a set of targets. According to the reviews, the target groups determine the type of multimedia components employed while preparing instructional materials and the ways they are to be delivered. To provide context, a review of some of the analysed case studies are presented next.

Huang et al. (2017) explored the use of multimedia-based teaching materials that include three view diagrams (3D) and tangible 3D materials to teach 3D modelling course. This was aimed at determining the influence of multimedia technology in meta-cognitive behaviour of students. The authors employed lag sequential analysis as well as interview methods to examine the pattern transformation of students' meta-cognitive behaviour while solving problematic tasks. The evaluation results show that different teaching method and materials produce different meta-cognitive behaviours in student. The result further revealed that compare to traditional instructional instruments, using 3D tangible object in cognitive apprenticeship instruction stimulates more meta-cognitive behaviour. To teach an introductory course to control theory and programming in MATLAB, a video based multimedia guide was created by Karel and Tomas (2015) for distance learning students using Camtasia Studio 7 program. The software can record screen, edit video and create DVD menu. The impact of the multimedia aid tool was evaluated to be positive on the students based on the feedback.

Zhang (2012) created an online teaching and learning resource platform with interactive and integrated features. The platform was created with Macromedia Flash version 8.0, a form of Computer – Aided Drawing (CAD) software that is very easy to use. In an attempt to test student's professional cognition and operational skill cognition as well as learning satisfaction during learning phase, an experimentation technique that utilizes a non-equivalent pre-test and post-test control group was adopted. The evaluation revealed no significant difference between the groups in terms of professional cognition and operation skill cognition. However, it was noted that a significant difference exists in learning satisfaction, which shows a greater satisfaction in the coursework with multimedia Flash compare to that of the traditional learning method.

A web-based multimedia software is another popular educational tool designed to enhance teaching and learning. The major constraints of web-based learning are in its ability to provide personalised learning materials. Hwang et al., (2007) presented a web-based tool for creating and sharing annotations in their study. They then investigated the effect of the tool on learning using college students as a case study after four months of using the tool. The study concluded that there is value in understanding the use of collaborative learning through shared annotation. The paper also carried out a GEFT test on the students and concluded that there was no significant divergence between field – dependent and cognitive style students on the quantity of annotation. The paper also concluded that in the final examination, the tool provided a high motivation for students to study for their final exams.

Similarly, Bánsági and Rodgers (2018) developed a graphic web-based application in the educational sector for liquid – liquid extraction with the help of ternary phase diagram. The application allows chemical engineering students of the University of Manchester to draw liquid – liquid two – phase equilibrium curves and calculate mixture of phase separation among others. The application was put into use for testing purpose during which student usage figure as well as their opinions was sampled for both full – time taught and distance learning courses. The HTML 5, JavaScript, and Cascading Style Sheet (CSS) based application is interactive and easy to be used. In order to further analyse the web application developed, an iTeach questionnaire for the assessment of the efficiency of individual pedagogical approach was administered to students. The study revealed that students find the application useful as it has increased their level of understanding the course.

In order to teach students how to compose and continue delivering text based information in various media forms for current and emerging technologies, Blevins (2018) made students to search and analyse various multimedia technologies used in new media and capable of reflecting on their current and future works by adopting a scaffold project – based activities. The students were taught Augmented Reality (AR) software in a specific way with an assumption that such method will change next time students embark on AR project. After student's evaluation, the assumption was achieved even more than expected.

Ertugrul (2000) provided an overview of some lab view application software for teaching. The focus of the software was to seek for software use friendliness and compatibility faced by users. The paper provided recommendations for selection criterion. Even though the software applications have been found very useful and could compliment for conventional practical teaching particularly where there is shortage of laboratory facilities, the application is not suitable for engineering kind courses that requires hands on and intensive practical. Davies and Cormican (2013) identified the fundamental principles needed when designing a multimedia training tool or material for effective teaching and learning. The principles considered both students and an instructor's perspectives. Experiments were conducted in Ireland using a computer aided design (CAD) training environment. During data collection, mixed methods (i.e. interviews, surveys and a group discussion) were employed and findings showed that computer-based material is the most effective and popular way to learn. However, the costs, perceived lack of skill and insufficient support could be hindering factors.

The department of Computer Science in UiTMNegriSembian, developed three applications, namely, the Greenfoot, Visualization makes Array Easy (VAE) and e-TajweedYassin. The Greenfoot as a Teaching Tool in Object Oriented Programming is a tool that creates scenarios in order to ease visualization of 2D objects interaction in teaching object-oriented programming. The term “scenario” is used in Greenfoot to mean a project, and it has been used as a teaching aid for object-oriented programming (OOP) language introduction course. To ensure that a standard and quality application is built, the teaching aid was developed using System Development Life Cycle (SDLC). The Greenfoot-based scenario shows a great improvement in visualization and object element interaction and an impressive engagement of students during learning process. The application also provides clear illustration of object-oriented concepts to students and enabled them develop a game-like application from the scenario provided.

The Visualization makes Array Easy (VAE) on the other hand was created using the ADDIE model which is made up of Analysis, Design, Develop, Implement and Evaluate for instructional design. The analysis stage recognizes visualization technique as a key factor for enhancing students' understanding of programming concepts. The design stage of VAE took about a week to create a storyboard, while MS PowerPoint with i-Spring and Video Scribe formed the principal software for developing the application using storyboard as a guide. The VAE was instrumental in teaching students some hard programming concepts like Array. The results of the simple test with 60 students showed simulation technique of VAE to be effective in helping students to learn the concepts. To determine the effectiveness of VAE prototype, learnability, efficiency, memorability, accuracy and satisfaction of students were examined.

While the e-TajweedYaasin software was also developed using the ADDIE model (Analysis, Design, Develop, Implement and Evaluate) as an e-learning application, the tool was intended to aid students mastering tajweed and avoid common mistakes that were usually made by previous students who had undergone the course. During the analysis stage, visualization and interactive technique were recognised to be helpful in ensuring that students understand tajweed properly and are able to study with ease. The design stage involved the designing of the application layout with the focus on its easy accessibility to users. In addition, its user interface imitates the traditional teaching method called syafawiah. The development stage involved the use of MS PowerPoint with i-Spring features. The combination of audio, video and animation was more effective in comparison to text only in the promotion of learning. A sample of 51 students were selected to use the system and later, they were evaluated based on their ability to read the surah of Yaasin. A great improvement was observed as the number of mistakes had reduced to all the rules as students were enabled to better recognise and practice the tajweed for the surah of Yaasin ( Kapi et al., 2017 ).

Kapi et al. (2017) compared the effectiveness of three multimedia applications for effective teaching and learning. The applications considered were: Greenfoot Tool for programming; Visualisation Makes Array Easy (VAE) and e-TajweedYasin applications. The comparison looked into the design models used in meeting the desired instructional needs. Findings from the paper showed much more improved students' performance, learning and better understanding of subjects taught.

The advantages of using multimedia tools to teach Physics, which most students think is difficult, are enumerated in Jian-hua & Hong's (2012) work. They established that effective application of multimedia technology in university physics teaching can change the form of information, integrating graph, text, sound and image on PC, improving the expressive force of the teaching content so that the students can actively participate in multi-media activities via multi senses. High-quality university physics multimedia courseware is the best means to provide a variety of audio-visual images, which can show a lot of physical processes and phenomena vividly that is difficult by common means. The tool, especially, combines the advantages of multimedia courseware for university physics and that of traditional teaching of physics, and it greatly helped in improving teaching results of physics ( Jian-hua & Hong, 2012 ).

Two researchers developed a culturally responsive Visual Art Education module at the secondary level so as to assist the teachers to integrate and to implement a multicultural education in the teaching and learning practices at schools with the aim of enhancing students' knowledge and awareness regarding the elements of art and culture inherited by each race that makes up the multiracial society in Malaysia. Microsoft power point authoring tool was the technology with visual art materials including images and texts in a multimedia interactive teaching material for teaching 60 secondary school students, which resulted in accelerated teaching and learning processes with the IT skills of the teachers greatly improved ( Maaruf and Siraj, 2013 ).

Two control groups, pre-test and post-test, were selected for the implementation of a developed multimedia tool for 20 weeks. The tool, multimedia aided teaching (MAT) with text, audio, video and animation, was applied on 60 science students with age less than 15 years. The valid and reliable questionnaires were used as data collection tools. The Attitude Towards Science Scale (ATSS) was used to measure the attitude of both groups before and after treatments. The independent sample t-test was used to analyze the data. The results indicated that MAT is more effective than the traditional one. Students' attitude towards science improved with the use of MAT when compared to the traditional method of teaching ( Shah and Khan, 2015 ).

The effect of multimedia tools on the performance of 67 grade 4 students of social studies in Kayseri, Turkey was presented. Teaching tool with Computer representation with text, audio, video and animation as its components applied on a control group and an experimental group. The study concluded that academic performance of students in social studies was greatly improved when multimedia technique was applied as compared to traditional classroom ( Ilhan and Oruc, 2016 ).

Two samples of 60 senior secondary school II students in two different schools in Lagos State, Nigeria, were selected for the pre-test, post-test control group quasi experimental design in the research by Akinoso. Mathematics Achievement Test (MAT) with twenty-five questions from four topics namely: logarithm, percentage error, range, variance and standard deviation and circle theorems was the tool used. It was concluded that the students in the experimental group where multimedia tool was used performed better than those in the control group. It was equally inferred from the work that students' interest, motivation and participation increased according to the researcher and experimental group's teacher observations ( Akinoso, 2018 ).

Specifically, in the field of engineering, laboratory software applications can be used to provide an interface to providing practical alternatives to students depending on their requirement. Ertugrul (2000) provided a review of LabView software applications. The paper provided some knowledge about laboratory software tools used in the field of engineering and concluded that computer-based technology has advanced up to the stage where it can aid Engineering education at a significantly low price. The paper also highlighted some challenges faced by institutions in selecting and in the use of these software such as the need to upgrade software as the curriculum changes while also providing some future trends.

Zulkifli et al. (2008) examined a self-calibrating automated system for depressed cladding applications as they demonstrated utilizing the Laboratory Virtual Instrument Engineering Workbench (LabVIEW) software and General-Purpose Interface Bus (GPIB) interface. The presented model confirmed that the overall experiment time was reduced by 80% and data obtained is more accurate than caring out the experiment physically. Similarly, Teng et al. (2000) presented a Lab view as a teaching aid for use as power analyzer. The paper showed the tool allows for developmental speed to be accelerated as it is a connection between different workbench instruments.

The structured information extracted from the relevant reviewed articles are presented in the next sections. The systematic review enabled us to extract information from the reviewed articles on the type of multimedia tool the article described, what type of technology the tool deployed, what were the multimedia components utilized, and whether the tool applied to a teaching or learning scenario or both. Furthermore, results from articles reviewed for their evaluation studies are also presented including barriers to multimedia use.

4.1. Multimedia tools, technology, components and applications

The systematic review enabled us to extract information from the reviewed articles on the type of multimedia tool the article described, what type of technology the tool deployed, what were the multimedia components utilized, and whether the tool applied to a teaching or learning scenario or both. The results are presented in Table 3 .

Table 3

Summary of multimedia tools, technology, components and applications for education.

PublicationMultimedia ToolTechnologyStand-alone/ Multimedia ComponentsApplication
Web basedTeach-ingLearn-ing
Multimedia tool for teaching MathematicsComputer RepresentationStand-aloneText, Graphics, Audio and VideoYesYes
Teaching toolComputer RepresentationStand-aloneText, audio, video and animationYesYes
Graphic web-based applicationHTML 5, JavaScript, and Cascading Style Sheet (CSS)Web-basedGraphics, text, video, audioYesNo
Continue Multimedia Teaching Delivery ToolAugmented Reality (AR) softwareStand-aloneText, symbols, images, animations, video, audioYesYes
Multimedia Teaching and Learning ToolComputer Aided Design (CAD) Training EnvironmentStand-aloneText, image, audio, video, animationYesYes
Communication FormsWord-processing and presentationStand-aloneText, images, sound, videoYesYes
SkitchPresentationStand-aloneText and picturesYesYes
CENTRAOnlineWeb-basedAudio, videoNoYes
Virtual learning tool for Engineering EducationLab View Software Application based on sub-virtual instrumentWeb-basedText, diagram, signal (animation), illustrationYesYes
Three view diagrams and tangible materials3D modeling and PrintingStand-alone3D, graphics, text, audio, videoYesYes
Web-based applicationHTML, JavaScript, CSSWeb-basedText, image, annotationYesYes
Teaching tool for teaching social studiesComputer RepresentationWeb-basedText, audio, video and animationYesYes
Multimedia tool for teaching optimizationOnline university physics multimedia paper evaluation and Simulation LaboratoryStand-aloneGraphics, textYesYes
Visualisation Makes Array Easy (VAE)Video scribe and MS PowerPoint with i-springStand-aloneText, graphics, video, audio and animationYesYes
GreenfootObject-Oriented ProgrammingStand-aloneText, pictureYesYes
e-Tajweed YaasinMS-Power point and i-SpringStand-aloneText, audio, video, and animationNoYes
Video based multimedia guideCamtasia Studio 7 programStand-aloneVideo, textYesYes
Multimedia interactive teaching materialsMicrosoft power point authoring toolStand-aloneVisual art materials including images and textYesNo
Teaching toolComputer representationStand-aloneText, audio, video and animationYesYes
Multimedia Aided Teaching (MAT)WebWeb-basedText, audio, video and animationYesYes
Educational videosYouTubeWeb-basedText, audio, videoYesYes
Multimedia aid for teaching classical Chinese languageE – Book System and Draw Express software and tabletStand-aloneText, graphicsYesYes
Online teaching and learning resource platformMacromedia Flash version 8.0Web-basedText, audio, video, imageYesYes

Various multimedia tools were identified in the research papers reviewed. Perhaps, owing to the advancement in multimedia technology, several applications have been developed and deployed to enhance teaching skills and learning environment in many fields of study. These include subject specific tools such as that for teaching and learning Mathematics ( Akinoso, 2018 ), the Chinese language ( Wu and Chen, 2018 ), Physics ( Jian-hua & Hong, 2012 ) and for teaching Social Studies ( Ilhan and Oruc (2016) . All the multimedia tools were developed for teaching except the CENTRA tool ( Eady and Lockyer, 2013 ) and the e-Tajweed Yaasin tool ( Kapi et al., 2017 ). Likewise, all the tools handled learning except the web-based application reported by Bánsági and Rodgers (2018) and the multimedia interactive teaching material ( Maaruf and Siraj, 2013 ).

The tools fell into two categories: standalone or web-based. One-third were web-based (36%) while 65% were standalone.

Technologies identified varied widely. Multimedia tools used included advanced technologies such as computer representation ( Akinoso, 2018 ; Aloraini, 2012 ; Ilhan and Oruc, 2016 ; Milovanovic et al., 2013 ) and augmented reality ( Blevins, 2018 ). High-level web design and programming software were also utilized. For instance, Bánsági and Rodgers (2018) and Hwang et al. (2007) utilized HTML 5, JavaScript and Cascading Style Sheet (CSS), which are software commonly used for web site programming. Camtasia Studio 7 software was used in the development of a video based multimedia guide for teaching and learning ( Karel and Tomas, 2015 ).

A commonly used web design and animation software, Macromedia Flash, was also identified ( Zhang, 2012 ). Object-oriented programming software was reported by Kapi et al. (2017) in the Greenfoot multimedia tool reported by them. Some low end technologies such as word-processing ( Eady and Lockyer, 2013 ) and presentation software ( Kapi et al., 2017 ) were also utilised. Other technologies reported include the use of e-book ( Wu and Chen, 2018 ), computer aided design (CAD) ( Davies and Cormican, 2013 ) and YouTube ( Shoufan, 2019 ).

As shown in Table 3 , several multimedia components were identified. These included text, audio, video, image, animation, annotation and 3D, with several of the multimedia tools combining two or more components. However, the incorporation of 3D was reported only by Huang et al. (2017) . All the analysed papers incorporated text in the multimedia tool reported, except in the tool, CENTRA ( Eady and Lockyer, 2013 ). Animation was also embedded as part of the multimedia tool developed for visualisation ( Kapi et al., 2017 ), for teaching Social Studies ( Ilhan and Oruc, 2016 ), engineering virtual learning tool ( Ertugrul, 2000 ), CAD ( Davies and Cormican, 2013 ), augmented reality ( Blevins, 2018 ) and in tool for teaching Mathematics ( Akinoso, 2018 ). Figure 2 shows the trend in educational technology based on year of publication of the reviewed articles. The figure reveals that while incorporation of audio and video became common as from 2012, 3-D makes its first appearance in 2017. This suggests that as new ICTs emerge educators are likely to try them in the quest for the best learning experience possible.

Figure 2

Educational technology trend based on year of publication.

4.2. Multimedia tools test location and target age

In this section, information on the location where the multimedia tool was tested and the target age of the study group are presented as summarised in Table 4 . The table also includes comments about the articles that could not be captured under any of the tabulation headings.

Table 4

Summary of multimedia tools for education study locations.

PublicationMultimedia ToolTest LocationTarget AgeComments
Multimedia tool for teaching MathematicsLagos State, NigeriaSenior Secondary StudentsThe use of Multimedia technique increased the academic successes of students in Mathematics
Teaching ToolSaudi Arabia>15 years20 students sampled.
Graphic web based applicationUniversity of ManchesterChemical Engineering studentsPositive impact recorded
Continue Multimedia Teaching Delivery ToolNot specifiedStudentsDwells on application of AR in emerging technology
Teaching and learning ToolIreland>18 years28 instructors and 159 learners considered
Communication FormsAustraliaYear 4For History students
SkitchAustraliaYear 4For Creative Arts Lesson
CENTRACanada and AustraliaYear 4Enhanced students' collaborations within and outside their classrooms+
Lab View based on web and standalone (optional)Not specifiedEngineering studentsEnhanced learning of practical
Three view diagrams and tangible materialsVocational university in TaiwanUndergraduate students (16–22)The result of Lag sequential analysis used to test the impact of multimedia technology on students' meta-cognitive behaviour is positive.
Web-based applicationNot specifiedCollege studentsStudents were highly motivated
Teaching tool for teaching social studiesTurkeyGrade 4 (that is > 15 years)67 social science students sampled. Academic performance of students in social studies improved greatly
Multimedia tool for teaching optimizationUniversity
Not specified
Physics StudentsIntroduction of multimedia tools to teaching physics improved student performance
GreenFootUiTMNegeri, SembilanUndergraduate students (16–22)20 students sampled. The technology allows the creation of scenarios
Visualisation Makes Array Easy (VAE)UiTMNegeri, SembilanUndergraduate students (16–22)60 students sampled. Helps students to understand programming
E-TajweedYasinUiTMNegeri, SembilanUndergraduate students (16–22)51 students sampled. Helps students to identify and revise the Tajweed for suraYasin.
Video based multimedia guideUniversity in Czech republicStudents of control theory classPositive result was achieved based on student feedback
Multimedia interactive teaching materialsMalysiaSecondary School students60 secondary school students sampled.
Teaching ToolSerbia>15 years50 students sampled.
Multimedia Aided Teaching (MAT)Fazaia Inter College Malir, Karachi, Pakistan<15 years60 students sampled for 20 weeks. Used for instructional delivery
Educational videosNot specifiedOnline distance learning students105 educational videos collected from YouTube, analyzed using regression analysis.
Multimedia aid for teaching classical Chinese languageChinavocational school students5 students in experimental group were given tablet for evaluation. E-books improves learning activities
Online teaching and learning resource platformVocational school studentsNot specifiedEvaluation based on non-equivalent pre-test and post-test control group shows significant improvement in students' performance

The multimedia tools tested were reported in studies from various countries, including Nigeria ( Akinoso, 2018 ), Saudi Arabia ( Aloraini, 2012 ), England ( Bánsági and Rodgers, 2018 ), Ireland ( Davies and Cormican, 2013 ), Australia and Canada ( Eady and Lockyer, 2013 ), Taiwan ( Huang et al., 2017 ), Turkey ( Ilhan and Oruc, 2016 ) Czech republic ( Karel and Tomas, 2015 ), Malaysia ( Maaruf and Siraj, 2013 ), Serbia ( Milovanovic et al., 2013 ), Pakistan ( Shah and Khan, 2015 ) and China ( Wu and Chen, 2018 ).

Various age groups were targeted by the multimedia tool tests. A considerable proportion involved university students with ages starting from 16 or 18 years as specified in the articles ( Bánsági and Rodgers, 2018 ; Huang et al., 2017 ); Hwang et al., 2007 ; Jian-hua & Hong, 2012 ; Kapi et al., 2017 ; Karel and Tomas, 2015 ). Another group targeted were secondary school students ( Akinoso, 2018 ; Maaruf and Siraj, 2013 ) including vocational school students ( Wu and Chen, 2018 ). Shah and Khan (2015) reported testing their multimedia tool on children below the age of 15 years.

4.3. Evaluation methods of multimedia technology tools in education

The articles involving evaluation were examined to identify the methodologies used for the evaluation, the target groups and sample of the evaluation and the evaluation outcome. The limitations of the evaluation were also identified and whether or not the study outcome could be generalized. Thirteen articles were found and the results are presented in Table 5 .

Table 5

Summary of Evaluation methods of multimedia technology Tools in education.

PublicationFocus areaEvaluation methodTarget groupSample sizeOutcomeLimitationsGeneral-izable outcome
MathematicsExperimental investigationSecondary school students60Multimedia aids the teaching of mathematicsDuration of the experiment was not stated.
Two schools were chosen randomly, no definite number of sample size per group.
No
PhysiologySurvey (online)2 year University Students231Technology affects students achievementsStudy focused on students' interaction with curricular content, administrators, instructors, and other related personnel not considered.Yes
EducationExperimental - comparison with traditional methodUniversity female students40 (20 students for each group)Significant difference observed between the average marks of the two methods40 out of 400 female students were used for the study, representing only 10%.No
General courseSurveyUniversity students234The amount of students learning significantly increased compared to traditional method.Multimedia has no effect on participation and responsibility, team work, self- esteem and democracy skills of the students.No
Physical education studiesSurveyProfessor interviewUndisclosedMultimedia has positive influence on college physical education.The paper did not provide the methodology, sample space or size.No
ScienceExperimental (using animated cartoons)10–11 years179Motivations to learning aid to young people.The scope of the multimedia solution is narrow.Yes
Social scienceExperimental:-Teaching with multimedia
-Teaching without multimedia
4th grade students67Multimedia technique increased the academic success.Single lesson within social studies curriculum was considered
Both groups were chosen randomly, no definite number of sample size per group.
No
ScienceExperimental (using animated cartoons)Elementary school76Significant difference was determined in favour of post-test scoresQuasi experimental design was adopted and no control group used for the testing.No
Visual Art EducationSurvey: in-depth interviewSecondary school teachers2Multimedia usage resulted in accelerated teaching and learning processes.Very small sample size.No
General EducationSurveyAcademic staff6,139Restriction and limit on the use of social media among the academicsLow level of response rate, i.e. 10.5%.No
Mathematics classesExperimental: -Teaching with multimedia
-Teaching without multimedia
University students50 (25 each for experimental and control groups)Experimental group had significantly higher scoresOnly two lessons considered: Isometric transformations and regular polyhedral.No
ScienceExperimental: multimedia-aided teaching (MAT)Elementary students60 (30 students for each group)Learners become active participantsNo significant difference observed in academic performance.No
General studiesSurveyStudents272Students prefer structured texts with colour discrimi-nation.No experiment undertaken to validate the outcome.Yes

Evaluation of multimedia technology used for teaching and learning is important in establishing the efficacy of the tool. For determination of the impact of a developed tool, an experimental evaluation is more meaningful over a survey. However, the results from the analysis showed that the survey method for evaluation was used nearly as equally as the experimental design.

Experimental based evaluation was conducted by Akinoso (2018) , Aloraini (2012) , Ilhan and Oruc (2016) , and Shah and Khan (2015) in order to determine the effectiveness of the multimedia tool they developed. Another group of experimental evaluations involved designing the research for teaching with or without multimedia aids not necessarily developed by the research team which involved exposing 10–11 year olds ( Dalacosta et al., 2009 ) and elementary school students ( Kaptan and İzgi, 2014 ) to animated cartoons. Another of such evaluation was done by Milovanovi et al. (2013) , who used an experimental and control group to evaluate the impact of teaching a group of university students with multimedia.

In contrast, the survey method was used to elicit the opinion of respondents on the impact of the use of multimedia in teaching and learning and the target group were university students ( Al-Hariri and Al-Hattami, 2017 ; Barzegar et al., 2012 ), secondary school students ( Akinoso, 2018 ; Maaruf and Siraj, 2013 ); one involved interviewing the Professors ( Chen and Xia, 2012 ), another involved 4–10 year olds ( Manca and Ranieri, 2016 ) and a sample of 272 students whose ages were not specified ( Ocepek et al., 2013 ).

The focus areas in which the evaluations were conducted ranged from the sciences including mathematics ( Akinoso, 2018 ; Al-Hariri and Al-Hattami, 2017 ; Dalacosta et al., 2009 ; Kaptan and İzgi, 2014 ; Milovanovi et al., 2013 ) to the social sciences ( Ilhan and Oruc, 2016 ) and the arts ( Maaruf and Siraj, 2013 ). There were evaluations focused on education as a subject as well ( Aloraini, 2012 ; Chen and Xia, 2012 ; Maaruf and Siraj, 2013 ; Manca and Ranieri, 2016 ). While positive outcomes were generally reported, Ocepek et al. (2013) , specified that students in their evaluation study preferred structured texts with colour discrimination.

Sample sizes used in the studies varied widely, from Maaruf and Siraj (2013) that based their conclusions on an in-depth interview of teachers, to Manca and Ranieri (2016) that carried out a survey with a sample of 6,139 academic staff. However, the latter study reported a low response rate of 10.5%. One notable weakness identified was that the findings from all but one of the studies could not be generalized. Reasons for this ranged from inadequate sample size, the exposure being limited to a single lesson, or the sampling method and duration of the experiment not explicitly stated.

4.4. Identified barriers to multimedia use in teaching and learning

The review revealed some challenges that could be barriers to the use of multimedia tools in teaching and learning. Some of these barriers, as found in the reviewed articles, are highlighted as follows:

  • • Attitudes and beliefs towards the use of technology in education. Findings from literatures and surveys have shown high resistant to change and negative attitude towards adoption and use of ICT in education ( Cuban et al., 2001 ; Said et al., 2009 ; Snoeyink and Ertmer, 2001 ). In some findings, some respondents perceived no benefits ( Mumtaz, 2000 ; Snoeyink and Ertmer, 2001 ; Yuen and Ma, 2002 ).
  • • Lack of teachers' confidence in the use of technology and resistance to change ( Bosley and Moon, 2003 ; Fabry& Higgs, 1997 ; Said et al., 2009 ).
  • • Lack of basic knowledge and ICT skills for adoption and use of multimedia tools ( Akbaba-Altun, 2006 ; Bingimlas, 2009 ; Cagiltay et al., 2001 )
  • • Lack of access to computing resources such as hardware and software ( Akbaba-Altun, 2006 ; Bosley and Moon, 2003 ; Cinar, 2002 ; Mumtaz, 2000 ; Taylor and Todd, 1995 )
  • • Lack of technical, administrative and financial supports ( Akbaba-Altun, 2006 ; Cinar, 2002 ; Said et al., 2009 ; Goktas et al., 2013 )
  • • Others include lack of instructional content, basic knowledge and skills, physical environment and lack of time to learn new technologies ( Akbaba-Altun, 2006 ; Cinar, 2002 ; Said et al., 2009 ).

5. Discussion

The findings from the systematic review are discussed in this section with a view to answering the research questions posed. The questions bordered on identifying the existing multimedia tools for teaching and learning and the multimedia components adopted in the tools, the type of audience best suited to a certain multimedia component, the methods used when multimedia in teaching and learning are being evaluated and the success or failure factors to consider.

5.1. Multimedia tools in teaching and learning

The review revealed that multimedia tools have been developed to enhance teaching and learning for various fields of study. The review also shows that multimedia tools are delivered using different technologies and multimedia components, and can be broadly categorized as web-based or standalone.

From the review, it was found that standalone multimedia tools were more than twice (64%) the number of tools that were web-based (36%). Standalone tools are a category of teaching and learning aids which are not delivered or used over the internet, but authored to be installed, copied, loaded and used on teachers or students' personal computers (PCs) or workstations. Standalone tools are especially useful for teaching and practicing new concepts such as 3D technology for modelling and printing ( Huang et al., 2017 ) or understanding augmented reality (AR) software ( Blevins, 2018 ). Microsoft Powerpoint is a presentation tool used in some of the reviewed articles and is usually done with standalone systems.

Standalone tools were favoured over web-based tools probably because the internet is not a requirement which makes the tool possible to deploy in all settings. This means that teachers and students in suburban and rural areas that are digitally excluded, can benefit from such a multimedia tool. This system is considered most useful because a majority of the populace in most developing countries are socially and educationally excluded due to a lack of the necessary resources for teaching and learning. The need to sustainably run an online learning environment may be difficult, and therefore, the standalone, provides a better fit for such settings. However, the problem with a standalone application or system is the platform dependency. For instance, a Windows based application can only run on a windows platform. Also, there will be slow convergence time when there is modification in the curricular or modules, since, each system will run offline and has to be updated manually or completely replaced from each location where the tool is deployed.

The other category, web-based multimedia tools, are authored using web authoring tools and delivered online for teaching and learning purposes. About one-third of the tools identified from the review were web-based although they were used largely in university teaching and learning. Examples of these tools are: online teaching and learning resource platform ( Zhang, 2012 ), graphic web-based application ( Bánsági and Rodgers, 2018 ), multimedia tool for teaching optimization ( Jian-hua & Hong, 2012 ), and educational videos on YouTube ( Shoufan, 2019 ).

One of the benefits of the web based multimedia solution is that it is online and centralized over the internet. Part of its advantages is easy update and deployment in contrast to the standalone multimedia system. The major requirements on the teachers and learners' side are that a web browser is installed and that they have an internet connection. Also, the multimedia web application is platform independent; it does not require any special operating system to operate. The same multimedia application can be accessed through a web browser regardless of the learners' operations system. However, when many people access the resource at the same time, this could lead to congestion, packet loss and retransmission. This scenario happens often when large classes take online examinations at the same time. Also, the data requirements for graphics or applications developed with the combination of video, audio and text may differs with system developed with only pictures and text. Hence, the web based system can only be sustainably run with stable high speed internet access.

A major weakness of web-based multimedia tools is the challenge posed for low internet penetration communities and the cost of bandwidth for low-income groups. As access to the internet becomes more easily accessible, it is expected that the advantages of deploying a web-based multimedia solution will far outweigh the disadvantages and more of such tools would be web-based.

5.2. Components, technology and applications of multimedia tools in education

The results from the review revealed that most of the existing multimedia tools in education consist of various multimedia components such as text, symbol, image, audio, video and animation, that are converged in technologies such as 3D ( Huang et al., 2017 ), Camtasia Studio 7 software ( Karel and Tomas, 2015 ), Macromedia Flash ( Zhang, 2012 ), HTML5, JavaScript, CSS ( Bánsági and Rodgers, 2018 ; Eady and Lockyer, 2013 ; Chen and Liu, 2008 ; Shah and Khan, 2015 ; Shoufan, 2019 ). As shown in Figure 3 , the analysis confirms that text (26.8%) is the predominant multimedia component being used in most of the educational materials while other components such as videos (19.5%), audios (18.3%), images (18.3%) and animation (11.0%) are fairly used in teaching and learning multimedia materials. However, annotation and 3D technologies are least incorporated.

Figure 3

Proportion of multimedia components in reviewed articles.

How these components are combined is shown in Figure 4 . Perhaps, the combination of these four major components (text, video, audio, image) provides the best outcome for the learner and points to the place of text as a most desired multimedia component. The components used also reflect the type of subject matter being addressed. For instance, the audio component is important for language classes while video and image components are stimulating in Biology classes, for example, due to the need for visual perception for the learners. It is, therefore, imperative to note that the choice of the combination of these components could yield variable impacts to learners. Hence, it can be deduced from the studies that most of the tools are applied either as teaching or/and learning aids depending on the nature of the audience and teacher.

Figure 4

Use of various multimedia combinations.

In Figure 4 , we provided the analysis of the component combination of the data set reviewed. The multimedia components combinations range from two to six. This was grouped based on the multimedia components combination employed in each of the data set. Group 1 (G1) represents the number of multimedia application with the combination of Text, Image, audio, Video, and 3D. G2 consists of video and audio, while G13 combines all the multimedia components except the 3D.

Furthermore, a majority of the multimedia applications considered four (4) and two (2) combinations of components in their design as shown in Figure 5 . Tools with five and six components were very few and as the figure reveals, all the tools used at least two components.

Figure 5

Multimedia tools and the number of components utilized.

These findings stress the fact that application of multimedia tools in education and the multimedia component incorporated, are audience, subject, curricula and teacher-specific and the tool needs to be well articulated and structured to achieve its goals.

5.3. Targeted multimedia solutions

Our systematic review also revealed that most multimedia solutions deployed for teaching and learning target the solution to the pedagogical content of the subject of interest (see Table 4 ) and the user audience of the solution ( Table 5 ). Several studies highlighted in Tables  4 and ​ and5 5 showcase multimedia tools used for mathematics classes ( Akinoso, 2018 ; Milovanovi et al., 2013 ), Social science ( Ilhan and Oruc, 2016 ), Physiology ( Al-Hariri and Al-Hattami, 2017 ), Physics ( Jian-hua and Hong, 2012 ), in Chemical engineering ( Bánsági and Rodgers, 2018 ) and teaching of Chinese language ( Wu and Chen, 2018 ). In addition, multimedia tools were utilized for teaching specific principles such as in control theory ( Karel and Tomas, 2015 ) and teaching of arrays ( Kapi et al., 2017 ). That multimedia solutions are subject-based is not surprising given that multimedia involves relaying information using different forms of communication. It follows that multimedia solution developers need to incorporate some text, video, audio, still photographs, sound, animation, image and interactive contents in a manner that best conveys the desired content for teaching or to aid learning.

As stated earlier, the review revealed a variety of user types for the multimedia solutions reported. It is noteworthy that a large proportion of the studies where the target audience were university students, a mixture of graphics, text, audio, video and sometimes animation was utilized ( Aloraini 2012 ; Blevins, 2018 ; Huang et al., 2017 ; Shah and Khan, 2015 ). While a sizeable number of solutions were targeted at secondary school students (such as Maaruf and Siraj, 2013 , Kapi et al., 2017 , and Ilhan and Oruc, 2016 ), very few studies were identified that targeted students less than 15 years in age. Shah and Khan (2015) targeted a multimedia teaching aid that incorporated text, audio, video and animation. Perhaps the absence of multimedia tools targeted at very young children may be as a result of the inclusion criteria used for identifying articles for the review.

5.4. Success factors

The success of the different multimedia tools that have been used on the various target groups and subjects can be attributed to the technologies and components embedded as shown in Tables  4 and ​ and5. 5 . In most cases where text, audio, video, graphics and animations were the components of choice, significant improvements in teaching and learning are used, as reported in the studies reviewed ( Blevins, 2018 ; Huang et al., 2017 ; Zhang, 2012 ).

These studies also implemented technologies such as 3D modelling and printing; Macromedia flash version 8.0 and augmented reality (AR) software respectively. It is worthy of note that all the above-mentioned multimedia tools were applicable in both the teaching and learning processes. Another set of tools with components being text, audio, video and animation, excluding graphics, and equally applied in both the teaching and learning processes, adopted computer representation as their technologies ( Aloraini, 2012 ; Ilhan and Oruc, 2016 ; Milovanovic et al., 2013 ). Teaching and learning were equally greatly improved in these cases.

5.5. Evaluation methodologies

Our systematic review included a synthesis of the methodologies described by the reviewed articles for evaluating the multimedia tools that they present as shown in the summary in Table 5 . The evaluation methodologies appeared to be different depending on the type of multimedia tool, technology components, deployment strategies, and application area and target groups. However, two main evaluation methods were identified - experimental investigations and the survey methodology.

The experimental approach involved the use of an experimental group and a control group, where the assessment of the impact of the multimedia tool on the students' performance on the experimental group was compared with the performance of the control group who were taught the same content without the use of the multimedia tool. This experimental approach is a widely practiced evaluation method and has proven to be effective. It was deployed by Aloraini (2012) , Milovanovi et al. (2013) , Kaptan and İzgi (2014) , Shah and Khan (2015) , Ilhan and Oruc (2016) and Akinoso (2018) in their studies in the subject area of education, social sciences, general science, science, education and mathematics classes respectively.

Survey, as an evaluation approach which was used in 46% of the studies reviewed, involved the use of questionnaires that were administered to gather opinion on the perceived impact of the multimedia tool from a targeted group of respondents. From the systematic review, it was found that the questionnaire administration approach also varied. The data collection could be face-to-face interview ( Al-Hariri and Al-Hattami, 2017 ; Barzegar et al., 2012 ; Chen and Xia, 2012 ), or online survey ( Armenteros et al., 2013 ; Wang et al., 2020 ).

The difficulty of determining impact from a survey is related to the weaknesses associated with instrument design and sampling biases. It is our opinion that the perceived impact of the technology components used in the development of the multimedia tools may not be accurately ascertained using survey when compared with the actual deployment and experimentation with the multimedia tool that takes place in experimentation approach. Besides, in the survey approach, judgment is merely based on perceptions. Interestingly, the simplicity and ease of the survey method makes it a good option for evaluating larger target groups, and its findings can be generalised when the statistical condition is satisfied ( Krejcie and Morgan, 1970 ).

Although the evaluation studies analysed had publication dates as recently as 2015 to 2018, none reported any objective data collection such as from eye-tracking or other behavioural data. Perhaps, this may be due to our search keyword terms not being wide enough to identify multimedia evaluation studies that used objective data gathering. It could also be that the cost, time and effort needed to collect objective data means that many studies incorporating evaluation are avoiding this route.

5.6. Barriers to multimedia use in teaching and learning

Several barriers to multimedia use in teaching and learning were revealed as a result of the review. Such barriers include resistance to the adoption of ICT, lack of teachers' confidence in the use of technology, resistance to change on the part of teachers, a lack of ICT skills and lack of access to ICT resources. Other barriers identified were the lack of support, lack of time to learn new technologies, lack of instructional content, and the physical environment in which multimedia delivery took place. Some studies reported respondents that perceived no benefits from the use of multimedia. These barriers certainly affect both the integration of multimedia in teaching and learning and the uptake of the multimedia tool.

Most of the barriers identified could be classified into three groups with a major one being the fear or resistance to change. This means that change management must be an integral part of multimedia tools development and deployment in order to achieve the desired goal. Also, barriers such as lack of time and lack of resources should not be underestimated. Some of the studies reported providing the hardware for the multimedia application and such an approach should be considered. Most multimedia tools are ICT driven and as such the identified barrier of lack of ICT skills is an important aspect that must be addressed. This can be done as part of the change process and would also boost the confidence of teachers to incorporate multimedia for teaching.

It is important that the multimedia tool is designed and developed with the end-goal in mind. As indicated, some recipients of multimedia applications did not see any benefit in its use. This means that the multimedia tool should be designed to provide an experience that is worth the teachers and students' time, attention and effort.

6. Conclusions and future research direction

A lot of work has been done to highlight the effectiveness of multimedia as a teaching and learning aid. This paper provides a systematic review of studies on the use of multimedia in education in order to identify the multimedia tools being commonly used to aid teaching and learning. The paper did a systematic review of extant literature that reported studies that have been carried out to determine the extent to which multimedia has been successful in improving both teaching and learning, and challenges of using multimedia for leaning and teaching.

We note, however, that our review, especially of the studies on evaluation of multimedia, leaned more to the outcome from the studies rather than the process. Some of the information that was not captured include how the classroom teacher's mastery of the technology influences the attractiveness of the tool to the learner, both visually and through the content and if the multimedia tool allowed for learners' participation. Also, while studies on multimedia evaluation was of interest to us, this search phrase was not part of the search phrases used. A future review could incorporate these for a richer perspective.

It is obvious from the review that researchers have explored several multimedia in order to develop teaching and learning tools either based on the web or standalone using different technologies. It is observed that there exist several multimedia tools in education, but the proliferation of the tools is attributed to the evolvement of technologies over the years and the continuous teachers' efforts to improving knowledge delivery with respect to the subject areas and target audience. It is also revealed that most multimedia solutions deployed for teaching and learning target the solution to the pedagogical content of the subject of interest and the user audience of the solution. The success of the different multimedia tools that have been used on the various target groups and subjects is also attributed to the technologies and components embedded.

Furthermore, the evaluation methodologies and learning outcomes of the deployment of multimedia tools appeared to be different depending on the type of multimedia tool, technology components, deployment strategies, and application area and target groups. The two main evaluation methodologies identified from the various studies reported in the articles we reviewed were the experimental investigations and the survey methodology.

Attitudes and beliefs towards the use of technology in education, lack of teachers' confidence and resistance to change, lack of basic knowledge and ICT skills, lack of technical, administrative and financial supports, lack of physical environment are some of the barriers identified in the various articles reviewed. These barriers affect the integration of multimedia in education.

For future work, efforts should be made to explore mobile technology with several multimedia components in order to enhance teaching and learning processes across a diverse group of learners in the primary, secondary, vocational, and higher institutions of learning. Such research efforts would be significant in increasing inclusiveness and narrowing the educational divide. Also, research into the change management process for overcoming the barriers to multimedia adoption would be of interest.

Declarations

Author contribution statement.

All authors listed have significantly contributed to the development and the writing of this article.

Funding statement

This work was supported by Tertiary Education Trust Fund (TetFund), Ministry of Education, Federal Government of Nigeria 2016–2017 Institutional Based Research Grant.

Competing interest statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.

  • Agulla E.G., Rúa E.A., Castro J.L.A., Jiménez D.G., Rifón L.A. 2009 11th IEEE International Symposium on Multimedia. 2009. Multimodal biometrics-based student attendance measurement in learning management systems; pp. 699–704. [ Google Scholar ]
  • Akbaba-Altun S. Complexity of integrating computer technologies into education in Turkey. Educ. Technol. Soc. 2006; 9 (1):176–187. [ Google Scholar ]
  • Akinoso O. Effect of the use of multimedia on students' performance in secondary school mathematics. Global Media J. 2018; 16 (30):1–8. [ Google Scholar ]
  • Al-Ajmi N.A.H., Aljazzaf Z.M. Factors influencing the use of multimedia technologies in teaching English language in Kuwait. Int. J. Emerg. Technol. Learn. 2020; 15 (5):212–234. [ Google Scholar ]
  • Alemdag E., Cagiltay K. A systematic review of eye tracking research on multimedia learning. Comput. Educ. 2018; 125 :413–428. 2018. [ Google Scholar ]
  • Al-Hariri M.T., Al-Hattami A.A. Impact of students' use of technology on their learning achievements in physiology courses at the University of Dammam. J. Taibah Univ. Med. Sci. 2017; 12 (1):82–85. [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • Almara'beh H., Amer E.F., Sulieman A. The effectiveness of multimedia learning tools in education. Int. J. Adv. Res. Comput. Sci. Software Eng. 2015; 5 (12):761–764. [ Google Scholar ]
  • Aloraini S. The impact of using multimedia on students’ academic achievement in the College of Education at King Saud University. Kind Saud Univ. J. King Saud Univ. Lang. Transl. 2012; 24 :75–82. 2012. [ Google Scholar ]
  • Anderson R.E. IEA Computers in Education Study, Department of Sociology, University of Minnesota. 1993. Computers in American schools 1992: an overview: a national report from the international IEA computers in education study. [ Google Scholar ]
  • Armenteros M., Liaw S.S., Fernández M., Díaz R.F., Sánchez R.A. Surveying FIFA instructors' behavioral intention toward the multimedia teaching materials. Comput. Educ. 2013; 61 :91–104. [ Google Scholar ]
  • Bánsági T., Jr., Rodgers T.L. Graphic web–apps for teaching ternary diagrams and liquid–liquid extraction. Educ. Chem. Eng. 2018; 22 :27–34. [ Google Scholar ]
  • Barzegar N., Farjad S., Hosseini N. The effect of teaching model based on multimedia and network on the student learning (case study: guidance schools in Iran) Procedia Soc. Behav. Sci. 2012; 47 :1263–1267. 2012. [ Google Scholar ]
  • Bingimlas K. Barriers to the successful integration of ICT in teaching and learning environments: a review of the literature. Eurasia J. Math. Sci. Technol. Educ. 2009; 5 (3):235–245. [ Google Scholar ]
  • Blevins B. Teaching digital literacy composing concepts: focusing on the layers of augmented reality in an era of changing technology. Comput. Compos. 2018; 50 :21–38. [ Google Scholar ]
  • Bosley C., Moon S. Centre for Guidance Studies, University of Derby; 2003. Review of Existing Literature on the Use of Information and Communication Technology within an Educational Context. [ Google Scholar ]
  • Cagiltay K., Cakiroglu J., Cagiltay N., Cakiroglu E. Teachers’ perspectives about the use of computer in education. H. U. J. Educ. 2001; 21 (1):19–28. [ Google Scholar ]
  • Chen H.Y., Liu K.Y. Web-based synchronized multimedia lecture system design for teaching/learning Chinese as second language. Comput. Educ. 2008; 50 (3):693–702. [ Google Scholar ]
  • Chen S., Xia Y. Research on application of multimedia technology in college physical education. Procedia Eng. 2012; 29 (2012):4213–4217. [ Google Scholar ]
  • Cinar A. METU; Ankara, Turkey: 2002. Teachers’ Computer Use at Basic Education Schools: Identifying Contributing Factors. Unpublished master’s thesis. [ Google Scholar ]
  • Coleman L.O., Gibson P., Cotten S.R., Howell-Moroney M., Stringer K. Integrating computing across the curriculum: the impact of internal barriers and training intensity on computer integration in the elementary school classroom. J. Educ. Comput. Res. 2016; 54 (2):275–294. [ Google Scholar ]
  • Cuban L., Kirkpatrick H., Peck C. High access and low use of technology in high school classrooms: explaining an apparent paradox. Am. Educ. Res. J. 2001; 38 (4):813–834. [ Google Scholar ]
  • Dalacosta K., Kamariotaki-Paparrigopoulou M., Palyvos J.A., Spyrellis N. Multimedia application with animated cartoons for teaching science in elementary education. Comput. Educ. 2009; 52 (4):741–748. [ Google Scholar ]
  • Davies W., Cormican K. An analysis of the use of multimedia technology in computer aided design training: towards effective design goals. Procedia Technol. 2013; 9 :200–208. 2013. [ Google Scholar ]
  • Eady M.J., Lockyer L. Queensland University of Technology; Australia: 2013. “Tools for Learning: Technology and Teaching Strategies,” Learning to Teach in the Primary School; p. 71. [ Google Scholar ]
  • Ertugrul N. Towards virtual laboratories: a survey of LabVIEW-based teaching/learning tools and future trends. Int. J. Eng. Educ. 2000; 16 (3):171–180. [ Google Scholar ]
  • Fabry D., Higgs J. Barriers to the effective use of technology in education. J. Educ. Comput. 1997; 17 (4):385–395. [ Google Scholar ]
  • Goktas Y., Gedik N., Baydas O. Enablers and barriers to the use of ICT in primary schools in Turkey: a comparative study of 2005–2011. Comput. Educ. 2013; 68 :211–222. [ Google Scholar ]
  • Guan N., Song J., Li D. On the advantages of computer multimedia-aided English teaching. Procedia Comput. Sci. 2018; 131 :727–732. 2018. [ Google Scholar ]
  • Horsley M., Eliot M., Knight B.A., Reilly R. Springer; Cham, Switzerland: 2014. Current Trends in Eye Tracking Research. [ Google Scholar ]
  • Huang T.C., Chen M.Y., Lin C.Y. Exploring the behavioral patterns transformation of learners in different 3D modeling teaching strategies. Comput. Hum. Behav. 2017; 92 :670–678. 2017. [ Google Scholar ]
  • Hwang W.Y., Wang C.Y., Sharples M. A study of multimedia annotation of Web-based materials. Comput. Educ. 2007; 48 (4):680–699. [ Google Scholar ]
  • Ilhan G.O., Oruc S. Effect of the use of multimedia on students' performance: a case study of social studies class. Educ. Res. Rev. 2016; 11 (8):877–882. [ Google Scholar ]
  • Janda K. Multimedia in political science: sobering lessons from a teaching experiment. J. Educ. Multimedia Hypermedia. 1992; 1 (3):341–354. [ Google Scholar ]
  • Jian-hua S., Hong L. Explore the effective use of multimedia technology in college physics teaching. 2012 International Conference on Future Electr. Power Energy Syst. Explore. 2012; 17 :1897–1900. [ Google Scholar ]
  • Kapi A.Y., Osman N., Ramli R.Z., Taib J.M. Multimedia education tools for effective teaching and learning. J. Telecommun. Electron. Comput. Eng. 2017; 9 (2-8):143–146. [ Google Scholar ]
  • Kaptan F., İzgi Ü. The effect of use concept cartoons attitudes of first grade elementary students towards science and technology course. Procedia Soc. Behav. Sci. 2014; 116 :2307–2311. 2014. [ Google Scholar ]
  • Karel P., Tomas Z. Multimedia teaching aid for students of basics of control theory in Matlab and Simulink. Procedia Eng. 2015; 100 :150–158. 2015. [ Google Scholar ]
  • Keengwe S., Onchwari G., Wachira P. The use of computer toolsto support meaningful learning. Educ. Technol. Rev. 2008; 16 (1):77–92. [ Google Scholar ]
  • Keengwe J., Onchwari G., Wachira P. Computer technology integration and student learning: barriers and promise. J. Sci. Educ. Technol. 2008; 17 :560–565. 2008. [ Google Scholar ]
  • Kelley K., Clark B., Brown V., Sitzia J. Good practice in the conduct and reporting of survey research. Int. J. Qual. Health Care. 2003; 15 (3):261–266. [ PubMed ] [ Google Scholar ]
  • Kennedy G.E., Judd T.S. Expectations and reality: evaluating patterns of learning behaviour using audit trails. Comput. Educ. 2007; 49 (3):840–855. [ Google Scholar ]
  • Kingsley K.V., Boone R. Effects of multimedia software on achievement of middle school students in an American history class. J. Res. Technol. Educ. 2008; 41 (2):203–221. [ Google Scholar ]
  • Kitchenham B., Brereton O.P., Budgen D., Turner M., Bailey J., Linkman S. Systematic literature reviews in software engineering–a systematic literature review. Inf. Software Technol. 2009; 51 (1):7–15. [ Google Scholar ]
  • Krejcie R.V., Morgan D.W. Determining sample size for research activities. Educ. Psychol. Meas. 1970; 30 (3):607–610. [ Google Scholar ]
  • Liberati A., Altman D.G., Tetzlaff J., Mulrow C., Gøtzsche P.C., Ioannidis J.P., Clarke M., Devereaux P.J., Kleijnen J., Moher D. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. Ann. Intern. Med. 2009; 151 (4):65. [ PubMed ] [ Google Scholar ]
  • Maaruf S.Z., Siraj S. The state of technology and the arts-interactive multimedia in enhancing culturally responsive pedagogy. Procedia Soc. Behav. Sci. 2013; 103 :1171–1180. [ Google Scholar ]
  • Manca S., Ranieri M. Facebook and the others.Potentials and obstacles of social media for teaching in higher education. Comput. Educ. 2016; 95 :216–230. [ Google Scholar ]
  • Mayer R.E. Cognitive theory of multimedia learning. Camb. handb. Multimed Learn. 2005; 41 :31–48. [ Google Scholar ]
  • Mayer R.E. Applying the science of learning: evidence-based principles for the design of multimedia instruction. Am. Psychol. 2008; 63 (8):760–769. [ PubMed ] [ Google Scholar ]
  • Miller B.W. Using reading times and eye-movements to measure cognitive engagement. Educ. Psychol. 2015; 50 (1):31–42. [ Google Scholar ]
  • Milovanovic M., Obradovic J., Milajic A. Application of interactive multimedia tools in teaching mathematics--examples of lessons from geometry. Turk. Online J. of Educ. Technol.-TOJET. 2013; 12 (1):19–31. [ Google Scholar ]
  • Moher D., Shamseer L., Clarke M., Ghersi D., Liberati A., Petticrew M., Shekelle P., Stewart L.A. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst. Rev. 2015; 4 (1):1. [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • Molina A.I., Navarro O., Ortega M., Lacruz M. Evaluating multimedia learning materials in primary education using eye tracking. Comput. Stand. Interfac. 2018; 59 :45–60. [ Google Scholar ]
  • Morris L.V., Finnegan C., Wu S.S. Tracking student behavior, persistence, and achievement in online courses. Internet High Educ. 2005; 8 (3):221–231. [ Google Scholar ]
  • Mumtaz S. Factors affecting teachers’ use of information and communications technology: a review of the literature. J. Inf. Technol. Teach. Educ. 2000; 9 (3):319–341. [ Google Scholar ]
  • Nie Y., Zhe Y. On-line classroom visual tracking and quality evaluation by an advanced feature mining technique. Signal Process. Image Commun. 2020; 84 (May):115817. [ Google Scholar ]
  • Ocepek U., Bosnić Z., Šerbec I.N., Rugelj J. Exploring the relation between learning style models and preferred multimedia types. Comput. Educ. 2013; 69 :343–355. 2013. [ Google Scholar ]
  • Pea R.D. Learning through multimedia. IEEE Comput. Grap. Appl. 1991; 11 (4):58–66. [ Google Scholar ]
  • Putra C.A. Utilization of multimedia technology for instructional media. J. ICT Educ. 2018; 5 :1–8. 2018. [ Google Scholar ]
  • Said A., Lin L., Jim P. Barriers to adopting technology for teaching and learning in Oman. Comput. Educ. 2009; 53 :575–590. [ Google Scholar ]
  • Shah I., Khan M. Impact of multimedia-aided teaching on students’ academic achievement and attitude at elementary level. US China Educ. Rev. 2015; 5 (5):349–360. [ Google Scholar ]
  • Shoufan A. Estimating the cognitive value of YouTube's educational videos: a learning analytics approach. Comput. Hum. Behav. 2019; 92 :450–458. [ Google Scholar ]
  • Snoeyink R., Ertmer P. Thrust into technology: how veteran teachers respond. J. Educ. Technol. Syst. 2001; 30 (1):85–111. [ Google Scholar ]
  • Stark L., Brünken R., Park B. Emotional text design in multimedia learning: a mixed-methods study using eye tracking. Comput. Educ. 2018; 120 :185–196. [ Google Scholar ]
  • Taradi S.K., Taradi M., Radic K., Pokrajac N. Blending problem-based learning with Web technology positively impacts student learning outcomes in acid-base physiology. Adv. Physiol. Educ. 2005; 29 (1):35–39. [ PubMed ] [ Google Scholar ]
  • Taylor S., Todd P.A. Understanding information technology usage: a test of competing models. Inf. Syst. Res. 1995; 6 (2):144–176. [ Google Scholar ]
  • Teng J.H., Chan S.Y., Lee J.C., Lee R. Vol. 1. 2000. A LabVIEW based virtual instrument for power analyzers; pp. 179–184. (2000 International Conference on Power System Technology. Proceeding s (Cat.No. 00EX409) ). [ Google Scholar ]
  • Wang C., Fang T., Gu Y. Learning performance and behavioral patterns of online collaborative learning: impact of cognitive load and affordances of different multimedia. Comput. Educ. 2020; 143 :103683. [ Google Scholar ]
  • West J. 2019. Data Collection. https://www.researchconnections.org/childcare/datamethods/survey.jsp Retrieved on 3 Sept 2020 from: [ Google Scholar ]
  • Wu T.T., Chen A.C. Combining e-books with mind mapping in a reciprocal teaching strategy for a classical Chinese course. Comput. Educ. 2018; 116 (2020):64–80. [ Google Scholar ]
  • Yildiz R., Atkins M.J. ERIC No. ED 350 978; 1992. How to Evaluate Multimedia Simulations: Learning from the Past. [ Google Scholar ]
  • Yuen A.H., Ma W.W. Gender differences in teacher computer acceptance. J. Technol. Teach Educ. 2002; 10 (3):365–382. [ Google Scholar ]
  • Zhang F. Significances of multimedia technologies training. Phys. Procedia. 2012; 33 :2005–2010. 2012. [ Google Scholar ]
  • Zin M.Z.M., Sakat A.A., Ahmad N.A., Bhari A. Relationship between the multimedia technology and education in improving learning quality. Procedia Soc. Behav. Sci. 2013; 90 :351–355. 2013. [ Google Scholar ]
  • Zulkifli M.Z., Harun S.W., Thambiratnam K., Ahmad H. Self-calibrating automated characterization system for depressed cladding EDFA applications using LabVIEW software with GPIB. IEEE Trans. Instrument. Meas. 2008; 57 (11):2677–2681. [ Google Scholar ]

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings
  • My Bibliography
  • Collections
  • Citation manager

Save citation to file

Email citation, add to collections.

  • Create a new collection
  • Add to an existing collection

Add to My Bibliography

Your saved search, create a file for external citation management software, your rss feed.

  • Search in PubMed
  • Search in NLM Catalog
  • Add to Search

Eye-tracking technology in medical education: A systematic review

Affiliations.

  • 1 a Department of Surgery and Cancer , Imperial College, St Mary's Hospital , London , UK.
  • 2 b Frimley NHS Trust , Surrey , UK.
  • PMID: 29172823
  • DOI: 10.1080/0142159X.2017.1391373

Background: Eye-tracking technology is an established research tool within allied industries such as advertising, psychology and aerospace. This review aims to consolidate literature describing the evidence for use of eye-tracking as an adjunct to traditional teaching methods in medical education.

Methods: A systematic literature review was conducted in line with STORIES guidelines. A search of EMBASE, OVID MEDLINE, PsycINFO, TRIP database, and Science Direct was conducted until January 2017. Studies describing the use of eye-tracking in the training, assessment, and feedback of clinicians were included in the review.

Results: Thirty-three studies were included in the final qualitative synthesis. Three studies were based on the use of gaze training, three studies on the changes in gaze behavior during the learning curve, 17 studies on clinical assessment and six studies focused on the use of eye-tracking methodology as a feedback tool. The studies demonstrated feasibility and validity in the use of eye-tracking as a training and assessment method.

Conclusions: Overall, eye-tracking methodology has contributed significantly to the training, assessment, and feedback practices used in the clinical setting. The technology provides reliable quantitative data, which can be interpreted to give an indication of clinical skill, provide training solutions and aid in feedback and reflection. This review provides a detailed summary of evidence relating to eye-tracking methodology and its uses as a training method, changes in visual gaze behavior during the learning curve, eye-tracking methodology for proficiency assessment and its uses as a feedback tool.

PubMed Disclaimer

Similar articles

  • Eye tracking for skills assessment and training: a systematic review. Tien T, Pucher PH, Sodergren MH, Sriskandarajah K, Yang GZ, Darzi A. Tien T, et al. J Surg Res. 2014 Sep;191(1):169-78. doi: 10.1016/j.jss.2014.04.032. Epub 2014 Apr 24. J Surg Res. 2014. PMID: 24881471 Review.
  • Eye-Tracking Technology in Surgical Training. Merali N, Veeramootoo D, Singh S. Merali N, et al. J Invest Surg. 2019 Nov;32(7):587-593. doi: 10.1080/08941939.2017.1404663. Epub 2017 Dec 18. J Invest Surg. 2019. PMID: 29252051 Review.
  • How visual search relates to visual diagnostic performance: a narrative systematic review of eye-tracking research in radiology. van der Gijp A, Ravesloot CJ, Jarodzka H, van der Schaaf MF, van der Schaaf IC, van Schaik JPJ, Ten Cate TJ. van der Gijp A, et al. Adv Health Sci Educ Theory Pract. 2017 Aug;22(3):765-787. doi: 10.1007/s10459-016-9698-1. Epub 2016 Jul 19. Adv Health Sci Educ Theory Pract. 2017. PMID: 27436353 Free PMC article. Review.
  • Eye-tracking in surgery: a systematic review. Bapna T, Valles J, Leng S, Pacilli M, Nataraja RM. Bapna T, et al. ANZ J Surg. 2023 Nov;93(11):2600-2608. doi: 10.1111/ans.18686. Epub 2023 Sep 5. ANZ J Surg. 2023. PMID: 37668263 Review.
  • Student and educator experiences of maternal-child simulation-based learning: a systematic review of qualitative evidence protocol. MacKinnon K, Marcellus L, Rivers J, Gordon C, Ryan M, Butcher D. MacKinnon K, et al. JBI Database System Rev Implement Rep. 2015 Jan;13(1):14-26. doi: 10.11124/jbisrir-2015-1694. JBI Database System Rev Implement Rep. 2015. PMID: 26447004
  • Analyzing Eye Paths Using Fractals. Newport RA, Liu S, Di Ieva A. Newport RA, et al. Adv Neurobiol. 2024;36:827-848. doi: 10.1007/978-3-031-47606-8_42. Adv Neurobiol. 2024. PMID: 38468066
  • Shifting Perspectives: A proposed framework for analyzing head-mounted eye-tracking data with dynamic areas of interest and dynamic scenes. Tzamaras HM, Wu HL, Moore JZ, Miller SR. Tzamaras HM, et al. Proc Hum Factors Ergon Soc Annu Meet. 2023 Sep;67(1):953-958. doi: 10.1177/21695067231192929. Epub 2023 Oct 25. Proc Hum Factors Ergon Soc Annu Meet. 2023. PMID: 38450120
  • Application of eye-tracking in nursing research: A scoping review. Hu H, Li H, Wang B, Zhang M, Wu B, Wu X. Hu H, et al. Nurs Open. 2024 Feb;11(2):e2108. doi: 10.1002/nop2.2108. Nurs Open. 2024. PMID: 38391099 Free PMC article. Review.
  • Artificial Intelligence developments in medical education: a conceptual and practical framework. Masters K. Masters K. MedEdPublish (2016). 2020 Oct 26;9:239. doi: 10.15694/mep.2020.000239.1. eCollection 2020. MedEdPublish (2016). 2020. PMID: 38058891 Free PMC article.
  • Nontechnically speaking: A review of tools and methods in the teaching and assessment of nontechnical skills in emergency medicine training. Brevik C, Miller D, Kendall J, Michael S. Brevik C, et al. AEM Educ Train. 2023 Nov 12;7(6):e10911. doi: 10.1002/aet2.10911. eCollection 2023 Dec. AEM Educ Train. 2023. PMID: 37974662 No abstract available.

Publication types

  • Search in MeSH

LinkOut - more resources

Full text sources.

  • Spiral, Imperial College Digital Repository
  • Taylor & Francis

Other Literature Sources

  • figshare - Data
  • scite Smart Citations

full text provider logo

  • Citation Manager

NCBI Literature Resources

MeSH PMC Bookshelf Disclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • View all journals
  • Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • Open access
  • Published: 06 August 2024

Development of an eye-tracking system based on a deep learning model to assess executive function in patients with mental illnesses

  • Minah Kim 1 , 2   na1 ,
  • Jungha Lee 3   na1 ,
  • Soo Yong Lee 4 ,
  • Minji Ha 3 ,
  • Inkyung Park 3 ,
  • Jiseon Jang 1 ,
  • Moonyoung Jang 1 , 2 ,
  • Sunghyun Park 1 &
  • Jun Soo Kwon 1 , 2 , 3 , 5  

Scientific Reports volume  14 , Article number:  18186 ( 2024 ) Cite this article

155 Accesses

Metrics details

  • Neuroscience

Patients with mental illnesses, particularly psychosis and obsessive‒compulsive disorder (OCD), frequently exhibit deficits in executive function and visuospatial memory. Traditional assessments, such as the Rey‒Osterrieth Complex Figure Test (RCFT), performed in clinical settings require time and effort. This study aimed to develop a deep learning model using the RCFT and based on eye tracking to detect impaired executive function during visuospatial memory encoding in patients with mental illnesses. In 96 patients with first-episode psychosis, 49 with clinical high risk for psychosis, 104 with OCD, and 159 healthy controls, eye movements were recorded during a 3-min RCFT figure memorization task, and organization and immediate recall scores were obtained. These scores, along with the fixation points indicating eye-focused locations in the figure, were used to train a Long Short-Term Memory + Attention model for detecting impaired executive function and visuospatial memory. The model distinguished between normal and impaired executive function, with an F 1 score of 83.5%, and identified visuospatial memory deficits, with an F 1 score of 80.7%, regardless of psychiatric diagnosis. These findings suggest that this eye tracking-based deep learning model can directly and rapidly identify impaired executive function during visuospatial memory encoding, with potential applications in various psychiatric and neurological disorders.

Similar content being viewed by others

a systematic review of eye tracking research on multimedia learning

Eye movement as a biomarker of impaired organizational strategies during visual memory encoding in obsessive–compulsive disorder

a systematic review of eye tracking research on multimedia learning

Eye-tracking indices of impaired encoding of visual short-term memory in familial Alzheimer’s disease

a systematic review of eye tracking research on multimedia learning

Unveiling Trail Making Test: visual and manual trajectories indexing multiple executive processes

Introduction.

Psychiatric disorders commonly manifest from underlying brain dysfunctions, often resulting in cognitive deficits across various neuropsychological domains 1 . The Rey‒Osterrieth Complex Figure Test (RCFT) has been used as a neuropsychological measure in clinical and research settings to evaluate visuospatial memory and executive function, such as organizational strategy and planning 2 . Notably, impairment of these functions is commonly reported in both patients with psychotic disorders and those with obsessive‒compulsive disorder (OCD) 3 , 4 , 5 , 6 , 7 , 8 . Both patient groups often exhibit significantly lower RCFT immediate recall scores and organization scores than healthy controls (HCs), stemming from executive function deficits during visuospatial memory encoding 3 , 5 , 7 , suggesting that these deficits could serve as transdiagnostic markers across these disorders. These cognitive deficits are also often observed in patients with various psychiatric and neurological disorders, such as bipolar disorder, Alzheimer’s disease, Parkinson’s disease, closed head injury, autism, epilepsy, encephalitis, multiple sclerosis, and ischemic stroke, who exhibit brain dysfunctions 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 .

Although the well-established conventional RCFT is a valuable tool for assessing cognitive deficits across various psychiatric and neurological disorders, it has several limitations. The RCFT indirectly assesses executive function through a drawing task, which makes it challenging to link observable behavioral-level phenotypes, such as poor drawing, to underlying brain dysfunction. The RCFT can also be affected by a patient’s visuomotor function 17 ; in addition, the administration and scoring systems of the RCFT are time-consuming, labor-intensive, complex, and subject to scoring variability among clinicians owing to human biases 18 , 19 .

Considerable efforts have been made to overcome the limitations of the RCFT scoring system, including the development of an automated scoring system using photos of RCFT drawings and a deep learning algorithm 20 , 21 , 22 , 23 , the implementation of a tablet-based digital drawing assessment 24 , and the adoption of a simpler method for scoring organizational strategies (0 or 1 points) 25 . Although these approaches have made substantial advancements in addressing the complexity, labor intensity, and scoring variability of the scoring system, there are still limitations in its administration given that it is a time-consuming, visuomotor function-affected, indirect drawing test.

To address the remaining limitations, a previous study from our laboratory successfully identified an eye movement biomarker that can be used to rapidly and directly assess impaired organizational strategy during the RCFT in patients with OCD 26 . However, the application of the results of that study to other psychiatric and neurological disorders is limited because the biomarker relies solely on calculations of eye gaze distribution and is based on the assumption that patients with OCD exhibit weak central coherence, focusing on narrow details of the RCFT figure. Thus, there is a need to develop a data-driven deep learning-based assessment model that is not constrained by a single disease characteristic and exhibits increased speed, simplicity, and directness and to extend its applicability to a broader range of psychiatric and neurological disorders. The development of this model would also be consistent with recent interest in assessing cognitive functions such as visual memory and attention in patients with mental illnesses using eye tracking, as eye movements provide real-time insights into the cognitive activities involved in how gaze interacts with visuospatial stimuli during the visual encoding process 27 , 28 , 29 , 30 , 31 , 32 .

Therefore, this study aimed to establish an assessment model using eye tracking and deep learning in patients with early psychosis, including those with first-episode psychosis (FEP), patients at clinical high risk (CHR) for psychosis, patients with OCD and HCs. The primary aim was to explore the effectiveness of the model as a rapid, simple, and direct assessment of impaired executive function in these patients. FEP and OCD patients were selected because of their notable impairments in executive function and visuospatial memory 3 , 5 , whereas CHR patients were selected because they included both patients with early psychosis and those with nonspecific psychiatric symptoms, such as depression and anxiety 33 . We hypothesized that the RCFT assessment model developed in this study would be able to distinguish normal and impaired executive function as well as identify visuospatial memory impairment on the basis of eye movements during the memorization of the RCFT figure, regardless of the specific psychiatric diagnosis, with increased speed, simplicity, and directness.

Participant characteristics

The demographic and clinical characteristics of the participants in each diagnostic group are summarized in Table 1 . The demographic characteristics of the participants, stratified by normal or impaired executive function and normal or impaired visuospatial memory, are summarized in Table 2 . The participants with normal executive function, as measured by the organization T score, and normal visuospatial memory, as measured by the immediate recall T score, had a greater intelligence quotient (IQ) than did the subjects with impaired executive function (t = 2.801, p  = 0.013) and impaired visuospatial memory (t = 6.832, p  =  < 0.001), respectively. There were more females than males with impaired executive function than with normal executive function (χ 2  = 4.620, p  = 0.032). Age, years of education, handedness and the proportion of participants wearing glasses were not different between the groups with normal and impaired executive function or visuospatial memory.

RCFT performance and long short-term memory (LSTM) + Attention model results

Analysis of covariance (ANCOVA) with IQ as a covariate revealed that participants with normal executive function had higher organization T scores (F = 249.031, p  < 0.001) than did participants with impaired executive function. ANCOVA using sex and IQ as covariates revealed that the participants with normal executive function had higher total organization scores (F = 50.382, p  < 0.001), fragmentation scores (F = 4.301, p  = 0.039), and planning scores (F = 102.373, p  < 0.001) than did the participants with impaired executive function. Similarly, ANCOVA with IQ as a covariate revealed that participants with normal visuospatial memory had higher immediate recall T scores (F = 87.960, p  < 0.001), total immediate recall scores (F = 200.415, p  < 0.001), configural presence scores (F = 242.611, p  < 0.001), configural accuracy scores (F = 70.500, p  < 0.001), cluster presence scores (F = 89.203, p  < 0.001), cluster accuracy scores (F = 52.323, p  < 0.001), and detail presence scores (F = 39.270, p  < 0.001) than did participants with impaired visuospatial memory. There was no difference in immediate recall time (F = 2.073, p  = 0.151) between participants with normal and impaired visuospatial memory. The ANCOVA results for RCFT performance between HCs and patients with normal and impaired executive function and visuospatial memory are described in Table S1 in the Supplementary material.

Table 3 shows the deep learning performance results of the LSTM + Attention model 34 , 35 for the two classifications. The model achieved F 1 scores of 83.5 and 80.7% and area under the receiver operating characteristic curve (AUROC) values of 60.7% (Fig.  1 a) and 69.9% (Fig.  1 b) for distinguishing between normal and impaired executive function and between normal and impaired visuospatial memory, respectively.

figure 1

Gaze fixation sequence map showing the order of eye movements during the 3-min memorization of the Rey‒Osterrieth Complex Figure Test (RCFT) figure. ( a ) Receiver operating characteristic (ROC) curve of the Long Short-Term Memory (LSTM) + Attention model classification for normal and impaired executive function. ( b ) ROC curve of the LSTM + Attention model classification for normal and impaired visuospatial memory. ( c ) The order of gaze fixation in the patient with the lowest organization T score (i.e., < 20). ( d ) The order of gaze fixation in the patient with the highest organization T score (i.e., 70). The numbers within the circles indicate the order of gaze fixation, starting from 0. Abbreviation: AUROC, area under the ROC curve.

Model interpretation results

Eye gaze sequence maps are displayed in Fig.  1 c and d. The order of gaze fixation was concentrated in a narrow area and horizontally distributed, with a low number of fixations in the patient with the lowest organization T score (i.e., < 20; Fig.  1 c). Conversely, the order of gaze fixation was widely and evenly distributed across the figure in the patient with the highest organization T score (i.e., 70; Fig.  1 d). The eye movement comparison results between the normal and impaired groups are provided in Tables 4 and 5 .

This study aimed to develop an eye-tracking and deep learning-based RCFT assessment model for evaluating impaired executive function during visuospatial memory encoding in the RCFT that is faster, simpler, and more direct. The model achieved high performance in assessing impairment in early psychosis and OCD patients on the basis of their sequential eye movements while they were memorizing the RCFT figure. This assessment is performed regardless of specific psychiatric diagnoses, as this impairment is shared across these disorders. These results indicate that eye movements during the encoding of highly complex figures reflect executive function during visuospatial memory encoding, serving as a transdiagnostic biomarker of impairment in early psychosis and OCD. Additionally, since the assessment model in this study utilized a data-driven deep learning technique that does not require handcrafted feature selection according to specific disease hypotheses, there is the potential to extend the use of this model beyond early psychosis and OCD to other psychiatric and neurological disorders that also exhibit impaired executive function, poor performance in the RCFT in the form of disorganized and fragmented drawings, and difficulties in visuospatial integration 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 .

The model also offered a rapid and simple measure of impaired executive function, taking only 3 min from administration to assessment through computerized eye movement recording and deep learning analysis. This process is considerably faster and simpler than the traditional RCFT scoring system and the automated scoring systems reported in previous studies, which still require a prior drawing process 20 , 21 , 22 , 23 . This improved model is beneficial and easy to apply in real-world clinical and research settings, saving a significant amount of labor and time and reducing human scoring variability. Moreover, the model enabled a more direct assessment, as sequential eye movements reflected real-time visuospatial information processing 36 and indicated how subjects strategically encoded, planned, and organized the figure. Thus, eye-tracking measurements can bridge the gap between the behavioral-level phenotype and brain dysfunction by capturing the inner workings of executive function during visuospatial memory encoding.

In the interpretation of the results of the deep learning model, the gaze fixation sequence maps in Fig.  1 c and d show distinct eye movement patterns between patients with the lowest and highest organization scores. In the patient with the lowest organization score, exploration of the figure was limited, and the patient lacked a strategy and focused simply on horizontal movements without attending to important areas. In contrast, in the patient with the highest organization score, exploration was more structured and extensive, capturing a larger picture and focusing sequentially on various critical areas. In addition, quantitative differences in eye movements revealed that participants with impaired executive function and visuospatial memory spent less time looking at the figure and exhibited fewer fixations and saccades than the participants in the normal group did, indicating less effective and comprehensive encoding of the visuospatial information within the figure (Tables 4 and 5 ). These ineffective, disorganized, and limited eye movement patterns in the impaired group seem to be consistent with previous research findings that patients with executive function deficits have difficulties processing the overall RCFT figure and utilize a fragmented and piecemeal approach 3 , 5 , 7 . Overall, these differences in eye movement patterns between the impaired and normal groups may have contributed to the ability of the deep learning model to distinguish between them.

There are several limitations in this study. First, this assessment model was initially developed using data from individuals with early psychosis and OCD and classified their functions into only normal and impaired, limiting its applicability to individuals with other psychiatric and neurological disorders. Additionally, the impairment criteria in this study were stringent, making it challenging to detect patients with mild impairment. Nonetheless, this model has the potential to expand beyond binary classification and include a wider range of psychiatric and neurological disorders. Second, the small number of participants in the impaired group resulted in highly imbalanced datasets, which limits the validity and reliability of the assessment model in this study. To address this, the split dataset was stratified to maintain class label proportions consistent with those of the original dataset, and synthetic minority oversampling technique (SMOTE) 37 data augmentation was implemented, as in previous studies with similarly imbalanced datasets 38 , 39 , 40 . However, our results should be interpreted with caution because of the relatively small and imbalanced sample size of the original dataset. Third, most patients were taking medication at the time of the eye-tracking RCFT. Therefore, it is necessary to consider the medication effect when interpreting the study results, as this study did not investigate the impact of medication on patients' RCFT performance or eye movement markers. However, given that the assessment model aims to encompass various psychiatric and neurological disorders in future research, these findings remain promising, as they indicate effectiveness of the model even in the presence of potential influences from medication. Fourth, there was a significant difference in IQ between the normal and impaired groups. Although the statistical group comparison was conducted with IQ as a covariate, the deep learning model, LSTM + Attention, does not account for or exclude the potential impact of cognitive function on eye movement markers in its classification.

Although the RCFT is a well-established tool for evaluating executive function during visuospatial memory encoding, its administration and scoring pose difficulties because of its time-consuming nature, indirect measurement, and scoring variability. While a previous study from our laboratory identified an eye movement biomarker to detect impaired executive function with enhanced speed and directness, its utility was limited to OCD because it was based on OCD-specific characteristics. Therefore, we developed an RCFT assessment model that integrated eye tracking and deep learning, which not only offered a more direct, rapid, and simplified evaluation of impaired executive function but also demonstrated the potential for wider applicability to other disorders, as it was data driven and did not rely on singular disease hypotheses. Future studies could benefit from including various psychiatric and neurological disorders and utilizing explainable artificial intelligence to identify key features distinguishing between individuals with normal and impaired executive function during visuospatial memory encoding.

Participants

We analyzed data from 408 participants, including 96 patients with FEP, 49 patients at CHR for psychosis, 104 patients with OCD, and 159 HCs. FEP patients and CHR individuals were recruited from both the inpatient and outpatient clinics of the Department of Neuropsychiatry and the Seoul Youth Clinic ( www.youthclinic.org ) at Seoul National University Hospital (SNUH). In this study, the FEP patients included individuals who were diagnosed with schizophrenia, schizoaffective disorder, or schizophreniform disorder according to the Structured Clinical Interview for the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Axis I Disorders (SCID-I) with an illness duration of less than 2 years. Psychotic symptoms were assessed using the Positive and Negative Syndrome Scale (PANSS). To confirm the CHR status of the participants, the Structured Interview for Prodromal Symptoms (SIPS) 41 was used. Prodromal symptoms were assessed using the validated Korean version of the Scale of Prodromal Symptoms (SOPS) 41 , 42 . Patients with OCD were recruited from the OCD clinic ( www.ocd.snu.ac.kr ) at SNUH and fulfilled the Diagnostic and Statistical Manual of Mental Disorders-IV (DSM-IV) criteria for OCD. The Yale‒Brown Obsessive Compulsive Scale (Y-BOCS) 43 was used to evaluate the severity of OCD. The Hamilton Rating Scale for Depression (HAM-D) 44 and the Hamilton Rating Scale for Anxiety (HAM-A) 45 were used to evaluate the severity of depressive and anxious symptoms, respectively. The HCs were recruited using internet advertisements. To screen for the presence of psychiatric disorders or symptoms, the HCs underwent an assessment employing the Structured Clinical Interview for DSM-IV Non-Patient Version (SCID-NP). HCs with a past or current axis I diagnosis or first- to third-degree biological relatives with a history of psychotic disorders were not eligible to participate in this study. We assessed participants' IQ using the Korean version of the Wechsler Adult Intelligence Scale (K-WAIS) 46 .

All participants were assessed according to the exclusion criteria, which included the presence of neurological conditions, significant head injuries, substance abuse or dependency (except for nicotine), and intellectual disability (IQ < 70). We provided thorough explanations of the research procedures and obtained written informed consent from all participants (IRB no. H-1110-009-380, H-1201-008-392). For participants younger than 18, consent was also obtained from their parents. This study adhered to the principles outlined in the Declaration of Helsinki (2013) and received approval from the Institutional Review Board of SNUH (IRB No. H-2306-210-1445).

Eye movement data acquisition and the RCFT

Eye movement data were obtained during an eye-tracking experiment in which participants were instructed to view and memorize the RCFT figure for a duration of 3 min (Fig.  2 ), as detailed by Kim et al. 26 . Briefly, the RCFT figure was presented on a 19-inch monitor with a screen resolution of 1280 × 1024 pixels using Experiment Builder v.2.1.45 software (SR Research, Ottawa, Ontario, Canada). During the experiment, the participant's head was positioned on a chin rest in a room with low lighting. The distance between the chin rest and the monitor was 70 cm, and the participant had a horizontal viewing angle of 22° and a vertical viewing angle of 17°. Before eye movements were measured, a nine-point calibration and verification process was conducted. The data were collected at a 1,000-Hz sampling rate and exported through the EyeLink 1000 (SR Research) eye tracking device.

figure 2

Eye tracking-based Rey‒Osterrieth Complex Figure Test (RCFT) procedure used in this study. Memorization of an RCFT figure for 3 min was followed by immediate recall of the figure.

The key parameters collected were gaze fixation point coordinates, indicating where the eyes briefly paused to focus and acquire new information 47 , and time in milliseconds. Upon the completion of the eye-tracking session, the participants were instructed to reproduce the RCFT figure from memory, akin to the immediate recall condition of the RCFT. During this drawing task, response times were recorded, and an experimenter systematically tracked the participant’s reproduction of the figure. This meticulous monitoring aimed to assess organizational strategies as a substitution for the RCFT copy condition. A skilled evaluator manually assessed the participants’ drawings using the Boston Qualitative Scoring System (BQSS) 48 . This assessment aimed to evaluate the participants' organizational and immediate recall performance in the RCFT. The organization and immediate recall scores were subsequently categorized according to the BQSS clinical interpretation criteria as normal (score: 40–70) or impaired (score: < 39). The participants were grouped as follows: a normal executive function group with normal organization scores, a normal visuospatial memory group with normal immediate recall scores, an impaired executive function group with impaired organization scores and an impaired visuospatial memory group with impaired immediate recall scores.

LSTM + attention model

In this study, we utilized an LSTM model using Python and PyTorch 49 to analyze time series eye movement fixations during the memorization of the RCFT figure. The effectiveness of the LSTM model in handling sequential data was a key factor in its selection, especially since sequential temporal relationships might play a significant role in participants' effective memorization and organization of the RCFT figure. Additionally, the LSTM model was combined with an attention mechanism. Time series fixation sequences were input recursively into the model, facilitating the learning of patterns and relationships within sequential eye movement fixations. The acquired representations at each timestamp were summed by attention coefficients to obtain the final sequence representation. The resulting representations were fed into a single-layer classifier to determine the probabilities of sequences belonging to a specific class (normal or impaired). The model was trained with a sequence size of 32 fixation points, and the learning rate was set to α = 0.005 over the training course. The fixation dataset was split 70/30 into training and testing sets, and the split was stratified to preserve class label proportions similar to those of the original dataset. The evaluation metrics used in the LSTM + Attention model included recall (sensitivity), precision, AUROC, and F 1 score. The AUROC and F 1 score were utilized to determine the accuracy of the model in highly imbalanced datasets, as in this study. The overall modeling workflow is described in Fig.  3 .

figure 3

Overall workflow of modeling in this study. Abbreviations: Org, organization; IR, immediate recall; SMOTE, synthetic minority oversampling technique.

Data augmentation

In this study, the dataset was highly imbalanced, with a significant disparity between the majority class (e.g., 385 participants in the normal group) and the minority class (e.g., 23 participants in the impaired group). Imbalanced datasets cause problems for learning algorithms that expect an even distribution across classes, leading to bias favoring the majority class 50 . To address this, data augmentation is commonly employed to achieve an ideal balance, e.g., a 50:50 ratio, by artificially expanding the training dataset for enhanced reliability. Thus, the minority class (impaired group) in our training datasets was oversampled using SMOTE. The imbalance was maintained in the test dataset to represent the real-world distribution.

Statistical analysis

SPSS v.26.0 (IBM, Armonk, NY, USA) was used for the statistical analyses, and the significance level was set at p  < 0.05. Comparisons of demographic and clinical characteristics across groups were performed using independent t tests or Welch's t tests if the variances were not equal for continuous variables and chi-square tests for categorical variables. Group comparisons of RCFT scores were performed using ANCOVA with IQ or IQ and sex as covariates.

Model interpretation

Owing to the limited interpretability of the decision-making process of the LSTM + Attention model, additional visual interpretations and statistical analyses were conducted to interpret the results of the model in this study. First, a gaze fixation sequence map was created to explore participants' visuospatial information processing and organization of the RCFT figure in sequence. Second, eye movement measures, including the number of fixations, average duration of fixation (ms), average saccade amplitude and duration (ms), and number of blinks and saccades, were compared between the normal and impaired groups to identify quantitative differences in eye movements. Saccades refer to rapid eye movements between fixations.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Keefe, R. S. The contribution of neuropsychology to psychiatry. Am. J. Psychiatry 152 , 6–15 (1995).

Article   CAS   PubMed   Google Scholar  

Rey, A. L’examen psychologique dans les cas d’encephalopathie traumatique. Arch. Psychol. 28 , 215–285 (1941).

Google Scholar  

Kim, M. S., Namgoong, Y. & Youn, T. Effect of organizational strategy on visual memory in patients with schizophrenia. Psychiatry Clin. Neurosci. 62 , 427–434 (2008).

Article   PubMed   Google Scholar  

Penadés, R., Catalán, R., Andrés, S., Salamero, M. & Gastó, C. Executive function and nonverbal memory in obsessive-compulsive disorder. Psychiatry Res. 133 , 81–90 (2005).

Savage, C. R. et al. Organizational strategies mediate nonverbal memory impairment in obsessive–compulsive disorder. Biol. Psychiatry 45 , 905–916 (1999).

Scala, S. et al. Verbal and visual–spatial memory impairment in youth at familial risk for schizophrenia or affective psychosis: A pilot study. Schizophr. Res. 144 , 122–128 (2013).

Article   PubMed   PubMed Central   Google Scholar  

Seidman, L. J., Lanca, M., Kremen, W. S., Faraone, S. V. & Tsuang, M. T. Organizational and visual memory deficits in schizophrenia and bipolar psychoses using the Rey-Osterrieth complex figure: Effects of duration of illness. J. Clin. Exp. Neuropsychol. 25 , 949–964 (2003).

Shin, N. Y., Lee, T. Y., Kim, E. & Kwon, J. S. Cognitive functioning in obsessive-compulsive disorder: A meta-analysis. Psychol. Med. 44 , 1121–1130 (2014).

Bigler, E. D., Rosa, L., Schultz, F., Hall, S. & Harris, J. Rey-Auditory verbal learning and Rey-Osterrieth complex figure design performance in Alzheimer’s disease and closed head injury. J. Clin. Psychol. 45 , 277–280 (1989).

Cardillo, R., Lievore, R. & Mammarella, I. C. Do children with and without autism spectrum disorder use different visuospatial processing skills to perform the Rey-Osterrieth complex figure test?. Autism Res. 15 , 1311–1323 (2022).

Correa-Ghisays, P. et al. Visual memory dysfunction as a neurocognitive endophenotype in bipolar disorder patients and their unaffected relatives. Evidence from a 5-year follow-up Valencia study. J. Affect. Disord. 257 , 31–37 (2019).

Dimitrov, I. et al. Application of the Rey-Osterrieth complex figure test for assessment of cognitive impairment in multiple sclerosis. Scr. Sci. Med. 47 , 59–64 (2015).

Finke, C. et al. Evaluation of cognitive deficits and structural hippocampal damage in encephalitis with leucine-rich, glioma-inactivated 1 antibodies. JAMA Neurol. 74 , 50–59 (2017).

Grossman, M. et al. Visual construction impairments in Parkinson’s disease. Neuropsychology 7 , 536–547 (1993).

Article   Google Scholar  

Hernandez, M. T. et al. Attention, memory, and behavioral adjustment in children with frontal lobe epilepsy. Epilepsy Behav. 4 , 522–536 (2003).

Schaapsmeerders, P. et al. Long-term cognitive impairment after first-ever ischemic stroke in young adults. Stroke 44 , 1621–1628 (2013).

VanGilder, J. L., Lohse, K. R., Duff, K., Wang, P. & Schaefer, S. Y. Evidence for associations between Rey-Osterrieth complex figure test and motor skill learning in older adults. Acta Psychol. 214 , 103261 (2021).

Fastenau, P. S., Bennett, J. M. & Denburg, N. L. Application of psychometric standards to scoring system evaluation: Is “new” necessarily “improved”?. J. Clin. Exp. Neuropsychol. 18 , 462–472 (1996).

Tupler, L. A., Welsh, K. A., Asare-Aboagye, Y. & Dawson, D. V. Reliability of the Rey-Osterrieth complex figure in use with memory-impaired patients. J. Clin. Exp. Neuropsychol. 17 , 566–579 (1995).

Langer, N. et al. The AI neuropsychologist: Automatic scoring of memory deficits with deep learning. bioRxiv . https://doi.org/10.1101/2022.06.15.496291 (2022).

Park, J. Y., Seo, E. H., Yoon, H. J., Won, S. & Lee, K. H. Automating Rey complex figure test scoring using a deep learning-based approach: A potential large-scale screening tool for cognitive decline. Alzheimers Res. Ther. 15 , 145 (2023).

Vogt, J. et al. Automated scoring of the Rey-Osterrieth complex figure test using a deep-learning algorithm. Arch. Clin. Neuropsychol. 34 , 836–836 (2019).

Youn, Y. C. et al. Use of the clock drawing test and the Rey-Osterrieth complex figure test-copy with convolutional neural networks to predict cognitive impairment. Alzheimers Res. Ther. 13 , 85 (2021).

Article   CAS   PubMed   PubMed Central   Google Scholar  

Petilli, M. A., Daini, R., Saibene, F. L. & Rabuffetti, M. Automated scoring for a Tablet-based Rey Figure copy task differentiates constructional, organisational, and motor abilities. Sci. Rep. 11 , 14895 (2021).

Article   ADS   CAS   PubMed   PubMed Central   Google Scholar  

Smith, S. R. et al. Psychometrics of a simple method for scoring organizational approach on the Rey-Osterrieth complex figure. J. Neuropsychol. 1 , 39–51 (2007).

Kim, M. et al. Eye movement as a biomarker of impaired organizational strategies during visual memory encoding in obsessive–compulsive disorder. Sci. Rep. 11 , 18402 (2021).

Yi, L. et al. Children with autism spectrum disorder scan own-race faces differently from other-race faces. J. Exp. Child Psychol. 141 , 177–186 (2016).

Bradley, M. C. et al. Obsessive–compulsive symptoms and attentional bias: An eye-tracking methodology. J. Behav. Ther. Exp. Psychiatry 50 , 303–308 (2016).

Haque, R. U. et al. VisMET: A passive, efficient, and sensitive assessment of visuospatial memory in healthy aging, mild cognitive impairment, and Alzheimer’s disease. Learn. Mem. 26 , 93–100 (2019).

Vail, A. K. et al. Visual attention in schizophrenia: Eye contact and gaze aversion during clinical interactions. In 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII) 490–497 (IEEE, 2017).

Navalón, P. et al. Attentional biases to emotional scenes in schizophrenia: An eye-tracking study. Biol. Psychol. 160 , 108045 (2021).

Pavisic, I. M. et al. Eye-tracking indices of impaired encoding of visual short-term memory in familial Alzheimer’s disease. Sci. Rep. 11 , 8696 (2021).

Addington, J. et al. Comorbid diagnoses for youth at clinical high risk of psychosis. Schizophr. Res. 190 , 90–95 (2017).

Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9 , 1735–1780 (1997).

Mnih, V., Heess, N., Graves, A. & Kavukcuoglu, K. Recurrent models of visual attention. In NIPS'14: Proceedings of the 27th International Conference on Neural Information Processing Systems 2204–2212 (MIT Press, 2014).

Wolf, K. et al. Neurocognitive development of the resolution of selective visuo-spatial attention: Functional MRI evidence from object tracking. Front. Psychol. 9 , 1106 (2018).

Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 16 , 321–357 (2002).

Ahmed, Z. A. T. et al. Applying eye tracking with deep learning techniques for early-stage detection of autism spectrum disorders. Data 8 , 168 (2023).

Priyadarshini, I. Autism screening in toddlers and adults using deep learning and fair AI techniques. Future Internet 15 , 292 (2023).

Sims, S. & Conati, C. A neural architecture for detecting confusion in eye-tracking data. arXiv preprint arXiv:2003.06434 (2020).

Miller, T. J. et al. Prodromal assessment with the structured interview for prodromal syndromes and the scale of prodromal symptoms: Predictive validity, interrater reliability, and training to reliability. Schizophr. Bull. 29 , 703–715 (2003).

Jung, M. H. et al. The reliability and validity of the Korean version of the structured interview for prodromal syndrome. Psychiatry Investig. 7 , 257–263 (2010).

Goodman, W. K. et al. The Yale-Brown obsessive compulsive scale. I. Development, use, and reliability. Arch. Gen. Psychiatry 46 , 1006–1011 (1989).

Hamilton, M. A rating scale for depression. J. Neurol. Neurosurg. Psychiatry 23 , 56–62 (1960).

Hamilton, M. The assessment of anxiety states by rating. Br. J. Med. Psychol. 32 , 50–55 (1959).

Kim, Z. Two-and four-subtest short forms of the Korean-Weschler Adult Intelligence. Scale Seoul J. Psychiatry 19 , 121–126 (1994).

Blignaut, P. Fixation identification: The optimum threshold for a dispersion algorithm. Atten. Percept. Psychophys. 71 , 881–895 (2009).

Somerville, J., Tremont, G. & Stern, R. A. The Boston qualitative scoring system as a measure of executive functioning in Rey-Osterrieth complex figure performance. J. Clin. Exp. Neuropsychol. 22 , 613–621 (2000).

Paszke, A. et al. PyTorch: An imperative style, high-performance deep learning library. In Proceedings of the 33rd International Conference on Neural Information Processing Systems 8026–8037 (Curran Associates Inc., 2019).

Fotouhi, S., Asadi, S. & Kattan, M. W. A comprehensive data level analysis for cancer diagnosis on imbalanced data. J. Biomed. Inform. 90 , 103089 (2019).

Download references

Acknowledgements

This research was supported by the Bio & Medical Technology Development Program, the Brain Science Convergence Research Program through the National Research Foundation of Korea (NRF) and the KBRI basic research program through the Korea Brain Research Institute, funded by the Ministry of Science & ICT (2021M3A9E408078412, RS-2023-00266120, and 21-BR-03-01).

Author information

These authors contributed equally: Minah Kim and Jungha Lee.

Authors and Affiliations

Department of Neuropsychiatry, Seoul National University Hospital, Seoul, Republic of Korea

Minah Kim, Jiseon Jang, Moonyoung Jang, Sunghyun Park & Jun Soo Kwon

Department of Psychiatry, Seoul National University College of Medicine, 101 Daehak-no, Chongno-gu, Seoul, 03080, Republic of Korea

Minah Kim, Moonyoung Jang & Jun Soo Kwon

Department of Brain and Cognitive Sciences, Seoul National University College of Natural Sciences, Seoul, Republic of Korea

Jungha Lee, Minji Ha, Inkyung Park & Jun Soo Kwon

Kim Jaechul Graduate School of Artificial Intelligence, KAIST, Daejeon, Republic of Korea

Soo Yong Lee

Institute of Human Behavioral Medicine, SNU-MRC, Seoul, Republic of Korea

Jun Soo Kwon

You can also search for this author in PubMed   Google Scholar

Contributions

Authors M.K., J.L., and J.S.K. contributed to the conception and design of the study. Authors M.K., J.L., M.H., I.P., J.J., M.J., and S.P. collected the data. Authors J.L. and S.Y.L. performed the data analysis. Authors M.K. and J.L. wrote the first draft of the manuscript. Authors S.Y.L., M.H., I.P., J.J., M.J., S.P., and J.S.K. interpreted the data and critically edited the manuscript. Authors M.K. and J.S.K. contributed to the conception of the study, interpreted the data, and provided critical comments regarding the manuscript. All the authors contributed to manuscript revision and read and approved the submitted version.

Corresponding author

Correspondence to Jun Soo Kwon .

Ethics declarations

Competing interests.

The authors declare no competing interests.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary information., rights and permissions.

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/ .

Reprints and permissions

About this article

Cite this article.

Kim, M., Lee, J., Lee, S.Y. et al. Development of an eye-tracking system based on a deep learning model to assess executive function in patients with mental illnesses. Sci Rep 14 , 18186 (2024). https://doi.org/10.1038/s41598-024-68586-2

Download citation

Received : 05 May 2024

Accepted : 25 July 2024

Published : 06 August 2024

DOI : https://doi.org/10.1038/s41598-024-68586-2

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Eye tracking
  • Deep learning
  • Early psychosis
  • Obsessive‒compulsive disorder
  • Executive function
  • Visuospatial memory

By submitting a comment you agree to abide by our Terms and Community Guidelines . If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Quick links

  • Explore articles by subject
  • Guide to authors
  • Editorial policies

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

a systematic review of eye tracking research on multimedia learning

Eye-tracking research in interactive language learning environments: A systematic review

  • Published: 18 April 2024

Cite this article

a systematic review of eye tracking research on multimedia learning

  • Xiaojing Li 1  

262 Accesses

Explore all metrics

This comprehensive systematic review synthesizes thirty-four peer-reviewed articles published between 2010 and 2022, utilizing eye-tracking research within interactive language learning environments. Following the PRISMA scheme for article selection, this review illuminates both the affordances and challenges of eye-tracking technology in enhancing language learning outcomes. Through a methodical examination, including sensitivity and specificity analysis of relevant databases such as JSTOR, EBSCOHost, and ProQuest, this study not only underscores the potential of eye-tracking technology in identifying effective instructional strategies and personalizing instruction but also addresses significant challenges like equipment cost and complexity. Theoretically, this review advances our understanding of the cognitive processes involved in language learning by detailing how eye-tracking data can reveal patterns of attention allocation and information processing that are essential for effective CALL design. Pedagogically, it suggests that educators can leverage these insights to develop more engaging and effective language learning interventions that cater to the diverse needs of learners. By highlighting specific instances where eye-tracking technology has facilitated improved learning outcomes, this review sets a foundation for future research to explore innovative ways to integrate visual attention analysis in language education. Future research directions are proposed for continuing to harness eye-tracking technology’s utility in both theoretical exploration and practical application in language learning research and CALL design.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save.

  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime

Price includes VAT (Russian Federation)

Instant access to the full article PDF.

Rent this article via DeepDyve

Institutional subscriptions

a systematic review of eye tracking research on multimedia learning

Similar content being viewed by others

a systematic review of eye tracking research on multimedia learning

Eye-Tracking Research in Computer-Mediated Language Learning

a systematic review of eye tracking research on multimedia learning

Eye-Tracking as a Research Method in Language Testing

Data availability.

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

Alkan, S., & Cagiltay, K. (2007). Studying computer game learning experience through eye tracking. British Journal of Educational Technology , 38 (3), 538–542. https://doi.org/10.1111/j.1467-8535.2007.00721.x .

Article   Google Scholar  

Aryadoust, V., & Ang, B. H. (2021). Exploring the frontiers of eye tracking research in language studies: A novel co-citation scientometric review. Computer Assisted Language Learning , 34 (7), 898–933. https://doi.org/10.1080/09588221.2019.1647251 .

Bacca-Acosta, J., Tejada, J., Fabregat, R., et al. (2022). Scaffolding in immersive virtual reality environments for learning English: An eye tracking study. Educational Technology Research and Development , 70 , 339–362. https://doi.org/10.1007/s11423-021-10068-7 .

Bacca-Acosta, J., Tejada, J., Fabregat, R., Kinshuk, & Guevara, J. (2022b). Scaffolding in immersive virtual reality environments for learning English: An eye tracking study. Educational Technology Research and Development , 1–24. https://doi.org/10.1007/s11423-021-10068-7 .

Bahari, A. (2021). Computer-assisted Language Proficiency Assessment Tools and Strategies. Open Learning: The Journal of Open Distance and e-Learning , 36 (1), 61–87. https://doi.org/10.1080/02680513.2020.1726738 .

Article   MathSciNet   Google Scholar  

Bahari, A. (2022). Affordances and challenges of Technology-assisted Language Learning for Motivation: A systematic review. Interactive Learning Environments https// . https://doi.org/10.1080/10494820.2021.2021246 .

Bahari, A. (2023). Challenges and affordances of cognitive load management in Technology-assisted Language Learning: A systematic review. International Journal of Human-Computer Interaction , 38 (3), 1–15. https://doi.org/10.1080/10447318.2021.2019957 .

Bahari, A., Wu, S., & Ayres, P. (2023). Improving computer–assisted Language Learning through the Lens of Cognitive load. Educational Psychology Review , 35 (53), 1–30. https://doi.org/10.1007/s10648-023-09764-y .

Benati, A. (2022). The effects of structured input and traditional instruction on the acquisition of the English causative passive forms: An eye-tracking study measuring accuracy in responses and processing patterns. Language Teaching Research , 26 (6), 1231–1251. https://doi.org/10.1177/1362168820928577 .

Boers, F., Warren, P., Grimshaw, G., & Siyanova-Chanturia, A. (2017). On the benefits of multimodal annotations for vocabulary uptake from reading. Computer Assisted Language Learning , 30 (7), 709–725. https://doi.org/10.1080/09588221.2017.1356335 .

Bolzer, M., Strijbos, J. W., & Fischer, F. (2015). Inferring mindful cognitive-processing of peer‐feedback via eye‐tracking: Role of feedback‐characteristics, fixation‐durations and transitions. Journal of Computer Assisted Learning , 31 (5), 422–434. https://doi.org/10.1111/jcal.12091 .

Chen, Y. T., Li, M., Huang, C. Q., Han, Z. M., Hwang, G. J., & Yang, G. (2022). Promoting deep writing with immersive technologies: An SVVR-supported Chinese composition writing approach for primary schools. British Journal of Educational Technology , 53 (6), 2071–2091. https://doi.org/10.1111/bjet.13247 .

Chen, Y. T., Li, M., & Cukurova, M. (2023a). Unleashing imagination: An effective pedagogical approach to integrate into spherical video-based virtual reality to improve students’ creative writing. Education and Information Technologies , 1–25. https://doi.org/10.1007/s10639-023-12115-7 .

Chen, Y., Li, M., Huang, C., Cukurova, M., & Ma, Q. (2023b). A Systematic Review of Research on Immersive Technology-enhanced writing education: The current state and a Research Agenda. IEEE Transactions on Learning Technologies . https://doi.org/10.1109/TLT.2023.3341420 .

Chen, F., Xia, Q., Feng, Y., Wang, L., & Peng, G. (2023c). Learning challenging L2 sounds via computer-assisted training: Audiovisual training with an airflow model. Journal of Computer Assisted Learning , 39 (1), 34–48. https://doi.org/10.1111/jcal.12724 .

Chen, Y. T., Li, M., Cukurova, M., & Jong, M. S. Y. (2024). Incorporation of peer-feedback into the pedagogical use of spherical video‐based virtual reality in writing education. British Journal of Educational Technology , 55 (2), 519–540. https://doi.org/10.1111/bjet.13376 .

Chien, K. P., Tsai, C. Y., Chen, H. L., Chang, W. H., & Chen, S. (2015). Learning differences and eye fixation patterns in virtual and physical science laboratories. Computers & Education , 82 , 191–201. https://doi.org/10.1016/j.compedu.2014.11.023 .

Chun, D., Kern, R., & Smith, B. (2016). Technology in language use, language teaching, and language learning. The Modern Language Journal , 100 (S1), 64–80. https://doi.org/10.1111/modl.12302 .

Clark, R. C., & Mayer, R. E. (2016). E-learning and the science of instruction: Proven guidelines for consumers and designers of multimedia learning . Wiley.

Cook, A. E., Wei, W., & Preziosi, M. A. (2018). The use of ocular-motor measures in a convergent approach to studying cognitive load. In R. Z. Zheng (Ed.), Cognitive load measurement and application: A theoretical framework for meaningful research and practice (pp. 112–128). Routledge.

Dagienė, V., Jasutė, E., & Dolgopolovas, V. (2021). Professional development of in-service teachers: Use of eye tracking for language classes, case study. Sustainability , 13 (22), 12504. https://doi.org/10.3390/su132212504 .

De Koning, B. B., Tabbers, H. K., Rikers, R. M., & Paas, F. (2010). Attention guidance in learning from a complex animation: Seeing is understanding? Learning and Instruction , 20 (2), 111–122. https://doi.org/10.1016/j.learninstruc.2009.02.010 .

Frenck-Mestre, C. (2005). Eye-movement recording as a tool for studying syntactic processing in a second language: A review of methodologies and experimental findings. Second Language Research , 21 (2), 175–198. https://doi.org/10.1191/0267658305sr257oa .

Gass, S., Winke, P., Isbell, D. R., & Ahn, J. (2019). How captions help people learn languages: A working-memory, eye-tracking study. Language Learning & Technology , 23 (2), 84–104.

Google Scholar  

Godfroid, A., & Schmidtke, J. (2013). What do eye movements tell us about awareness? A triangulation of eye-movement data, verbal reports and vocabulary learning scores. In J. M. Bergsleithner, S. N. Frota, & J. K. Yoshioka (Eds.), Noticing and second language acquisition: Studies in honor of Richard Schmidt (pp. 183–205). University of Hawai‛i, National Foreign Language Resource Center.

Godfroid, A., Boers, F., & Housen, A. (2013). An eye for words: Gauging the role of attention in incidental L2 vocabulary acquisition by means of eye-tracking. Studies in Second Language Acquisition , 35 (3), 483–517. https://doi.org/10.1017/S0272263113000119 .

Godfroid, A., Loewen, S., Jung, S., Park, J. H., Gass, S., & Ellis, R. (2015). Timed and untimed grammaticality judgments measure distinct types of knowledge: Evidence from eye-movement patterns. Studies in Second Language Acquisition , 37 (2), 269–297. https://doi.org/10.1017/S0272263114000850 .

Godfroid, A., Ahn, J., Choi, I. N. A., Ballard, L., Cui, Y., Johnston, S., & Yoon, H. J. (2018). Incidental vocabulary learning in a natural reading context: An eye-tracking study. Bilingualism: Language and Cognition , 21 (3), 563–584. https://doi.org/10.1191/0267658305sr257oa .

Hegarty, M. (2010). Components of spatial intelligence. In Psychology of learning and motivation (Vol. 52, pp. 265–297). Academic Press. https://doi.org/10.1016/S00797421(10)52007-3 .

Hou, J., Lee, J. F., & Doherty, S. (2022). A study of the effects of mobile media on L2 text processing: Beyond offline comprehension accuracy measures. Computers & Education , 182 , 104466. https://doi.org/10.1016/j.compedu.2022.104466 .

Indrarathne, B., & Kormos, J. (2017). Attentional processing of input in explicit and implicit conditions: An eye-tracking study. Studies in Second Language Acquisition , 39 (3), 401–430. https://doi.org/10.1017/S027226311600019X .

Jarodzka, H., & Brand-Gruwel, S. (2017). Tracking the reading eye: Towards a model of real‐world reading. Journal of Computer Assisted Learning , 33 (3), 193–201. https://doi.org/10.1111/jcal.12189 .

Kang, H. (2014). Establishing wiki design principles to advance wiki-based learning: An eye tracking study. International Journal of Social Media and Interactive Learning Environments , 2 (3), 258–270. https://doi.org/10.1504/IJSMILE.2014.064206 .

Kang, H., Kweon, S. O., & Choi, S. (2022). Using eye-tracking to examine the role of first and second language glosses. Language Teaching Research , 26 (6), 1252–1273. https://doi.org/10.1177/1362168820928567 .

Kao, G. Y. M., Chiang, X. Z., & Foulsham, T. (2019). Reading behavior and the effect of embedded selfies in role-playing picture e-books: An eye-tracking investigation. Computers & Education , 136 , 99–112. https://doi.org/10.1016/j.compedu.2019.03.010 .

Knoblich, G., Ohlsson, S., & Raney, G. E. (2001). An eye movement study of insight problem solving. Memory & Cognition , 29 (7), 1000–1009. https://doi.org/10.3758/BF03195762 .

Kobylyanskaya, S. (2022). Speech and Eye Tracking features for L2 Acquisition: A Multimodal Experiment. In M. M. Rodrigo, N. Matsuda, A. I. Cristea, & V. Dimitrova (Eds.), Artificial Intelligence in Education. Posters and late breaking results, Workshops and tutorials, Industry and Innovation Tracks, practitioners’ and Doctoral Consortium. AIED 2022 (Vol. 13356). Springer. Lecture Notes in Computer Science https://doi.org/10.1007/978-3-031-11647-6_8 .

Koć-Januchta, M. M., Höffler, T. N., Prechtl, H., & Leutner, D. (2020). Is too much help an obstacle? Effects of interactivity and cognitive style on learning with dynamic versus non-dynamic visualizations with narrative explanations. Educational Technology Research and Development , 68 , 2971–2990. https://doi.org/10.1007/s11423-020-09822-0 .

Lai, M. L., Tsai, M. J., Yang, F. Y., Hsu, C. Y., Liu, T. C., Lee, S. W. Y., & Tsai, C. C. (2013). A review of using eye-tracking technology in exploring learning from 2000 to 2012. Educational Research Review , 10 , 90–115. https://doi.org/10.1016/j.edurev.2013.10.001 .

Lawson, A. P., & Mayer, R. E. (2022). The power of voice to convey emotion in multimedia instructional messages. International Journal of Artificial Intelligence in Education , 32 (4), 971–990. https://doi.org/10.1007/s40593-021-00282-y .

Liberman, L., & Dubovi, I. (2023). The effect of the modality principle to support learning with virtual reality: An eye-tracking and electrodermal activity study. Journal of Computer Assisted Learning , 39 (2), 547–557. https://doi.org/10.1111/jcal.12763 .

Liou, H. C. (2012). The roles of Second Life in a college computer-assisted language learning (CALL) course in Taiwan, ROC. Computer Assisted Language Learning , 25 (4), 365–382. https://doi.org/10.1080/09588221.2011.597766 .

Liu, P. L. (2014). Using eye tracking to understand learners’ reading process through the concept-mapping learning strategy. Computers & Education , 78 , 237–249. https://doi.org/10.1016/j.compedu.2014.05.011 .

Liu, T. S. W., Liu, Y. T., & Chen, C. Y. D. (2019). Meaningfulness is in the eye of the reader: Eye-tracking insights of L2 learners reading e-books and their pedagogical implications. Interactive Learning Environments , 27 (2), 181–199. https://doi.org/10.1080/10494820.2018.1451901 .

Mayer, R. E. (2010). Unique contributions of eye-tracking research to the study of learning with graphics. Learning and Instruction , 20 (2), 167–171. https://doi.org/10.1016/j.learninstruc.2009.02.012 .

Meißner, M., & Oll, J. (2019). The promise of eye-tracking methodology in organizational research: A taxonomy, review, and future avenues. Organizational Research Methods , 22 (2), 590–617. https://doi.org/10.1177/1094428117744882 .

Mohamed, A. A. (2018). Exposure frequency in L2 reading: An eye-movement perspective of incidental vocabulary learning. Studies in Second Language Acquisition , 40 (2), 269–293. https://doi.org/10.1017/S0272263117000092 .

Mohsen, M. A. (2016). The use of help options in multimedia listening environments to aid language learning: A review. British Journal of Educational Technology , 47 (6), 1232–1242. https://doi.org/10.1111/bjet.12305 .

Ozcelik, E., Arslan-Ari, I., & Cagiltay, K. (2010). Why does signaling enhance multimedia learning? Evidence from eye movements. Computers in Human Behavior , 26 (1), 110–117. https://doi.org/10.1016/j.chb.2009.09.001 .

Paas, F., Renkl, A., & Sweller, J. (2003). Cognitive load theory and instructional design: Recent developments. Educational Psychologist , 38 (1), 1–4. https://doi.org/10.1207/S15326985EP3801_1 .

Pellicer-Sánchez, A. (2016). Incidental L2 vocabulary acquisition from and while reading: An eye-tracking study. Studies in Second Language Acquisition , 38 (1), 97–130. https://doi.org/10.1017/S0272263115000224 .

Ponce, H. R., Mayer, R. E., Loyola, M. S., López, M. J., & Méndez, E. E. (2018). When two computer-supported learning strategies are better than one: An eye-tracking study. Computers & Education , 125 , 376–388. https://doi.org/10.1016/j.compedu.2018.06.024 .

Qu-Lee, J., & Balcetis, E. (2022). The call for ecological validity is right but missing perceptual idiosyncrasies is wrong. Behavioral and Brain Sciences , 45 (1), 1–15. https://doi.org/10.1017/S0140525X21000807 .

Russell, M. C. (2005). Hotspots and hyperlinks: Using eye-tracking to supplement usability testing. Usability News , 7 (2), 1–11.

Salmerón, L., Naumann, J., García, V., & Fajardo, I. (2017). Scanning and deep processing of information in hypertext: An eye tracking and cued retrospective think-aloud study. Journal of Computer Assisted Learning , 33 (3), 222–233. https://doi.org/10.1111/jcal.12152 .

Schugar, H. R., Smith, C. A., & Schugar, J. T. (2013). Teaching with interactive picture e-books in grades K–6. The Reading Teacher , 66 (8), 615–624. https://doi.org/10.1002/trtr.1168 .

Shadiev, R., & Li, D. (2023). A review study on eye-tracking technology usage in immersive virtual reality learning environments. Computers & Education , 196 , 104681. https://doi.org/10.1016/j.compedu.2022.104681 .

Siyanova-Chanturia, A., Conklin, K., & Schmitt, N. (2011). Adding more fuel to the fire: An eye-tracking study of idiom processing by native and non-native speakers. Second Language Research , 27 (2), 251–272. https://doi.org/10.1177/0267658310382068 .

Spit, S., Andringa, S., Rispens, J., & Aboh, E. O. (2022). The Effect of Explicit instruction on implicit and explicit linguistic knowledge in Kindergartners. Language Learning and Development , 18 (2), 201–228. https://doi.org/10.1080/15475441.2021.1941968 .

Tai, T. Y., & Chen, H. H. J. (2021). The impact of immersive virtual reality on EFL learners’ listening comprehension. Journal of Educational Computing Research , 59 (7), 1272–1293. https://doi.org/10.1177/0735633121994291 .

Takacs, Z. K., Swart, E. K., & Bus, A. G. (2015). Benefits and pitfalls of multimedia and interactive features in technology-enhanced storybooks: A meta-analysis. Review of Educational Research , 85 (4), 698–739. https://doi.org/10.3102/0034654314566989 .

Wang, J., Antonenko, P., & Dawson, K. (2020). Does visual attention to the instructor in online video affect learning and learner perceptions? An eye-tracking analysis. Computers & Education , 146 , 103779. https://doi.org/10.1016/j.compedu.2019.103779 .

Winke, P., Gass, S., & Sydorenko, T. (2013). Factors influencing the use of captions by foreign language learners: An eye-tracking study. The Modern Language Journal , 97 (1), 254–275. https://doi.org/10.1111/j.1540-4781.2013.01432.x .

Yaros, R. A., & Cook, A. E. (2011). Attention versus learning of online content: Preliminary findings from an eye-tracking study. International Journal of Cyber Behavior Psychology and Learning (IJCBPL) , 1 (4), 49–69. https://doi.org/10.4018/ijcbpl.2011100104 .

Download references

The authors declares that they have no relevant or material financial interests that relate to the research described in this paper.

Author information

Authors and affiliations.

School of Foreign Languages, Xinyang Normal University, Xinyang, Henan, 464000, China

Xiaojing Li

You can also search for this author in PubMed   Google Scholar

Contributions

Xiaojing Li is the sole author of the work.

Corresponding author

Correspondence to Xiaojing Li .

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Li, X. Eye-tracking research in interactive language learning environments: A systematic review. Educ Inf Technol (2024). https://doi.org/10.1007/s10639-024-12648-5

Download citation

Received : 24 August 2023

Accepted : 21 March 2024

Published : 18 April 2024

DOI : https://doi.org/10.1007/s10639-024-12648-5

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Interactive learning environments (ILEs)
  • Systematic review
  • Challenges and affordances
  • Computer-assisted language learning (CALL)
  • Find a journal
  • Publish with us
  • Track your research

Login into Epistemonikos

Your browser is ancient! Upgrade to a different browser to experience this site.

Perform simple searches, like the ones you use in Google. A single term for a condition and another for an intervention may suffice.

For instance,

  • ...vitamin C cold
  • ...vaccine autism

There is no real need of using a taxonomy, or boolean terms, such as AND or OR (they work, but are rarely needed. If you are really fond of this kind of search strategies, you can use our advanced search).

You can search in your own language or in English. You can even combine terms from different languages.

A systematic review of eye tracking research on multimedia learning.

a systematic review of eye tracking research on multimedia learning

Export Citation

Create your own Matrix Beta

Evidence related with this article: (not yet available)

Broad syntheses, systematic reviews, primary studies, external links:, available languages for this document.

IMAGES

  1. A systematic review of eye‐tracking‐based research on animated

    a systematic review of eye tracking research on multimedia learning

  2. (PDF) Mobile Eye Tracking Research in Inclusive Classrooms: Children’s

    a systematic review of eye tracking research on multimedia learning

  3. (PDF) The effectiveness of eye tracking in the diagnosis of cognitive

    a systematic review of eye tracking research on multimedia learning

  4. (PDF) Evaluation of Multimedia Educational Materials Using Eye Tracking

    a systematic review of eye tracking research on multimedia learning

  5. Classification of eye‐tracking metrics employed in animated multimedia

    a systematic review of eye tracking research on multimedia learning

  6. (PDF) A study of eye tracking technology and its applications

    a systematic review of eye tracking research on multimedia learning

COMMENTS

  1. A systematic review of eye tracking research on multimedia learning

    This study provides a current systematic review of eye tracking research in the domain of multimedia learning. The particular aim of the review is to explore how cognitive processes in multimedia learning are studied with relevant variables through eye tracking technology. To this end, 52 articles, including 58 studies, were analyzed.

  2. A systematic review of eye tracking research on multimedia learning

    In their systematic review of eye-tracking research in multimedia learning, [14] found that most studies have been conducted with university students, providing little empirical evidence for ...

  3. A systematic review of eye tracking research on multimedia learning

    A review of eye tracking research on video-based learning. This review sought to uncover how the utilisation of eye tracking technology has advanced understandings of the mechanisms underlying effective video-based learning and what type of caution should be exercised when interpreting the findings of these studies.

  4. A systematic review of eye‐tracking‐based research on animated

    The most challenging task in eye-tracking-based multimedia research is to establish a relationship between eye-tracking metrics (or cognitive processes) and learners' performance scores. Additionally, there are current debates about the effectiveness of animations (or simulations) in promoting learning in multimedia settings.

  5. A systematic review of eye tracking research on multimedia learning

    The particular aim of the review is to explore how cognitive processes in mult... Highlights •Multimedia learning research with eye tracking technology is on the rise.•Eye movements of college students using science materials were mainly analyzed.•Eye movements were associated w...

  6. A systematic review of eye‐tracking‐based research on animated

    Objectives: As a result, the current study aimed to review eye tracking-based. research on learners' cognitive processes in the animated/simulated multimedia. learning domain. Method: For this ...

  7. A review of eye tracking research on video-based learning

    Eye tracking technology is increasingly used to understand individuals' non-conscious, moment-to-moment processes during video-based learning. This review evaluated 44 eye tracking studies on video-based learning conducted between 2010 and 2021. Specifically, the review sought to uncover how the utilisation of eye tracking technology has advanced understandings of the mechanisms underlying ...

  8. PDF A systematic review of eye tracking research on multimedia learning

    A systematic review of eye tracking research on multimedia learning, Computers & Education (2018), doi: 10.1016/j.compedu.2018.06.023. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting ...

  9. A systematic review of eye tracking research on multimedia learning

    A systematic review of eye tracking research on multimedia learning; A systematic review of eye tracking research on multimedia learning. Ecenaz Alemdag. Ecenaz Alemdag; Kursat Cagiltay. Kursat Cagiltay; Top Cited Papers. Publisher Website . Google Scholar . Cite Download Share Download. 1 October 2018;

  10. A systematic review of eye tracking research on multimedia learning

    This study provides a current systematic review of eye tracking research in the domain of multimedia learning. The particular aim of the review is to explore how cognitive processes in multimedia learning are studied with relevant variables through eye tracking technology. To this end, 52 articles, including 58 studies, were analyzed. Remarkable results are that (1) there is a burgeoning ...

  11. Multimedia tools in the teaching and learning processes: A systematic

    Alemdag and Cagiltay (2018) conducted a systematic review of eye-tracking research on multimedia learning and found that while this research method was on the rise it was mainly used to understand the effects of multimedia use among higher education students. They also identified that although eye movements were linked to how students select ...

  12. Measuring (meta)cognitive processes in multimedia learning: Matching

    Even though recent eye tracking studies used the analysis of scanpaths or gaze patterns to investigate (meta)cognitive processes (e.g., Bühler et al., 2024; Stark et al., 2024; Tjon et al., 2023), the use of simple eye tracking indicators is widespread in research on multimedia learning as they are easy to calculate.

  13. PDF Eye-tracking research in interactive language learning ...

    this gap by presenting a systematic review of eye-tracking research in interactive language learning environments. It aims to critically analyze existing literature, pose ... ries of multimedia learning and theories specific to language acquisition. Theo-retical frameworks were critically analyzed to understand how they influenced

  14. PDF Using eye-tracking in education: review of empirical research and

    Alemdag and Cagiltay (2018) conducted a systematic review of eye-tracking for the research of multimedia learning. This work reported the deploy-ment of temporal and count scales of eye-tracking in studying the selection, organization, and integration of multimedia information. The authors advocated more research on using

  15. Full article: Using Eye Tracking Technology to Analyse Cognitive Load

    2.1. Eye tracking as a method of measuring learning in multimedia environments. Eye tracking technology is a non-invasive technique that facilitates the recording and measurement of certain cognitive processes, as well as the inference of metacognitive processes that occur during the learning process (Asish et al., Citation 2022; Tong & Nie, Citation 2022; van Marlen et al., Citation 2022).

  16. The Use of Eye Tracking as a Research and Instructional Tool in

    The present chapter summarizes the state of the art of using eye tracking in research on multimedia learning. It first provides an overview the various eye tracking parameters that have been used ...

  17. PDF A systematic review of eye tracking research on multimedia learning

    A Systematic Review of Eye Tracking Research on Multimedia Learning Abstract This study provides a current systematic review of eye tracking research in the domain of multimedia learning. The particular aim of the review is to explore how cognitive processes in multimedia learning are studied with relevant variables through eye tracking technology.

  18. Eye-tracking technology in medical education: A systematic review

    Background: Eye-tracking technology is an established research tool within allied industries such as advertising, psychology and aerospace. This review aims to consolidate literature describing the evidence for use of eye-tracking as an adjunct to traditional teaching methods in medical education. Methods: A systematic literature review was ...

  19. Development of an eye-tracking system based on a deep learning ...

    These findings suggest that this eye tracking-based deep learning model can directly and rapidly identify impaired executive function during visuospatial memory encoding, with potential ...

  20. Eye-tracking research in interactive language learning ...

    This comprehensive systematic review synthesizes thirty-four peer-reviewed articles published between 2010 and 2022, utilizing eye-tracking research within interactive language learning environments. Following the PRISMA scheme for article selection, this review illuminates both the affordances and challenges of eye-tracking technology in enhancing language learning outcomes. Through a ...

  21. Eye-tracking study provides valuable insights into learning ...

    "Eye-tracking research provides valuable data to improve the learning process, making it more personalized, effective and engaging," says the KTU Ph.D. student, a co-author of the study.

  22. A systematic review of eye tracking research on multimedia learning

    This study provides a current systematic review of eye tracking research in the domain of multimedia learning. The particular aim of the review is to explore how cognitive processes in multimedia learning are studied with relevant variables through eye tracking technology. To this end, 52 articles, including 58 studies, were analyzed.

  23. Eye-tracking study provides valuable insights into learning mathematics

    Eye tracking technology on children's mathematical education: systematic review. Frontiers in Education , 2024; 9 DOI: 10.3389/feduc.2024.1386487 Cite This Page :