You're using an outdated browser. Please upgrade to a modern browser for the best experience.
Vision Transformer for Real-Time Video Action Recognition
Academic Video Service
All videos are free for registered users. Please login to proceed.
  • View Times: 63
  • |
  • Update Date: 12 Oct 2023
  • action recognition
  • vision transformer
  • cloud solution
Video Introduction

This video is adapted from 10.3390/make5040067

Among computer vision tasks, video analysis, including action recognition and event detection, has significantly progressed in recent decades. Action recognition refers to a set of algorithms that identify a specific human action or an event in a series of video frames, such as playing a musical instrument or scoring a goal in a soccer match.

Full Transcript
1000/1000
Hot Most Recent

Confirm

Are you sure to Delete?
Yes No
Cite
If you have any further questions, please contact Encyclopedia Editorial Office.
Sarraf, S.; Kabia, M. Vision Transformer for Real-Time Video Action Recognition. Encyclopedia. Available online: https://encyclopedia.pub/video/video_detail/937 (accessed on 05 December 2025).
Sarraf S, Kabia M. Vision Transformer for Real-Time Video Action Recognition. Encyclopedia. Available at: https://encyclopedia.pub/video/video_detail/937. Accessed December 05, 2025.
Sarraf, Saman, Milton Kabia. "Vision Transformer for Real-Time Video Action Recognition" Encyclopedia, https://encyclopedia.pub/video/video_detail/937 (accessed December 05, 2025).
Sarraf, S., & Kabia, M. (2023, October 12). Vision Transformer for Real-Time Video Action Recognition. In Encyclopedia. https://encyclopedia.pub/video/video_detail/937
Sarraf, Saman and Milton Kabia. "Vision Transformer for Real-Time Video Action Recognition." Encyclopedia. Web. 12 October, 2023.
Academic Video Service