Class-incremental learning network for real-time anomaly recognition in surveillance environmentsShow others and affiliations
2026 (English)In: Pattern Recognition, ISSN 0031-3203, E-ISSN 1873-5142, Vol. 170, article id 112064Article in journal (Refereed) Published
Abstract [en]
The rise in crime rates has become a significant cause of property and life losses, necessitating the development of intelligent video surveillance systems for enhanced monitoring in law enforcement, transportation, and environmental contexts. However, the accurate identification of abnormal activities in real-time video surveillance systems remains a challenging task. Existing surveillance systems struggle with the vast amount of video streaming, making manual 24/7 monitoring impractical and error-prone. Traditional anomaly detection methods often process the entire dataset's feature set, which can be limiting in complex scenarios, leading to incorrect predictions, especially with challenging patterns or inter-class similarities. Therefore, this paper addresses the limitations of automatic video anomaly recognition systems by developing a vision transformer-based class-incremental learning network (CILAR-Net). The CILAR-Net leverages a vision transformer to extract spatiotemporal features from surveillance video frames, followed by a GRU network for anomaly recognition. The incremental learning approach enables the model to adapt to new classes without retraining. The CILAR-Net is validated on challenging anomaly recognition datasets, including UCF-Crime, LAD-2000, and RWF-2000, showcasing a state-of-the-art performance. Comparative analysis with existing methods demonstrates the effectiveness of CILAR-Net, which achieves an accuracy of 53.03%, 79.07%, and 93.46%, with improvements of 2.03%, 9.67%, and 0.20% from state-of-the-art methods on the UCF-Crime, LAD-2000, and RWF-2000 datasets, respectively. These results highlight the practical advantage and robustness of our method in enhancing anomaly recognition performance across diverse datasets. This article addresses significant research gaps in anomaly recognition by providing a robust and efficient solution for real-world surveillance applications.
Place, publisher, year, edition, pages
Elsevier, 2026. Vol. 170, article id 112064
Keywords [en]
GRU, Incremental learning, Surveillance videos, Video anomaly recognition, Vision transformer
National Category
Computer Sciences Computer Systems
Identifiers
URN: urn:nbn:se:umu:diva-242338DOI: 10.1016/j.patcog.2025.112064Scopus ID: 2-s2.0-105010919496OAI: oai:DiVA.org:umu-242338DiVA, id: diva2:1985764
2025-07-282025-07-282025-07-28Bibliographically approved