Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Latency robust cooperative perception using asynchronous feature fusion
Umeå University, Faculty of Science and Technology, Department of Applied Physics and Electronics.
Umeå University, Faculty of Science and Technology, Department of Applied Physics and Electronics. Rise Research Institutes of Sweden, Sweden.ORCID iD: 0000-0002-0562-2082
2025 (English)In: 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV): proceedings, IEEE, 2025, p. 1-10Conference paper, Published paper (Refereed)
Abstract [en]

Recent advancements in cooperative perception have showcased substantial improvements compared to single-agent perception. Nonetheless, the inherent latency present in such systems can dramatically impair their effectiveness. In this paper, we propose a Latency Robust Cooperative Perception framework, named LRCP, to compensate for the effect of temporal asynchrony. The intuition of LRCP is to directly fuse asynchronous bird's-eye view (BEV) features instead of estimating aligned features. To achieve this, we first propose a novel flow prediction module that uses cached past BEV features to predict the flow with a non-discrete time delay at the BEV feature level. Then, the predicted flow is employed to guide the spatial sam-pling location of interests. Our approach substantially en-hances the robustness of temporal asynchronous cooper-ative perception. Specifically, we achieved robust performance across a range of latencies up to 500 ms, with a per-formance degradation of only 1 percent point for AP@0.5 metric and 4 percent points for AP@0.7 metric at 500ms on two public datasets (V2X-Sim and Dair- V2x). Code to reproduce our results is available at https://github.com/JesseWong333/LRCP.

Place, publisher, year, edition, pages
IEEE, 2025. p. 1-10
Series
Proceedings (IEEE Workshop on Applications of Computer Vision), ISSN 2472-6737, E-ISSN 2642-9381
Keywords [en]
cooperative perception
National Category
Communication Systems
Identifiers
URN: urn:nbn:se:umu:diva-238466DOI: 10.1109/WACV61041.2025.00476Scopus ID: 2-s2.0-105003635055ISBN: 979-8-3315-1083-1 (electronic)ISBN: 979-8-3315-1084-8 (print)OAI: oai:DiVA.org:umu-238466DiVA, id: diva2:1957144
Conference
2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025, Tucson, Arizona, February 26 - March 6, 2025
Available from: 2025-05-08 Created: 2025-05-08 Last updated: 2026-04-13Bibliographically approved
In thesis
1. Cooperative perception for next-generation autonomous vehicles
Open this publication in new window or tab >>Cooperative perception for next-generation autonomous vehicles
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Samverkande perception för nästa generations autonoma fordon
Abstract [en]

Cooperative perception has emerged as a key paradigm for enhancing environmental understanding in multi-agent systems by fusing sensory information from multiple agents to achieve more comprehensive and accurate perception than single-agent approaches.Despite its demonstrated benefits, existing cooperative perception methods face critical limitations in practical deployments, primarily due to model heterogeneity, latency, and limited communication bandwidth.

This Ph.D. thesis addresses the gap between the theoretical promise of cooperative perception and its practical deployment by systematically investigating how to design cooperative perception systems that are robust, efficient, and scalable under realistic constraints. The main objective of this research is to develop unified frameworks that enable effective multi-agent perception.

To this end, the thesis proposes a series of novel methods targeting these challenges.First, as a foundational study, InputMix is proposed to balance the contributions of heterogeneous sensors in joint training scenarios. Second, an intermediate model-agnostic cooperative perception framework is introduced to enable modular training and seamless collaboration among agents with heterogeneous models. Third, the Latency-Robust Cooperative Perception (LRCP) framework is developed to mitigate the adverse effects of temporal misalignment among agents. Fourth, a lightweight, codebook-free feature compression framework is designed to reduce communication overhead while preserving perceptual performance. Finally, these components are integrated into a unified framework.

Extensive experiments on public benchmark datasets demonstrate that the proposed methods achieve perception performance comparable to the ideal scenario under latency constraints, while enabling effective collaboration among heterogeneous agents and substantially reducing communication bandwidth.

The main contributions of this thesis lie in establishing practical cooperative perception frameworks that collectively address multiple fundamental challenges in multi-agent perception. The findings of this research have broader implications for large-scale autonomous systems, including connected autonomous vehicles and distributed robotic platforms, where reliable cooperative perception under communication and system heterogeneity constraints is essential.

Place, publisher, year, edition, pages
Umeå: Umeå University, 2026. p. 65
Keywords
Cooperative Perception, Autonomous Driving
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
urn:nbn:se:umu:diva-251909 (URN)978-91-6850-019-5 (ISBN)978-91-6850-020-1 (ISBN)
Public defence
2026-05-07, NAT.D.440, 09:00 (English)
Opponent
Supervisors
Available from: 2026-04-16 Created: 2026-04-13 Last updated: 2026-04-14Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Wang, JunjieNordström, Tomas

Search in DiVA

By author/editor
Wang, JunjieNordström, Tomas
By organisation
Department of Applied Physics and Electronics
Communication Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 259 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf