Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Communication efficient cooperative perception via codebook-free vector quantization
Umeå University, Faculty of Science and Technology, Department of Applied Physics and Electronics.ORCID iD: 0009-0006-0350-3043
Umeå University, Faculty of Science and Technology, Department of Applied Physics and Electronics.ORCID iD: 0009-0006-7418-6989
Umeå University, Faculty of Science and Technology, Department of Applied Physics and Electronics. Department of Applied Physics and Electronics, Umeå University, Umeå, Sweden.ORCID iD: 0000-0002-0562-2082
2026 (English)In: IEEE Access, E-ISSN 2169-3536, Vol. 14, p. 42353-42365Article in journal (Refereed) Published
Abstract [en]

Recent advances in cooperative perception have demonstrated significant performance improvements over single-agent perception. In practice, cooperative perception methods often exchange intermediate multi-channel BEV feature maps at a high frame rate, which can impose a substantial burden on available V2X bandwidth and hinder the practical deployment of such systems. To mitigate bandwidth limitations, we introduce Codebook-Free Quantization for Cooperative Perception (CFQ4CP), a lightweight feature-compression framework. Specifically, a learned neural network, combined with a simple rounding operation, maps continuous latent features to compact integer codes. The discrete representations, instead of high-dimensional feature maps, are then transmitted through V2X communication. Subsequently, these features are reconstructed from the integer codes via a lightweight neural network compression decoder. Unlike existing compression methods that rely on fixed codebooks and nearest-neighbor lookups, our approach generates highly compact integer representations that eliminate the need for codebook alignment between transmitting and receiving agents. We evaluated the proposed framework on two real-world cooperative perception datasets, V2V4Real and DAIR-V2X. Experimental results demonstrate that CFQ4CP achieves perception accuracy competitive with full-precision models, even when each feature vector in the bird’s eye view map is quantized to extremely low bitwidths (e.g., 1 or 2 bits), results in reducing the communication costs by thousands of times.

Place, publisher, year, edition, pages
IEEE, 2026. Vol. 14, p. 42353-42365
Keywords [en]
Cooperative perception, autonomous driving
National Category
Communication Systems Telecommunications Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-251911DOI: 10.1109/access.2026.3674083ISI: 001723069100023Scopus ID: 2-s2.0-105033370093OAI: oai:DiVA.org:umu-251911DiVA, id: diva2:2052351
Funder
Swedish Research Council, 2022-06725Available from: 2026-04-13 Created: 2026-04-13 Last updated: 2026-04-13Bibliographically approved
In thesis
1. Cooperative perception for next-generation autonomous vehicles
Open this publication in new window or tab >>Cooperative perception for next-generation autonomous vehicles
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Samverkande perception för nästa generations autonoma fordon
Abstract [en]

Cooperative perception has emerged as a key paradigm for enhancing environmental understanding in multi-agent systems by fusing sensory information from multiple agents to achieve more comprehensive and accurate perception than single-agent approaches.Despite its demonstrated benefits, existing cooperative perception methods face critical limitations in practical deployments, primarily due to model heterogeneity, latency, and limited communication bandwidth.

This Ph.D. thesis addresses the gap between the theoretical promise of cooperative perception and its practical deployment by systematically investigating how to design cooperative perception systems that are robust, efficient, and scalable under realistic constraints. The main objective of this research is to develop unified frameworks that enable effective multi-agent perception.

To this end, the thesis proposes a series of novel methods targeting these challenges.First, as a foundational study, InputMix is proposed to balance the contributions of heterogeneous sensors in joint training scenarios. Second, an intermediate model-agnostic cooperative perception framework is introduced to enable modular training and seamless collaboration among agents with heterogeneous models. Third, the Latency-Robust Cooperative Perception (LRCP) framework is developed to mitigate the adverse effects of temporal misalignment among agents. Fourth, a lightweight, codebook-free feature compression framework is designed to reduce communication overhead while preserving perceptual performance. Finally, these components are integrated into a unified framework.

Extensive experiments on public benchmark datasets demonstrate that the proposed methods achieve perception performance comparable to the ideal scenario under latency constraints, while enabling effective collaboration among heterogeneous agents and substantially reducing communication bandwidth.

The main contributions of this thesis lie in establishing practical cooperative perception frameworks that collectively address multiple fundamental challenges in multi-agent perception. The findings of this research have broader implications for large-scale autonomous systems, including connected autonomous vehicles and distributed robotic platforms, where reliable cooperative perception under communication and system heterogeneity constraints is essential.

Place, publisher, year, edition, pages
Umeå: Umeå University, 2026. p. 65
Keywords
Cooperative Perception, Autonomous Driving
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
urn:nbn:se:umu:diva-251909 (URN)978-91-6850-019-5 (ISBN)978-91-6850-020-1 (ISBN)
Public defence
2026-05-07, NAT.D.440, 09:00 (English)
Opponent
Supervisors
Available from: 2026-04-16 Created: 2026-04-13 Last updated: 2026-04-14Bibliographically approved

Open Access in DiVA

fulltext(2244 kB)37 downloads
File information
File name FULLTEXT01.pdfFile size 2244 kBChecksum SHA-512
ba5d13dedef36ba9400ce7bf474b68255e7280dadb7c6028343a608d55354c6d74d7adfc71ec564990fbf461814f6e2735648800624de77397863b07ddaa3583
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Wang, JunjieGao, ZeyuNordström, Tomas

Search in DiVA

By author/editor
Wang, JunjieGao, ZeyuNordström, Tomas
By organisation
Department of Applied Physics and Electronics
In the same journal
IEEE Access
Communication SystemsTelecommunicationsComputer Sciences

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 178 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf