Urban Population Prediction with Deep Learning for Enhanced 5G Simulations
2025 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE credits
Student thesis
Abstract [en]
Precise knowledge of population distribution at fine spatial resolutions is essential for numerous socio-economic, environmental, and infrastructure planning tasks, including city planning and telecommunication simulation. However, traditional census data are often outdated, infrequently collected, and aggregated at coarse administrative levels, making them unsuitable for applications requiring high-resolution population estimates. To address this challenge, this thesis - developed in collaboration with Tietoevry - presents the Multimodal Architecture for Population Synthesis (MAPS), a deep learning framework for estimating residential building-level populations.
The approach integrates heterogeneous data sources in a completely new dataset - OpenStreetMap (OSM) features, building geometries, building heights, and Google Street View imagery - with supervision from two ground truth sources: building-level population from the Swedish state personal address register (SPAR) and aggregated zone-level population from Statistics Sweden. A multitask learning setup combining a building-level regression loss with a zone-level loss is proposed to bridge the granularity gap between these sources, optimizing accuracy over two different granularities. Additionally, two auxiliary models are developed to fill in missing data: one for building type classification, HouseCat, achieving 95% accuracy, and another for floor count estimation, FloorCast, reaching a mean absolute error of 1.0.
The proposed method is the first of its kind predicting population on the building level with easily accesible geospatial data while using a combined zone- and building-level loss training strategy. The MAPS method achieves a 64% reduction in mean squared error on the building-level compared to Tietoevry’s baseline model and attains a mean absolute percentage error of just 21% at the zone level.
Place, publisher, year, edition, pages
2025. , p. 72
Series
UMNAD ; 1573
Keywords [en]
AI, artificial intelligence, computing science, computer science, deep learning, machine learning, computer vision, telecom, 5G, population prediction, population estimation, building type prediction, open street map, geospatial data, geospatial, streetview, google earth, floor prediction, regression, resnet, census, random forest, multilayered perceptron, Building-level Prediction, Multimodal Learning, Multitask Learning, City Simulation, Urban Planning, 5G network simulation
Keywords [sv]
datavetenskap, artificiell intelligens, Befolkningsestimering, Byggnadsnivåprediktion, Geodata, Djupinlärning, Multimodal inlärning, Multitask-inlärning, Google Street View, OpenStreetMap, Stadssimulering, Stadsplanering, 5G-nätverkssimulering, Husklassificering, Våningsprediktion, Populationsmodell, Random Forest, Bildigenkänning, ResNet18, Maskininlärning
National Category
Artificial Intelligence Computer Sciences Computer Vision and Learning Systems Telecommunications
Identifiers
URN: urn:nbn:se:umu:diva-241839OAI: oai:DiVA.org:umu-241839DiVA, id: diva2:1980404
External cooperation
Tietoevry
Educational program
Master of Science Programme in Computing Science and Engineering
Presentation
2025-06-04, 10:00 (English)
Supervisors
Examiners
2025-07-022025-07-022025-07-02Bibliographically approved