Umeå University's logo

umu.sePublications
Change search
Link to record
Permanent link

Direct link
Publications (6 of 6) Show all publications
Blöcker, C., Smiljanic, J., Scholtes, I. & Rosvall, M. (2022). Similarity-based link prediction from modular compression of network flows. In: Proceedings of the First Learning on Graphs Conference: . Paper presented at LOG 2022, 1st Learning on Graphs Conference, Virtual, December9-12, 2022 (pp. 52:1-52:18). ML Research Press
Open this publication in new window or tab >>Similarity-based link prediction from modular compression of network flows
2022 (English)In: Proceedings of the First Learning on Graphs Conference, ML Research Press , 2022, p. 52:1-52:18Conference paper, Published paper (Refereed)
Abstract [en]

Node similarity scores are a foundation for machine learning in graphs for clustering, node classification, anomaly detection, and link prediction with applications in biological systems, information networks, and recommender systems. Recent works on link prediction use vector space embeddings to calculate node similarities in undirected networks with good performance. Still, they have several disadvantages: limited interpretability, need for hyperparameter tuning, manual model fitting through dimensionality reduction, and poor performance from symmetric similarities in directed link prediction. We propose MapSim, an information-theoretic measure to assess node similarities based on modular compression of network flows. Unlike vector space embeddings, MapSim represents nodes in a discrete, non-metric space of communities and yields asymmetric similarities in an unsupervised fashion. We compare MapSim on a link prediction task to popular embedding-based algorithms across 47 networks and find that MapSim's average performance across all networks is more than 7% higher than its closest competitor, outperforming all embedding methods in 11 of the 47 networks. Our method demonstrates the potential of compression-based approaches in graph representation learning, with promising applications in other graph learning tasks.

Place, publisher, year, edition, pages
ML Research Press, 2022
Series
Proceedings of Machine Learning Research, E-ISSN 2640-3498 ; 198
National Category
Computer Sciences Computer Systems
Identifiers
urn:nbn:se:umu:diva-212276 (URN)2-s2.0-85164537856 (Scopus ID)
Conference
LOG 2022, 1st Learning on Graphs Conference, Virtual, December9-12, 2022
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut and Alice Wallenberg FoundationSwedish Research Council, 2016-00796
Available from: 2023-07-20 Created: 2023-07-20 Last updated: 2023-07-20Bibliographically approved
Vranić, A., Smiljanic, J. & Dankulov, M. M. (2022). Universal growth of social groups: empirical analysis and modeling. Journal of Statistical Mechanics: Theory and Experiment, 2022(12), Article ID 123402.
Open this publication in new window or tab >>Universal growth of social groups: empirical analysis and modeling
2022 (English)In: Journal of Statistical Mechanics: Theory and Experiment, E-ISSN 1742-5468, Vol. 2022, no 12, article id 123402Article in journal (Refereed) Published
Abstract [en]

Social groups are fundamental elements of any social system. Their emergence and evolution are closely related to the structure and dynamics of a social system. Research on social groups was primarily focused on the growth and the structure of the interaction networks of social system members and how members’ group affiliation influences the evolution of these networks. The distribution of groups’ size and how members join groups has not been investigated in detail. Here we combine statistical physics and complex network theory tools to analyze the distribution of group sizes in three data sets, Meetup groups based in London and New York and Reddit. We show that all three distributions exhibit log-normal behavior that indicates universal growth patterns in these systems. We propose a theoretical model that combines social and random diffusion of members between groups to simulate the roles of social interactions and members’ interest in the growth of social groups. The simulation results show that our model reproduces growth patterns observed in empirical data. Moreover, our analysis shows that social interactions are more critical for the diffusion of members in online groups, such as Reddit, than in offline groups, such as Meetup. This work shows that social groups follow universal growth mechanisms that need to be considered in modeling the evolution of social systems.

Place, publisher, year, edition, pages
Institute of Physics (IOP), 2022
Keywords
network dynamics, random graphs, networks, scaling in socio-economic systems, stochastic processes
National Category
Other Physics Topics
Identifiers
urn:nbn:se:umu:diva-201954 (URN)10.1088/1742-5468/aca0e9 (DOI)000894090400001 ()2-s2.0-85143908186 (Scopus ID)
Available from: 2022-12-28 Created: 2022-12-28 Last updated: 2024-07-04Bibliographically approved
Smiljanic, J., Blöcker, C., Edler, D. & Rosvall, M. (2021). Mapping flows on weighted and directed networks with incomplete observations. Journal of Complex Networks, 9(6), Article ID cnab044.
Open this publication in new window or tab >>Mapping flows on weighted and directed networks with incomplete observations
2021 (English)In: Journal of Complex Networks, ISSN 2051-1310, E-ISSN 2051-1329, Vol. 9, no 6, article id cnab044Article in journal (Refereed) Published
Abstract [en]

Detecting significant community structure in networks with incomplete observations is challenging because the evidence for specific solutions fades away with missing data. For example, recent research shows that flow-based community detection methods can highlight spurious communities in sparse undirected and unweighted networks with missing links. Current Bayesian approaches developed to overcome this problem do not work for incomplete observations in weighted and directed networks that describe network flows. To overcome this gap, we extend the idea behind the Bayesian estimate of the map equation for unweighted and undirected networks to enable more robust community detection in weighted and directed networks. We derive an empirical Bayes estimate of the transitions rates that can incorporate metadata information and show how an efficient implementation in the community-detection method Infomap provides more reliable communities even with a significant fraction of data missing.

Place, publisher, year, edition, pages
Oxford University Press, 2021
Keywords
community detection, directed and weighted networks, incomplete data, the map equation
National Category
Other Physics Topics
Identifiers
urn:nbn:se:umu:diva-194470 (URN)10.1093/comnet/cnab044 (DOI)000797304300006 ()2-s2.0-85128774619 (Scopus ID)
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut and Alice Wallenberg FoundationSwedish Research Council, 2016-00796
Note

Errata: "Correction to “Mapping flows on weighted and directed networks with incomplete observations”, Journal of Complex Networks, Volume 10, Issue 2, April 2022, cnac010, https://doi.org/10.1093/comnet/cnac010"

Available from: 2022-05-06 Created: 2022-05-06 Last updated: 2022-12-08Bibliographically approved
Smiljanic, J., Edler, D. & Rosvall, M. (2020). Mapping flows on sparse networks with missing links. Physical review. E, 102(1), Article ID 012302.
Open this publication in new window or tab >>Mapping flows on sparse networks with missing links
2020 (English)In: Physical review. E, ISSN 2470-0045, E-ISSN 2470-0053, Vol. 102, no 1, article id 012302Article in journal (Refereed) Published
Abstract [en]

Unreliable network data can cause community-detection methods to overfit and highlight spurious structures with misleading information about the organization and function of complex systems. Here we show how to detect significant flow-based communities in sparse networks with missing links using the map equation. Since the map equation builds on Shannon entropy estimation, it assumes complete data such that analyzing undersampled networks can lead to overfitting. To overcome this problem, we incorporate a Bayesian approach with assumptions about network uncertainties into the map equation framework. Results in both synthetic and real-world networks show that the Bayesian estimate of the map equation provides a principled approach to revealing significant structures in undersampled networks.

Place, publisher, year, edition, pages
American Physical Society, 2020
National Category
Computer Sciences Other Physics Topics
Identifiers
urn:nbn:se:umu:diva-173895 (URN)10.1103/PhysRevE.102.012302 (DOI)000550381200011 ()2-s2.0-85089465455 (Scopus ID)
Funder
Swedish Research Council, 2016-00796
Available from: 2020-08-06 Created: 2020-08-06 Last updated: 2023-03-24Bibliographically approved
Blöcker, C., Smiljanic, J., Scholtes, I. & Rosvall, M.Similarity-based Link Prediction from Modular Compression of Network Flows.
Open this publication in new window or tab >>Similarity-based Link Prediction from Modular Compression of Network Flows
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Node similarity scores constitute a foundation for machine learning in graphs. Besides clustering, node classification, and anomaly detection, they are a basis for link prediction with critical applications in biological systems, information networks, and recommender systems. Recent works on link prediction use vector space embeddings to calculate node similarities. While these methods can provide good performance in undirected networks, they have several disadvantages: limited interpretability, problem-specific hyperparameter tuning, manual model fitting through dimensionality reduction, and poor performance of symmetric similarities in directed link prediction. To address these issues, we propose MapSim, a novel information-theoretic approach to assess node similarities based on modular compression of network flows. Different from vector space embeddings, MapSim represents nodes in a discrete, non-metric space of communities and yields asymmetric similarities suitable to predict directed and undirected links in an unsupervised fashion. The resulting similarities can be explained based on a network's hierarchical modular organisation, facilitating interpretability. MapSim naturally accounts for Occam's razor, leading to parsimonious representations of clusters at multiple scales. Addressing unsupervised link prediction, we compare MapSim to popular embedding-based algorithms across 47 data sets of networks from a few hundred to hundreds of thousands of nodes and millions of links. Our analysis shows that MapSim's average performance across all networks is more than 7% higher than its closest competitor, outperforming all embedding methods in 14 of the 47 networks, and a more than 33% better worst-case performance. Our method demonstrates the potential of compression-based approaches in graph representation learning, with promising applications in other graph learning tasks.

Keywords
machine learning, network analysis, graph learning, representation learning, link prediction, minimum description length
National Category
Computer Sciences Other Computer and Information Science
Identifiers
urn:nbn:se:umu:diva-199604 (URN)
Available from: 2022-09-22 Created: 2022-09-22 Last updated: 2022-09-27
Edler, D., Smiljanić, J., Holmgren, A., Antonelli, A. & Rosvall, M.Variable Markov dynamics as a multifocal lens to map multiscale complex networks.
Open this publication in new window or tab >>Variable Markov dynamics as a multifocal lens to map multiscale complex networks
Show others...
(English)Manuscript (preprint) (Other academic)
Keywords
network science, community detection, Infomap
National Category
Computer Sciences Other Physics Topics
Identifiers
urn:nbn:se:umu:diva-201174 (URN)
Available from: 2022-11-22 Created: 2022-11-22 Last updated: 2022-11-23
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-0124-1909

Search in DiVA

Show all publications