Umeå universitets logga

umu.sePublikationer
Ändra sökning
Länk till posten
Permanent länk

Direktlänk
Publikationer (6 of 6) Visa alla publikationer
Blöcker, C., Smiljanic, J., Scholtes, I. & Rosvall, M. (2022). Similarity-based link prediction from modular compression of network flows. In: Proceedings of the First Learning on Graphs Conference: . Paper presented at LOG 2022, 1st Learning on Graphs Conference, Virtual, December9-12, 2022 (pp. 52:1-52:18). ML Research Press
Öppna denna publikation i ny flik eller fönster >>Similarity-based link prediction from modular compression of network flows
2022 (Engelska)Ingår i: Proceedings of the First Learning on Graphs Conference, ML Research Press , 2022, s. 52:1-52:18Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Node similarity scores are a foundation for machine learning in graphs for clustering, node classification, anomaly detection, and link prediction with applications in biological systems, information networks, and recommender systems. Recent works on link prediction use vector space embeddings to calculate node similarities in undirected networks with good performance. Still, they have several disadvantages: limited interpretability, need for hyperparameter tuning, manual model fitting through dimensionality reduction, and poor performance from symmetric similarities in directed link prediction. We propose MapSim, an information-theoretic measure to assess node similarities based on modular compression of network flows. Unlike vector space embeddings, MapSim represents nodes in a discrete, non-metric space of communities and yields asymmetric similarities in an unsupervised fashion. We compare MapSim on a link prediction task to popular embedding-based algorithms across 47 networks and find that MapSim's average performance across all networks is more than 7% higher than its closest competitor, outperforming all embedding methods in 11 of the 47 networks. Our method demonstrates the potential of compression-based approaches in graph representation learning, with promising applications in other graph learning tasks.

Ort, förlag, år, upplaga, sidor
ML Research Press, 2022
Serie
Proceedings of Machine Learning Research, E-ISSN 2640-3498 ; 198
Nationell ämneskategori
Datavetenskap (datalogi) Datorsystem
Identifikatorer
urn:nbn:se:umu:diva-212276 (URN)2-s2.0-85164537856 (Scopus ID)
Konferens
LOG 2022, 1st Learning on Graphs Conference, Virtual, December9-12, 2022
Forskningsfinansiär
Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut och Alice Wallenbergs StiftelseVetenskapsrådet, 2016-00796
Tillgänglig från: 2023-07-20 Skapad: 2023-07-20 Senast uppdaterad: 2023-07-20Bibliografiskt granskad
Vranić, A., Smiljanic, J. & Dankulov, M. M. (2022). Universal growth of social groups: empirical analysis and modeling. Journal of Statistical Mechanics: Theory and Experiment, 2022(12), Article ID 123402.
Öppna denna publikation i ny flik eller fönster >>Universal growth of social groups: empirical analysis and modeling
2022 (Engelska)Ingår i: Journal of Statistical Mechanics: Theory and Experiment, ISSN 1742-5468, E-ISSN 1742-5468, Vol. 2022, nr 12, artikel-id 123402Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Social groups are fundamental elements of any social system. Their emergence and evolution are closely related to the structure and dynamics of a social system. Research on social groups was primarily focused on the growth and the structure of the interaction networks of social system members and how members’ group affiliation influences the evolution of these networks. The distribution of groups’ size and how members join groups has not been investigated in detail. Here we combine statistical physics and complex network theory tools to analyze the distribution of group sizes in three data sets, Meetup groups based in London and New York and Reddit. We show that all three distributions exhibit log-normal behavior that indicates universal growth patterns in these systems. We propose a theoretical model that combines social and random diffusion of members between groups to simulate the roles of social interactions and members’ interest in the growth of social groups. The simulation results show that our model reproduces growth patterns observed in empirical data. Moreover, our analysis shows that social interactions are more critical for the diffusion of members in online groups, such as Reddit, than in offline groups, such as Meetup. This work shows that social groups follow universal growth mechanisms that need to be considered in modeling the evolution of social systems.

Ort, förlag, år, upplaga, sidor
Institute of Physics (IOP), 2022
Nyckelord
network dynamics, random graphs, networks, scaling in socio-economic systems, stochastic processes
Nationell ämneskategori
Annan fysik
Identifikatorer
urn:nbn:se:umu:diva-201954 (URN)10.1088/1742-5468/aca0e9 (DOI)000894090400001 ()2-s2.0-85143908186 (Scopus ID)
Tillgänglig från: 2022-12-28 Skapad: 2022-12-28 Senast uppdaterad: 2022-12-28Bibliografiskt granskad
Smiljanic, J., Blöcker, C., Edler, D. & Rosvall, M. (2021). Mapping flows on weighted and directed networks with incomplete observations. Journal of Complex Networks, 9(6), Article ID cnab044.
Öppna denna publikation i ny flik eller fönster >>Mapping flows on weighted and directed networks with incomplete observations
2021 (Engelska)Ingår i: Journal of Complex Networks, ISSN 2051-1310, E-ISSN 2051-1329, Vol. 9, nr 6, artikel-id cnab044Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Detecting significant community structure in networks with incomplete observations is challenging because the evidence for specific solutions fades away with missing data. For example, recent research shows that flow-based community detection methods can highlight spurious communities in sparse undirected and unweighted networks with missing links. Current Bayesian approaches developed to overcome this problem do not work for incomplete observations in weighted and directed networks that describe network flows. To overcome this gap, we extend the idea behind the Bayesian estimate of the map equation for unweighted and undirected networks to enable more robust community detection in weighted and directed networks. We derive an empirical Bayes estimate of the transitions rates that can incorporate metadata information and show how an efficient implementation in the community-detection method Infomap provides more reliable communities even with a significant fraction of data missing.

Ort, förlag, år, upplaga, sidor
Oxford University Press, 2021
Nyckelord
community detection, directed and weighted networks, incomplete data, the map equation
Nationell ämneskategori
Annan fysik
Identifikatorer
urn:nbn:se:umu:diva-194470 (URN)10.1093/comnet/cnab044 (DOI)000797304300006 ()2-s2.0-85128774619 (Scopus ID)
Forskningsfinansiär
Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut och Alice Wallenbergs StiftelseVetenskapsrådet, 2016-00796
Anmärkning

Errata: "Correction to “Mapping flows on weighted and directed networks with incomplete observations”, Journal of Complex Networks, Volume 10, Issue 2, April 2022, cnac010, https://doi.org/10.1093/comnet/cnac010"

Tillgänglig från: 2022-05-06 Skapad: 2022-05-06 Senast uppdaterad: 2022-12-08Bibliografiskt granskad
Smiljanic, J., Edler, D. & Rosvall, M. (2020). Mapping flows on sparse networks with missing links. Physical review. E, 102(1), Article ID 012302.
Öppna denna publikation i ny flik eller fönster >>Mapping flows on sparse networks with missing links
2020 (Engelska)Ingår i: Physical review. E, ISSN 2470-0045, E-ISSN 2470-0053, Vol. 102, nr 1, artikel-id 012302Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Unreliable network data can cause community-detection methods to overfit and highlight spurious structures with misleading information about the organization and function of complex systems. Here we show how to detect significant flow-based communities in sparse networks with missing links using the map equation. Since the map equation builds on Shannon entropy estimation, it assumes complete data such that analyzing undersampled networks can lead to overfitting. To overcome this problem, we incorporate a Bayesian approach with assumptions about network uncertainties into the map equation framework. Results in both synthetic and real-world networks show that the Bayesian estimate of the map equation provides a principled approach to revealing significant structures in undersampled networks.

Ort, förlag, år, upplaga, sidor
American Physical Society, 2020
Nationell ämneskategori
Datavetenskap (datalogi) Annan fysik
Identifikatorer
urn:nbn:se:umu:diva-173895 (URN)10.1103/PhysRevE.102.012302 (DOI)000550381200011 ()2-s2.0-85089465455 (Scopus ID)
Forskningsfinansiär
Vetenskapsrådet, 2016-00796
Tillgänglig från: 2020-08-06 Skapad: 2020-08-06 Senast uppdaterad: 2023-03-24Bibliografiskt granskad
Blöcker, C., Smiljanic, J., Scholtes, I. & Rosvall, M.Similarity-based Link Prediction from Modular Compression of Network Flows.
Öppna denna publikation i ny flik eller fönster >>Similarity-based Link Prediction from Modular Compression of Network Flows
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Abstract [en]

Node similarity scores constitute a foundation for machine learning in graphs. Besides clustering, node classification, and anomaly detection, they are a basis for link prediction with critical applications in biological systems, information networks, and recommender systems. Recent works on link prediction use vector space embeddings to calculate node similarities. While these methods can provide good performance in undirected networks, they have several disadvantages: limited interpretability, problem-specific hyperparameter tuning, manual model fitting through dimensionality reduction, and poor performance of symmetric similarities in directed link prediction. To address these issues, we propose MapSim, a novel information-theoretic approach to assess node similarities based on modular compression of network flows. Different from vector space embeddings, MapSim represents nodes in a discrete, non-metric space of communities and yields asymmetric similarities suitable to predict directed and undirected links in an unsupervised fashion. The resulting similarities can be explained based on a network's hierarchical modular organisation, facilitating interpretability. MapSim naturally accounts for Occam's razor, leading to parsimonious representations of clusters at multiple scales. Addressing unsupervised link prediction, we compare MapSim to popular embedding-based algorithms across 47 data sets of networks from a few hundred to hundreds of thousands of nodes and millions of links. Our analysis shows that MapSim's average performance across all networks is more than 7% higher than its closest competitor, outperforming all embedding methods in 14 of the 47 networks, and a more than 33% better worst-case performance. Our method demonstrates the potential of compression-based approaches in graph representation learning, with promising applications in other graph learning tasks.

Nyckelord
machine learning, network analysis, graph learning, representation learning, link prediction, minimum description length
Nationell ämneskategori
Datavetenskap (datalogi) Annan data- och informationsvetenskap
Identifikatorer
urn:nbn:se:umu:diva-199604 (URN)
Tillgänglig från: 2022-09-22 Skapad: 2022-09-22 Senast uppdaterad: 2022-09-27
Edler, D., Smiljanić, J., Holmgren, A., Antonelli, A. & Rosvall, M.Variable Markov dynamics as a multifocal lens to map multiscale complex networks.
Öppna denna publikation i ny flik eller fönster >>Variable Markov dynamics as a multifocal lens to map multiscale complex networks
Visa övriga...
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Nyckelord
network science, community detection, Infomap
Nationell ämneskategori
Datavetenskap (datalogi) Annan fysik
Identifikatorer
urn:nbn:se:umu:diva-201174 (URN)
Tillgänglig från: 2022-11-22 Skapad: 2022-11-22 Senast uppdaterad: 2022-11-23
Organisationer
Identifikatorer
ORCID-id: ORCID iD iconorcid.org/0000-0003-0124-1909

Sök vidare i DiVA

Visa alla publikationer