Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A stratified review of COVID-19 infection forecasting and an efficient methodology using multiple domain-based transfer learning
Department of Computer Science, South Asian University, New Delhi, India; Department of Genetics Genomics and Informatics, The University of Tennessee Health Science Center, Memphis, Tennessee, USA.
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0000-0002-7204-8228
Department of Computer Science, South Asian University, New Delhi, India.ORCID iD: 0000-0001-7122-7622
2025 (English)In: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 262, article id 125277Article in journal (Refereed) Published
Abstract [en]

The initial outbreak of COVID-19 was reported in December 2019, China. The pandemic has led to unforeseen challenges, causing unimaginable devastation of the economic and social disruption since its inception. An effective approach for forecasting infections will be beneficial for the health sector and administration in better strategic planning and proficient management of all necessary schemes towards preventive and curative treatments. Most existing studies consider image dataset for COVID-19 prediction, whereas studies involving structural data are very rare. Thus, initially the main focus of this paper is to provide an exhaustive review that discusses about COVID-19 forecasting papers with emphasis on structural data. Then, this paper introduces a pioneering approach to COVID-19 infection forecasting, utilizing structural datasets instead of traditional image datasets. It presents a novel multi-source transfer-learning framework to enhance prediction accuracy, integrating demographic, economic, and COVID-19 data for intra-provincial spread forecasts. The COVID-19 forecasting depends on several parameters such as its current statistics, geographical area, population density and economic status like GDP etc. However, the dataset generated for an individual province of a country is alone inadequate for the precise forecast, as it faces data scarcity. Thus, transfer learning helps in such cases, where the dataset has been collected from multiple provinces. Since, it is a time-series data, thus we also consider lagged features for efficient prediction of COVID cases. Thus, apart from the detailed review, this study also aims to develop robust machine learning models by proposing a novel and efficient multi-source transfer learning technique for accurate forecasting of COVID-19 in a province. The proposed approach has been evaluated over a wide range of datasets involving sixty-two different provinces belonging to a diverse set of countries. We also performed hyperparameter tuning using Bayesian optimisation to optimise the machine learning models used. Later, we performed Friedman and Nemenyi test to compare the results generated from different models. Empirical evidence proved that forecasting using the proposed approach is much more precise with the simpler models such as Decision Trees as compared to complex models. In cases of data scarcity, when target domain data could not be used for training/fine-tuning the models simpler models are far more powerful due to their generalization capabilities than complex models. Hence, the proposed methodology is promising and valuable for governments and organizations to deal with the challenges of any pandemic outbreak for better healthcare planning and management, even when the data is in scarcity.

Place, publisher, year, edition, pages
Elsevier, 2025. Vol. 262, article id 125277
Keywords [en]
COVID-19, gross domestic product (GDP), infection forecasting, machine learning regression models, multi-source domain dataset, multi-source transfer learning, province-specific data.
National Category
Computer Sciences Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:umu:diva-229231DOI: 10.1016/j.eswa.2024.125277ISI: 001355011800001Scopus ID: 2-s2.0-85206521878OAI: oai:DiVA.org:umu-229231DiVA, id: diva2:1895449
Available from: 2024-09-05 Created: 2024-09-05 Last updated: 2025-04-24Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Garg, Sonakshi

Search in DiVA

By author/editor
Garg, SonakshiMuhuri, Pranab K.
By organisation
Department of Computing Science
In the same journal
Expert systems with applications
Computer SciencesProbability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 103 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf