Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Estimation of hazard ratios from observational data with applications related to stroke
Umeå University, Faculty of Social Sciences, Umeå School of Business and Economics (USBE), Statistics.ORCID iD: 0000-0002-9313-3499
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The objective of this thesis is to examine some challenges that may emerge when conducting time-to-event studies based on observational data. Time-to-event (also called survival) is a setting that involves analyzing how different factors may influence the length of time until an individual experiences the event of interest. This type of analysis is commonly applied in fields such as medical research and epidemiology. In this thesis, which focuses on stroke, we are interested in the time to a recurrent stroke or the death of a patient who survived a first stroke.

Hazard ratios are one of the main parameters estimated in time-to-event studies. Hazard ratios involve comparing the risk of experiencing the event between two groups, usually a treated group and an untreated group.  They can also involve other factors, such as different age groups. Hazard ratios can be estimated from the data by using the Cox regression model.

Observational data, in contrast to experimental data, involves data collected without any intervention or random assignment of treatment to the individuals. Confounders, that is, variables that distort or obscure the true relationship between treatment and outcome, are always present and need to be controlled for in observational studies.

National registers are an important source of observational data. A national registry is a centralized database or system that collects, stores, and maintains information about a specific population or group of individuals within a country. Sweden is known for its detailed and complete national registers. In this thesis, data from the Swedish Stroke Register (Riksstroke) is used to study factors related to stroke.

In time-to-event studies involving observational data, several challenges may arise for the researcher during data analysis. Some individuals may not experience the event during the observation period and thus the information about their time until the event is incomplete. These individuals are considered as censored. Some individuals may experience another event rather than the one of interest, a competing risk. Additionally, models must be properly constructed, with researchers selecting variables and determining the suitable functional form.

Four papers are included in the thesis. Paper I demonstrates how to handle competing risks in survival analysis. The study involves comparing individuals with and without standard modifiable risk factors and their risks of a recurrent stroke or death using data from the Swedish Stroke Register.

The estimation of marginal hazard ratios is a common theme in the other three papers. All involve simulation studies in order to extend methods and explore best practices when estimating marginal hazard ratios.

Paper II explores non-parametric methods that can be used as alternatives to more traditional parametric methods when balancing datasets in order to estimate a marginal hazard ratio. A case study was also conducted using data from the Swedish Stroke Register involving the prescription of anticoagulants at hospital discharge after a stroke.

Paper III is about how censoring affects marginal hazard ratio estimation, even with perfect balancing of the dataset. We study this issue, taking into consideration varying effect sizes and censoring rates. A procedure to attenuate the problem is also studied.

Paper IV concerns covariate selection in the case of high-dimensional data. High-dimensional data involves cases in which the number of covariates in the study is comparable to the number of individuals, and therefore covariate selection methods are needed. In the paper, we explore some of these methods and suggest a best-performing procedure. As Paper II, Paper IV involves a case study of anticoagulant prescription using data from the Swedish Stroke Register.

Place, publisher, year, edition, pages
Umeå University, 2024. , p. 19
Series
Statistical studies, ISSN 1100-8989 ; 57
Keywords [en]
survival analysis, causal inference, hazard ratios, marginal hazard ratio, stroke, balancing
National Category
Probability Theory and Statistics
Research subject
Statistics
Identifiers
URN: urn:nbn:se:umu:diva-219201ISBN: 978-91-8070-240-9 (print)ISBN: 978-91-8070-241-6 (electronic)OAI: oai:DiVA.org:umu-219201DiVA, id: diva2:1825506
Public defence
2024-02-02, Hörsal NBET.A.101, Norra Beteendevetarhuset, Mediegränd 14, 907 36, Umeå, 10:00 (English)
Opponent
Supervisors
Available from: 2024-01-12 Created: 2024-01-09 Last updated: 2024-01-10Bibliographically approved
List of papers
1. Recurrent ischemic stroke and mortality in stroke patients without standard modifiable risk factors: an analysis of the riksstroke registry
Open this publication in new window or tab >>Recurrent ischemic stroke and mortality in stroke patients without standard modifiable risk factors: an analysis of the riksstroke registry
Show others...
(English)Manuscript (preprint) (Other academic)
National Category
Public Health, Global Health, Social Medicine and Epidemiology
Identifiers
urn:nbn:se:umu:diva-218977 (URN)
Available from: 2024-01-03 Created: 2024-01-03 Last updated: 2024-01-09
2. Performance of modeling and balancing approach methods when using weights to estimate treatment effects in observational time-to-event settings
Open this publication in new window or tab >>Performance of modeling and balancing approach methods when using weights to estimate treatment effects in observational time-to-event settings
2023 (English)In: PLOS ONE, E-ISSN 1932-6203, Vol. 18, no 12, article id e0289316Article in journal (Refereed) Published
Abstract [en]

In observational studies weighting techniques are often used to overcome bias due to confounding. Modeling approaches, such as inverse propensity score weighting, are popular, but often rely on the correct specification of a parametric model wherein neither balance nor stability are targeted. More recently, balancing approach methods that directly target covariate imbalances have been proposed, and these allow the researcher to explicitly set the desired balance constraints. In this study, we evaluate the finite sample properties of different modeling and balancing approach methods, when estimating the marginal hazard ratio, through Monte Carlo simulations. The use of the different methods is also illustrated by analyzing data from the Swedish stroke register to estimate the effect of prescribing oral anticoagulants on time to recurrent stroke or death in stroke patients with atrial fibrillation. In simulated scenarios with good overlap and low or no model misspecification the balancing approach methods performed similarly to the modeling approach methods. In scenarios with bad overlap and model misspecification, the modeling approach method incorporating variable selection performed better than the other methods. The results indicate that it is valuable to use methods that target covariate balance when estimating marginal hazard ratios, but this does not in itself guarantee good performance in situations with, e.g., poor overlap, high censoring, or misspecified models/balance constraints.

Place, publisher, year, edition, pages
Public Library of Science (PLoS), 2023
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:umu:diva-218671 (URN)10.1371/journal.pone.0289316 (DOI)38060567 (PubMedID)2-s2.0-85179800320 (Scopus ID)
Available from: 2023-12-27 Created: 2023-12-27 Last updated: 2024-01-09Bibliographically approved
3. Impact of non-informative censoring on propensity score based estimation of marginal hazard ratios
Open this publication in new window or tab >>Impact of non-informative censoring on propensity score based estimation of marginal hazard ratios
(English)Manuscript (preprint) (Other academic)
Keywords
Survival Analysis, Censoring, Marginal Hazard Ratio, Causal Inference, Simulation
National Category
Probability Theory and Statistics
Research subject
Statistics
Identifiers
urn:nbn:se:umu:diva-218975 (URN)
Funder
Swedish Research Council
Available from: 2024-01-03 Created: 2024-01-03 Last updated: 2024-01-09
4. Covariate selection for the estimation of marginal hazard ratios in high-dimensional data
Open this publication in new window or tab >>Covariate selection for the estimation of marginal hazard ratios in high-dimensional data
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Hazard ratios are frequently reported in time-to-event and epidemiological studies to assess treatment effects. In observational studies, the combination of propensity score weights with the Cox proportional hazards model facilitates the estimation of the marginal hazard ratio (MHR). The methods for estimating MHR are analogous to those employed for estimating common causal parameters, such as the average treatment effect. However, MHR estimation in the context of high-dimensional data remain unexplored. This paper seeks to address this gap through a simulation study that consider variable selection methods from causal inference combined with a recently proposed multiply robust approach for MHR estimation. Additionally, a case study utilizing stroke register data is conducted to demonstrate the application of these methods. The results from the simulation study indicate that the double selection covariate selection method is preferable to several other strategies when estimating MHR. Nevertheless, the estimation can be further improved by employing the multiply robust approach to the set of propensity score models obtained during the double selection process.

National Category
Probability Theory and Statistics
Research subject
Statistics; Statistics
Identifiers
urn:nbn:se:umu:diva-218976 (URN)
Available from: 2024-01-03 Created: 2024-01-03 Last updated: 2024-01-09

Open Access in DiVA

fulltext(667 kB)376 downloads
File information
File name FULLTEXT01.pdfFile size 667 kBChecksum SHA-512
6dc1207a0f5b0891aca58a259fa30712b4de79c870927f8e3f307a3779b905d67a53118c132e91c432fe8a58b5abd4ee9a414cff52d53383a8aff1e8c28836be
Type fulltextMimetype application/pdf
spikblad(170 kB)35 downloads
File information
File name SPIKBLAD01.pdfFile size 170 kBChecksum SHA-512
563e61d2a8e2f687f6471907fccb1de4eb9fcc1e0b2d09e5bf1708c5896287748c325483335a6f366bf9a036bce263231731bb128fe3748df10622857a1ec580
Type spikbladMimetype application/pdf
omslag(1088 kB)30 downloads
File information
File name COVER01.pdfFile size 1088 kBChecksum SHA-512
0c0428d1256e08d88c726d179f1142a19f98d6950edb6190466b475c3c2ba6c5e23cf02f43a0697ffa404f19990d68a6ddaef8a26e79b2feb0ad385b0570ad31
Type coverMimetype application/pdf

Authority records

Barros, Guilherme

Search in DiVA

By author/editor
Barros, Guilherme
By organisation
Statistics
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 376 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 481 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf