Umeå universitets logga

umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Estimation of hazard ratios from observational data with applications related to stroke
Umeå universitet, Samhällsvetenskapliga fakulteten, Handelshögskolan vid Umeå universitet, Statistik.ORCID-id: 0000-0002-9313-3499
2024 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

The objective of this thesis is to examine some challenges that may emerge when conducting time-to-event studies based on observational data. Time-to-event (also called survival) is a setting that involves analyzing how different factors may influence the length of time until an individual experiences the event of interest. This type of analysis is commonly applied in fields such as medical research and epidemiology. In this thesis, which focuses on stroke, we are interested in the time to a recurrent stroke or the death of a patient who survived a first stroke.

Hazard ratios are one of the main parameters estimated in time-to-event studies. Hazard ratios involve comparing the risk of experiencing the event between two groups, usually a treated group and an untreated group.  They can also involve other factors, such as different age groups. Hazard ratios can be estimated from the data by using the Cox regression model.

Observational data, in contrast to experimental data, involves data collected without any intervention or random assignment of treatment to the individuals. Confounders, that is, variables that distort or obscure the true relationship between treatment and outcome, are always present and need to be controlled for in observational studies.

National registers are an important source of observational data. A national registry is a centralized database or system that collects, stores, and maintains information about a specific population or group of individuals within a country. Sweden is known for its detailed and complete national registers. In this thesis, data from the Swedish Stroke Register (Riksstroke) is used to study factors related to stroke.

In time-to-event studies involving observational data, several challenges may arise for the researcher during data analysis. Some individuals may not experience the event during the observation period and thus the information about their time until the event is incomplete. These individuals are considered as censored. Some individuals may experience another event rather than the one of interest, a competing risk. Additionally, models must be properly constructed, with researchers selecting variables and determining the suitable functional form.

Four papers are included in the thesis. Paper I demonstrates how to handle competing risks in survival analysis. The study involves comparing individuals with and without standard modifiable risk factors and their risks of a recurrent stroke or death using data from the Swedish Stroke Register.

The estimation of marginal hazard ratios is a common theme in the other three papers. All involve simulation studies in order to extend methods and explore best practices when estimating marginal hazard ratios.

Paper II explores non-parametric methods that can be used as alternatives to more traditional parametric methods when balancing datasets in order to estimate a marginal hazard ratio. A case study was also conducted using data from the Swedish Stroke Register involving the prescription of anticoagulants at hospital discharge after a stroke.

Paper III is about how censoring affects marginal hazard ratio estimation, even with perfect balancing of the dataset. We study this issue, taking into consideration varying effect sizes and censoring rates. A procedure to attenuate the problem is also studied.

Paper IV concerns covariate selection in the case of high-dimensional data. High-dimensional data involves cases in which the number of covariates in the study is comparable to the number of individuals, and therefore covariate selection methods are needed. In the paper, we explore some of these methods and suggest a best-performing procedure. As Paper II, Paper IV involves a case study of anticoagulant prescription using data from the Swedish Stroke Register.

Ort, förlag, år, upplaga, sidor
Umeå University, 2024. , s. 19
Serie
Statistical studies, ISSN 1100-8989 ; 57
Nyckelord [en]
survival analysis, causal inference, hazard ratios, marginal hazard ratio, stroke, balancing
Nationell ämneskategori
Sannolikhetsteori och statistik
Forskningsämne
statistik
Identifikatorer
URN: urn:nbn:se:umu:diva-219201ISBN: 978-91-8070-240-9 (tryckt)ISBN: 978-91-8070-241-6 (digital)OAI: oai:DiVA.org:umu-219201DiVA, id: diva2:1825506
Disputation
2024-02-02, Hörsal NBET.A.101, Norra Beteendevetarhuset, Mediegränd 14, 907 36, Umeå, 10:00 (Engelska)
Opponent
Handledare
Tillgänglig från: 2024-01-12 Skapad: 2024-01-09 Senast uppdaterad: 2024-01-10Bibliografiskt granskad
Delarbeten
1. Recurrent ischemic stroke and mortality in stroke patients without standard modifiable risk factors: an analysis of the riksstroke registry
Öppna denna publikation i ny flik eller fönster >>Recurrent ischemic stroke and mortality in stroke patients without standard modifiable risk factors: an analysis of the riksstroke registry
Visa övriga...
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Nationell ämneskategori
Folkhälsovetenskap, global hälsa, socialmedicin och epidemiologi
Identifikatorer
urn:nbn:se:umu:diva-218977 (URN)
Tillgänglig från: 2024-01-03 Skapad: 2024-01-03 Senast uppdaterad: 2024-01-09
2. Performance of modeling and balancing approach methods when using weights to estimate treatment effects in observational time-to-event settings
Öppna denna publikation i ny flik eller fönster >>Performance of modeling and balancing approach methods when using weights to estimate treatment effects in observational time-to-event settings
2023 (Engelska)Ingår i: PLOS ONE, E-ISSN 1932-6203, Vol. 18, nr 12, artikel-id e0289316Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

In observational studies weighting techniques are often used to overcome bias due to confounding. Modeling approaches, such as inverse propensity score weighting, are popular, but often rely on the correct specification of a parametric model wherein neither balance nor stability are targeted. More recently, balancing approach methods that directly target covariate imbalances have been proposed, and these allow the researcher to explicitly set the desired balance constraints. In this study, we evaluate the finite sample properties of different modeling and balancing approach methods, when estimating the marginal hazard ratio, through Monte Carlo simulations. The use of the different methods is also illustrated by analyzing data from the Swedish stroke register to estimate the effect of prescribing oral anticoagulants on time to recurrent stroke or death in stroke patients with atrial fibrillation. In simulated scenarios with good overlap and low or no model misspecification the balancing approach methods performed similarly to the modeling approach methods. In scenarios with bad overlap and model misspecification, the modeling approach method incorporating variable selection performed better than the other methods. The results indicate that it is valuable to use methods that target covariate balance when estimating marginal hazard ratios, but this does not in itself guarantee good performance in situations with, e.g., poor overlap, high censoring, or misspecified models/balance constraints.

Ort, förlag, år, upplaga, sidor
Public Library of Science (PLoS), 2023
Nationell ämneskategori
Sannolikhetsteori och statistik
Identifikatorer
urn:nbn:se:umu:diva-218671 (URN)10.1371/journal.pone.0289316 (DOI)38060567 (PubMedID)2-s2.0-85179800320 (Scopus ID)
Tillgänglig från: 2023-12-27 Skapad: 2023-12-27 Senast uppdaterad: 2024-01-09Bibliografiskt granskad
3. Impact of non-informative censoring on propensity score based estimation of marginal hazard ratios
Öppna denna publikation i ny flik eller fönster >>Impact of non-informative censoring on propensity score based estimation of marginal hazard ratios
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Nyckelord
Survival Analysis, Censoring, Marginal Hazard Ratio, Causal Inference, Simulation
Nationell ämneskategori
Sannolikhetsteori och statistik
Forskningsämne
statistik
Identifikatorer
urn:nbn:se:umu:diva-218975 (URN)
Forskningsfinansiär
Vetenskapsrådet
Tillgänglig från: 2024-01-03 Skapad: 2024-01-03 Senast uppdaterad: 2024-01-09
4. Covariate selection for the estimation of marginal hazard ratios in high-dimensional data
Öppna denna publikation i ny flik eller fönster >>Covariate selection for the estimation of marginal hazard ratios in high-dimensional data
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Abstract [en]

Hazard ratios are frequently reported in time-to-event and epidemiological studies to assess treatment effects. In observational studies, the combination of propensity score weights with the Cox proportional hazards model facilitates the estimation of the marginal hazard ratio (MHR). The methods for estimating MHR are analogous to those employed for estimating common causal parameters, such as the average treatment effect. However, MHR estimation in the context of high-dimensional data remain unexplored. This paper seeks to address this gap through a simulation study that consider variable selection methods from causal inference combined with a recently proposed multiply robust approach for MHR estimation. Additionally, a case study utilizing stroke register data is conducted to demonstrate the application of these methods. The results from the simulation study indicate that the double selection covariate selection method is preferable to several other strategies when estimating MHR. Nevertheless, the estimation can be further improved by employing the multiply robust approach to the set of propensity score models obtained during the double selection process.

Nationell ämneskategori
Sannolikhetsteori och statistik
Forskningsämne
statistik; statistik
Identifikatorer
urn:nbn:se:umu:diva-218976 (URN)
Tillgänglig från: 2024-01-03 Skapad: 2024-01-03 Senast uppdaterad: 2024-01-09

Open Access i DiVA

fulltext(667 kB)562 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 667 kBChecksumma SHA-512
6dc1207a0f5b0891aca58a259fa30712b4de79c870927f8e3f307a3779b905d67a53118c132e91c432fe8a58b5abd4ee9a414cff52d53383a8aff1e8c28836be
Typ fulltextMimetyp application/pdf
spikblad(170 kB)43 nedladdningar
Filinformation
Filnamn SPIKBLAD01.pdfFilstorlek 170 kBChecksumma SHA-512
563e61d2a8e2f687f6471907fccb1de4eb9fcc1e0b2d09e5bf1708c5896287748c325483335a6f366bf9a036bce263231731bb128fe3748df10622857a1ec580
Typ spikbladMimetyp application/pdf
omslag(1088 kB)38 nedladdningar
Filinformation
Filnamn COVER01.pdfFilstorlek 1088 kBChecksumma SHA-512
0c0428d1256e08d88c726d179f1142a19f98d6950edb6190466b475c3c2ba6c5e23cf02f43a0697ffa404f19990d68a6ddaef8a26e79b2feb0ad385b0570ad31
Typ coverMimetyp application/pdf

Person

Barros, Guilherme

Sök vidare i DiVA

Av författaren/redaktören
Barros, Guilherme
Av organisationen
Statistik
Sannolikhetsteori och statistik

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 562 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 593 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf