Motivation
Real-world data (RWD), such as data from electronic health records, disease registries, administrative claims for billing, and cell lines, provide unprecedented opportunity to propel generation of real-world evidence (RWE) to advance precision medicine and improve health outcomes. However, RWD are rarely collected specifically for research purposes from a rigorously designed study; thus, RWE is vulnerable to multiple sources of bias leading to suboptimal decisions. Key challenges of research with RWD that may lead to bias include uneven data quality, data sparseness, participant heterogeneity, selection bias, and informative outcome-dependent follow-up. In this session, we will describe advances in statistical methods to address these challenges. Methods are motivated by and applied to settings such as observational longitudinal studies, risk prediction, hybrid-design randomized trials, and genetic testing.
This collection of talks will explore a breadth of topics in RWD, which will enable comparison and contrast of salient features of different RWD platforms. This topic is particularly relevant to a wide population of biostatisticians working in the health sciences owing to a proliferation of efforts to link RWD to more traditional research data resources, such as epidemiologic observational or intervention study data. Therefore, a session comprising discussion of different RWD platforms for the diverse purposes of observational studies, risk prediction, hybrid randomized trials, and genetic testing will enhance cross-talk among biostatisticians working in multiple applications. Ultimately, the audience will learn the opportunities as well as the challenges arising from RWD to generate RWE with the goal of improving health.
Session speakers are diverse across multiple dimensions: region (USA, Canada, UK), gender (three women, two men), career stage (from assistant to full professor), affiliation (school of medicine, mathematics/statistics, data science).
Proposed Speakers & Discussant
Janie Coulombe, Universite de Montreal (Canada)
- Multiply robust longitudinal causal inference with informative monitoring times with an application to lifestyle interventions in electronic health records
Ying Lu, Stanford University School of Medicine (USA)
- Quality assessment of real-world data and their impact on randomized trials linked to registry data with an application to amyotrophic lateral sclerosis
Hongsheng Dai, Newcastle University (UK)
- Novel mixture models to address patient heterogeneity in real-world genetic data to advance precision medicine with an application to the Cancer Cell Line Encyclopedia
Jessica Gronsbell, University of Toronto (Canada)
- Estimation and evaluation of semi-supervised learning models in settings with large proportions of unlabeled data with application to outcome prediction from electronic health records
Michelle Shardell, University of Maryland School of Medicine (USA)
- Semiparametric models of longitudinal data with time-varying covariates and latent effects to address informative observation times with application to U.S. Medicare claims data
--