User Tools

Site Tools


research:largescalepred

This is an old revision of the document!


Proof of concept study for large-scale patient-level predictive modelling in the OHDSI data network

Objective: The objective of the large-scale patient-level predictive modelling study is to develop models using 5 commonly used classifiers for a single ‘at risk’ cohort (pharmaceutically treated depression cohort) and 22 ‘outcome’ cohorts. The study is implemented across the OHDSI collaborator network to externally validate the models and assess their transportabilities across the world.

Rationale: Observational Health Data Sciences and Informatics (OHDSI) holds the promise of making massive-scale, patient-specific predictive modeling a reality. The data is stored in the common data model (CDM) enables uniform and transparent analysis. The large standardized populations contain rich data to build highly predictive large-scale models and also provide immediate opportunity to serve large communities of patients who are in most need of improved quality of care. Effective exploitation of these massive dataset to develop patient-level prediction models demands a standardized pipeline for both model development and evaluation.

A patient level prediction model problem is defined by an ‘at risk’ cohort (the group of people we wish to do the prediction for), the ‘outcome’ cohort (the outcome we wish to predict) and the ‘at-risk’ period (time window relative to the start of the at risk cohort index date). At present only a limited number of conditions have existing patient level prediction models and little is known about the feasibility of utilizing observational databases for clinically useful patient level prediction models at scale (for all suitable ‘at risk’ and ‘outcome’ cohort pairs). We want to fill this gap by using the observational databases to determine a very large number of ‘at-risk’ and ‘outcome’ pairs and develop prediction models for all these pairs. This study is the start of that challenging but extremely interesting journey.

Project Lead(s): Peter Rijnbeek, Jenna Reps

Coordinating Institution(s): Erasmus MC Rotterdam, The Netherlands

Additional Participants: Martijn Schuemie, Marc Suchard, Patrick Ryan

Full Protocol: To be added

Initial Proposal Date: 2016-09-23

Launch Date: 2016-09-23

Study Closure Date: Pending

Results Submission: Upload the file export/studyResult.zip in the output folder to the study coordinator: submitResults(“c:/temp/study_results/export”, key = “<key>”, secret = “<secret>”)
Where key and secret are the credentials provided to you personally by the study coordinator.

Requirements

CDM: V5

Table Accessed: person, drug_exposure, observations

Database Dialects: SQL Server, Postgres, Oracle

Software: R, Python

Code

Discussion

Datasets Run

  • CCAE, MDCD, MDCR, OPTUM
research/largescalepred.1474464612.txt.gz · Last modified: 2016/09/21 13:30 by prijnbeek