User Tools

Site Tools


development:data_architecture

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Last revision Both sides next revision
development:data_architecture [2016/08/22 18:54]
frank_defalco
development:data_architecture [2016/08/23 13:20]
frank_defalco
Line 5: Line 5:
   * **Standardized Data** : Data that has been converted into the OMOP CDM format. ​ These data are said to exist in the "//CDM schema//"​.   * **Standardized Data** : Data that has been converted into the OMOP CDM format. ​ These data are said to exist in the "//CDM schema//"​.
   * **Derived Data** : Data that is the result of some form of processing on data that came from the OMOP CDM format. ​ These data are said to exist in the "//​results schema//"​   * **Derived Data** : Data that is the result of some form of processing on data that came from the OMOP CDM format. ​ These data are said to exist in the "//​results schema//"​
-  * **Administrative ​Data** : Data that is created or maintained to describe a concept, process or system function within the broader OHDSI architecture. These data are said to exist in the "//​OHDSI schema//"​.+  * **OHDSI Platform ​Data** : Data that is created or maintained to describe a concept, process or system function within the broader OHDSI architecture. These data are said to exist in the "//​OHDSI schema//"​.
  
 === Data Life-Cycle === === Data Life-Cycle ===
Line 11: Line 11:
 There is a life-cycle of data within the OHDSI data architecture. ​ Data primarily originates from a person level data source. ​ These data sources include electronic health records, administrative claims records, clinical trial data and billing data among others. ​ These person level data sources are typically received in their own proprietary format which we refer to as the "​Native Format"​. ​ The approach used within OHDSI is to standardize these sources of person level data by performing [[documentation:​etl_best_practices|ETL processes]] and converting them from their Native format to the [[documentation:​cdm|OMOP Common Data Model (CDM)]] format. ​ Once this conversion is complete the intent is for the CDM to be a read only data source. There is a life-cycle of data within the OHDSI data architecture. ​ Data primarily originates from a person level data source. ​ These data sources include electronic health records, administrative claims records, clinical trial data and billing data among others. ​ These person level data sources are typically received in their own proprietary format which we refer to as the "​Native Format"​. ​ The approach used within OHDSI is to standardize these sources of person level data by performing [[documentation:​etl_best_practices|ETL processes]] and converting them from their Native format to the [[documentation:​cdm|OMOP Common Data Model (CDM)]] format. ​ Once this conversion is complete the intent is for the CDM to be a read only data source.
  
-Once a data source has been converted from the native schema to the CDM schema the OHDSI tools provide features to perform many different analyses. ​ These analyses include high level population level descriptive statistics and characterization,​ definition and generation of cohorts, study specification and population level estimation among others. ​ The processes that implement these features generate new data that has been derived from the CDM data schema. ​ This derived data is then stored in the results schema. ​ The administrative ​data or metadata that describes the concepts and processes that were used to derive these results from the CDM schema are stored in the OHDSI schema.+Once a data source has been converted from the native schema to the CDM schema the OHDSI tools provide features to perform many different analyses. ​ These analyses include high level population level descriptive statistics and characterization,​ definition and generation of cohorts, study specification and population level estimation among others. ​ The processes that implement these features generate new data that has been derived from the CDM data schema. ​ This derived data is then stored in the results schema. ​ The OHDSI Platform ​data or metadata that describes the concepts and processes that were used to derive these results from the CDM schema are stored in the OHDSI schema.
  
  
  
development/data_architecture.txt ยท Last modified: 2016/08/23 13:24 by frank_defalco