User Tools

Site Tools


projects:workgroups:nlp-wg

This is an old revision of the document!


OHDSI Natural Language Processing Working Group

Objective

The primary goal of the NLP working group is to promote the use of textual information from Electronic Health Records (EHRs) for observational studies under the OHDSI umbrella. To facilitate this objective, the group will develop methods and software that can be implemented to utilize clinical text for studies by the OHDSI community.

Project Lead

Hua Xu

Project Co-leads

Jon Duke, Nigam Shah, Noemie Elhadad

Plan

  1. IRB for use of clinical text
  2. Clinical text data storage and representation schema
  3. NLP tools/pipelines for ETL
  4. Use cases, e.g, phenotyping for cohort selection using NLP outputs

Ongoing Projects

2018

  1. Rules for defining term_exists – led by Stephane Meystre - COMPLETED
  2. Mapping of CUIs to standard terminology – Juan Banda - COMPLETED - https://github.com/thepanacealab/OHDSIananke
  3. Mapping of Note Types to LOINC/standard vocabulary –Karthik Natarajan, Ruth Reeves, Jon Duke and Hua Xu– Report type list discussion
  4. Landscape Analysis of section identifier systems and proposal of a standard terminology for use – Hua Xu, Karthik Natarajan
  5. Examples and rules for term_temporal – led by George Hripsack (Sunny)
  6. Standardization of term_modifiers and values – Hua Xu

2019

  1. Mapping of Note Types to LOINC/standard vocabulary –Karthik Natarajan, Ruth Reeves, Jon Duke and Hua Xu– Report type list discussion
  2. Landscape Analysis of section identifier systems and proposal of a standard terminology for use – Hua Xu, Karthik Natarajan
  3. Examples and rules for term_temporal – led by George Hripsack (Sunny)
  4. Standardization of term_modifiers and values – Hua Xu
  5. Evaluate and revise textual CDM tables by sharing practical issues and lessons learnt during ETL for processing textual data into CDM, Usecases – Ruth Reeves,
  6. Develop tools (within Atlas) to facilitate uses of NLP data for cohort building/phenotyping : Collaborate with eMERGE consortium:
  7. Conduct cross-site studies that use textual data
  8. Continue developing other NLP resources

Participants

  • Hua Xu
  • Anupama Gururaj
  • Nigam Shah
  • Noemie Elhadad
  • Jon Duke
  • Alexandre Yahi
  • Thomas Ginter
  • Olga Patterson
  • George Hripsack
  • Vojtech Huser
  • Mark Khayter
  • Karthik Natarajan
  • Min Jiang
  • Scott DuVall
  • Abraham Hartzema
  • David Sontag
  • Arnab Bose
  • Lian Hu
  • Jan A Kors
  • J van Der Lei
  • Peter R Rijnbeek
  • Vivienne Zhu
  • Bob Patterson
  • Michael Gurley
  • Xiaoling Chen
  • Hongfang Liu
  • Hong Yu
  • Stephane Meystre
  • Timothy Miller
  • Wendy Chapman
  • Feifan Liu
  • Paris Nicolas
  • Mark Dredze
  • Masoud Rouhizadeh
  • Malcolm McRoberts
  • Nishanth Parameshwar Pavinkurve
  • Carol Friedman

Upcoming Meeting Dates

  • March 13th, 2019
  • April 10th, 2019
  • May 8th, 2019
  • June 12th, 2019
  • July 10th, 2019

Repository

Proposal for concepts detected by NLP

create a new table called NOTE_NLP with the following columns

  • note_id (integer) links to NOTE.note_id (foreign key)
  • note_concept_id (integer) concept_id of a term found in the note
  • certainty (real number 0-100) how certain is the NLP pipeline that this concept is present in the note
  • offset position of where in the note was the concept detected
  • span (integer) number of characters from offset where the concept was detected
  • negation_flag (string of length 1 (or boolean)) indicates if the concept is negated

16ohdsi_nlp_schema_updated.pdf

https://docs.google.com/document/d/1ykYVJTQ5MuI7eh_Nk7xzt44EzNjVs71nq2LIsC_RlOg/edit

Start Date

August 2015

WG Agenda/Minutes/Recordings

Meetings

  Schedule: Second Wednesday of every month at 2pm Eastern Time

THE DECEMBER MEETING IS CANCELLED

  
  The next meeting will be on March 13th, 2019
  
  Meetings in 2019:
  
    * April 10th
    * May 8th
    * June 12th
    * July 10th
  
  Call-in: 
  1) Dial +1 (571) 317-3122 (United States) 
     Please see below for international call-in numbers
  2) Enter conference ID: 707-196-421 
  Screen Sharing: [[https://global.gotomeeting.com/join/707196421]]
  More phone numbers
  * Australia : +61 2 8355 1039
  * Austria : +43 7 2088 2172
  * Belgium : +32 (0) 42 68 0180
  * Canada : +1 (647) 497-9379
  * Denmark : +45 89 88 05 39
  * Finland : +358 (0) 931 58 4588
  * France : +33 (0) 170 950 589
  * Germany : +49 (0) 692 5736 7301
  * Ireland : +353 (0) 15 360 757
  * Italy : +39 0 294 75 15 37
  * Netherlands : +31 (0) 108 080 116
  * New Zealand : +64 9 801 0294
  * Norway : +47 21 51 81 86
  * Spain : +34 911 23 4248
  * Sweden : +46 (0) 852 500 516
  * Switzerland : +41 (0) 435 0824 41
  * United Kingdom : +44 (0) 330 221 0099
projects/workgroups/nlp-wg.1550093662.txt.gz · Last modified: 2019/02/13 21:34 by anu_gururaj2