User Tools

Site Tools


projects:workgroups:wg_meeting_01062016

This is an old revision of the document!


Minutes_Meeting_01062016

Attendees

Hua Xu, Jon Duke, George Hripcsak, Karthik Natarajan, Anupama Gururaj, Mark Khayter, Min Jiang, Don Torok, Alexandre Yahi, Andrew Williams

Agenda

  1. IRB for use of clinical text
  2. Clinical text data storage and representation schema
  3. NLP tools/pipelines for ETL
  4. Use cases, e.g, phenotyping for cohort selection using NLP outputs
  5. Discussion

Minutes

  1. General IRB document for use of clinical text and approval from all contributors, post online - Almost completed
  2. Collect minimum set of modifiers for all clinical entities that support use of rule to derive clinical concepts: Alex
    • cTAKES is being run on clinical notes programmatically. Alex will present the minimal model in the next meeting.
  3. Aggregate and share note-type metadata from various sources: Karthik
    • LOINC note type mapping would be a very useful resource. We should generate hierarchical representation of note-types as an ontology. Karthink will present his work to date at the next meeting.
    • Existing ontology for note types to be shared : Vanderbilt (Hua) and Regenstrief (Jon)
  4. Simple search set up for MT samples: MinPresentation
  5. Presentation
  6. Updates from Annual meeting
    • Extensive interest from the OHDSI community with reference to the text processing aspect. During the meeting, suggestions for improvements in the current projects were received.
  7. IRB for use of clinical text
    • IRB language pertaining to textual part of the record is being compiled from multiple sources.
    • Anu will collect and generate a generic document for use as an example.
    • Once approval of the document is obtained from the contributors, the document will be posted online for use by the OHDSI community.
  8. Clinical text data storage and representation schema
    • Minimum set of modifiers for all clinical entities that support use of rule to derive clinical concepts will be generated by Alex (Columbia).
    • To classify the notes for the representation schema, metadata about the notes with note-type defined in detail and mapped to LOINC codes will be generated.
    • Note types from different institutions will be collected. George will share hierarchical note type metadata. Also, we will collect note type metadata from Josh Denny at Vanderbilt. All the collected material will be aggregated by Karthik.
  9. NLP tools/pipelines for ETL
    • The plan is to develop a set of wrappers for multiple NLP tools (currently cTAKES and MetaMap) for conversion of output to the OHDSI textual data schema.
    • In order to get an idea of the updates in cTAKES, need to invite Guergana Savova to present and do a demo of cTAKES during the January call.
    • In order to prioritize the work, focus on positive concepts first for high confidence extraction of NER from text.
  10. Use cases, e.g, phenotyping for cohort selection using NLP outputs
    • To define the syntax for storing phenotypes, two aspects can be considered:
      1. set of data elements or features on which an algorithm functions
      2. formulation of the phenotype definition
    • In order to represent the NLP output, query-based phenotyping will be the first focus of the group.
    • For machine-learning based algorithms, the NLP output will be accessed outside of the CDM
    • Is ElasticSearch a good first step in this area? ES should be considered here as a tool more for cohort building and selection rather than phenotyping. For this purpose, it is a good starting point.
    • Finding patients for clinical trials will be used as a usecase here. The ES could serve as an explorer for feature selection in the phenotyping process.
    • Action item: Simple search set up for MT samples by next meeting by Min.
    • Use MIMICII and MIMICIII as demo datasets for the tools being developed by the group
  11. Discussion

Action Items

  1. IRB for use of clinical text
  2. Clinical text data storage and representation schema
  3. NLP tools/pipelines for ETL
  4. Use cases, e.g, phenotyping for cohort selection using NLP outputs
  5. Discussion
projects/workgroups/wg_meeting_01062016.1454106624.txt.gz · Last modified: 2016/01/29 22:30 by anu_gururaj