User Tools

Site Tools


projects:workgroups:minutes

This is an old revision of the document!


Minutes_Meeting_10072015

Attendees

Hua Xu, Jon Duke, Noemie Elhadad, Anupama Gururaj, Alexandre Yahi, Thomas Ginter, Olga Patterson, George Hripsack, Vojtech Huser

Agenda

  1. IRB for use of clinical text
  2. Clinical text data storage and representation schema
    1. Presentation by Dr. Noemie Elhadad
    2. Title: NLP schemas and clinical NLP tools in ShARe, File
      1. output of converted unstructured text could be in the form of structured data, bag of words and word embedding. Structured data and bag of words are the most useful in the current context.
      2. the ShARe schema for structured output combines many initiatives such as SHARP, THYME etc.
    3. Discussion – Next steps
      1. Table structure for storing concept level NLP outputs to be determined
      2. It is sufficient to start with structured output
      3. A concept table with concept ID in each row and note IDs should be generated
      4. OMOP vocabulary is to be used to aggregate concept to a higher level to manage and condense the number of concepts
      5. Next step is to go through all the columns exhaustively for all attributes, merge them and then decide the attributes that should be used in the table
  3. NLP tools/pipelines for ETL
  4. Use cases, e.g, phenotyping for cohort selection using NLP outputs
    1. Presentation by Dr. Jon Duke, Title: Regenstrief NLP platform and approach to validation of phenotypes
    2. Discussion – Next steps
  5. Discussion
projects/workgroups/minutes.1445541228.txt.gz · Last modified: 2015/10/22 19:13 by anu_gururaj