Extracting Symptom Burden from Clinical Narratives of Cancer Patients Using NLP - PowerPoint Presentation

342 views
Uploaded On 2022-08-04

Extracting Symptom Burden from Clinical Narratives of Cancer Patients Using NLP - PPT Presentation

Meliha Yetisgen University of Washington Ahmad Halwani University of Utah Ozlem Uzuner George Mason University Treatment Diagnosis Remission Survivorship End of Life pain drowsiness ID: 935961

cancer symptoms annotation symptom symptoms cancer symptom annotation standard gold patients extraction active resolved present values information learning aim

Link:

Copy

Embed:

<iframe width="560" height="315" src="https://www.docslides.com/embed/935961" frameborder="0" allowfullscreen></iframe>

Download Presentation from below link

Download Presentation The PPT/PDF document "Extracting Symptom Burden from Clinical ..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.

Presentation Transcript

Slide1

Extracting Symptom Burden from Clinical Narratives of Cancer Patients Using NLP

Meliha

Yetisgen

University of Washington

Ahmad Halwani

University of Utah

Ozlem

Uzuner

George Mason University

Slide2

Treatment

DiagnosisRemission

SurvivorshipEnd of Life

pain

drowsiness

Shortness of breath

constipation

anxiety

anorexia

depression

tiredness

Most patients experience symptoms throughout their cancer journey leading to decreased quality of life

Only a minority of patients are referred to supportive oncology services.

Utilization of symptom burden information allows for prioritizing and facilitating patient referral to SOS, decreasing symptom burden in patients with cancer, improving QOL.

Symptom burden is documented in provider notes.

Anywhere between 25% to over 50% of patients on a single visit reported symptoms

Slide3

Example provider noteSince last seen, Mr. Doe reports continued daily fatigue and lack of energy that is worse in the afternoon and limits his performance status; he can perform ADLs but tires easily and need to take a nap in the evening. Lower extremity

edema noted immediately after infusion (likely fluid overload) is now resolved. No chest pain or shortness of breath. His lymphadenopathy is improved but persists and his neck node continues to bother him. A week after his last cycle, he thought he felt a fever but did not take his temperature. This resolved by the next day. Appetite is decreased though oral intake is overall preserved. No chills. Daily

night sweats have resolved since starting treatment. No weight loss. The peripheral neuropathy he developed after cycle 1 (tingling in feet bilaterally, no weakness) is stable and does not bother him much.

Present symptomsAbsent symptoms

Present but Improved symptomsResolved symptoms

Present symptomsPresent symptomsPresent symptoms

Absent symptoms

Slide4

Challenges Richness of language clinicians use in describing details of symptoms High cost of gold standard annotations needed to train and evaluate information extraction models Language use and reporting style differences across institutions

Slide5

Aim 2: Detailed symptom extraction with NLPTwo major cancer centers:Seattle Cancer Care Alliance (SCCA)

Huntsman Cancer Institute (HCI)Two cancer patient populations:Prostate cancerDiffuse Large B Cell Lymphoma (DLBCL)Gold standard annotation with Active learning

Novel deep learning based text processing architectureEvaluation over two institutes for two cancer populations to ensure generalizability

Aim 1: Multi-institution gold standard annotation Goal: We will build Natural Language Processing technologies that can extract symptom information, along with their detailed attributes from clinical narratives.

Outcomes: Detailed symptom schema & annotation guidelines Active learning framework for sampling content rich notes to decrease the manual annotation effortFirst de-identified gold standard corpora annotated with symptoms

Outcomes:

Language modelsOpen source release of the NLP tools R21 Project – Start date May’2021

Slide6

Aim 1 - Symptom Schema Trigger / Symptom Name (e.g., pain, headache) Assertion*Values: present, absent, possible, conditional, not-patient Change of state Values: no_change, worsened, improved, resolved Severity Values: mild, moderate, severe

Etiology Values: disease, treatment AnatomyCharacteristics DurationFrequency

Image source: https://www.cdc.gov/dengue/symptoms/index.html

Slide7

Example Annotations

To capture this complex representation, many annotated training instances are required to train robust IE models

Slide8

Gold standard annotation with Active LearningSample selection criteria:Informativeness: reduce classification uncertaintyDiversity: describe variation in selected samplesPrevious work on Social Determinants of Health gold standard corpora annotation

Lybarger K, Ostendorf M, Yetisgen M. Annotating social determinants of health using active learning, and characterizing determinants using neural event extraction. J Biomed Inform. 2021 Jan;113:103631.

Slide9

Annotation planGold standard: Training set (75%) Random sampling + Active learningHeld-out Test set (25%)Random sampling: To capture true distribution of the dataset Ongoing work & Current progress: Datasets are created6 years of clinical notes of prostate cancer + DLBCL patients from SCCA and HCI

De-identified by an in-house de-identification toolAnnotation guidelines are finalized 6 medical students are working on annotation tasksQuery function design is in progress

Slide10

Aim 2 - Information Extraction Framework Lybarger K, Ostendorf M, Thompson M, Yetisgen M. Extracting COVID-19 Diagnoses and Symptoms From Clinical Text: A New Annotated Corpus and Neural Event Extraction Framework. J Biomed Inform. 2021 Mar 26;:103761.

ArgumentPrecision

RecallF1Trigger0.810.850.83Assertion

0.770.800.79Change 0.450.050.09

Severity 0.450.310.37 Anatomy0.780.500.61

Characteristics0.660.250.36Duration0.540.560.55Frequency0.600.510.55

Baseline Framework

Slide11

Thank you and Questions