Pragmatic approaches toward automated extraction and understanding of large scale health-related texts

Xie, Tianyi

Pragmatic approaches toward automated extraction and understanding of large scale health-related texts

Xie, Tianyi

2021

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Natural language processing has become a very popular tool in many areas and has also drawn great attention in the community of health informatics. It is a series of processes that allows informaticists to take advantage by extracting and understanding the information hidden in the unstructured text. Such a process can help clinicians making more accurate decisions, filtering more useful information, and better understand public health-related social norms. However, due to the uniqueness of the health-related content, the regular workflow of NLP has limited application because of the challenges in the annotation. The thesis primarily focuses on shortening the gap in annotation by integrating deep learning NLP approaches in the workflow to reduce the task in annotation, or to realize semi-automatic and automatic annotation in certain tasks. In this dissertation, I first present a deep learning-based phenotyping system that allows extraction of blood pressure readings from unstructured clinical notes. The workflow employs a pre-filtering approach that can reduce the workload in annotation, and can be applied in different domains. The second part presents an extractive text summarization system that utilizes the information in the abstract of scientific publications. The system uses a self-supervised approach that does not require any annotation while generating a classifier that can detect the content in the body text of the publication which should be extracted. In the third part, I proposed a workflow that performs info-surveillance on social media about COVID-19. By using a small group of annotated social media posts, the workflow will be able to monitor the trend and sentiment of the different topics being discussed on social media based on different times and locations.

Details

Title

Pragmatic approaches toward automated extraction and understanding of large scale health-related texts

Author

Xie, Tianyi (Computer Science)

Contributor

ProQuest (Firm) Contributor
University of North Carolina at Charlotte Degree Granting Institution
Ge, Yaorong Thesis Advisor
Chen, Shi Committee Member
Park, Albert Committee Member
Yang, Jing Committee Member

Date

2021

Publisher

University of North Carolina at Charlotte

Subjects

Computer science

Link to This Page

Handle: http://hdl.handle.net/20.500.13093/etd:2833

Publication Type

doctoral dissertations

Pagination

1 online resource (116 pages) : PDF

File Format

application/pdf

Degree Type

Ph.D.

Usage Statement

This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). For additional information, see http://rightsstatements.org/page/InC/1.0/., (http://rightsstatements.org/page/InC/1.0/)
Copyright is held by the author unless otherwise indicated.

Record Appears in

Departments and Institutes > Computer Science
Types > Doctoral Dissertations
Graduate Theses and Dissertations
Graduate Thesis and Dissertations

PDF

Statistics

Download Full History

Pragmatic approaches toward automated extraction and understanding of large scale health-related texts

Files

Abstract

Details

Related Items

PDF

Statistics