• Home
  • Advanced Search
  • Directory of Libraries
  • About lib.ir
  • Contact Us
  • History

عنوان
Knowledge Discovery through Linking Multiple Heterogeneous, Unstructured Data Streams:

پدید آورنده
Alodadi, Mohammad Saad S.

موضوع
Computer science,Information science,Information technology

رده

کتابخانه
Center and Library of Islamic Studies in European Languages

محل استقرار
استان: Qom ـ شهر: Qom

Center and Library of Islamic Studies in European Languages

تماس با کتابخانه : 32910706-025

NATIONAL BIBLIOGRAPHY NUMBER

Number
TLpq2478031553

LANGUAGE OF THE ITEM

.Language of Text, Soundtrack etc
انگلیسی

TITLE AND STATEMENT OF RESPONSIBILITY

Title Proper
Knowledge Discovery through Linking Multiple Heterogeneous, Unstructured Data Streams:
General Material Designation
[Thesis]
First Statement of Responsibility
Alodadi, Mohammad Saad S.
Title Proper by Another Author
A Case of Clinical Notes Mining
Subsequent Statement of Responsibility
Janeja, Vandana P.

.PUBLICATION, DISTRIBUTION, ETC

Name of Publisher, Distributor, etc.
University of Maryland, Baltimore County
Date of Publication, Distribution, etc.
2020

PHYSICAL DESCRIPTION

Specific Material Designation and Extent of Item
184

DISSERTATION (THESIS) NOTE

Dissertation or thesis details and type of degree
Ph.D.
Body granting the degree
University of Maryland, Baltimore County
Text preceding or following the note
2020

SUMMARY OR ABSTRACT

Text of Note
The focus of this dissertation is to discover patterns of significant relationships across unstructured, heterogeneous data streams. One such example domain is in Electronic Health Records (EHR) for text in treatments, tests, and diagnoses, particularly in clinical notes. While applying large and automated analysis on the unstructured data can be much more complicated than on structured data, it can potentially provide better support and information for decision making. From the health care providers' perspective, the large number of medical tests of a patient has further increased the complexity in identifying and determining an accurate diagnosis for each patient. In addition, the EHR should be able to support Clinical Decision Support Systems (CDS) for providing health professionals with the most recent and related biomedical literature to improve the decision-making process concerning a given patient's record. However, the current EHR systems lack this ability. This creates a gap between the advances in the biomedical domain and day to day practices within EHR systems. In this dissertation, we propose several methods to overcome these challenges. For extracting knowledge from heterogeneous unstructured data, we propose a weighted association rules mining method to extract significant entities from the unstructured data and generate weighted association rules among them. We also expand our data by utilizing ontology-based expansion. Our discovered rules reveal non-trivial interdependencies which can help support practitioners' decision, such as during clinical interventions. Our frequency-based methods generate rules with higher interestingness and relatedness rate provided by a health provider. Furthermore, on a temporal use case in EHR, our method shows an increase in rules in later days of hospital admission, which imitate the secondary diagnoses phenomenon, which are conditions that coexist at the time of admission that develop subsequently to the principal diagnosis. Our preliminary results with proposed weighted transactional item representation show promising results in identifying strongly related entities, for example, in medical entities (diagnosis, test, treatments). To improve the literature search for professionals, we propose and evaluate multiple query expansion and re-ranking methods. The expansion query methods rely on vocabularies and different embedding models. The re-ranking method relies on dual embedding retrieval indexes that focus on latent features compared to standard explicit terms. We experiment with these methods on publicly available ad-hoc information retrieval tasks in biomedical literature. The results show that combining latent features and explicit features increases the precision of the results. We performed an extrinsic evaluation on multiple datasets (a biomedical literature database and health forum data). We found that our embedding model can be generalized to offer data representation of medical textual data (such as biomedical literature, clinical notes, or medical social media data).

TOPICAL NAME USED AS SUBJECT

Computer science
Information science
Information technology

PERSONAL NAME - PRIMARY RESPONSIBILITY

Alodadi, Mohammad Saad S.
Janeja, Vandana P.

ELECTRONIC LOCATION AND ACCESS

Electronic name
 مطالعه متن کتاب 

p

[Thesis]
276903

a
Y

Proposal/Bug Report

Warning! Enter The Information Carefully
Send Cancel
This website is managed by Dar Al-Hadith Scientific-Cultural Institute and Computer Research Center of Islamic Sciences (also known as Noor)
Libraries are responsible for the validity of information, and the spiritual rights of information are reserved for them
Best Searcher - The 5th Digital Media Festival