fbpx Using Latent Dirichlet Allocation and Text Mining Techniques for Understanding Medical Literature |ARAB AMERICAN UNIVERSITY
Contact information for Technical Support and Student Assistance ... Click here

Using Latent Dirichlet Allocation and Text Mining Techniques for Understanding Medical Literature

Authors: 
Saadat M. Alhashmi
Mohammed Maree
Zaina Saadeddin
ISSN: 
2312-5381
Journal Name: 
International Journal of Computing
Volume: 
20
Issue: 
4
Pages From: 
506
To: 
512
Date: 
Friday, December 31, 2021
Keywords: 
text mining, data analysis, medical domain, trending topics, word association rules
Abstract: 
Over the past few years, numerous studies and research articles have been published in the medical literature review domain. The topics covered by these researches included medical information retrieval, disease statistics, drug analysis, and many other fields and application domains. In this paper, we employ various text mining and data analysis techniques in an attempt to discover trending topics and topic concordance in the healthcare literature and data mining field. This analysis focuses on healthcare literature and bibliometric data and word association rules applied to 1945 research articles that had been published between the years 2006 and 2019. Our aim in this context is to assist saving time and effort required for manually summarizing large-scale amounts of information in such a broad and multi-disciplinary domain. To carry out this task, we employ topic modeling techniques through the utilization of Latent Dirichlet Allocation (LDA), in addition to various document and word embedding and clustering approaches. Findings reveal that since 2010 the interest in the healthcare big data analysis has increased significantly, as demonstrated by the five most commonly used topics in this domain.
Attachments: