Publication Type : Book Chapter
Publisher : Springer Netherlands
Source : Lecture Notes in Computational Vision and Biomechanics, Springer Netherlands, Volume 31, p.227-237 (2019)
Campus : Coimbatore
School : School of Engineering
Center : Center for Computational Engineering and Networking
Department : Electronics and Communication
Year : 2019
Abstract : Many healthcare-related applications use pretrained embeddings, but these are often trained over general corpus which is mostly downstreamed to certain particular application. One problem noticed among such embeddings is that these are not efficient across various health text applications and even less number of research describe evaluation of these embedding for health domain. In this paper, distributional embedding model is performed to acquire a word representation on data crawled from Journal of Medical Case Reports. This distributed embedding model is analyzed qualitatively and quantitatively over crawled corpus. Qualitative evaluation is employed by cosine similarity on different categories and is visually represented. Quantitative evaluation performed with parts of speech tagging and entity recognition. The embedding model attained a cross-validation accuracy of 91.70% in parts of speech tagging for GENIA corpus and ensured 83% accuracy in the entity recognition of i2b2 clinical data. © Springer Nature Switzerland AG 2019.
Cite this Research Publication : J. R. Naveen, Ganesh, H. B. Barathi, M. Kumar, A., and Dr. Soman K. P., “Distributed representation of healthcare text through qualitative and quantitative analysis”, in Lecture Notes in Computational Vision and Biomechanics, vol. 31, Springer Netherlands, 2019, pp. 227-237.