| Sign up

PDF (1.6 MB)

Cite

EndNote(RIS) BibTeX

Collect

Collect

Submit Manuscript

Open Access

Fusion Model for Tentative Diagnosis Inference Based on Clinical Narratives

Ying Yu^{¹^,³}, Junwen Duan^²(), Min Li^²

1School of Computer Science and Engineering, Central South University, Changsha 410083, China

2Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China

3School of Computer Science, University of South China, Hengyang 421001, China

Show Author Information

Abstract

In general, physicians make a preliminary diagnosis based on patients’ admission narratives and admission conditions, largely depending on their experiences and professional knowledge. An automatic and accurate tentative diagnosis based on clinical narratives would be of great importance to physicians, particularly in the shortage of medical resources. Despite its great value, little work has been conducted on this diagnosis method. Thus, in this study, we propose a fusion model that integrates the semantic and symptom features contained in the clinical text. The semantic features of the input text are initially captured by an attention-based Bidirectional Long Short-Term Memory (BiLSTM) network. The symptom concepts, recognized from the input text, are then vectorized by using the term frequency-inverse document frequency method based on the relations between symptoms and diseases. Finally, two fusion strategies are utilized to recommend the most potential candidate for the international classification of diseases code. Model training and evaluation are performed on a public clinical dataset. The results show that both fusion strategies achieved a promising performance, in which the best performance obtained a top-3 accuracy of 0.7412.

Keywords

tentative diagnosis clinical narrative Bidirectional Long Short-Term Memory (BiLSTM)Term Frequency-Inverse Document Frequency (TF-IDF)fusion strategy

References

[1]

I.

Boas

, Early and tentative diagnosis of gastrointestinal carcinoma, Am. J. Cancer, vol. 15, no. 3, pp. 1586–1589, 1931.

[2]

Y.

Yu

, M.

Li

, L.

Liu

, Y.

Li

, and J.

Wang

, Clinical big data and deep learning: Applications, challenges, and future outlooks, Big Data Mining and Analytics, vol. 2, no. 4, pp. 288–305, 2019.

Crossref Google Scholar

[3]

E.

Choi

, M. T.

Bahadori

, J. A.

Kulas

, A.

Schuetz

, W. F.

Stewart

, and J.

Sun

, RETAIN: An interpretable predictive model for healthcare using reverse time attention mechanism, in Proc. 30^th Int. Conf. on Neural Information Processing Systems, Barcelona, Spain, 2016, pp. 3512–3520.

[4]

F.

Ma

, R.

Chitta

, J.

Zhou

, Q.

You

, T.

Sun

, and J.

Gao

, Dipole: Diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks, in Proc. 23^rd ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, Halifax, Canada, 2017, pp. 1903–1911.

Crossref Google Scholar

[5]

F.

Ma

, J.

Gao

, Q.

Suo

, Q.

You

, J.

Zhou

, and A.

Zhang

, Risk prediction on electronic health records with prior medical knowledge, in Proc. 24^th ACM SIGKDD Int. Conf. on Knowledge Discovery & Data, London, UK, 2018, pp. 1910–1919.

Crossref Google Scholar

[6]

H.

Liang

, B. Y.

Tsui

, H.

Ni

, C. C. S.

Valentim

, S. L.

Baxter

, G.

Liu

, W.

Cai

, D. S.

Kermany

, X.

Sun

, J.

Chen

, et al., Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence, Nat. Med., vol. 25, no. 3, pp. 433–438, 2019.

Crossref Google Scholar

[7]

A.

Graves

and J.

Schmidhuber

, Framewise phoneme classification with bidirectional LSTM and other neural network architectures, Neural Netw., vol. 18, nos. 5&6, pp. 602–610, 2005.

Crossref Google Scholar

[8]

A. R.

Aronson

, Effective mapping of biomedical text to the UMLS metathesaurus: The MetaMap program, in Proc. AMIA 2001, Washington, DC, USA, 2001, p. 17.

[9]

G.

Salton

, A.

Wong

, and C. S.

Yang

, A vector space model for automatic indexing, Commun. ACM, vol. 18, no. 11, pp. 613–620, 1975.

Crossref Google Scholar

[10]

Y.

Yu

, M.

Li

, L.

Liu

, F. X.

Wu

, and J.

Wang

, Tentative diagnosis prediction via deep understanding of patient narratives, in Proc. 2019 IEEE Int. Conf. on Bioinformatics and Biomedicine (BIBM), San Diego, CA, USA, 2019, pp. 1000–1003.

Crossref Google Scholar

[11]

A. N.

Jagannatha

and H.

Yu

, Bidirectional RNN for medical event detection in electronic health records, in Proc. 2016 Conf. North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, CA, USA, 2016, pp. 473–482.

Crossref Google Scholar

[12]

Y.

Luo

, Recurrent neural networks for classifying relations in clinical notes, J. Biomed. Inform., vol. 72, pp. 85–95, 2017.

Crossref Google Scholar

[13]

S.

Gao

, M. T.

Young

, J. X.

Qiu

, H. J.

Yoon

, J. B.

Christian

, P. A.

Fearn

, G. D.

Tourassi

, and A.

Ramanthan

, Hierarchical attention networks for information extraction from cancer pathology reports, J. Am. Med. Inform. Assoc., vol. 25, no. 3, pp. 321–330, 2018.

Crossref Google Scholar

[14]

L.

Gligic

, A.

Kormilitzin

, P.

Goldberg

, and A.

Nevado-Holgado

, Named entity recognition in electronic health records using transfer learning bootstrapped neural networks, Neural Netw., vol. 121, pp. 132–139, 2020.

Crossref Google Scholar

[15]

S.

Gehrmann

, F.

Dernoncourt

, Y.

Li

, Y.

Li

, E. T.

Carlson

, J. T.

Wu

, J.

Welt

, J.

Foote

Jr., E. T.

Moseley

, D. W.

Grant

, et al., Comparing rule-based and deep learning models for patient phenotyping, arXiv preprint arXiv: 1703.08705, 2017.

[16]

H.

Shi

, P.

Xie

, Z.

Hu

, M.

Zhang

, and E. P.

Xing

, Towards automated ICD coding using deep learning, arXiv preprint arXiv: 1711.04075, 2017.

[17]

W.

Ning

, M.

Yu

, and R.

Zhang

, A hierarchical method to automatically encode Chinese diagnoses through semantic similarity estimation, BMC Med. Inform. Decis. Mak., vol. 16, p. 30, 2016.

Crossref Google Scholar

[18]

T.

Baumel

, J.

Nassour-Kassis

, R.

Cohen

, M.

Elhadad

, and N.

Elhadad

, Multi-label classification of patient notes a case study on ICD code assignment, arXiv preprint arXiv: 1709.09587, 2017.

[19]

M.

Li

, Z.

Fei

, M.

Zeng

, F. X.

Wu

, Y.

Li

, Y.

Pan

, and J.

Wang

, Automated ICD-9 coding via a deep learning approach, IEEE/ACM Trans. Comput. Biol. Bioinform., vol. 16, no. 4, pp. 1193–1202, 2019.

Crossref Google Scholar

[20]

M.

Zeng

, M.

Li

, Z.

Fei

, Y.

Yu

, Y.

Pan

, and J.

Wang

, Automatic ICD-9 coding via deep transfer learning, Neurocomputing, vol. 324, pp. 43–50, 2019.

Crossref Google Scholar

[21]

Y.

Wu

, M.

Zeng

, Z.

Fei

, Y.

Yu

, F. X.

Wu

, and M.

Li

, KAICD: A knowledge attention-based deep learning framework for automatic ICD coding, Neurocomputing, vol. 469, pp. 376–383, 2022.

Crossref Google Scholar

[22]

Z.

Liu

, B.

Tang

, X.

Wang

, and Q.

Chen

, De-identification of clinical notes via recurrent neural network and conditional random field, J. Biomed. Inform., vol. 75, pp. S34–S42, 2017.

Crossref Google Scholar

[23]

F.

Dernoncourt

, J. Y.

Lee

, O.

Uzuner

, and P.

Szolovits

, De-identification of patient notes with recurrent neural networks, J. Am. Med. Inform. Assoc., vol. 24, no. 3, pp. 596–606, 2017.

Crossref Google Scholar

[24]

Y.

Yu

, M.

Li

, L.

Liu

, Z.

Fei

, F. X.

Wu

, and J.

Wang

, Automatic ICD code assignment of Chinese clinical notes based on multilayer attention BiRNN, J. Biomed. Inform., vol. 91, p. 103114, 2019.

Crossref Google Scholar

[25]

G. B.

Moody

and R. G.

Mark

, A database to support development and evaluation of intelligent intensive care monitoring, in Proc. of Computers in Cardiology 1996, Indianapolis, IN, USA, 2002, pp. 657–660.

[26]

M.

Saeed

, M.

Villarroel

, A. T.

Reisner

, G.

Clifford

, L. W.

Lehman

, G.

Moody

, T.

Heldt

, T. H.

Kyaw

, B.

Moody

, and R. G.

Mark

, Multiparameter intelligent monitoring in intensive care II: A public-access intensive care unit database, Crit. Care Med., vol. 39, no. 5, pp. 952–960, 2011.

Crossref Google Scholar

[27]

A. E. W.

Johnson

, T. J.

Pollard

, L.

Shen

, L. W. H.

Lehman

, M.

Feng

, M.

Ghassemi

, B.

Moody

, P.

Szolovits

, L. A.

Celi

, and R. G.

Mark

, MIMIC-III, a freely accessible critical care database, Sci. Data, vol. 3, p. 160035, 2016.

Crossref Google Scholar

[28]

A. R.

Aronson

and F. M.

Lang

, An overview of MetaMap: Historical perspective and recent advances, J. Am. Med. Inform. Assoc., vol. 17, no. 3, pp. 229–236, 2010.

Crossref Google Scholar

[29]

P.

Sondhi

, J.

Sun

, H.

Tong

, and C.

Zhai

, SympGraph: A framework for mining clinical notes through symptom relation graphs, in Proc. 18^th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, Beijing, China, 2012, pp. 1167–1175.

Crossref Google Scholar

[30]

S.

Hochreiter

and J.

Schmidhuber

, Long short-term memory, Neural Comput., vol. 9, no. 8, pp. 1735–1780, 1997.

Crossref Google Scholar

[31]

A.

Graves

and N.

Jaitly

, Towards end-to-end speech recognition with recurrent neural networks, in Proc. 31^st Int. Conf. on Int. Conf. on Machine Learning, Beijing, China, 2014, pp. 1764–1772.

[32]

J.

Chorowski

, D.

Bahdanau

, D.

Serdyuk

, K.

Cho

, and Y.

Bengio

, Attention-based models for speech recognition, in Proc. 28^th Int Conf on Neural Information Processing Systems, Montreal, Canada, 2015, pp. 577–585.

[33]

Q.

You

, H.

Jin

, Z.

Wang

, C.

Fang

, and J.

Luo

, Image captioning with semantic attention, in Proc. 2016 IEEE Conf. on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 4651–4659.

Crossref Google Scholar

[34]

P.

Zhou

, W.

Shi

, J.

Tian

, Z.

Qi

, B.

Li

, H.

Hao

, and B.

Xu

, Attention-based bidirectional long short-term memory networks for relation classification, in Proc. 54^th Annu. Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Berlin, Germany, 2016, pp. 207–212.

Crossref Google Scholar

[35]

L.

Luo

, Z.

Yang

, P.

Yang

, Y.

Zhang

, L.

Wang

, H.

Lin

, and J.

Wang

, An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition, Bioinformatics, vol. 34, no. 8, pp. 1381–1388 2017.

Crossref Google Scholar

[36]

A.

Vaswani

, N.

Shazeer

, N.

Parmar

, J.

Uszkoreit

, L.

Jones

, A. N.

Gomez

, Ł

Kaiser

, and I.

Polosukhin

, Attention is all you need, in Proc. 31^st Int. Conf. on Neural Information Processing Systems, Long Beach, CA, USA, 2017, pp. 6000–6010.

[37]

K. S.

Jones

, A statistical interpretation of term specificity and its application in retrieval, J. Doc., vol. 28, no. 1, pp. 11–21, 1972.

Crossref Google Scholar

[38]

X.

Zhou

, J.

Menche

, A. L.

Barabási

, and A.

Sharma

, Human symptoms-disease network, Nat. Commun., vol. 5, p. 4212, 2014.

Crossref Google Scholar

Tsinghua Science and Technology

Volume 28 Issue 4,
August 2023

Pages 686-695

DOI: 10.26599/TST.2022.9010049

Cite this article:

Yu Y, Duan J, Li M. Fusion Model for Tentative Diagnosis Inference Based on Clinical Narratives. Tsinghua Science and Technology, 2023, 28(4): 686-695. https://doi.org/10.26599/TST.2022.9010049

About Us

Learn about Open Access

Tsinghua University Press

Publish with Us

Peer Review Policy

Copyright and Licensing

Article Processing Charge

Contact Us

Journal Collaboration: Yao Meng (Ms.)✉️ +86-10-83470574

Technical Support: Kuo Zhao (Mr.)✉️ +86-10-83470507

Media Contact: Hao Jin (Mr.)✉️ +86-10-83470559

Address: Floor 6, Tower B, Xueyan Building, Shuangqing Road, Haidian District, Beijing 100084, China.

SciOpen——中国科技期刊卓越行动计划支持项目

Copyright © 2025 Tsinghua University Press Ltd.

京ICP备 10035462号-42 京公网安备11010802044758号