Relation Extraction between Medical Entities using Deep Learning Approach

Ruchi Patel, Sanjay Tanwani, Chhaya Patidar


Medical discharge summaries or patient prescriptions contain variety of medical terms. The semantic relation extraction between medical terms is essential for discovery of significant medical knowledge. The relation classification is one of the imperative tasks of biomedical information extraction. The automatic identification of relations between medical diseases, tests and treatments can improve the quality of patient care. This paper presents the deep learning based proposed system for relation extraction between medical entities. In this paper, convolution neural network is used for relation classification. The system is divided into four modules: word embedding, feature extraction, convolution and softmax classifier. The output contains classified relations between medical entities. In this work, data set provided by I2b2 2010 challenge is used for relation detection which consisted of total 9070 relations in test data and 5262 total relations in train data. The performance evaluation of relation extraction task is done using precision and recall. The system achieved average 75% precision and 72% recall. The performance of the system is compared with awarded i2b2 participated systems.

Keywords: Convolution Neural Network;Feature Extraction;Relation Classification;Word Embedding.

Full Text:



A.-L. Minard, A.-L. Ligozat, A. Ben Abacha, D. Bernhard, B. Cartoni, L. Deléger, et al., "Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification," Journal of the American Medical Informatics Association, vol. 18, p. 588, 2011.

N. Kang, R. J. Barendse, Z. Afzal, B. Singh, M. J. Schuemie, E. M. van Mulligen, et al., "Erasmus MC approaches to the i2b2 Challenge," in Proceedings of the 2010 i2b2/VA workshop on challenges in natural language processing for clinical data. Boston, MA, USA: i2b2, 2010.

B. deBruijn, C. Cherry, S. Kiritchenko, J. Martin, and X. Zhu, "NRC at i2b2: one challenge, three practical tasks, nine statistical systems, hundreds of clinical records, millions of useful features," in Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, 2010.

J. D. Patrick, D. H. M. Nguyen, Y. Wang, and M. Li, "A knowledge discovery and reuse pipeline for information extraction in clinical notes," Journal of the American Medical Informatics Association, vol. 18, pp. 574-579, 2011.

I. Solt, F. P. Szidarovszky, and D. Tikk, "Concept, Assertion and Relation Extraction at the 2010 i2b2 Relation Extraction Challenge using parsing information and dictionaries," Proc. of i2b2/VA Shared-Task. Washington, DC, 2010.

X. Zhu, C. Cherry, S. Kiritchenko, J. Martin, and B. De Bruijn, "Detecting concept relations in clinical text: Insights from a state-of-the-art model," Journal of biomedical informatics, vol. 46, pp. 275-285, 2013.

K. Roberts, B. Rink, and S. Harabagiu, "Extraction of medical concepts, assertions, and relations from discharge summaries for the fourth i2b2/VA shared task," in Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, 2010.

C. Grouin, A. B. Abacha, D. Bernhard, B. Cartoni, L. Deleger, B. Grau, et al., "CARAMBA: concept, assertion, and relation annotation using machine-learning based approaches," in i2b2 Medication Extraction Challenge Workshop, 2010, pp. -.

R. J. Kate and R. J. Mooney, "Joint entity and relation extraction using card-pyramid parsing," in Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 2010, pp. 203-212.

M. Liu, L. Jiang, and H. Hu, "Automatic extraction and visualization of semantic relations between medical entities from medicine instructions," Multimedia Tools and Applications, vol. 76, pp. 10555-10573, 2017.

O. Frunza and D. Inkpen, "Extracting relations between diseases, treatments, and tests from clinical data," in Canadian Conference on Artificial Intelligence, 2011, pp. 140-145.

O. Frunza, D. Inkpen, and T. Tran, "A machine learning approach for identifying disease-treatment relations in short texts," IEEE transactions on knowledge and data engineering, vol. 23, pp. 801-814, 2011.

C. Giuliano, A. Lavelli, and L. Romano, "Exploiting shallow linguistic information for relation extraction from biomedical literature," in 11th Conference of the European Chapter of the Association for Computational Linguistics, 2006.

W. W. Chapman, D. Chu, and J. N. Dowling, "ConText: An algorithm for identifying contextual features from clinical text," in Proceedings of the workshop on BioNLP 2007: biological, translational, and clinical language processing, 2007, pp. 81-88.

C. A. Bejan and J. C. Denny, "Learning to identify treatment relations in clinical text," in AMIA Annual Symposium Proceedings, 2014, p. 282.

D. Hristovski, C. Friedman, T. C. Rindflesch, and B. Peterlin, "Exploiting semantic relations for literature-based discovery," AMIA ... Annual Symposium proceedings. AMIA Symposium, vol. 2006, pp. 349-353, 2006.

O. Uzuner, J. Mailoa, R. Ryan, and T. Sibanda, "Semantic relations for problem-oriented medical records," Artificial intelligence in medicine, vol. 50, pp. 63-73, 2010.

M. Porumb, I. Barbantan, C. Lemnaru, and R. Potolea, "REMed: automatic relation extraction from medical documents," presented at the Proceedings of the 17th International Conference on Information Integration and Web-based Applications & Services, Brussels, Belgium, 2015.

J. Kim, Y. Choe, and K. Mueller, "Extracting Clinical Relations in Electronic Health Records Using Enriched Parse Trees," Procedia Computer Science, vol. 53, pp. 274-283, 2015/01/01/ 2015.

B. de Bruijn, C. Cherry, S. Kiritchenko, J. Martin, and X. Zhu, "Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010," Journal of the American Medical Informatics Association : JAMIA, vol. 18, pp. 557-562, Sep-Oct 2011.

Y. Xu, K. Hong, J. Tsujii, and E. I. C. Chang, "Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries," Journal of the American Medical Informatics Association : JAMIA, vol. 19, pp. 824-832, Sep-Oct 2012.

Ö. Uzuner, B. R. South, S. Shen, and S. L. DuVall, "2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text," Journal of the American Medical Informatics Association : JAMIA, vol. 18, pp. 552-556, Sep-Oct 2011.

D. Zeng, K. Liu, S. Lai, G. Zhou, and J. Zhao, "Relation classification via convolutional deep neural network," 2014.

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, "Distributed representations of words and phrases and their compositionality," in Advances in neural information processing systems, 2013, pp. 3111-3119.

Q. Le and T. Mikolov, "Distributed representations of sentences and documents," in International conference on machine learning, 2014, pp. 1188-1196.

Y. Wu, M. Jiang, J. Xu, D. Zhi, and H. Xu, "Clinical Named Entity Recognition Using Deep Learning Models," AMIA ... Annual Symposium proceedings. AMIA Symposium, vol. 2017, pp. 1812-1819, 2018.

O. Bodenreider, "The Unified Medical Language System (UMLS): integrating biomedical terminology," Nucleic Acids Research, vol. 32, pp. D267-D270, 2004.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.