Biomedical named entity recognition based on recurrent neural networks with different extended methods Online publication date: Fri, 14-Oct-2016
by Dingxin Song; Lishuang Li; Liuke Jin; Degen Huang
International Journal of Data Mining and Bioinformatics (IJDMB), Vol. 16, No. 1, 2016
Abstract: Biomedical Named Entity Recognition (bio-NER) has become essential to the text mining and knowledge discovery tasks in biomedical field. However, the performance of traditional NER systems is limited to the construction of complex hand-designed features which are derived from various linguistic analyses and may only adapted to specified domain. In this paper, we mainly focus on building a simple and efficient system for bio-NER based on Recurrent Neural Network (RNN) where complex hand-designed features are replaced with word embeddings. Furthermore, the system is extended by the predicted information from the prior node and external context information (topical information & clustering information). During the training process, the word embeddings are fine-tuned by the neural network. The experiments conducted on the BioCreative II GM data set demonstrate RNN models outperform CRF model and Deep Neural Networks (DNNs) and the extended RNN model performs better than the original RNN, achieving 82.47% F-score.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining and Bioinformatics (IJDMB):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com