REVISTA
Information

TODAS

Redirigiendo al acceso original de articulo en 15 segundos...

Inicio / Information / Vol: 6 Par: 4 (2015) / Artículo

ARTÍCULO

TITULO

Effects of Semantic Features on Machine Learning-Based Drug Name Recognition Systems: Word Embeddings vs. Manually Constructed Dictionaries

Shengyu Liu

Buzhou Tang

Qingcai Chen and Xiaolong Wang

Resumen

Semantic features are very important for machine learning-based drug name recognition (DNR) systems. The semantic features used in most DNR systems are based on drug dictionaries manually constructed by experts. Building large-scale drug dictionaries is a time-consuming task and adding new drugs to existing drug dictionaries immediately after they are developed is also a challenge. In recent years, word embeddings that contain rich latent semantic information of words have been widely used to improve the performance of various natural language processing tasks. However, they have not been used in DNR systems. Compared to the semantic features based on drug dictionaries, the advantage of word embeddings lies in that learning them is unsupervised. In this paper, we investigate the effect of semantic features based on word embeddings on DNR and compare them with semantic features based on three drug dictionaries. We propose a conditional random fields (CRF)-based system for DNR. The skip-gram model, an unsupervised algorithm, is used to induce word embeddings on about 17.3 GigaByte (GB) unlabeled biomedical texts collected from MEDLINE (National Library of Medicine, Bethesda, MD, USA). The system is evaluated on the drug-drug interaction extraction (DDIExtraction) 2013 corpus. Experimental results show that word embeddings significantly improve the performance of the DNR system and they are competitive with semantic features based on drug dictionaries. F-score is improved by 2.92 percentage points when word embeddings are added into the baseline system. It is comparative with the improvements from semantic features based on drug dictionaries. Furthermore, word embeddings are complementary to the semantic features based on drug dictionaries. When both word embeddings and semantic features based on drug dictionaries are added, the system achieves the best performance with an F-score of 78.37%, which outperforms the best system of the DDIExtraction 2013 challenge by 6.87 percentage points.

Palabras claves

drug name recognition - word embeddings - drug information extraction - biomedical texts

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 6 Parte: 4 (2015)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Applied Sciences
Information
Informatics

DOI

https://doi.org/10.3390/info6040848

Artículos similares

Modelling a Spatial-Motion Deep Learning Framework to Classify Dynamic Patterns of Videos

Acceso

Sandeli Priyanwada Kasthuri Arachchi, Timothy K. Shih and Noorkholis Luthfil Hakim

Video classification is an essential process for analyzing the pervasive semantic information of video content in computer vision. Traditional hand-crafted features are insufficient when classifying complex video information due to the similarity of visu... ver más

Revista: Applied Sciences

Exploring the Importance of Entities in Semantic Ranking

Acceso

Zhenyang Li, Guangluan Xu, Xiao Liang, Feng Li, Lei Wang and Daobing Zhang

In recent years, entity-based ranking models have led to exciting breakthroughs in the research of information retrieval. Compared with traditional retrieval models, entity-based representation enables a better understanding of queries and documents. How... ver más

Revista: Information

Semantic analysis of color, light and transparency in Islamic architecture of Iran

Acceso

Behnam Sarbakhshian

The subject of color, including single colors or compounds like Haft Rang (seven colors), which conveys the psychological effects and mystical interpretation of color, has a palpable presence in the realm of arts, especially in architecture. On the other... ver más

Revista: Innovaciencia

The Intra-Class and Inter-Class Relationships in Style Transfer

Acceso

Xin Cui, Meng Qi, Yi Niu and Bingxin Li

Neural style transfer, which has attracted great attention in both academic research and industrial engineering and demonstrated very exciting and remarkable results, is the technique of migrating the semantic content of one image to different artistic s... ver más

Revista: Applied Sciences

Effects of Semantic Features on Machine Learning-Based Drug Name Recognition Systems: Word Embeddings vs. Manually Constructed Dictionaries

Acceso

Shengyu Liu, Buzhou Tang, Qingcai Chen and Xiaolong Wang

Revista: Information

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas